Conversation
|
I also added some build instructions to build and test the repo. |
|
My workflow is as follows.
|
|
That won't work because you have to tame the LLM. Otherwise it starts spewing nonsense. Task it with the task of resolving this pr test failures and you will see it. |
|
Let me give a try, I shall update you soon. |
|
@ilayn I have made a commit to a feature branch Changes:
|
|
I appreciate the effort but you need to tame that LLM so that it does not do some irrelevant stuff like adding uv support or changing the formatting etc. It just needs to fix test failures by running it itself, and adjusting the wrapper and the code and then maybe offer recommendations. Typically this is only possible only by writing a very long description of the problem at hand. Mine is almost like a full page. Then I set it free so it starts iterating over those precise goals. As far as I can see, your LLM is running wild right now and adding random stuff which is not something we want. They are terrible at precise goals without precise explanations. You have to restrict their context to the minimum |
|
Need to tune the CLAUDE.md file. |
|
@ilayn what was happening was the agent was running |
|
@ilayn can you review this test fix plan? It is planning to remove some tests. jamestjsp#3 |
These tests are generated with LLM but they need modifications because I used the wrong prompt. Moreover, the translations can still have mistakes. Fun.
CC @jamestjsp if you want to stretch your test writing muscle