▲ | the_mitsuhiko 4 days ago | |||||||
Technically speaking, you can get away with just a Bash tool, and I had some success with this. It's actually quite interesting to take away tools from agents and see how creative they are with the use. One of the reasons why you get better performance if you give them the other tools is that there has been some reinforcement learning on Sonne with all these tools. The model is aware of how these tools work, it is more token-efficient and it is generally much more successful at performing those actions. The Bash tool, for instance, at times gets confused by bashisms, not escaping arguments correctly, not handling whitespace correctly etc. | ||||||||
▲ | 4 days ago | parent | next [-] | |||||||
[deleted] | ||||||||
▲ | normie3000 4 days ago | parent | prev | next [-] | |||||||
> The model is aware of how these tools work, it is more token-efficient and it is generally much more successful at performing those actions. Interesting! This didn't seem to be the case in the OP's examples - for instance using a list_files tool and then checking if the json result included README vs bash [ -f README ] | ||||||||
| ||||||||
▲ | dotancohen 4 days ago | parent | prev [-] | |||||||
This was the only informative sentence in the reply. Can you please go on in this manner - it was an important question. |