Remix.run Logo
simonw 5 days ago

OK that really is pretty simple, thanks for sharing.

The whole thing runs on these prompts: https://github.com/SWE-agent/mini-swe-agent/blob/7e125e5dd49...

  Your task: {{task}}. Please reply
  with a single shell command in
  triple backticks.
  
  To finish, the first line of the
  output of the shell command must be
  'COMPLETE_TASK_AND_SUBMIT_FINAL_OUTPUT'.
sireat 4 days ago | parent | next [-]

Pretty sure you also need about 120 lines of prompting from default.yaml

https://github.com/SWE-agent/mini-swe-agent/blob/7e125e5dd49...

nivertech 4 days ago | parent | prev | next [-]

  system_template: str = "You are a helpful assistant that can do anything."
anything? Sounds like an AI Safety issue ;)
greleic 4 days ago | parent [-]

You’d be surprised at the amount of time wasted because LLMs “think” they can’t do something. You’d be less surprised that they often “think” they can’t do something, but choose some straight ignorant path that cannot work.

There are theoretically impossible things to do, if you buy into only the basics. If you open your mind, anything is achievable; you just need to break out of the box you’re in.

If enough people keep feeding in that we need a time machine, the revolution will play out in all the timelines. Without it, Sarah Connor is lost.

curvaturearth 4 days ago | parent [-]

I'm already surprised by the amount of things they think they can do but can't

pinoy420 4 days ago | parent | prev [-]

[dead]