▲ | koakuma-chan 4 days ago | ||||||||||||||||
Do you mind sharing which tasks you achieved great results on? | |||||||||||||||||
▲ | tlarkworthy 4 days ago | parent [-] | ||||||||||||||||
It's all written up and linked in the notebook and executable in your browser (if you dare to insert your OPEN_AI_KEY, but my results are included assuming you won't). The evals were coding observable notebook challenges, simple things like create a drop down, but to solve you need to know the observable standard library and some of the unique syntax like "viewof". There is a table of the cases here https://observablehq.com/@tomlarkworthy/robocoop-eval#cell-2... So it's important the prompt encodes enough of the programming model. The seed prompt did not, but the reflect function managed to figure it all out. At the top of the notebook is the final optimized prompt which has done a fair bit of research to figure out the programming model using web search. | |||||||||||||||||
|