▲ | anerli a day ago | |||||||||||||||||||||||||
So it's key to still have a big model that is devising the overall strategy for executing the test case. Moondream on its own is pretty limited and can't handle complex queries. The planner gives very specific instructions to Moondream, which is just responsible for locating different targets on the screen. It's basically just the layer between the big LLM doing the actual "thinking" and grounding that to specific UI interactions. Where it gets interesting, is that we can save the execution plan that the big model comes up with and run with ONLY Moondream if the plan is specific enough. Then switch back out to the big model if some action path requires adjustment. This means we can run repeated tests much more efficiently and consistently. | ||||||||||||||||||||||||||
▲ | grbsh a day ago | parent [-] | |||||||||||||||||||||||||
Ooh, I really like the idea about deciding whether to use the big or small model based on task specificity. | ||||||||||||||||||||||||||
|