▲ | nsonha 4 days ago | |
Tested a few agentic browsers such as genspark, fellou and comet. I found the vision approach less effective comparing to the dom-based approach, and seem quite slower too. Does it need a reasoning step to type an url into the address bar? | ||
▲ | ElasticBottle 4 days ago | parent [-] | |
I see it 3 fundamental pillars: * Accuracy (does it do what we want) * Reliability (does it consistently do what we want) * Speed (does it do what we want fast) We're mostly focused on solving 1 and maybe in some capacity 2. The belief here is that models are going to get better. With that smaller models will become more capable. This will result in speed ups automatically. So yes, I will concur that speed is probably not the main strength of our framework right now, but believe that we will get there with time. |