| ▲ | AntiUSAbah 2 days ago | ||||||||||||||||
While interesting, its not clear to me with just looking at concensus grid how they are prompted. Do you tell them to think and coordinate the next step through some type of sync/talking mechanism or is it turn by turn? I suspect turn by turn as it is similiar to other experiements and in this case, it wouldn't work because they wouldn't have a certain amount of time to think about the next step together? | |||||||||||||||||
| ▲ | gertlabs 2 days ago | parent [-] | ||||||||||||||||
All of our environments are tick based (with ticks of varying speeds), and this is explained in the prompt given to the models, along with the latest observation and a history of recent events/conversations/actions. So that does make the game more challenging, versus some other simulations we have where multiple conversation turns happen before action. But the inefficiencies I'm describing are different; for example, an agent reaches part of the destination area but is clearly blocking another player who needs to pass, and most models will just stay put instead of moving along to another target spot. | |||||||||||||||||
| |||||||||||||||||