Remix.run Logo
elikoga 7 hours ago

> this means that autoresearch will find the most optimal model for your platform in that time budget

I'm looking forward to finding out what model is optimal on my rtx3090

One thing I'm concerned with is that the model with best bpb after 5 minutes in smaller setups are only about ~10M Parameters in size which is too small for some emergent effects.