Remix.run Logo
spmurrayzzz 3 days ago

Might end up being some confusion with the RULER benchmark from NVIDIA given the (somewhat shared) domain: https://github.com/NVIDIA/RULER

EDIT: by shared I only mean the adjacency to LLMs/AI/ML, RL is a pretty big differentiator though and project looks great

kcorbitt 3 days ago | parent [-]

Dang, hadn't seen that. Namespace collision strikes again.

swyx 3 days ago | parent [-]

yeah unforutnately for you this is one of the well known long context benchmarks. too late tho, soldier on.