Remix.run Logo
spiderfarmer 6 days ago

On that note, I want to see benchmarks for which LLM's are best at translating between languages. To me, it's an entire product category.

pbmango 6 days ago | parent | next [-]

There are probably many more small battles being fought or emerging. I think voice and PDF parsing are growing battles too.

oezi 5 days ago | parent | prev [-]

I would love to see a stackexchange-like site where humans ask questions and we get to vote on the reply by various LLMs.

anotherengineer 5 days ago | parent [-]

is this like what you're thinking of? https://lmarena.ai

oezi 5 days ago | parent [-]

Kind of. But lmarena.ai has no way to see results to questions people asked and it only lets you look at two responses side by side.