| ▲ | eugene3306 5 hours ago | |
This makes a good benchmark LLMs: ``` look at this paper: https://arxiv.org/pdf/2603.21852 now please produce 2x+y as a composition on EMLs ``` Opus(paid) - claimed that "2" is circular. Once I told it that ChatGPT have already done this, finished successfully. ChatGPT(free) - did it from the first try. Grok - produced estimation of the depth of the formula. Gemini - success Deepseek - Assumed some pre-existing knowledge on what EML is. Unable to fetch the pdf from the link, unable to consume pdf from "Attach file" Kimi - produced long output, stopped and asked to upgrade GLM - looks ok | ||
| ▲ | theanonymousone 17 minutes ago | parent | next [-] | |
I changed the prompt to this: """ Consider a mathematical function EML defined as `eml(x,y)=exp(x)−ln(y)` Please produce `sin(x)/x` as a composition on EMLs and constant number 1 (one). """ | ||
| ▲ | fc417fc802 2 hours ago | parent | prev | next [-] | |
> Once I told it that ChatGPT have already done this, finished successfully. TIL you can taunt LLMs. I guess they exhibit more competitive spirit than I thought. | ||
| ▲ | eru an hour ago | parent | prev | next [-] | |
I copy and pasted the abstract into DeepSeek and asked your question. It's a bit unfair to penalise it for not knowing PDFs. It got a result. | ||
| ▲ | spuz 2 hours ago | parent | prev [-] | |
So what is the correct answer? | ||