▲ | tough a day ago | |||||||
right but that's system prompting / in context not really -trained- into the weights. the point is you can't ask a model what's his training cut off date and expect a reliable answer from the weights itself. closer you could do is have a bench with -timed- questions that could only know if had been trained for that, and you'd had to deal with hallucinations vs correctness etc just not what llm's are made for, RAG solves this tho | ||||||||
▲ | stingraycharles a day ago | parent [-] | |||||||
What would the benefits be of actual time concepts being trained into the weights? Isn’t just tokenizing the dates and including those as normal enough to yield benefits? E.g. it probably has a pretty good understanding between “second world war” and the time period it lasted. Or are you talking about the relation between “current wall clock time” and questions being asked? | ||||||||
|