| ▲ | eknkc an hour ago | |
BTW, the openai responses api has a store parameter and a thread id input. Makes it possible to send a thread id and append a new message, ask for completion. So it feels like keeping the conversation going. Technically it does retrieve the entire history and reevaulate it since the LLM is stateless. Just more ergonomic for the developer. And prompt caching helps cut the costs down when a conversation drags on. | ||