Remix clone Hacker News

new | show | ask | jobs Github

	▲	visarga 2 years ago
		You can achieve the same effect by using LLM to do question answering prior to embedding, it's much more flexible but slower, you can use CoT, or even graph rag. Late chunking is a faster implicit alternative.