| ▲ | kbrisso 4 hours ago | |||||||
I built this for local RAG https://github.com/kbrisso/byte-vision it uses llama.cpp and Elasticsearch. On a laptop with 8 GB GPU it can handle a 30K token size and summarize a fairly large PDF. | ||||||||
| ▲ | busssard 3 hours ago | parent [-] | |||||||
elasticsearch is the true limitation of rag systems... | ||||||||
| ||||||||