Remix.run Logo
jacksonastone 12 hours ago

Made this in a free evening. Index an permissive license translation of the Bible (WEB) into a RAG database to allow returning passages of similar semantic meaning. Lots of fun. For example, "more money more problems" returns Ecclesiastes 5:9-13 which, I'll just say, is spot on..

"Moreover the profit of the earth is for all. The king profits from the field. He who loves silver shall not be satisfied with silver, nor he who loves abundance, with increase. This also is vanity. When goods increase, those who eat them are increased; and what advantage is there to its owner, except to feast on them with his eyes? The sleep of a laboring man is sweet, whether he eats little or much; but the abundance of the rich will not allow him to sleep. There is a grievous evil which I have seen under the sun: wealth kept by its owner to his harm."

Anyway - thought it was fun enough to share. It's slow and I vibe coded it so I haven't sorted out how to make it not take 15 seconds to vector search against the full 4GB index.

mcswell 12 hours ago | parent | next [-]

Slow, but interesting. I used the query "government" and got back passages in Romans 13 (as I expected), but also passages in Daniel and Ezra describing decrees by government officials, which made sense.

kordlessagain 8 hours ago | parent | prev | next [-]

This is really cool...great job! It's a favorite pastime of mine to index various large corpora.

As for speed, this might help for code referencing: https://github.com/deepbluedynamics/lume

Blog post: https://deepbluedynamics.com/blog/lume-retrieval-primitives

I use a small local model to extract entities for the graph, but it's not necessary.

You can optionally use GTR-T5 which is a few years old now, but still good for generating fast and free embeddings. That step is only run once if you run it in hybrid mode.

Feel free to take and remix or use!

ReactiveJelly 11 hours ago | parent | prev | next [-]

> The king profits from the field

For the solution, read Henry George!

atmanactive 3 hours ago | parent | prev | next [-]

For completeness, this should include all possible books, including Ethiopian, and then it should include a drop-down with pre-defined sets one could choose from (Protestant, Catholic, Orthodox...).

yjftsjthsd-h 8 hours ago | parent | prev | next [-]

Any chance you wrote it up anywhere? I'm somewhat interested in RAG and a worked example would be nice.

jacksonastone 3 hours ago | parent [-]

I'll just open source it, just a sec (keep in mind, vibe coded :D)

https://github.com/jacksonStone/cross-cannon

jnamaya 12 hours ago | parent | prev | next [-]

The site is not loading for me! Interesting project

mcswell 12 hours ago | parent [-]

You have to wait awhile. It came up for me after 30 seconds or so.

jacksonastone 10 hours ago | parent [-]

it's basically running on a toaster, sorry

TeMPOraL 9 hours ago | parent [-]

If that toaster didn't have a plan already, then feeding it with monotheistic holy scripture will definitely give it some ideas.

jacksonastone 3 hours ago | parent | next [-]

I wonder if polytheistic toaster would be more or less... ambitious.

a96 6 hours ago | parent | prev [-]

A toaster is just a death ray with a smaller power supply

TeMPOraL 16 minutes ago | parent [-]

Less power, more hatred towards humankind.

cdong 9 hours ago | parent | prev | next [-]

pg-vector with hnsw is still 15 seconds?

anamnesis 12 hours ago | parent | prev [-]

[dead]