▲ | ccgreg 8 days ago | |
At the end, the author thinks about adding Common Crawl data. Our ranking information, generated from our web graph, would probably be a big help in picking which pages to crawl. I love seeing the worked out example at scale -- I'm surprised at how cost effective the vector database was. |