Crawl the pages, optionally convert HTML to Markdown, and insert them into a search index, such as OpenSearch/Elasticsearch. Vibe code a script to make queries with filtering etc.