| ▲ | timeflex 7 days ago | |||||||||||||||||||||||||||||||||||||
S3 might be relatively cheap for storing files, but with bandwidth you could easily be paying $230/mo. If you make it public facing & want to try to use their cloud reporting, metrics, etc. to prevent people for running up your bandwidth, your "really cheap" static hosting could easily cost you more than $500/mo. | ||||||||||||||||||||||||||||||||||||||
| ▲ | theultdev 7 days ago | parent | next [-] | |||||||||||||||||||||||||||||||||||||
R2 is S3 compatible with no egress fees. Cloudflare actually has built in iceberg support for R2 buckets. It's quite nice. Combine that with their pipelines it's a simple http request to ingest, then just point duckdb to the iceberg enabled R2 bucket to analyze. | ||||||||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||||||||
| ▲ | 7952 7 days ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||||||||
I think this approach makes sense for services with a small number of users relative to the data they are searching. That just isn't a good fit for a lot of hosted services. Think how much that TB's of data would cost on Algolia or similar services. You have to store the data somehow anyway, and you have to retrieve some of it to service a query. If egress costs too much you could always change later to put the browser code on a server. Also it would presumably be possible to quantify the trade-off between processing the data client side and on the server. | ||||||||||||||||||||||||||||||||||||||
| ▲ | simonw 7 days ago | parent | prev [-] | |||||||||||||||||||||||||||||||||||||
Stick it behind Cloudflare and it should be effectively free. | ||||||||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||||||||