| ▲ | boundlessdreamz 2 hours ago | |
How do you copy all the relevant data? Doesn't this create unnecessary load on your source DB? | ||
| ▲ | cjonas 2 hours ago | parent [-] | |
We have various data sources (which is another benefit of this approach). Data from the application DB is currently pulled using the FE apis which handle tenant isolation and allow the application database to deal with the load. I think pg_duckdb could be a good solution here as well, but haven't gotten around to testing it. Other data come from analytics DB. Most of this is landed on an interval via pipeline scripts. | ||