| ▲ | munk-a 7 hours ago | ||||||||||||||||
In fact - if you're building a very large distributed system the goal is usually to shrink that centralized component to the smallest and most robust surface you can. If the system is well designed it is amazing just how much consistency power you can get from a tiny component of centralization. There are always tradeoffs of course, but building a truly decentralized system requires some really difficult compromises to correctness. The two general's problem is a great piece of reading on this topic - distribution always requires compromises in general, but to fully remove an authority on truth gets quite tricky. | |||||||||||||||||
| ▲ | brentjanderson 6 hours ago | parent | next [-] | ||||||||||||||||
I think Ducklake[1] is a terrific example of this. They said "look, let's build a lake house over S3, but for the bit that needs strong consistency (the manifest of which S3 blobs are in play), let's use Postgres". Postgres as a metadata catalog or control plane is brilliant for this, since you get strong consistency and the scaling story around a metadata catalog is far different than the volume of data you need to store. Use S3 for volume, Postgres for consistent metadata. A similar pattern has spilled out of projects like Warpstream[2], which I suspect is using Postgres behind the scenes of their control plane. | |||||||||||||||||
| |||||||||||||||||
| ▲ | mrkeen 5 hours ago | parent | prev [-] | ||||||||||||||||
> The two general's problem is a great piece of reading on this topic It is! And the solution is to add an extra general on the left side. Let's call him Outus Boxus. The two generals on the left side can communicate in perfect lockstep. Then if you need the general on the right to find out about something, you can send a few workers to tell him or something... More seriously though, you can have a DS for two reasons: tech or political. Tech means scaling or reliability. So clients can be serviced by any of the nodes. Political means different actors don't have a central authority. You can't stick two banks into one db. This technique doesn't seem to address either aspect. | |||||||||||||||||