The author discusses the switch from Cockroach DB due to a license change, opting to explore PostgreSQL for smaller data sizes. They delve into PostgreSQL’s lack of native high availability solutions, highlighting the need for personal HA setups for replication. The blog series will test different HA solutions using Jepsen, focusing on PostgreSQL. Part one introduces Jepsen testing, using Patroni as an example. The tests uncover issues like read committed isolation violations and failed cluster recovery with one lost node. The author struggled to reproduce specific failures due to network constraints but ultimately triggers and analyzes the expected data inconsistencies.
https://www.binwang.me/2024-12-02-PostgreSQL-High-Availability-Solutions-Part-1.html