Posted in

应对 PostgreSQL 逻辑复制的挑战_AI阅读总结 — 包阅AI

包阅导读总结

1. `PostgreSQL`、`pgEdge`、`逻辑复制`、`分布式`、`数据库`

2. pgEdge 最新的“Constellation Release”版本解决了 PostgreSQL 逻辑复制的复杂问题,提供了增强的功能,如并行处理、大对象支持和错误处理等,使其成为替代传统数据库的开源选择,并将在四季度增加高性能并行复制。

3.

– pgEdge 新版本

– 解决 PostgreSQL 逻辑复制的复杂性

– 被称为“Constellation Release”

– 版本特性

– 增强并行处理

– 支持大对象逻辑复制(LOLOR)

– 具备复制异常处理和日志记录功能

– 新增复制修复模式

– 四月份宣布自动化数据定义语言(DDL)复制和 Snowflake 序列

– 未来计划

– 四季度增加高性能并行复制

– 提升高流量、跨区域事务工作负载的复制吞吐量

思维导图:

文章地址:https://thenewstack.io/tackling-the-challenges-of-logical-replication-in-postgresql/

文章来源:thenewstack.io

作者:Susan Hall

发布时间:2024/8/7 19:19

语言:英文

总字数:759字

预计阅读时间:4分钟

评分:88分

标签:PostgreSQL,逻辑复制,pgEdge,数据库管理,开源


以下为原文内容

本内容来源于用户推荐转载,旨在分享知识与观点,如有侵权请联系删除 联系邮箱 media@ilingban.com

Distributed Postgres vendor pgEdge continues to tackle the complexities of logical replication with its latest version, known as the “Constellation Release,” which offers enhanced parallel processing, large object support and error handling.

The greater throughput, flexibility and control make pgEdge a viable open source alternative for legacy database workloads requiring multimaster capability, according to Phillip Merrick, pgEdge CEO. Today these workloads are typically running on legacy platforms such as Oracle Goldengate, he said in an email.

These workloads need the ability to take both read and write traffic at multiple nodes in a distributed database cluster to achieve low data latency or very high levels of availability (four or five nines) or both. An example might be a financial services application that operates across multiple regions or a critical e-commerce application that can never be down, he explained.

Merrick and his cofounder Denis Lussier, who together previously created EnterpriseDB, built pgEdge on open source PostgreSQL with the idea that a few nodes in a geographically distributed cluster, each handling reads and writes, can offer low latency, high availability, resiliency and performance.

Though a powerful feature, logical replication in Postgres presents challenges including consistency, synchronization, conflict resolution and overhead, affecting performance.

The Constellation Release features include:

  • Large object logical replication (LOLOR): This PostgreSQL plugin replacement makes media assets for existing applications, such as binary files, images and other non-relational data types, compatible with logical replication. These large files in PostgreSQL databases can now run on pgEdge without modification. Though Postgres supports large objects as chunks in catalog tables, replicating these tables requires special handling, according to its GitHub page. With LOLOR, this data is stored in non-catalog tables to make replication across multiple database instances or servers easier. It replicates data based on logical changes such as insert, update and delete operations rather than physical changes at the storage level and employs change data capture to ensure synchronization in near real-time to other database instances. This is especially useful in distributed systems where consistency, availability and fault tolerance are critical, according to pgEdge.
  • Replication exception handling and logging: With an updated error handling and logging mechanism, replication errors are logged into a new exception table to prevent them from blocking subsequent changes. This enhances visibility into replication errors for easier troubleshooting without interrupting overall system operation.
  • Replication repair mode: A new function allows users to use or opt out of using “repair mode” on a specific database node. This extra control can be used to prevent replication changes during error resolution or while modifying the state of a single database node. It also supports error remediation by external tools without affecting the entire cluster.

While counting these features as part of this release, the company announced automated Data Definition Language (DDL) replication and Snowflake sequences in April.

DDL is used to create and modify Postgres objects through command statements like CREATE, ALTER and DROP.Postgres traditionally required modifications to table definitions through DDL commands be made manually on each node. With this automation, you can update the database schema on a single node, and the changes are seamlessly propagated to other nodes within the cluster.

Snowflake sequences address the complexity of managing sequences in multimaster replication scenarios. In a distributedmultimaster Postgres system,the sequence must be updated across different regions and if each node updates the sequence independently it creates conflicts that cannot be resolved. This replacement for PostgreSQL sequence definitionsprovides a unique sequence — a timestamp, a counter and a unique node identifier — within a cluster that can be used across different regions without the need to write code or modify schema.

The company announced its free tier Cloud Developer Edition in January and its vector search capability using open source extension pgvector last October.

In the fourth quarter, pgEdge said it will add high-performance parallel replication, which it calls “a game-changer in the industry” to its capabilities.

High-performanceparallel replication is the ability to run multiple data replication streams on each networkconnection between nodes, using multipleCPUs at each node for significantly higher levels of data throughput, Merrick explained.Current replication architectures for Postgres only permit one stream between nodes, placing an upper limit on replication performance.

This boost in replication throughput in high-traffic, cross-region transactional workloads will enable users to manage larger volumes of data replication across distributed clusters in high-demand environments while reducing lag times and ensuring timely synchronization.

YOUTUBE.COM/THENEWSTACK

Tech moves fast, don’t miss an episode. Subscribe to our YouTubechannel to stream all our podcasts, interviews, demos, and more.

GroupCreated with Sketch.