Replication & Upgrades - PostgreSQL 9 Administration

Database Reference

In-Depth Information

F Data movement: Allowing data to be used by additional applications or workloads

on additional hardware. Examples are Reference Data Management, where a

single central server might provide information to many other applications, and also

Business Intelligence/Reporting Systems.

Of course, both of those topics are complex areas, and there are many architectures and

possibilities for doing each of those.

What we will talk about here is data movement, where there is no transformation of the

data—we simply copy the data from one PostgreSQL database server to another. So, we are

specifically avoiding all discussion on ETL tools, EAI tools, inter-database migration, data

warehousing strategies, and so on. Those are valid topics in IT architecture, we just don't

cover them here.

Let's look at the basic architecture. Typically, the individual database servers are referred to

as nodes. The whole group of database servers involved in replication is known as a Cluster.

That is the common usage of the term, though be careful that the term Cluster is also used for

two other quite separate meanings elsewhere in PostgreSQL. First, cluster is sometimes used

to refer to the whole database instance, though I prefer the term "database server". Second,

there is a command named CLUSTER , which is designed to sort data into a specific order

within a table.

The first database server is also known as the Master , Primary , Provider , Sender , or the

Source server .

The second database server is also known as the Standby , Slave or Subscriber , or Receiver .

There can be multiple Standbys, if desired.

The replication systems described here are all Single Master, Multiple Standby systems. Multi-

Master architectures are discussed briefly later.

The designations Master and Standby are just roles that any node can take at some point.

To move the Master role to another node, we perform a procedure named Switchover. If the

Master dies, and does not recover, then the more severe role change is known as a Failover.

In many ways, they may be similar, but it helps to use different words for the two situations.

Software that manages the cluster, and in some cases automatically initiates the failover

process, is sometimes referred to as clusterware. Clusterware may also perform other

functions, such as load balancing.

The key aspect of replication is that data changes are captured on the Master, and then

transferred to the Standby nodes. In some cases, a Standby node may send data changes

onto later Standbys, a process known as relay.

Search WWH ::

Custom Search

Home