Skip Navigation Links | |
Exit Print View | |
![]() |
Oracle Solaris Cluster Geographic Edition Data Replication Guide for Oracle Data Guard Oracle Solaris Cluster 4.0 |
1. Replicating Data With Oracle Data Guard Software
2. Administering Oracle Data Guard Protection Groups
3. Migrating Services That Use Oracle Data Guard Data Replication
Detecting Cluster Failure on a System That Uses Oracle Data Guard Data Replication
Migrating Services That Use Oracle Data Guard With a Switchover
How to Switch Over an Oracle Data Guard Protection Group From the Primary to the Standby Cluster
Actions Performed by the Geographic Edition Software During a Switchover
Forcing a Takeover on Systems That Use Oracle Data Guard
How to Force Immediate Takeover of Oracle Data Guard Services by a Standby Cluster
Actions Performed by the Geographic Edition Software During a Takeover
Recovering Oracle Data Guard Data After a Takeover
How to Resynchronize and Revalidate the Protection Group Configuration
How to Perform a Failback Switchover or Failback Takeover
Recovering From an Oracle Data Guard Data Replication Error
How to Recover From a Data Replication Error
A. Geographic Edition Properties for Oracle Data Guard Broker Configurations
This section describes the internal processes that occur when failure is detected on a primary or a standby cluster.
When the primary cluster for a given protection group fails, the standby cluster in the partnership detects the failure. If the cluster that fails is a member of more than one partnership, multiple failure detections might occur.
The following actions occur when the overall state of a protection group changes to the Unknown state:
Heartbeat failure is detected by a partner cluster.
The heartbeat is activated in emergency mode to verify that the heartbeat loss is not transient and that the primary cluster has failed. The heartbeat remains in the OK state during this default timeout interval, while the heartbeat mechanism continues to retry the primary cluster. Only the heartbeat plug-ins appear in the Error state.
You set this query interval by setting the Query_interval property of the heartbeat. If the heartbeat still fails after four attempts due to the Query_interval that you configured (three retries and one emergency-mode probing), a heartbeat-lost event is generated and logged in the system log. When you specify the default interval, the emergency-mode retry behavior might delay the notification of heartbeat-loss for about nine minutes. Messages are displayed in the output of the geoadm status command.
For more information about logging, see Viewing the Geographic Edition Log Messages in Oracle Solaris Cluster Geographic Edition System Administration Guide.
When a standby cluster for a given protection group fails, a cluster in the same partnership detects the failure. If the cluster that failed is a member of more than one partnership, multiple failure detections might occur.
During failure detection, the following actions occur:
Heartbeat failure is detected by a partner cluster.
The heartbeat is activated in emergency mode to verify that the standby cluster failed.
The cluster notifies the administrator by issuing messages. The system detects all protection groups for which the cluster that failed was acting as standby. The state of these protection groups is set to the Unknown state.