Configure the Distributed Database Topology
After the databases for the shard catalog and all of the shards are
configured, along with corresponding TNS listeners, you can add the distributed database metadata to the shard catalog database using GDSCTL
. The distributed database metadata describes the topology used for the distributed database.
The distributed database topology consists of the data distribution method, replication (high availability) technology, the default number of chunks to be present in the distributed database, the location and number of shard directors, the numbers of shardgroups, shardspaces, regions, and shards in the distributed database, and the global services that will be used to connect to the distributed database.
Keep the Global Data Services Control Utility (GDSCTL)
Command Reference in the Oracle Database Global
Data Services Concepts and Administration
Guide on hand for
information about usage and options for the GDSCTL
commands used in
the configuration procedures.
Follow the procedures listed below, in order, to complete your distributed database topology configuration.
Run the commands from a shard director host, because the
GDSCTL
command line interface is installed there as part of the
shard director (global service manager) installation.
Create the Shard Catalog
Use the GDSCTL CREATE SHARDCATALOG
command to create
metadata describing the distributed database topology in the shard catalog database.
Note that once you run CREATE SHARDCATALOG
, and the rest
of the distributed database metadata has been created, there are several metadata properties that cannot be
modified without recreating the entire distributed database from scratch. These include the distributed database method (system-managed, user-defined, composite), replication technology
(Oracle Data Guard or Raft replication), default number of chunks in the shardspace,
and others. Make sure that you consult the GDSCTL
reference
documentation for the complete list of possible command options and their
defaults.
Shard Catalog Connect String
When you run the CREATE SHARDCATALOG
command,
GDSCTL
connects to the shard catalog database with the user
name and connect string specified.
If your shard catalog database has an associated standby database for high availability or disaster recovery purposes, the connection string, catalog_connect_string in the examples that follow, should specify all primary and standby databases. If you don't include the standby databases in the connect string, then the shard director processes will not be able to connect to the standby if the primary shard catalog is unavailable.
Note that catalog_connect_string should
specify the PDB for the shard catalog database, not the
CDB$ROOT
.
The following is a simple tnsnames.ora
entry.
CATALOG_CONNECT_STRING=
(DESCRIPTION =
(ADDRESS_LIST =
(ADDRESS = (PROTOCOL = tcp)(HOST = primary_catalog)(PORT = 1521))
(ADDRESS = (PROTOCOL = tcp)(HOST = standby_catalog)(PORT = 1521))
)
(CONNECT_DATA =
(SERVICE_NAME = catpdb.example.com)
)
)
Creating the Shard Catalog
Run CREATE SHARDCATALOG
with the settings appropriate
for your planned distributed database topology.
System-Managed Distribution Method
In the following example, the distributed database metadata is created for a system-managed configuration with two regions named
region1 and region2. Because
system-managed is the default distribution method, it does not need to be specified
with the -sharding
parameter.
GDSCTL> create shardcatalog -database catalog_connect_string
-user mysdbadmin/mysdbadmin_password -repl DG -region region1,region2
Note also that if -shardspace
is not specified, a
default shardspace named shardspaceora is created. If
-region
is not specified, the default region named
regionora is created. If the single default region is
created along with the default shardspace, then a default shardgroup named
shardspaceora_regionora is also created in the
shardspace.
For replication (-repl
) with system-managed
distribution, you can choose either Oracle Data Guard (DG
) or Raft
replication (native
).
Composite Distribution Method
The following example shows you how to create shard catalog metadata for
a composite distributed database with Data Guard replication in MaxAvailability
protection mode,
60 chunks per shardspace, and two shardspaces.
GDSCTL> create shardcatalog -database catalog_connect_string
-user mysdbadmin/mysdbadmin_password -sharding composite -chunks 60
-protectmode maxavailability -shardspace shardspace1,shardspace2
User-Defined Distribution Method
The next example shows you how to create shard catalog metadata for a user-defined distributed database with Data Guard replication.
GDSCTL> create shardcatalog -database catalog_connect_string
-user mysdbadmin/mysdbadmin_password -sharding user
-protectmode maxperformance
Consult the GDSCTL
documentation or run GDSCTL
HELP CREATE SHARDCATALOG
for more details about the command usage.
Replication Settings
Oracle Data Guard can be used with any data distribution method, and is
configured in the CREATE SHARDCATALOG
command with -repl
DG
.
Raft replication requires a bit more planning, but it is also enabled in
CREATE SHARDCATALOG
command with -repl
native
. See Raft Replication Configuration and Management for additional configurable attributes.
Future Connections to the Shard Catalog
GDSCTL
stores the credentials for the shard catalog
administrator in a wallet on the local host. However, for subsequent
GDSCTL
sessions on other hosts, it may be necessary to
explicitly connect to the shard catalog in order to perform administrative tasks by
running the GDSCTL CONNECT
command, as shown here.
GDSCTL> connect mysdbadmin/mysdbadmin_password@catalog_connect_string
Add and Start Shard Directors
Add to the configuration the shard directors, which will monitor the distributed database system and run background tasks in response to GDSCTL
commands and other
events, and start them.
The following commands must be run on the host where the shard director processes are to run. This can be the shard catalog host or a dedicated host for the shard director processes.
For later GDSCTL
sessions, you might need to explicitly
specify the shard director to be administered. If an error message is shown
referencing the default GSMORA shard director, run
GDSCTL SET GSM
before continuing, as shown here.
GDSCTL> set gsm -gsm sharddirector1
Add Shardspaces If Needed
If you are using composite or user-defined data distribution, and you need
to add more shardspaces to complete your desired distributed database topology, use the ADD SHARDSPACE
command to add additional
shardspaces.
Add Shardgroups If Needed
If your distributed database topology uses the system-managed or composite data distribution method, you can add any necessary additional shardgroups for your application.
Each shardspace must contain at least one primary shardgroup and may contain any number or type of standby shardgroups. Shardgroups are not used in the user-defined distribution method.
Verify the Distributed Database Topology
Before adding information about your shard databases to the catalog, verify
that your distributed database topology is correct before proceeding by using the various GDSCTL CONFIG
commands.
Once shards are added and deployed, it is no longer possible to change much of the shard catalog metadata, so validating your configuration is an important task at this point.
Add the Shard CDBs
Add the CDBs containing the shard PDBs to the distributed database configuration with the ADD CDB
command.
Add the Shard PDBs
Use the ADD SHARD
command to add the shard PDB information
to the shard catalog, then verify it with the CONFIG SHARD
command.
Add Host Metadata
Add all of the host names and IP addresses of your shard hosts to the shard catalog.
As part of the deployment process, the shard director contacts the shards and directs them to register with the shard director’s TNS listener process. This listener process only accepts incoming registration requests from trusted sources and will reject registration requests from unknown hosts.
If your shard hosts have multiple host names or network interfaces
assigned to them, it is possible that the incoming registration request to the shard
director may come from a host that was not automatically added during ADD
SHARD
. In this case, the registration request is rejected and the shard
will not deploy correctly. The visible symptom of this problem will be that
CONFIG SHARD
shows PENDING
for the shard’s
Availability after DEPLOY
has completed.
To avoid this issue, use the GDSCTL ADD INVITEDNODE
command to manually add all host names and IP addresses of your shard hosts to the
shard catalog metadata.
Check Free DB_FILES
Verify that there are enough free data files in each shard to make sure there is enough capacity to create the number of chunks and tablespace sets you need for the distributed database.
To check free DB_FILES
and parameter setting:
SQL> select count(*) from v$datafile;
COUNT(*)
----------
XxxXX
SQL> show parameter db_files
NAME TYPE VALUE
------------------------------------ ----------- ------------------------------
db_files integer 200
Formulas to calculate the DB_FILES
setting can be found in Create the Shard Databases.