5 Configuring Enterprise Data Quality with Oracle WebLogic Server
Note:
These instructions apply to Oracle WebLogic Server environments only. If you are using Apache Tomcat, you must follow the directions in Configuring Enterprise Data Quality with Apache Tomcat
This chapter includes the following sections:
Prerequisites for these Procedures
Before performing the procedures in this section, you must first read and satisfy the steps in:
Creating an EDQ Database Repository
EDQ makes use of some database schemas. These schemas are the configuration schema (EDQCONFIG
), the results schema (EDQRESULTS
), and the staging schema (EDQSTAGING). You create them with the Oracle Repository Creation Utility (RCU).
The person who runs RCU must be able to log into the database with DBA privileges. If you cannot run with DBA privileges, RCU can create a script for a DBA to run later.
Note:
As of Oracle Fusion Middleware 14c (14.1.2.0.0), new schemas are created with editions-based redefinition (EBR) views enabled by default. When EBR is enabled, the schema objects can be upgraded online to a future Fusion Middleware release without any downtime. For more information about using editions-based redefinition, see Using Edition-based Redefinition.
To run RCU:
-
Make certain the repository database is running.
-
Run the command shell or console of the operating system.
-
Start RCU from the
FMW_HOME/oracle_common/bin
directory, whereFMW_HOME
is the Oracle Fusion Middleware installation directory.On Linux:
./rcu
On Windows:
rcu.bat
Complete the RCU configuration screens by following the instructions in Table 5-1.
Table 5-1 Running the RCU Program
Screen | Action to Perform |
---|---|
Welcome |
Click Next to proceed with the installation. You may cancel the installation at any time by clicking Cancel. |
Create Repository |
Click Next to continue. This uses the default Create Repository and System Load and Product Load options. This requires the person running RCU to have DBA privileges. |
Database Connection Details |
Select Oracle Database from the Database Type list. Specify the host name where your Oracle database is running. Enter the port number for your database. The default port number for Oracle Database is 1521. Specify the service name for the database. Typically, the service name is the same as the global database name. For example, Enter the user name for your database. The user name could either be Enter the password for your database user. Select SYSDBA from the Role list. This is automatically selected when the user is Click Next to continue. |
Checking Global Prerequisites |
When the prerequisites checking progress has reached completion, click OK to continue. |
Select Components |
Select Create new prefix and enter a unique prefix name for all the database schemas you are creating in this session. For example, Select the Oracle AS Repository Components check box. The Oracle EDQ check boxes that create the EDQ configuration, results, and staging schemas in the database repository are then automatically selected. When you select the Oracle EDQ component, all the necessary schemas are installed. Not only will the EDQ schema be selected but the common schema needed to support audit and OPSS, which EDQ is integrated with will also be selected. This ensure that EDQ will function correctly with these integrated components. The three EDQ schemas that are to be installed are Config, Results and Staging. The Staging schema is also used by CDS, with or without the Fusion Connector. However installing the Staging schema when not necessary will not interfere with operation and will consume minimal database resources. Click Next to continue. |
Checking Component Prerequisites |
When the prerequisites checking progress has reached completion, click OK to continue. |
Schema Passwords |
Ensure that Use same passwords for all schemas is selected. Enter the password that you want to use for all of the EDQ database schemas in all password fields, then click Next to continue. |
Map Tablespaces |
The default EDQ tablespaces that will be created by RCU are displayed by component. You can change the tablespaces by clicking Manage Tablespaces and then modifying the information. Oracle recommends using one user tablespace for the
Click Next to continue. |
Validating and Creating Tablespaces |
Click OK to create any non-existent tablespaces in your schema, then click OK when the operation completes. |
Summary |
Review the database details, then click Create to continue. A status screen is displayed that shows the progress of creating the repository components. |
Completion Summary |
Click Close to exit the RCU program. |
Creating the WebLogic Server EDQ Domain
These instructions use the Configuration Wizard to create a Basic WebLogic Server domain for EDQ, with the following:
-
One Administration Server and one managed server (no additional managed servers or clusters).
Note:
If there is the possibility you may add more managed servers in the future, you should choose a cluster deployment, even if it starts with a single server. Otherwise it will be necessary to manually re-target the Data Sources, Applications and Deployments in WebLogic from a single server to the cluster.
-
One (non-RAC) data source for the EDQ configuration schema, one data source for the results schema, and one data source for the staging schema. You can convert the datasources to RAC data sources with the Configuration Wizard, or you can do so later through the WebLogic Server Administration Console.
-
A Node Manager configuration that is predefined within the EDQ domain as
edq/nodemanager
. You cannot edit the Node Manager home in this configuration. You can change this configuration during this procedure, if desired.
Note:
Oracle recommends the use of managed servers that are administered by Oracle WebLogic Node Manager. You can configure Managed Servers, Clusters, and other advanced features through the Configuration Wizard. For more information, see Running Multiple EDQ Servers in the Same Domain
Starting the WebLogic Server Domain Configuration Wizard
To start the Domain Configuration wizard, follow these steps. You will run the Configuration Wizard in graphical mode.
Navigating the Domain Configuration Wizard Screens
Table 5-2 describes the screens in the configuration wizard. Certain screens are displayed only in certain situations depending on your selections. For help with any screen, click the Help button.
Table 5-2 Configuration Screens for Creating a New EDQ WebLogic Server Domain
Screen | Action to Perform |
---|---|
Create Domain |
Select Create a new domain. In the Domain Location box, enter the path to the new domain (for example, Click Next to continue. |
Templates |
Select Oracle Enterprise Data Quality. The Oracle JRF and WebLogic Coherence Cluster Extension are automatically selected. Keep these selections. Click Next to continue. |
Administrator Account |
Specify the user name and password for the EDQ domain's administrator account. This account is used to administer the domain and to log into the EDQ application. Click Next to continue. |
Application Location |
Specify the directory in which the applications of the EDQ domain are to be stored. Click Next to continue. |
Domain Mode and JDK |
Select Production in the Domain Mode field. Note: As of WebLogic Server 14.1.2.0.0, when you select Production mode, WebLogic Server automatically sets some security configurations of Secured Production to more secure values. However, there are certain security configurations (such as SSL/TLS) that require manual configuration. See Using Secured Production Mode in Administering Security for Oracle WebLogic Server. If you want to disable the more secure default settings, then you may select Disable Secure Mode. This will enable the non-SSL listen ports. If you want to retain the more secure default
settings of Secured Production mode in general, but
want to change which ports (listen ports, SSL listen ports,
or administration ports) will be enabled by default in your
domain, then you may:
For more information, see Understand How Domain Mode Affects the Default Security Configuration in Securing a Production Environment for Oracle WebLogic Server. Select the Oracle HotSpot JDK in the JDK field. Click Next to continue. |
Database Configuration Type |
Ensure that RCU Data is selected. This populates the connection information you supplied when you ran the Repository Creation Utility (see Table 5-1). If you must change any of these fields, ensure that you use the schema prefix ( When done, click Get RCU Configuration to connect to the Oracle Database and bind the EDQ schemas. Click Next to continue. |
Component Datasources |
Accept the defaults and then click Next. |
JDBC Test |
All schemas are selected and automatically tested. Return to the previous screen to alter the connection configuration if necessary. Click Next to continue. |
Advanced Configuration |
Select the Administration Server, Node Manager, Topology (which includes Server Templates, Managed Servers, Clusters, Virtual Targets and Coherence) option. |
Administration Server |
On the Administration Server screen, change the listen address from "All Local Addresses" to the IP address of the host where the Administration Server will reside. Do not use All Local Addresses. Do not specify any server groups for the Administration Server. |
Node Manager |
Let the Per Domain Default Location option remain selected, and enter a Username and Password for the Node Manager. |
Managed Servers |
Clone the EDQ server to create a copy of the server. If you do not want to clone the EDQ server, select Add to add additional EDQ servers. Select the IP address of the host on which the Managed Server will reside. For configuration procedures for a clustered mode installation, see Configuring EDQ for High-Availability in a WebLogic Server Cluster |
Clusters You must use this screen if EDQ is deployed in a cluster. |
Select Add to add a cluster. If both servers are being deployed on the same machine, enter the machine name as the cluster address. For additional configuration procedures for a clustered mode installation and a description of those configuration screens, see Configuring EDQ for High-Availability in a WebLogic Server Cluster To deploy EDQ in a non-clustered mode, select the appropriate components as needed. |
Assign Servers to Clusters You must use this screen if EDQ is deployed in a cluster. |
Assign the managed servers that you created to the cluster that you created. For configuration procedures for a clustered mode installation, see Configuring EDQ for High-Availability in a WebLogic Server Cluster |
Coherence Clusters You must use this screen if EDQ is deployed in a cluster. |
Configure the Coherence cluster that is automatically added to the domain. Leave the default port number 7574 as the Coherence cluster listen port. |
Machines |
Create a new machine in the domain. A machine is required so that the Node Manager can start and stop servers. Click Add to create a new machine. Specify In the Node Manager Listen Address field, select the IP address of the machine in which the Managed Servers are being configured. You must select a specific interface and not "localhost." This allows Coherence cluster addresses to be dynamically calculated. Verify the port in the Node Manager Listen Port field. The port number |
Assign Servers to Machines |
Assign the Administration Server and Managed Servers to the new machine you just created. In the Machines pane, select the machine you want to assign the servers to; in this case, In the Servers pane, assign
Repeat to assign both |
Virtual Targets This screen is optional and you can use the screen, if required. |
Add or delete Virtual Targets for use in a WebLogic Server Multitenant (MT) environment. Virtual targets define the locations where a partition or resource group runs. They also provide instructions on how to route traffic, including addresses, protocol settings, and targeting. Request routing is determined by the host name and optional URI. WebLogic Server Multitenant virtual targets are deprecated in WebLogic Server 12.2.1.4.0 and will be removed in the next release. |
Partitions |
For each partition you want to add, click Add and enter the name you want to use for the partition. The partition name must be unique within the domain. WebLogic Server Multitenant domain partitions are deprecated in WebLogic Server 12.2.1.4.0 and will be removed in the next release. |
Configuration Summary |
Review the configuration for your domain by selecting a view and then selecting individual items in the list for that view. If the domain is configured as you want it, click Create to create the domain. If you need to make changes to the configuration, click Back to return to the appropriate screen for the settings you want to change, or click on the links on the left to go to that screen. |
Configuration Progress |
Shows the progress of the domain creation. When the process completes, click Next. |
End of Configuration |
Review the domain creation results. Click Finish to exit the Configuration Wizard. |
Start Oracle WebLogic Server
You must start your Administration Server, Managed Servers, and clusters to complete the installation. For information about starting managed servers using Node Manager and Administration Servers, see Starting and Stopping Administration Servers in Administering Oracle Fusion Middleware.
Depending on your existing security settings, you may need to perform additional configuration before you can manage a domain with secured production mode enabled. For more information, see Connecting to the Administration Server using WebLogic Remote Console
Note:
Secured Production Mode enforces more restrictive and stringent security settings to ensure less vulnerability to threats. To make sure that your domain is secure, after enabling Secured Production Mode, you will have to choose the security configuration options that are appropriate for the environment in which the domain runs, such as obtaining and storing certificates, protecting user accounts, and securing the network on which the domain runs. If these options are not properly configured, you will be blocked from using WebLogic Server.
After you have created your WebLogic domain, several key steps remain to ensure its integrity such as selecting appropriate security configurations. For more information, see Securing the Domain After You Have Created It in Administering Security for Oracle WebLogic Server.
Running Multiple EDQ Servers in the Same Domain
Note:
You must have an appropriate license of WebLogic Server and Coherence to configure EDQ to operate in a cluster.Multiple EDQ managed servers can be configured to run in the same WebLogic Server domain either in a cluster or not. If all the servers are on the same machine, each server must listen on a different port.
The Java Required Files (JRF) Template must be applied to any managed servers that were created using the WebLogic Server Administration Console. This is equivalent to the library targeting performed automatically by the WebLogic Server Configuration Wizard.
The final step is:
-
Use the WebLogic Server Administration Console to modify the managed server settings for the additional EDQ servers.
Once multiple EDQ servers have been configured, you can leave them un-clustered and accessed directly using their respective Launchpad URLs to the relevant port, or you can configure them as part of a cluster using standard WebLogic Server practices. You can configure a separate front-end load balancer to handle incoming web service requests through a single cluster URL.
Configuring EDQ for High-Availability in a WebLogic Server Cluster
You can install and configure Oracle Enterprise Data Quality for high availability in an Oracle WebLogic Server cluster environment. The high availability features in Oracle Enterprise Data Quality have been enhanced to make the system function in a clustered environment, and tolerate individual RAC node failures and reconnect after complete database failures.
For more information on the high-availability features in Enterprise Data Quality, please see Oracle Fusion Middleware Understanding Oracle Enterprise Data Quality.
Configuring EDQ for Clustered Mode Deployment
To install and configure Oracle Enterprise Data Quality for high availability in an Oracle WebLogic Server cluster environment:
-
Plan the installation, and install the prerequisites and software, as described in Prerequisites for these Procedures.
-
Run the Repository Creation Utility (RCU), as described in Creating an EDQ Database Repository. To get a full high availability implementation, you must deploy the schema to an Oracle RAC instance. Deploying to a standard Oracle database does not provide HA capability for the database.
-
Run the Domain Configuration wizard to provision a domain, as described in Creating the WebLogic Server EDQ Domain and with the configuration steps for a cluster noted in Running the Domain Configuration Wizard for a Clustered Deployment.
-
Start EDQ in a cluster as described in Starting EDQ in a Cluster.
-
Enable or configure options for high availability in EDQ. See the sections:
Starting EDQ in a Cluster
Once the domain has been created, start the Admin Server by running the following command:
FMW_HOME/user_projects/domains/edq_domain/bin/startWebLogic.sh
If you select Production Mode for the Domain Mode and JDK screen when you create the domain, you will be prompted for the Administrator user login credentials. Enter the same credentials that you received in the Administrator Account screen
Then start the Node Manager with the command:
FMW_HOME/user_projects/domains/edq_domain/bin/startNodeManager.sh
You can then log onto the Admin Console through a browser and start both of the Managed servers that were created during the configuration steps.
You can access the servers as follows:
-
You can access any of the servers' launchpads. Connect to one of the servers and start the Director application. Or,
-
You can connect via a load balancer in front of the cluster, and will be connected to whichever managed server the load balancer picks.
To display a dialog indicating which server the GUI is connected to, right-click the server in the project browser and select Server Information.
Enabling JMX API and Command Line for HA Clusters
By default, the EDQ high availability deployment templates come with the internal JMX server enabled, but each server uses a dynamically assigned port.
If you are running EDQ as a single managed server and not using a WebLogic cluster, the default management port is 8090.
If you are running EDQ in a WebLogic cluster, EDQ uses random ports which are allocated on server start. You can examine the Server Access MBean to find the current ports, but the ports change when the EDQ servers are restarted.
The standard EDQ JMX client has been updated to query these server access beans automatically, using the MBean server in the Administration server. Specify the host name and port for the Administration server using the command: $ java -jar jmxtools.jar runjob ... adminhost:7001
, and the connector code redirects the request to one of the managed servers on the correct port.
The username and password with jmxtools must be valid in WebLogic and EDQ. Use a user from the authentication provider setup in OPSS.
If a single EDQ server runs on a host, do not run the servers in a cluster. In this case, the default port 8090 is used.
If multiple EDQ managed servers run on the same host, you can configure different fixed port for each server in a cluster. (If the servers are running on different machines, they can use the same port). Edit the oedq.local.home
version of director.properties
file and replace the current management port setting with: [expr]management.port = 8090 + servernum - 1
servernum
is a precomputed value, which is "1" for edq_server1, "2" for edq_server2 and so on.
This setting causes the first EDQ managed server to run the JMX server on port 8090, second EDQ managed server to run the JMX server on port 8091, and so on.
The Siebel connector does not support the use of the admin server/port for JMX calls, but needs an explicit port like 8090.
You can determine the current JMX ports for each running server using the command:
$ java -jar jshell.har scripts/system/serveraccess.groovy -server host:port
-user user -pw pw
host:port
is the domain admin server.
Landing Area
The landing area is a feature that allows EDQ to read and write data to the server file system. If the landing area is to be used in a cluster then some consideration needs to be given as to how the landing area is shared amongst the managed servers. By default the landing area is located in the oedq.local.home area. If this is shared amongst the various hosts supporting EDQ managed servers then the landing area will continue to work as in a non-clustered system.
If the landing area file system is not shared amongst the managed servers, but is required for use then a number of options are available:
-
The location of the landing area can be changed using the
landingarea
property in thedirector.properties
file to a location that is shared amongst the hosts running EDQ managed servers. -
Since an EDQ job runs all its tasks and processes on the same managed server, any files consumed or generated by the job are written to its local landing area. External tasks can be added to the job to transfer any incoming or outgoing files to an appropriate shared location.
-
If an EDQ job is consuming external files then these could be copied to all managed servers before the job is started.
-
If an EDQ job is generating files for consumption by further EDQ jobs then the landing area can be synchronized across the various managed server between the various job run by use of an external tool such as rsync.
If files are being generated and consumed within the same job then a shared landing area may not necessary. This is because the entire job will run on the same managed server and so access the local landing area.