8 Configuring Search in WebCenter Portal
Configure search to index and search objects in WebCenter Portal.
Note:
Before performing any search, the Portal scheduler needs to be run in order to ensure that the results of the search are shown correctly.Permissions:
To perform the tasks in this chapter, you must be granted the WebLogic Server Admin
role through the Oracle WebLogic Server Administration Console and the Administrator
role granted through WebCenter Portal Administration.
For more information about roles and permissions, see Understanding Administrative Operations, Roles, and Tools.
Note:
Beginning with Release 12c (12.2.1.4.0), Oracle WebCenter Portal has deprecated the support for Oracle SES. If you have upgraded from a prior release, your upgraded instance might be configured to use Oracle SES. In this case, you must configure WebCenter Portal to use Elasticsearch.8.1 Creating a Crawl Admin User in WebCenter Portal
You can designate an existing user as crawl admin or create a crawl admin user (for example, mycrawladmin
) in WebCenter Portal and in your back-end identity management server to search using Elasticsearch. You must create a crawl admin user only once.
Note:
See your identity management system documentation for information on creating users.
The following example uses Oracle Directory Services Manager to create the mycrawladmin
user:
8.2 Integrating with Elasticsearch
Configure Elasticsearch to index and search objects in WebCenter Portal.
8.2.1 Understanding Search with Elasticsearch
Elasticsearch is a highly scalable search engine. It allows you to store, search, and analyze big volumes of data quickly and provides a distributed, multitenant-capable full-text search engine with an HTTP web interface and schema-free JSON documents.
Advantages of Elasticsearch
-
Elasticsearch provides full-text search capabilities as it is built on Lucene.
-
Elasticsearch is document-oriented. It stores data as structured JSON documents and indexes all fields by default, with a higher performance result.
-
Elasticsearch is API driven; actions can be performed using a simple Restful API.
-
Elasticsearch retrieves search results fast because it searches an index instead of searching the text directly.
You can configure Elasticsearch to search the following resources in WebCenter Portal:
-
Documents, including wikis and blogs
-
Portals, page metadata, page content (contents of HTML, Text and Styled Text components), lists, and people resources
8.2.2 Prerequisites for Configuring Elasticsearch
Ensure the following requirements:
-
Oracle WebCenter Portal is installed.
-
Optional. If you choose to use WebCenter Content for search, ensure that WebCenter Content is configured and all required components are enabled. See Managing Connections to Oracle WebCenter Content Server.
8.2.3 Configuration Roadmap for Elasticsearch in WebCenter Portal
Table 8-1 Roadmap - Setting Up Elasticsearch in WebCenter Portal
Actor | Task |
---|---|
Administrator |
|
Administrator |
|
Administrator |
|
Administrator |
Configuring WebCenter Portal for Search |
Administrator | Configuring Search Crawlers |
Administrator |
(Optional) Customizing Search Settings in WebCenter Portal Administration |
Administrator |
(Optional) Modifying Search Global Attributes |
Administrator |
(Optional) Configuring Search Custom Attributes |
Administrator |
(Optional) Creating Custom Facets |
8.2.4 Installing Elasticsearch
The following topics describe how to set up a single server and secure using HTTPS:
8.2.4.1 Setting Up Single Server
Note:
If Elasticsearch 7.17.x is already installed and configured for WebCenter Content Server, the same can be used for WebCenter Portal and the following can be skipped. But, the username used to connect to Elasticsearch in WebCenter Content should be set as the crawl admin user in WebCenter Portal. See Creating a Crawl Admin User in WebCenter Portal.8.2.4.2 Securing Elasticsearch Using HTTPS
8.2.4.2.1 Adding Certificate to WebCenter Portal Keystore
Download Certificate from Elasticsearch Server
- Open the Firefox browser and connect to the Elasticsearch server with the
following command:
https://host_name:9200
where
host_name
is the name of the Elasticsearch server. - Accept the security exception and continue. Provide the login credentials when prompted.
- Click the Lock icon in the URL field and navigate to Connection not secure and then More Information.
- In the pop-up window, click the View Certificate button.
- Click on the link PEM (cert) to download the certificate in the .PEM format.
Add the Certificate to WebCenter Portal Keystore
Once the certificate is downloaded, it should be imported to WebCenter Portal Keystore.
To import:
- Execute the following command in WebCenter Portal server and enter the Keystore
password when
prompted:
keytool -importcert -alias es_cert -file /filepath/sslcertificate/es_certificate.pem -keystore <JDK_path>/jre/lib/security/cacerts
- Restart the Managed server.
Adding certificate to WebCenter Portal Custom Trust Key Store
To add:
- Execute the following command:
keytool -importcert -alias es_cert -file /filepath/sslcertificate/es_certificate.pem -keystore <Custom Trust Key Store File Name>
- Restart Admin and WC_Portal server.
8.2.4.2.2 Search Connection to Elasticsearch Server Running on HTTPS
createSearchConnection(appName='webcenter', name='<conn_name>', url='https://<ES_host>:<ES_port>', indexAliasName='<index_alias>', appUser='<app_user>', appPassword='<app_user_password>')
If the search connection already exists, it should be updated to use HTTPS and can be done using the following command:
setSearchConnection(appName='webcenter', name='<existing_conn_name>', indexAliasName='<existing_index_alias>', appUser='<app_user>', appPassword='<app_user_password>', useSSL=1)
8.3 Configuring WebCenter Content for Search
Enable Faceted Search and Sort
Note:
The desktop client application should be used for the following steps.Navigate to the Configuration Manager Applet, Advanced Search Design, and then select the following field.
- dDocAccount: Click Edit and select the check box Is a filter category (if not selected). Click Ok.
- dDocAuthor: Click Edit and select the check boxes Is a filter category and Is sortable (if not selected). Click Ok.
- dDocLastModifiedDate: Click Edit and select the check boxes Is a filter category and Is sortable (if not selected). Click Ok.
- dDocTitle: Click Edit and select the check box Is sortable (if not selected). Click Ok.
- dFormat: Click Edit and select the check box Is a filter category (if not selected). Click Ok.
- xWCTags: Click Edit and select the check box Is a filter category (if not selected). Click Ok.
Full Collection Rebuild of the Search Index in WebCenter Content
To perform a full collection rebuild:
- Log in to Oracle WebCenter Content server as a system administrator.
- On the Oracle WebCenter Content home page, expand Administration and select Admin Applets.
- On the Administration Applets page, click Repository Manager.
- On the Repository Manager page, click the Indexer tab.
- Under Collection Rebuild cycle section, click Start. In the dialog, deselect the Use fast rebuild option and then click OK to proceed.
8.4 Configuring WebCenter Portal for Search
To configure WebCenter Portal for search, you need to configure the connection between WebCenter Portal and Elasticsearch.
8.5 Synchronizing Users in WebCenter Portal
Before performing a portal full crawl, we recommend you to run the LDAP synchronization WLST command to ensure that all users are available in portal.
8.6 Configuring Search Crawlers
You can configure the following types of crawlers to index WebCenter Portal resources:
-
Portal Crawler: This uses the Portal crawl source to crawl certain objects, such as lists, page metadata, page content (contents of HTML, Text, and Styled Text components), portals, and profiles.
The following topics describe how to create different crawl sources using Scheduler UI in WebCenter Portal Administration:
8.7 Configuring Search Custom Attributes
When you search using WebCenter Portal, only
certain predefined attributes show up in the search results. WebCenter Portal allows you to see additional
attributes in your search results. This can be achieved from the Search Setting page
in portal administration, where the Custom Attributes section lets you select which
custom search attributes should appear in search results and the order in which they
appear. This list in the Search Setting page is driven by
search-service-attributes.xml
. It contains list of all
attributes that we crawl for each service. Types in search index is defined by this
metadata. You can add a new custom attribute or modify the existing one in the
search-service-attributes.xml
file.
The following procedure describes how to add a new search custom attribute using Document service as an example:
8.8 Creating Custom Facets
Oracle WebCenter Portal supports faceted search to refine the search results without running a new search. With faceted search, search results are grouped under a predefined category and thus help users to narrow down the search results based on a relevant category. for example, Author
, Portal
, Last Modified date
.
In Oracle WebCenter Portal, by default, certain
predefined facets are provided in the Search Setting page. The list of facets is driven
by the search-service-custom-facets.xml
metadata file and each facet in
the file is mapped with search-service-attributes.xml
metadata file.
The following is the sample of the search-service-custom-facets.xml
metadata file:
<custom-facet dataType="keyword" displayName="Author" displayNameKey="ES_FACET_AUTHOR" mappedSearchAttribute="author" name="Author" itemsToDisplay="20"/> <custom-facet dataType="keyword" displayName="Service ID" displayNameKey="ES_FACET_SERVICEID" mappedSearchAttribute="wc_serviceId" name="Service ID"/> <custom-facet dataType="keyword" displayName="Portal" displayNameKey="ES_FACET_SCOPE" mappedSearchAttribute="wc_scopeGuid" name="Scope GUID" itemsToDisplay="20"/> <custom-facet dataType="keyword" displayName="Tags" displayNameKey="ES_FACET_TAGS" mappedSearchAttribute="wc_tagWords" name="Tags"itemsToDisplay="20"/> <custom-facet dataType="keyword" displayName="Mimetype" displayNameKey="ES_FACET_MIMETYPE" mappedSearchAttribute="mimetype" name="Mimetype" itemsToDisplay="20"/> <custom-facet dataType="date" displayName="Last Modified Date" displayNameKey="ES_FACET_LASTMODIFIED" mappedSearchAttribute="lastmodified" name="Last Modified Date" />
where,
-
name
is the name of the facet. -
displayNameKey
is the value of the custom facet metadata field. -
displayName
is the display name of the facet.Note:
If your business is supported in multiple languages, you can translate the newly added custom facets to the desired languages. See Translating Strings for Search Facets. -
mappedSearchAttribute
is used to map the custom facet with the search attribute. This value should be same asname
attribute value in thesearch-service-attributes.xml
metadata file. -
dataType
is the type of data. The available data types arekeyword
anddate
. This value should be same as the type value in thesearch-service-attributes.xml
metadata file. itemsToDisplay
is an optional attribute that defines the maximum number of items to be displayed under a facet. If the value is not specified, the default value is configured using option Administration > Tools and Services > Search in WebCenter Portal.Note:
This attribute is supported only if dataTypekeyword
is used.
Based on your business needs, you can add or modify the list of the facets in the
search-service-custom-facets.xml
metadata file for any of the
services available in Oracle WebCenter Portal, for
example, you can add custom facets for documents, people and other services listed in
the search-service-custom-facets.xml
metadata file.
This section shows how to add a custom facet for the document service. To add custom facet for the document service, you need to first add a custom metadata field in Oracle WebCenter Content, and then rebuild the content index.
The following steps shows you how to add the custom facet for the document service:
8.8.1 Adding a Custom Metadata Field in Oracle WebCenter Content
8.8.2 Enable Faceted Search in Configuration Manager Applet
Desktop client application should be used to enable faceted search for the Custom Metadata field.
- Navigate to Configuration Manager Applet, and then to Advanced Search Design, select the field on which you want to enable faceted search and then click Edit.
- In the Advanced Options, select the field Is a filter category to enable the variable to participate in faceted search.
- After OK is clicked, the Rebuild Search Index button will be enabled. Click this button to rebuild the search index.
8.8.3 Configuring the Search Setting Metadata
You need to add the defined custom attribute in Oracle WebCenter Portal. For the new custom attribute to appear in the search settings page, you need to add or update the search-service-attributes.xml
.
8.8.4 Configuring the Search Custom Facet Metadata
In WebCenter Portal Search Setting page, you can select which facets to display with search results. This list in the Search Setting page is driven by the search-service-custom-facets.xml
metadata file. It contains a list of facets used in WebCenter Portal. Each facet in the search-service-custom-facets.xml
metadata file is mapped with a custom attribute using the mappedSearchAttribute attribute.
8.8.5 Update the Metadata for the Document in Your Portal
CustomMetadataField
.
8.8.6 Configuring the File Exclusion in Search Results
You can configure to exclude certain file types from the search results. In
order to exclude the file types, the file types have to be specified in the metadata file
search-service-exclude-file-types.xml
.
- Export Metadata Service Document
- Update the File Types to Filter
- Import Metadata Service Document
- Verifying File Exclusion in Search Results
Export Metadata Service Document
The location of the file in MDS is
/oracle/webcenter/search/scopedMD/s8bba98ff_4cbb_40b8_beee_296c916a23ed/search-service-exclude-file-types.xml
.
search-service-exclude-file-types.xml
, enter the following
command:exportMetadata(application='webcenter', server='WC_Portal', toLocation='/tmp/search', docs='/oracle/webcenter/search/scopedMD/s8bba98ff_4cbb_40b8_beee_296c916a23ed/search-service-exclude-file-types.xml')
In the metadata service document
search-service-exclude-file-types.xml
, you can customize the
file types to be excluded.
Update the File Types to Filter
The metadata describes how we can specify the files types which have to be excluded during the search.
<?xml version='1.0' encoding='UTF-8'?>
<search-service-exclude-file-types xmlns="http://xmlns.oracle.com/webcenter/search/excludefiletypes">
<exclude-file-in-search>
<fileType extension=".xml" customType="site-studio-file" description="Exclude Site Studio data files that have extension .xml"/>
<fileType extension=".wav" description="Exclude all the file types with extension .wav"/>
<fileType extension=".jpeg" description="Exclude all the file types with extension .jpeg"/>
</exclude-file-in-search>
</search-service-exclude-file-types>
- The extension of the file which we want to filter from the search has to be entered in this metadata file.
- The
customType
attribute is optional. If we want to exclude the Site Studio data files in the search results, specify thecustomType
assite-studio-file
for extension ".xml". - The
description
attribute is optional. It explains the details of the files being excluded.
Import Metadata Service Document
To import the metadata service document
search-service-exclude-file-types.xml
, enter the following
command:
importMetadata(application='webcenter', server='WC_Portal', fromLocation='/tmp/search', docs='/oracle/webcenter/search/scopedMD/s8bba98ff_4cbb_40b8_beee_296c916a23ed/search-service-exclude-file-types.xml')
Verifying File Exclusion in Search Results
To verify the file types exclusion, click the Search icon and specify the file type, for example, *.wav or *.jpeg, and notice that these file types are excluded in the search result. Similarly, search on the .xml file, and notice that the .xml files created in the Site Studio are excluded.
8.9 Scheduling a Crawl
You can schedule an incremental search crawl or manually start a full crawl or manually start a fast crawl or manually start a reindex crawl. The topics in this section describe how to schedule a crawl and how to start, enable, or disable a crawl.
8.9.1 Scheduling an Incremental Crawl
8.9.2 Enabling and Disabling a Scheduled Crawl
8.10 Customizing Search Settings in WebCenter Portal Administration
You can customize Result Types and Filtering, Search Scope, Facets, and Custom Attributes on the Search Settings page in WebCenter Portal Administration. Portal managers can reset only the search scope for the portals that they manage.
To customize search settings for Elasticsearch:
8.11 Modifying Search Global Attributes
Note:
The attributes discussed below are of Elastisearch.The attributes wcESConnectionTimeoutPeriod
and wcESReadTimeoutPeriod
are used to configure the interaction between WebCenter Portal and Elasticsearch.
The following are the attributes:
-
wcESConnectionTimeoutPeriod is the connection timeout interval, in seconds. This is the amount of time WebCenter Portal will wait to establish the connection to the Elasticsearch server. The default value is 30 seconds.
-
wcESReadTimeoutPeriod is the read timeout interval, in seconds. Once WebCenter Portal is connected to the Elasticsearch server, this specifies the amount of time allowed for the Elasticsearch server to respond in a given request. The default value is 30 seconds.
You can modify the default value of the attributes in Attributes page in WebCenter Portal administration. After you modify the value, you must restart the WebCenter Portal server for the changes to take effect.