8 Starting and Stopping Coherence
This chapter includes the following sections:
- Using the Bootstrap API
Coherence has a simple bootstrap API that enables you to configure and start a Coherence application by building acom.tangosol.net.Coherence
instance and starting this instance. - Using the com.tangosol.net.Coherence Class
Cache servers are cluster members that are responsible for storing cached data. - Using the Legacy CacheFactory Client
Cache clients are cluster members that join the cluster to interact with the cluster's services. - Stopping Cluster Members
You can stop cluster members from the command line or programmatically. - Performing a Rolling Restart
A rolling restart is a technique for restarting cache servers in a cluster that ensures no data is lost during the restart.
Parent topic: Using Coherence Clusters
Using the Bootstrap API
Coherence has a simple bootstrap API that enables you to configure and start a Coherence application by building a com.tangosol.net.Coherence
instance and starting this instance.
com.tangosol.net.Session
instances. A
com.tangosol.net.Session
gives access to Coherence
clustered resources, such as NamedMap
,
NamedCache
, NamedTopic
, and so on.
Sessions can be of different types. For example, a session can be:
- related to a
ConfigurableCacheFactory
. - configured from a configuration file.
- a client-side gRPC session.
Example 8-1 Application Bootstrap Code
import com.tangosol.net.Coherence;
import com.tangosol.net.CoherenceConfiguration;
import com.tangosol.net.SessionConfiguration;
public class Main
{
public static void main(String[] args)
{
// Create a session configuration
SessionConfiguration session = SessionConfiguration.builder()
.named("Carts")
.withConfigUri("cache-config.xml")
.build();
// Create a Coherence instance configuration
CoherenceConfiguration cfg = CoherenceConfiguration.builder()
.withSession(SessionConfiguration.defaultSession())
.withSession(session)
.build();
// Create the Coherence instance from the configuration
Coherence coherence = Coherence.clusterMember(cfg);
// Start Coherence
coherence.start();
}
}
- The 'Create a session configuration' part creates
SessionConfiguration
from thecache-config.xml
configuration file with theSession
nameCarts
. - The 'Create a Coherence instance configuration' part creates
CoherenceConfiguration
to configure theCoherence
instance. This configuration contains theCarts
session configuration. - The 'Create the Coherence instance from the configuration' part
creates the
Coherence
cluster member instance fromCoherenceConfiguration
. - The 'Start Coherence' part starts the
Coherence
instance.
This section includes the following topics:
- Running a Coherence Server
- Session Configurations
- Configuring a Coherence Instance
- Creating a Coherence Instance
- Starting Coherence
- Obtaining a Coherence Instance
- Ensuring Coherence Has Started
- Adding Coherence Lifecycle Interceptors
Parent topic: Starting and Stopping Coherence
Running a Coherence Server
com.tangosol.net.Coherence
contains a main
method that you can use to run a Coherence server. This method is a more powerful
alternative to the legacy DefaultCacheServer
class.$ java -cp coherence.jar com.tangosol.net.Coherence
Without any other configuration, the default Coherence
instance started using the above command runs a server that is identical to the server
that is run using DefaultCacheServer
.
Parent topic: Using the Bootstrap API
Session Configurations
You can create a create a session configuration
named SessionConfiguration
by using
the SessionConfiguration
builder.
See the example in Using the Bootstrap API.
This section includes the following topics:
- About the Default Session
- Naming a Session
- Specifying the Session Configuration URI
- Adding Session Event Interceptors
- Scoping a Session Configuration
Parent topic: Using the Bootstrap API
About the Default Session
When running Coherence, if you have not specified any configuration, the default configuration file is used to configure Coherence. This behavior continues to apply to the bootstrap API.
If you start a Coherence
instance without specifying any session
configurations, it creates a single default Session
. This default
Session
wraps the cache factory interface
ConfigurableCacheFactory
that is created from the default
configuration file. See Interface ConfigurableCacheFactory. The
default file name of the default configuration file is
coherence-cache-config.xml
unless the name is overridden with
the coherence.cacheconfig
system property.
When creating a Coherence configuration instance named
CoherenceConfiguration
, you can add the default session by using
the SessionConfiguration.defaultSession()
helper method. This method
returns a SessionConfiguration
that is configured to create the default
session Session
.
CoherenceConfiguration
:CoherenceConfiguration cfg = CoherenceConfiguration.builder()
.withSession(SessionConfiguration.defaultSession())
.build();
Parent topic: Session Configurations
Naming a Session
All sessions have a name that must be unique within the application. If you have not
specified a name when the SessionConfiguration
is built, the default
name of $Default$
will be used. A Coherence
instance
fails to start if duplicate Session
names exist.
Examples:
- This configuration will have the default name
(
$Default$
):SessionConfiguration session = SessionConfiguration.builder() .build();
- This configuration will have the name
Test
:SessionConfiguration session = SessionConfiguration.builder() .name("Test") .build();
Parent topic: Session Configurations
Specifying the Session Configuration URI
The most common type of session is a wrapper around
ConfigurableCacheFactory
. When using the
SessionConfiguration
builder, you can specify the configuration
file URI by using the withConfigUri()
method. This method accepts a
string value for specifying the location of the configuration file.
cache-config.xml
:SessionConfiguration session = SessionConfiguration.builder()
.withConfigUri("cache-config.xml")
.build();
If you do not specify a configuration URI, the default value will be used.
The default value is coherence-cache-config.xml
unless this value
is overridden with the coherence.cacheconfig
system property.
Parent topic: Session Configurations
Adding Session Event Interceptors
Coherence provides many types of events. For example, life-cycle events for Coherence
itself, cache life-cycle events, cache entry events, partition events, and so on. These
events can be listened to by implementing an EventInterceptor
that
receives specific types of event. Event interceptors can be registered with a
Session
as part of its configuration.
CacheInterceptor
that listens to the
CacheLifecycleEvent
event when caches get created or destroyed. You
can add this interceptor to the session as shown
below:SessionConfiguration session = SessionConfiguration.builder()
.withInterceptor(new CacheInterceptor())
.build();
The interceptor will receive cache life-cycle events for all caches that are created using the session.
Parent topic: Session Configurations
Scoping a Session Configuration
Scope is a concept that helps you scope services and isolate them from other services
with the same name. For example, you can have multiple
ConfigurableCacheFactory
instances loaded from the same XML
configuration file but with different scope names. Scoping ensures that each
ConfigurableCacheFactory
instance has its own services in the
cluster.
Unless you require multiple sessions, a scope will not generally be used in a configuration.
withScopeName()
method, as shown in the following
example:SessionConfiguration session = SessionConfiguration.builder()
.withScopeName("Test")
.build();
The session (and any ConfigurableCacheFactory
it wraps)
created from the above configuration will have a scope name of
Test
.
<defaults>
section of the XML configuration file. An example of
scoped-configuration.xml
file is shown
below:<?xml version="1.0"?>
<cache-config xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xmlns="http://xmlns.oracle.com/coherence/coherence-cache-config"
xsi:schemaLocation="http://xmlns.oracle.com/coherence/coherence-cache-config coherence-cache-config.xsd">
<defaults>
<scope-name>Test</scope-name>
</defaults>
A ConfigurableCacheFactory
instance created from this XML file, and any
Session
that wraps it, will have a scope name of
Test
.
Note:
When using the bootstrap API, any scope name specifically configured in
SessionConfiguration
(that is not the default scope name)
overrides the scope name in the XML file.
For example, the following scenarios show the different ways of defining the scope
name using the scoped-configuration.xml
file (see the above
example):
- In this case, the scope name will be
Foo
because the scope name has been explicitly set inSessionConfiguration
:SessionConfiguration session = SessionConfiguration.builder() .withConfigUri("scoped-configuration.xml") .withScopeName("Foo") .build();
- In this case, the scope name will be
Foo
because, although no scope name has been explicitly set inSessionConfiguration
, the name has been set toFoo
. Therefore, the scope name will default toFoo
.SessionConfiguration session = SessionConfiguration.builder() .named("Foo") .withConfigUri("scoped-configuration.xml") .build();
- In this case, the scope name will be
Test
as no scope name or session name has been explicitly set inSessionConfiguration
. Therefore, the scope name ofTest
will be used from the XML configuration.SessionConfiguration session = SessionConfiguration.builder() .withConfigUri("scoped-configuration.xml") .build();
- In this case, the scope name will be
Test
as the session name has been set toFoo
. However, the scope name has been explicitly set to the default scope name using the constantCoherence.DEFAULT_SCOPE
. Therefore, the scope name ofTest
will be used from the XML configuration.SessionConfiguration session = SessionConfiguration.builder() .named("Foo") .withScopeName(Coherence.DEFAULT_SCOPE) .withConfigUri("scoped-configuration.xml") .build();
Parent topic: Session Configurations
Configuring a Coherence Instance
CoherenceConfiguration
. An instance of
CoherenceConfiguration
is created using the builder. For
example:CoherenceConfiguration cfg = CoherenceConfiguration.builder()
.build();
This section includes the following topics:
- Adding Sessions
- Configuring a Session for Auto-Discovery
- Naming Coherence Instances
- Adding Global Event Interceptors
Parent topic: Using the Bootstrap API
Adding Sessions
A Coherence
instance manages one or more Session
instances. You can add these session instances to
CoherenceConfiguration
by adding the
SessionConfiguration
instances to the builder.
Coherence
instance runs a single Session
that uses the default configuration
file.CoherenceConfiguration cfg = CoherenceConfiguration.builder()
.build();
The above configuration configures a Coherence
instance with the default
name and with a single Session
that uses the default configuration
file.
CoherenceConfiguration
:CoherenceConfiguration cfg = CoherenceConfiguration.builder()
.withSession(SessionConfiguration.defaultSession())
.build();
CoherenceConfiguration
:SessionConfiguration session = SessionConfiguration.builder()
.named("Carts")
.withConfigUri("cache-config.xml")
.build();
CoherenceConfiguration cfg = CoherenceConfiguration.builder()
.withSession(session)
.build();
While there is no limit to the number of sessions you can configure, the majority of applications require only a single session, which is most likely the default session.
Parent topic: Configuring a Coherence Instance
Configuring a Session for Auto-Discovery
You can configure a CoherenceConfiguration
to automatically discover
the SessionConfiguration
instances. These instance are discovered using
the Java ServiceLoader
. Any instances of
SessionConfiguration
or
SessionConfiguration.Provider
that are configured as services in
the META-INF/services/
files, will be loaded.
This feature is useful if you are building modular applications where you want to include
the functionality in a separate application module that uses its own
Session
. The SessionConfiguration
instance for the
module is made discoverable by the Java ServiceLoader
. When the
module’s jar file is on the classpath, the Session
gets created and the
module’s functionality becomes available to the application.
CoherenceConfiguration cfg = CoherenceConfiguration.builder()
.discoverSessions()
.build();
In this example, the call to discoverSessions()
loads the
SessionConfiguration
instances that have been discovered by the
Java ServiceLoader
.
Parent topic: Configuring a Coherence Instance
Naming Coherence Instances
Each Coherence
instance must be uniquely named. You can specify a
name by using the named()
method on the builder. If you do not specify
a name, then the default name of $Default$
will be used.
In the majority of use-cases, an application requires only a single
Coherence
instance. Therefore, there will be no requirement to
specify a name.
Coherence
instance with the name
Carts
.CoherenceConfiguration cfg = CoherenceConfiguration.builder()
.named("Carts")
.build();
Parent topic: Configuring a Coherence Instance
Adding Global Event Interceptors
You can add event interceptors to a SessionConfiguration
instance to
receive events for a session. See Adding Session Event Interceptors. Event interceptors can also be added to the Coherence
instance to
receive events for all Session
instances managed by that
Coherence
instance.
CacheInterceptor
class for
caches in all sessions, then the interceptor receives events for both the default
session and the Certs
session:SessionConfiguration cartsSession = SessionConfiguration.builder()
.named("Carts")
.withConfigUri("cache-config.xml")
.build();
CoherenceConfiguration cfg = CoherenceConfiguration.builder()
.withSession(SessionConfiguration.defaultSession())
.withSession(cartsSession)
.withInterceptor(new CacheInterceptor())
.build();
Parent topic: Configuring a Coherence Instance
Creating a Coherence Instance
You can use the CoherenceConfiguration
instance to
create a Coherence
instance.
Coherence
instance in one of the following two
modes:
- cluster member
- client
The mode chosen affects how some types of Session
are
created and whether auto-start services have started.
As the name suggests, a "cluster member" is a Coherence
instance that expects to start or join a Coherence cluster. In a cluster member, any
Session
that wraps a ConfigurableCacheFactory
has its services auto-started and monitored (this is the same behavior as seen when
you use the legacy DefaultCacheServer
to start a server).
A "client" Coherence
instance is typically not a cluster member, it
is a Coherence*Extend or a gRPC client. As such, Session
instances
that wrap a ConfigurableCacheFactory
are not auto-started. Instead,
they start on demand as resources such as maps, caches, or topics are requested from
them.
The com.tangosol.net.Coherence
class has static factory methods to
create Coherence
instances in different modes.
- To create a
Coherence
instance that is a cluster member, use theCoherence.clusterMember
method, as shown below:CoherenceConfiguration cfg = CoherenceConfiguration.builder() .build(); Coherence coherence = Coherence.clusterMember(cfg);
- To create a
Coherence
instance that is a client, use theCoherence.client
method, as shown below:CoherenceConfiguration cfg = CoherenceConfiguration.builder() .build(); Coherence coherence = Coherence.client(cfg);
This section includes the following topic:
Parent topic: Using the Bootstrap API
Creating a Default Coherence Instance
Coherence
instance without specifying any
configuration.Coherence coherence = Coherence.clusterMember();
Coherence coherence = Coherence.client();
In the above examples, the Coherence
instance will have
the default Session
and any discovered sessions.
Parent topic: Creating a Coherence Instance
Starting Coherence
Coherence
instance must be started to start all the
sessions that the Coherence
instance is managing. You can start the
Coherence
instance by calling the start()
method, as shown
below:Coherence coherence = Coherence.clusterMember(cfg);
coherence.start();
Parent topic: Using the Bootstrap API
Obtaining a Coherence Instance
To avoid having to pass around the instance of Coherence
that was
used to bootstrap an application, the Coherence
class has some
static methods that make it simple to retrieve an instance.
Coherence
is being used in
an application (which is true for most use-cases), use the
getInstance()
method:Coherence coherence = Coherence.getInstance();
CoherenceConfiguration cfg = CoherenceConfiguration.builder()
.named("Carts")
.build();
Coherence.create(cfg);
Coherence coherence = Coherence.getInstance("Carts");
Parent topic: Using the Bootstrap API
Ensuring Coherence Has Started
If an application code needs to ensure that a Coherence
instance has
started before doing work, use the whenStarted()
method. This
method obtains a CompletableFuture
that will be completed when the
Coherence
instance starts.
Coherence coherence = Coherence.getInstance("Carts");
CompletableFuture<Void> future = coherence.whenStarted();
future.join();
There is also a corresponding whenStopped()
method that returns a
future that will be completed when the Coherence
instance
stops.
Parent topic: Using the Bootstrap API
Adding Coherence Lifecycle Interceptors
Besides using the future methods as described in Ensuring Coherence Has Started, you can add an EventInterceptor
to the
configuration of a Coherence
instance to receive life-cycle events.
Coherence.LifecycleListener
.public class MyInterceptor implements Coherence.LifecycleListener {
public void onEvent(CoherenceLifecycleEvent event) {
// process event
}
}
CoherenceConfiguration cfg = CoherenceConfiguration.builder()
.withSession(SessionConfiguration.defaultSession())
.withInterceptor(new MyInterceptor())
.build();
When you start or stop a Coherence
instance created from
this configuration, the MyInterceptor
instance starts receiving the
life-cycle events.
Parent topic: Using the Bootstrap API
Using the com.tangosol.net.Coherence Class
Cache servers are cluster members that are responsible for storing cached data.
A cluster may be comprised of many cache servers. Each cache server runs in its own JVM.
This section includes the following topics:
- Overview of the Coherence Class
- Starting Cache Servers From the Command Line
- Starting Cache Servers Programmatically
- Coherence Lifecycle Listeners
Parent topic: Starting and Stopping Coherence
Overview of the Coherence Class
The com.tangosol.net.Coherence
class is used to start a cache
server. A cache server can be started from the command line or can be started
programmatically. The following arguments are used when starting a cache server:
-
The
--version
argument will display the version of Coherence being used. If this is the only parameter passed to the Coherence main method, then the version will be displayed and Coherence will exit. -
The name of a cache configuration file that is found on the classpath or the path to a Grid ARchive (GAR). If both are provided, then the GAR takes precedence. A GAR includes the artifacts that comprise a Coherence application and adheres to a specific directory structure. A GAR can be left as a directory or can be archived with a
.gar
extension. See Building a Coherence GAR Module in Administering Oracle Coherence. -
An optional application name for the GAR. If no name is provided, then the archive name is used (the directory name or the file name without the
.gar
extension). The name provides an application scope that is used to separate applications on a cluster.
Parent topic: Using the com.tangosol.net.Coherence Class
Starting Cache Servers From the Command Line
Cache servers are often started from the command line using the Coherence class main
method. Using the various lifecycle listeners in Coherence to
initialize application code means that there is not typically a
need for any application specific main class. Use the Java
-cp
option to indicate the location
of the coherence.jar
file, any application
.jar
files, and the location where
any Coherence configuration files are located. The location of
the configuration files must precede the
coherence.jar
file on the
classpath; otherwise, the default configuration files that are
located in the coherence.jar
file are used to
start the cache server instance. See Understanding Configuration.
The following example starts a cache server member, uses any configuration files
that are placed in the
COHERENCE_HOME
\config
directory.
java -server -Xms512m -Xmx512m -cp COHERENCE_HOME
\config;COHERENCE_HOME\lib\coherence.jar com.tangosol.net.Coherence
The following example starts a cache server member and uses the Coherence application artifacts that are packaged in the MyGar.gar
file. The default name (MyGAR
) is used as the application name.
java -server -Xms512m -Xmx512m -cp COHERENCE_HOME
\config;COHERENCE_HOME\lib\coherence.jar com.tangosol.net.Coherence D:\example\MyGAR.gar
Note:
The cache configuration file that is packaged in a GAR file takes precedence over a cache configuration file that is located on the classpath.
The COHERENCE_HOME
\bin\cache-server
script,
from the Oracle Coherence commercial version installer, is
provided as a convenience and example script to start a cache
server instance. The script is available for both Windows
(cache-server.cmd
) and UNIX-based
platforms (cache-server.sh
). The script sets up
a basic environment and then runs the Coherence
class. The scripts are typically modified as required for a
particular application’s requirements.
Tip:
During testing, it is sometimes useful to create multiple scripts with different names that uniquely identify each cache server. For example: cahe-server-a
, cache-server-b
, and so on.
Lastly, a cache server can be started on the command line by using the java -jar
command with the coherence.jar
library. Cache servers are typically started this way for testing and demonstration purposes. For example:
java -jar COHERENCE_HOME\lib\coherence.jar
Parent topic: Using the com.tangosol.net.Coherence Class
Starting Cache Servers Programmatically
Coherence cluster members that are started using the Bootstrap API, or by running the
com.tangosol.net.Coherence.main()
method, can be stopped by calling the
static com.tangosol.net.Coherence.closeAll()
method.
The legacy DefaultCacheServer
class provides the following two methods
that are used to shutdown a cache server:
Note:
Shutdown is supposed to be called in a standalone application where it shuts down the instance which theDefaultCacheServer
class itself maintains as a static
member.
shutdown()
- This is a static method that is used to shut down a cache server that was started on a different thread using theDefaultCacheServer.main()
orDefaultCacheServer.start()
methods.shutdownServer()
– This method is called on aDefaultCacheServer
instance which an application keeps hold of.
Parent topic: Using the com.tangosol.net.Coherence Class
Coherence Lifecycle Listeners
When starting Coherence cluster members or clients, Coherence offers various lifecycle events that can be used to trigger application specific logic before, during, and after the Coherence start-up lifecycle. Using lifecycle event listeners can often be used in place of a custom application main class to perform application specific start-up and shutdown logic.
Lifecycle events are available for the following instances:
Coherence
instancesSession
instancesConfigurableCacheFactory
instancesDefaultCacheServer
instances
For more information about descriptions of these events and listeners, see Coherence Lifecycle Listeners.
Parent topic: Using the com.tangosol.net.Coherence Class
Using the Legacy CacheFactory Client
This section includes the following topics:
Parent topic: Starting and Stopping Coherence
Disabling Local Storage
Cache clients that use the partition cache service (distributed caches) should not maintain any partitioned data. Cache clients that have storage disabled perform better and use less resources. Partitioned data should only be distributed among cache server instances.
Local storage is disabled on a per-process basis using the coherence.distributed.localstorage
system property. This allows cache clients and servers to use the same configuration descriptors. For example:
java -cp COHERENCE_HOME\config;COHERENCE_HOME\lib\coherence.jar -Dcoherence.distributed.localstorage=false com.MyApp
Parent topic: Using the Legacy CacheFactory Client
Using the CacheFactory Class to Start a Cache Client
Applications that use the com.tangosol.net.CacheFactory
class to get an instance of a cache become cluster members and are considered cache clients. The following example demonstrates the most common way of starting a cache client:
CacheFactory.ensureCluster();
NamedCache cache = CacheFactory.getCache("cache_name");
When starting an application that is a cache client, use the Java -cp
option to indicate the location of the coherence.jar
file and the location where the tangosol-coherence-override.xml
and coherence-cache-config.xml
files are located. The location of the configuration files must precede the coherence.jar
file on the classpath; otherwise, the default configuration files that are located in the coherence.jar
file are used to start the cache server instance. See Understanding Configuration.
The following example starts an application that is a cache client, uses any configuration files that are placed in the COHERENCE_HOME
\config
directory, and disables storage on the member.
java -cp COHERENCE_HOME\config;COHERENCE_HOME\lib\coherence.jar -Dcoherence.distributed.localstorage=false com.MyApp
The COHERENCE_HOME
\bin\coherence
script is provided for testing purposes and can start a cache client instance. The script is available for both Windows (coherence.cmd
) and UNIX-based platforms (coherence.sh
). The script sets up a basic environment, sets storage to be disabled, and then runs the CacheFactory
class, which returns a prompt. The prompt is used to enter commands for interacting with a cache and a cluster. The scripts are typically modified as required for a particular cluster. The class can also be started directly from the command line instead of using the script. For example:
java -cp COHERENCE_HOME\config;COHERENCE_HOME\lib\coherence.jar -Dcoherence.distributed.localstorage=false com.tangosol.net.CacheFactory
If a Coherence application is packaged as a GAR, the GAR can be loaded by the CacheFactory
instance using the server
command at the prompt after the client member starts.
server [<path-to-gar>] [<app-name>]
The following example loads the Coherence application artifacts that are packaged in the MyGar.gar
file. The default name (MyGAR
) is used as the application name.
Map (?) server D:\example\MyGAR.gar
Parent topic: Using the Legacy CacheFactory Client
Stopping Cluster Members
This section includes the following topics:
- Prerequisites for Stopping All Cluster Members
- Stopping Cluster Members From the Command Line
- Stopping Cache Servers Programmatically
Parent topic: Starting and Stopping Coherence
Prerequisites for Stopping All Cluster Members
To ensure there is no data loss during a controlled shutdown, you can leverage the service suspend feature. A service is considered suspended only after all the data is fully written, including active persistence mode, asynchronous persistence tasks, entries in the write-behind queue of a read-write backing map, and other asynchronous operations. Outstanding operations are completed and no new operations are allowed against the suspended services.
Thus, for a controlled complete shutdown of a cluster, Oracle recommends to execute
the Coherence ClusterMBean operation suspendService("Cluster")
,
which shuts down all services gracefully before shutting down the cluster
members.
Parent topic: Stopping Cluster Members
Stopping Cluster Members From the Command Line
Cluster members are most often shutdown using the kill
command when on the UNIX platform and Ctrl+c
when on the Windows platform. These commands initiate the standard JVM shutdown hook which is invoked upon normal JVM termination.
Note:
Issuing the kill -9
command triggers an abnormal JVM termination and the shutdown hook does not run. However, a graceful shutdown is generally not required if a service is known to be node-safe (as seen using JMX management) before termination.
The action a cluster member takes when receiving a shutdown command is configured in the operational override file within the <shutdown-listener>
element. The following options are available:
-
none
- Perform no explicit shutdown actions. -
force
- Perform a hard-stop on the node by callingCluster.stop()
. -
graceful
- (Default) Perform a normal shutdown by callingCluster.shutdown(
)
. -
true
- Same asforce
. -
false
- Same asnone
.
The coherence.shutdown.timeout
system property configures the duration to
wait for shutdown to complete before timing out. The default value is 2 minutes. If a
Coherence application uses persistence, write-behind cache store, or any other asynchronous
operation that takes longer than 2 minutes, then it is necessary to configure the timeout
to be longer to allow the pending asynchronous operations to complete. The system property
value is configured as a time duration such as "3s" for 3 seconds, "5m" for 5 minutes, or
"1hr" for 1 hour.
When the time duration to complete graceful shutdown exceeds the
coherence.shutdown.timeout
time, the JVM process is considered hung and
the JVM process terminates abruptly using halt. There exists a chance that not all
outstanding asynchronous operations completed when the shutdown times out. Therefore, it is
important to ensure that coherence.shutdown.timeout
is configured to a
time duration that is sufficiently long for the number of outstanding asynchronous
operations an application environment may have.
The following example sets the shutdown hook to none
.
<?xml version='1.0'?> <coherence xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://xmlns.oracle.com/coherence/coherence-operational-config" xsi:schemaLocation="http://xmlns.oracle.com/coherence/coherence-operational-config coherence-operational-config.xsd"> <cluster-config> <shutdown-listener> <enabled system-property="coherence.shutdownhook">none</enabled> </shutdown-listener> </cluster-config> </coherence>
The coherence.shutdownhook
system property is used to specify the shutdown hook behavior instead of using the operational override file. For example:
-Dcoherence.shutdownhook=none
Parent topic: Stopping Cluster Members
Stopping Cache Servers Programmatically
The DefaultCacheServer
class provides two methods that are used to shutdown a cache server:
Note:
Shutdown is supposed to be called in a standalone application where it shuts down the instance which the DefaultCacheServer
class itself maintains as a static member.
-
shutdown()
– This is a static method that is used to shut down a cache server that was started on a different thread using theDefaultCacheServer.main()
orDefaultCacheServer.start()
methods. -
shutdownServer()
– This method is called on aDefaultCacheServer
instance which an application keeps hold of.
Parent topic: Stopping Cluster Members
Performing a Rolling Restart
Rolling restarts are commonly performed when a cache server or its host computer must be updated or when upgrading a cache server to a new patch set release or patch set update release. However, the technique can also be used whenever you want to restart a cache server that is currently managing a portion of cached data.
Note:
When upgrading a cluster, a rolling restart can only be used to upgrade patch set releases or patch set update releases, but not major or minor releases. See Release Number Format in Administering Oracle Fusion Middleware.
This section includes the following topics:
- Prerequisites for a Rolling Restart
- Considerations for Application Upgrades
- Restarting Cache Servers for a Rolling Restart
- Persistence Roll Back After or During a Rolling Restart
Parent topic: Starting and Stopping Coherence
Prerequisites for a Rolling Restart
A rolling restart requires initial considerations and setup before you restart a cluster. A rolling restart cannot be performed on a cluster that does not meet the following prerequisites:
-
When changing the JDK version, you can do a rolling restart only between patch versions of the same major JDK version, for example, patches within JDK 17, such as 17.07 to 17.0.8. However, you cannot perform a rolling upgrade if you move between major JDK versions, for example, from JDK 11 to JDK 17.
-
Do not change the serialization type, for example, moving from Java to POF serialization or conversely because it is not supported.
-
If you add fields to the cache classes and they do not support evolvability (see Class Versioning and Evolution), then you cannot perform a rolling upgrade.
-
If you are performing an application upgrade, to determine if your application upgrade can be done using a rolling restart, see Considerations for Application Upgrades.
-
If you are changing any operational or cache configuration during the upgrade, to determine if it is a supported change during a rolling upgrade, see the documentation on the operational or cache configuration elements. If it is not able to be changed, then note similar to "You cannot change this element during a rolling restart." will be present.
-
The cache servers in a cluster must provide enough capacity to handle the shutdown of a single cache server (n minus 1 where n is the number of cache servers in the cluster). An out-of-memory exception or data eviction can occur during a redistribution of data if the cache servers are running at capacity. See Cache Size Calculation Recommendations in Administering Oracle Coherence.
-
Remote JMX management must be enabled on all cache servers and at least two cache servers must contain an operational MBean server. Ensure that you can connect to the MBean servers using an MBean browser such as JConsole. See Using JMX to Manage Oracle Coherence in Managing Oracle Coherence.
- If your environment is using static lambda serialization, then it is recommended that you configure dynamic lambda serialization mode before attempting a rolling restart. See Considerations for a Rolling Upgrade for details.
If a cache service is configured to use asynchronous backups, use the
shutdown
method to perform an orderly shut down instead of the
stop
method or kill -9
. Otherwise, a member may
shutdown before asynchronous backups are complete. The shutdown
method
guarantees that all updates are complete.
If using persistence, while a rolling restart is agnostic to the format of the data stored on disk, there are scenarios where you can take precautionary steps to ensure that there is a 'savepoint' to which the cluster can be rolled back if faced with a catastrophic event. On-disk persisted files may become unreadable when going from a later to an earlier version.
Therefore, Oracle recommends you to perform a persistent snapshot of the relevant services before the roll. See Using Snapshots to Persist a Cache Service. This can be performed while suspending the service (to ensure global consistency) or not based on the tolerance of the application. The snapshot offers a 'savepoint' to which the cluster can be rolled back if an issue occurs during the roll.
Note:
Coherence may change the format of the data saved on disk. When rolling from a version that includes a change in the storage format, Oracle strongly recommends creating a snapshot before roll/upgrade.Parent topic: Performing a Rolling Restart
Considerations for Application Upgrades
-
Considerations for Building Upgradable Coherence Applications
-
Federation Upgrade Considerations
-
Persistence Upgrade Considerations
Parent topic: Performing a Rolling Restart
Restarting Cache Servers for a Rolling Restart
Use these instructions to restart a cache server. If you are restarting the host computer, then make sure all cache server processes are shutdown before shutting down the computer.
Note:
The following instructions assume that none of the cache servers share the same Coherence JAR or ORACLE_HOME
. If some of the cache servers share the same Coherence JAR or ORACLE_HOME
, then treat the servers as a logical unit and perform the steps on each of the servers at the same time.
To restart a cache server:
Parent topic: Performing a Rolling Restart
Persistence Roll Back After or During a Rolling Restart
If an issue is observed during the rolling restart, it is advisable to roll back the entire cluster. You can roll back the cluster by rolling the nodes with the new version of the application (and/or Coherence) back to their prior versions, or in more severe cases, a complete restart of the cluster.
If the new version being migrated to includes a Coherence patch that modifies the storage format (as highlighted in the Oracle Coherence Release Notes) the persistent stores could be in a mixed state of the old and new format.
During a full cluster restart, the mixed state can result in a failure to
recover, causing the failed stores being moved into the trash. At this point, you can
choose the forceRecovery
operation of the
PersistenceManagerMBean
(in JConsole, for example,
Coherence/Persistence/<servicename>/PersistenceCoordinator
) to force Coherence to
reinstate empty versions of those partitions that could not be recovered. If you have
created a snapshot of the relevant services, it is possible to recover the snapshot,
reverting the state of those services.
Parent topic: Performing a Rolling Restart