Event Management

Intuitively, you monitor for specific events in your monitored environment. An event is a significant occurrence on a managed target that typically indicates something has occurred outside normal operating conditions--they provide a uniform way to indicate that something of interest has occurred in an environment managed by Enterprise Manager. Examples of events are:

  • Metric Alerts

  • Compliance Violations

  • Job Events

  • Availability Alerts

Existing Enterprise Manager customers may be familiar with metric alerts and metric collection errors. For Enterprise Manager 12c, metric alerts are a type of event, one of many different event types. The notion of an event unifies the different exception conditions that are detected by Enterprise Manager, such as monitoring issues or compliance issues, into a common concept. It is backed by a consistent and uniform set of event management capabilities that can indicate something of interest has occurred in a datacenter managed by Enterprise Manager.

All events have the following attributes:

Table 5-1 Event Attributes

Attribute Description

Type

Type of event that is being reported. All events of a specific type share the same set of attributes that describe the exact nature of the problem. For example, Metric Alert, Compliance Standard Score Violation, or Job Status Change.

Severity

Event severity. For example, Fatal, Warning, or Critical.

Internal Name

An internal name that describes the nature of the event and can be used to search for events. For example, you can search for all tablespacePctUsed events.

Entity on which the event is raised.

An event can be raised on a target, a non-target source object (such as a job) or be related to a target and a non-target source object. Note: This attribute is important when determining what privileges are required to manage the event.

Message

Informational text associated with the event.

Reported Date

Time the event was reported.

Category

Functional or operational classification for an event.

Available Categories:

  • Availability

  • Business

  • Capacity

  • Configuration

  • Diagnostics

  • Error

  • Fault

  • Jobs

  • Load

  • Performance

  • Security

Causal Analysis Update

Used for Root Cause Analysis of target down events.

Possible Values: Root Cause or Symptom

Event Types

The type of an event defines the structure and payload of an event and provides the details of the condition it is describing. For example, a metric alert raised by threshold violation has a specific payload whereas a job state change has a different structure. As shown in the following table, the range of events types greatly expands Enterprise Manager's monitoring flexibility.

Event Type Description

Target Availability

The Target Availability Event represents a target's availability status (Example: Up, Down, Agent Unreachable, or Blackout).

Metric Alert

A metric alert event is generated when an alert occurs for a metric on a specific target (Example: CPU utilization for a host target) or metric on a target and object combination Example: Space usage on a specific tablespace of a database target.

Metric Evaluation Error

A metric evaluation error is generated when the collection for a specific metric group fails for a target.

Job Status Change

All changes to the status of an Enterprise Manager job are treated as events, and these events are made available via the Job Status Change event class.

Note: A prerequisite to creating Incident Rules, is to enable the relevant job status and add required targets to job event generation criteria. To change this criteria, from the Setup menu, select Incidents, and then Job Events.

Compliance Standard Rule Violation

Events are generated for compliance standard rule violations. Each event corresponds to a violation of a compliance rule on a specific target.

Compliance Standard Score Violation

Events are generated for compliance standard score violations. An event is generated when the compliance score for a compliance standard on a specific target falls below predefined thresholds.

High Availability

High Availability events are generated for database availability operations (shutdown and startup), database backups and Data Guard operations (switchover, failover, and other state changes).

Service Level Agreement Alert

These events are generated when a service level or service level objective is violated for a service. occurs for a Service Level Agreement or a Service Level Objective.

User-reported

These events are created by end-users.

Application Performance Management KPI Alert

An Application Performance Management (APM) Key Performance Indicator (KPI) alert event is generated when a KPI violation alert occurs for a metric on an APM managed entity associated with a Business Application target.

JVM Diagnostics Threshold Violation

A JVMD Diagnostics event is raised when a JVMD metric exceeds its threshold value on a Java Virtual Machine target.

Application Dependency and Performance Alert

Alerts are raised by ADP monitoring when metrics related to a J2EE application or component have crossed some thresholds.

Blackout Infrastructure Alert

The Blackout Event represents blackout infrastructure events such as execution failure of a blackout operation as the agent is not reachable. The blackout operations include blackout start, blackout stop, and blackout edit.

Service Infrastructure Alert

These alerts are generated when there is a problem in the service infrastructure.

Target Monitoring Disruption

This event is generated when normal monitoring of the target cannot proceed. This is due to issues such as too many hung threads or higher than expected resource consumption during metric collection. These issues might result in limited target monitoring such as monitoring of target Status only, delayed collection evaluation, or no monitoring of the target.

Event Severity

The severity of an event indicates the criticality of a specific issue. The following table shows the various event severity levels along with the associated icon.

Icon Severity Description
failed

Fatal

Corresponding service is no longer available. For example, a monitored target is down (target down event). A Fatal severity is the highest level severity and only applies to the Target Availability event type.

critical

Critical

Immediate action is required in a particular area. The area is either not functional or indicative of imminent problems.

warning

Warning

Attention is required in a particular area, but the area is still functional.

advisory

Advisory

While the particular area does not require immediate attention, caution is recommended regarding the area's current state. This severity can be used, for example, to report Oracle best practice violations.

Clear

Clear

Conditions that raised the event have been resolved.

information

Informational

A specific condition has just occurred but does not require any remedial action.

Events with an informational severity:

  • do not appear in the incident management UI.

  • cannot create incidents.

  • are not stored within Enterprise Manager.