1 Oracle Exadata

This chapter provides information about the Oracle Exadata metrics.

For each metric, it provides the following information:

  • Description

  • Metric table

    The metric table can include some or all of the following: target version, default collection frequency, default warning threshold, default critical threshold, and alert text.

It includes the metrics collected for the following target types:

Oracle Exadata Storage Server

The Oracle Exadata target monitors the software and hardware performance of an individual Oracle Exadata Storage Server in the database.

Aggregated Exadata Capacity

This metric category contains the aggregated metrics of the Exadata Capacity metric category and it collects every 60 minutes.

Target Version: All Versions

Collection Frequency: Every 60 Minutes

Metric Description
Disk Size (GB) This metric gives an indication of the size of the status in GB.
Disk Type This metric reports the metrics are for hard disk, flash disk, flash cache, and grid disk.
Allocated (%) This metric gives an indication of the percentage allocation of the total number of bytes for the hard disk, flash disk, flash cache, and grid disk.

Aggregated Exadata CellDisk

This metric category contains the aggregated cell disk performance metrics. The metric values are aggregated over all the cell disks in a cell. They are mainly aggregated via averaging and totaling.

Target Version: All Versions

Collection Frequency: Every 15 Minutes

Metric Description
Average CellDisk IO Load This metric gives an indication of the average input/output load to the cell disk.
Average CellDisk Read IOPS This metric gives an indication of the average number of read input/output operations per second.
Average CellDisk Read Response Time This metric gives an indication of the average read response time to the cell disk.
Average CellDisk Read Throughput This metric gives an indication of the average number of bytes read from the cell disk.
Average CellDisk Write IOPS This metric gives an indication of the average number of write input/output operations to the cell disk.
Average CellDisk Write Response Time This metric gives an indication of the average write response time to the cell disk.
Average CellDisk Write Throughput This metric gives an indication for the average number of bytes written to the cell disk.
Maximum CellDisk IO Load This metric gives an indication of the maximum input/output load to the cell disk.
Total CellDisk IO Load This metric gives an indication of the total input/output load to the celldisk. The Total CellDisk IO load is the aggregated number of IO requests waiting to be serviced by the storage server disks at any given point in time. You can think of this as the length of the queue for I/O requests.

Because the type of requests can be either for small or large reads, there is not one number that would indicate a potential performance issue. Oracle cannot recommend a number as each customer environment is often unique. Monitor the value of the I/O load and a number that correlates with poor response time will be a good candidate for a metric threshold. An Exadata system is underutilized if the I/O load is less than 20.

Total CellDisk Read IOPS This metric gives an indication of the total number of read input/output operations per second to the cell disk.
Total CellDisk Read Throughput This metric gives an indication for the total number of bytes read from the cell disk.
Total CellDisk Write IOPS This metric gives an indication of the total number of write input/output operations per second to the cell disk.
Total CellDisk Write Throughput This metric gives an indication of the total number of bytes written to the cell disk.

Aggregated Exadata Diskgroup Capacity

This metric category contains the aggregated capacity metrics for ASM instances and disk groups.

Target Version: All Versions

Collection Frequency: Every 60 Minutes

Metric Description
ASM Instance This metric reports the ASM instance name for the aggregated exadata diskgroup.
Diskgroup Name This metric reports the name of the aggregated exadata diskgroup.
Count This metric reports the total grid disk number for the specific diskgroup.
Size (GB) This metric reports the diskgroup size in GB of the aggregated exadata diskgroup.

Aggregated Exadata FlashDisk and HardDisk

This metric category contains metrics that are aggregated over either the hard disks or flash disks in a cell.

Target Version: All Versions

Collection Frequency: Every 15 Minutes

Metric Description
Average CellDisk IO Load This metric reports the average input and output load to the cell disk.
Average CellDisk IO Utilization This metric indicates the average utilization for I/O requests from the cell disk.
Average CellDisk Large Read IOPS This metric indicates the average number of read input and output operations from large blocks in a cell disk.
Average CellDisk Large Read Response Time This metric reports the average response time to read large blocks from the cell disk.
Average CellDisk Large Read Throughput This gives an indication of the average number of bytes read from the large blocks from the hard disks or flash disks in a cell.
Average CellDisk Large Write IOPS This gives an indication of the average number of input and output operations written to large blocks of the hard disks or flash disks in a cell.
Average CellDisk Large Write Response Time This gives an indication of the average response time when writing large blocks to the cell disk.
Average CellDisk Large Write Throughput This gives an indication of the total number of bytes when writing large blocks to the cell disk.
Average CellDisk Read IOPS This metric gives an indication of the average number of read input/output operations from the hard disks or flash disks in a cell.
Average CellDisk Read Response Time This metric reports the average read response time to the cell disk.
Average CellDisk Read Throughput This metric gives an indication of the average number of bytes read from the hard disks or flash disks in a cell.
Average CellDisk Small Read IOPS This gives an indication of the average number of read input and output operations from small blocks in a cell disk.
Average CellDisk Small Read Response Time This metric reports the average response time when reading small blocks from the cell disk.
Average CellDisk Small Read Throughput This gives an indication of the average number of bytes read from the small blocks from the hard disks or flash disks in a cell.
Average CellDisk Small Write IOPS This gives an indication of the average number of input and output operations written to small blocks of the hard disks or flash disks in a cell.
Average CellDisk Small Write Response Time This metric reports the average response time when writing small blocks to the cell disk.
Average CellDisk Small Write Throughput This gives an indication of the total number of bytes when writing small blocks to the cell disk.
Average CellDisk Write IOPS This metric gives an indication of the average number of input/output operations written to the hard disks or flash disks in a cell.
Average CellDisk Write Response Time This metric reports the average response time when writing to the cell disk.
Average CellDisk Write Throughput This metric gives an indication of the average number of bytes written to the hard disks or flash disks in a cell.
CellDisk Type This metric reports the type of Cell disk, either hard disk or flash disk.
Maximum CellDisk Small Read Response Time This metric reports the maximum response time when reading small blocks from the cell disk.
Maximum CellDisk Small Write Response Time This metric reports the maximum response time when writing small blocks to the cell disk.
Total CellDisk IO Load This metric reports the total input/output load to the celldisk. The Total CellDisk IO load is the aggregated number of IO requests waiting to be serviced by the storage server disks at any given point in time. You can think of this as the length of the queue for I/O requests.

Because the type of requests can be either for small or large reads, there is not one number that would indicate a potential performance issue. Oracle cannot recommend a number as each customer environment is often unique. Monitor the value of the I/O load and a number that correlates with poor response time will be a good candidate for a metric threshold. An Exadata system is underutilized if the I/O load is less than 20.

Total CellDisk IO Utilization This metric reports the total utilization for I/O requests to the celldisk.
Total CellDisk Read IOPS This metric reports the total number of bytes read from the hard disks or flash disks in a cell.
Total CellDisk Read Throughput This metric reports the total number of bytes read from the hard disks or flash disks in a cell.
Total CellDisk Write IOPS This metric reports the total number of bytes written to the hard disks or flash disks in a cell.
Total CellDisk Write Throughput This metric reports the total number of bytes written to the hard disks or flash disks in a cell.

Aggregated Exadata Sparse Diskgroup Capacity

This metric category contains the sparse aggregated capacity metrics for ASM instances and disk groups.

Target Version: All Versions

Collection Frequency: Every 60 Minutes

Metric Description
Count This metric reports the total grid disk number for the specific diskgroup.
Size (GB) This metric reports the diskgroup size in GB of the aggregated exadata sparse diskgroup.
Virtual Size (GB) This metric reports the virtual diskgroup size in GB of the aggregated exadata sparse diskgroup.

Cell Generated Alert

This metric category contains the cell generated alert metrics. This is shown whenever the Exadata Storage server (cell) generates alert and the Enterprise Manager subscribes to the cell's SNMP alert.

Target Version: 11g, 12c

Collection Frequency: N/A

Metric Description
ADR Incident ID This metric shows the alert Automatic Diagnostic Repository (ADR) unique identifier for Enterprise Manager Incident Manager.
ADR Problem Key This metric shows the alert ADR problem key.
ADR Trace File Name This metric shows the Alert ADR Trace file.
Action This metric shows the recommended action to perform for this alert.
Alert Begin Time This metric shows the time stamp when an alert changes its state.
Alert Object This metric shows the Alert Object Name, such as cell disk or grid disk, for which a metric threshold has caused an alert.
Alert Type This metric shows the type of the alert. Values are stateful or stateless.

Default Warning Threshold: Warning

Default Critical Threshold: Critical

Alert Text: Alert from %target% is cleared: %msg%

Alert Name This metric shows the name of the alert.
Alert Sequence This metric shows the alert sequence.
ECID This metric shows the Alert ADR Execution Context Id.
Examined By This metric shows the administrator who reviewed the alert.
Msg This metric shows a brief explanation of the alert.
Notification This metric shows the number indicating progress in notifying subscribers to alert messages.
Sequence Begin Time This metric shows the time stamp when an alert sequence ID is first created.
Severity This metric shows the Severity level. Possible values are clear, info, warning, or critical.

Cell ILOM Generated Alert

This metric category contains the cell ILOM generated alert metrics. This is shown whenever the Exadata Storage server (cell) ILOM generates alert and the Enterprise Manager subscribes to the cell's SNMP alert.

Target Version: 11g, 12c

Collection Frequency: N/A

Metric Description
Chassis Id This metric shows the Chassis Id of the cell ILOM.
Fault Class This metric shows the fault class of the cell ILOM alert.
Fault Message Id This metric shows the fault message Id of the cell ILOM alert.
Fault Status This metric shows the fault status of the cell ILOM alert.
Fault Unique Id (UUID) This metric shows the fault unique Id (UUID) of the cell ILOM alert.
Product Name This metric shows the product name.

Exadata Services Status

This metric category contains the Exadata services status metric.

Target Version: All versions

Collection Frequency: Every 15 Minutes

Metric Description
CellSrv Status This metric shows the status of the service Cell Services.
MS Status This metric shows the status of the Management Server service.
RS Status This metric shows the status of the Restart Server service.

Exadata Cell Metric

This metric category contains the performance metrics collected at the cell level for each cell, such as CPU utilization and memory utilization.

Target Version: All Versions

Collection Frequency: Every 15 Minutes

Metric Description
CPU Utilization This metric provides information about the CPU utilization.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: CPU Utilization for %target%:%object_name%:%cell_name% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Cell Name This is the short name of the Exadata Storage Server without domain suffix.
Disk I/O Objective This metric provides the optimization objective which IORM is configured to achieve. For example, "Low Latency" or "Balanced" for OLTP-oriented databases, or "High Throughput" for data warehouses.
Exadata Run Queue Length This metric provides information about the Exadata run queue length.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: Exadata Run Queue Length for %target%:%object_name%:%cell_name% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Exadata Temperature Lower Threshold This metric shows the administrator who reviewed the alert.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: Exadata Temperature Lower Threshold for %target%:%object_name%:%cell_name% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Exadata Temperature Reading This metric reports the ambient operating temperature for the Exadata machine.
Exadata Temperature Upper Threshold This metric reports the upper or maximum temperature threshold for the ambient operating temperature for the Exadata machine.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: Exadata Temperature Upper Threshold for %target%:%object_name%:%cell_name% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

IORM Boost This metric reports the ratio of the cumulative number of positions in the I/O queue that were skipped because of IORM scheduling to the number of I/Os that were scheduled.
LED Status This metric provides the status of the locator LED (on or off).
Memory Utilization This metric provides information about the memory utilization.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: Memory Utilization for %target%:%object_name%:%cell_name% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Network Received This metric reports the total number of I/O packets received by interconnections per second.
Network Sent This metric reports the total number of I/O packets transmitted by interconnections per second.
Offload Efficiency This metric provides information about the offload efficiency.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: Offload Efficiency for %target%:%object_name%:%cell_name% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Exadata CellDisk Metric

This metric category contains performance metrics for each cell disk. The metric values are collected for each cell disk.

Target Version: All Versions

Collection Frequency: Every 15 Minutes

Metric Description
Average Large Read Response Time This metric reports the average response time to read large blocks from the cell disk.
Average Large Write Response Time This metric reports the average response time when writing large blocks to the cell disk.
Average Read Response Time This metric reports the average read response time to the cell disk.
Average Response Time This metric reports the average response time to the cell disk.
Average Small Read Response Time This metric reports the average response time when reading small blocks from the cell disk.
Average Small Write Response Time This metric reports the average response time when writing small blocks to the cell disk.
Average Write Response Time This metric reports the average response time when writing to the cell disk.
CellDisk Type This metric reports the celldisk type, either hard disk or flash disk.
IO Load This metric reports the average input/output load to the cell disk.
IO Utilization This metric reports the percentage utilization for I/O requests.
Large Read Bytes This metric reports the number of MB read in large blocks from a cell disk.

Default Critical Threshold: Not Defined

Default Warning Threshold: Not Defined

Alert Text: Large Read Bytes for %target% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Large Read Requests This metric reports the number of requests to read large blocks from a cell disk.

Default Critical Threshold: Not Defined

Default Warning Threshold: Not Defined

Alert Text: Large Read Requests for %target% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Large Write Bytes This metric reports the number of MB written in large blocks to a cell disk.

Default Critical Threshold: Not Defined

Default Warning Threshold: Not Defined

Alert Text: Large Write Bytes for %target% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Large Write Requests This metric reports the number of requests to write large blocks to a cell disk.

Default Critical Threshold: Not Defined

Default Warning Threshold: Not Defined

Alert Text: Large Write Requests for %target% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Object Name This metric reports the cell disk name.
Read IOPS This metric reports the number of read input/outputs per second to a cell disk.
Read Throughput (MBPS) This metric reports the number of bytes in MB per second read from a cell disk.
Small Read Bytes This metric reports the number of MB read in small blocks from a cell disk.

Default Critical Threshold: Not Defined

Default Warning Threshold: Not Defined

Alert Text: Small Read Bytes for %target% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Small Read Requests This metric reports the number of requests to read small blocks from a cell disk.

Default Critical Threshold: Not Defined

Default Warning Threshold: Not Defined

Alert Text: Small Read Requests for %target% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Small Write Bytes This metric reports the number of MB written in small blocks to a cell disk.

Default Critical Threshold: Not Defined

Default Warning Threshold: Not Defined

Alert Text: Small Write Bytes for %target% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Small Write Requests This metric reports the number of requests to write small blocks to a cell disk.

Default Critical Threshold: Not Defined

Default Warning Threshold: Not Defined

Alert Text: Small Write Requests for %target% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Write IOPS This metric reports number of write input/outputs operations per second to a cell disk.
Write Throughput (MBPS) This metric reports the number of bytes in MB per second written to a cell disk.

Exadata CellDisk Load Imbalance

This metric category contains the Exadata CellDisk Load Imbalance metrics.

Target Version: All Versions

Collection Frequency: Every 15 Minutes

Metric Description
IO Load Imbalance This metric gives an indication of the percentage of maximum average I/O load from the cell disk.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: CellDisk %object_name% is %cd_io_load_imbalance%% load imbalance, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Object Name This metric gives an indication of the object, such as hard disk or flash disk name.

Exadata Disk Status Metric

This metric category contains the status of the physical Exadata disk.

Target Version: All Versions

Collection Frequency: Every 1 Hour

Metric Description
Disk Status This metric reports the status of the physical disk.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: Physical Disk Status %target%:%object_name% is %value%, equaled to warning (%warning_threshold%) or critical (%critical_threshold%) value.

Exadata Flash Cache IORM Database Metric

This metric category contains the IO statistics for the flash cache by database.

Metric Description
Cell Name This is the short name of the Exadata Storage Server without domain suffix.

Target Version: All Versions

Collection Frequency: Every 15 Minutes

Size (MB) This metric shows the disk size in MB of the Flash Cache IORM Database.

Target Version: 10g, 11g, 12cR1

Collection Frequency: Every 24 hours

Exadata Flash Cache IORM Pluggable Database Metric

This metric category contains the IO statistics for the flash cache by pluggable database.

Metric Description
Cell Name This is the short name of the Exadata Storage Server without domain suffix.

Target Version: All Versions

Collection Frequency: Every 15 Minutes

Size (MB) This metric shows the disk size in MB of the Flash Cache IORM Pluggable Database.

Target Version: 13c

Collection Frequency: Every 15 Minutes

Exadata Flash Cache Metric

This metric category contains the performance metrics for the flash cache in a cell.

Target Version: All Versions

Collection Frequency: Every 15 Minutes

Metric Description
All I/O Requests This metric reports the cumulative number of read requests to flash cache since the metric was created.
Cell Name This is the short name of the Exadata Storage Server without domain suffix.
Default Hits This metric reports the number of read requests satisfied from flash cache non-keep objects since the last metric collection.
Default Hits (%) This metric reports the percentage of read requests to non-keep objects that are satisfied from flash cache since the last metric collection. Exadata Storage Server automatically decides which objects will be put in flash cache as non-keep objects. In general, the higher the hits rate, the better the performance.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: Default hits rate for %target%:%object_name%:%cell_name% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Default Misses This metric reports the number of read requests to non-keep objects which did not find all data in flash cache since the last metric collection.
Default Misses (%) This metric reports the percentage of read requests to non-keep objects which did not find all data in flash cache since the last metric collection. In general, a low number of read misses indicates better performance. However, in cases where it is not beneficial to put data object of large size into flash cache, a high number of read misses does not necessarily indicate performance issues.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: Default misses rate for %target%:%object_name%:%cell_name% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Default Read IOPS This metric reports the number of read requests per second which were satisfied from flash cache non-keep objects.
Default Read Throughput (MBPS) This metric reports the size of data read per second from flash cache non-keep objects.
Default Used (GB) This metric reports the space used for non-keep objects on flash cache.
Destage Write To Disk Per Second This metric reports the cumulative number of requests per second to write to flash cache since the metric was created.
First Writes The metric reports the cumulative number of requests to write new data to flash cache since the metric was created.
First Writes Per Second The metric reports the number of requests per second to write new data to flash cache since the last metric collection.
Flash Cache Population Writes Per Second The metric reports the number of requests that are population writes into the flash cache due to read miss.
I/O Requests Keep Pool Misses This metric reports the cumulative number of read requests to keep objects which did not find all data in flash cache since the metric was created.
I/O Requests Read Misses This metric reports the cumulative number of read requests which did not find all data in flash cache since the metric was created.
I/O Requests for keep This metric reports the cumulative number of read requests to keep objects since the metric was created.
Keep Hits This metric shows the number of read requests satisfied from Flash Cache keep objects since the last metric collection.
Keep Hits (%) This metric reports the percentage of read requests to keep objects that are satisfied from Flash Cache since the last metric collection. In general, the higher the keep hits rate, the better performance.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: Keep hits rate for %target%:%object_name%:%cell_name% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Keep Misses This metric reports the number of read requests to keep objects which did not find all data in Flash Cache since the last metric collection.
Keep Misses (%) This metric reports the percentage of read requests to keep objects which did not find all data in Flash Cache since the last metric collection. In general, a low number of read misses indicates better performance. However, in cases where it is not beneficial to put data objects of a large size into flash cache, a high number of read misses does not necessarily indicate performance issues.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: Keep misses rate for %target%:%object_name%:%cell_name% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Keep Overwrites Per Second This metric reports the number of megabytes per second pushed out of the flash cache because of space limits for keep objects.
Keep Pool Read IOPS This metric reports the number of read requests per second which were satisfied from Flash Cache keep objects.
Keep Pool Read Throughput (MBPS) This metric reports the size of data read per second from Flash Cache keep objects.
Keep Pool Used (GB) This metric reports the space used for keep objects on Flash Cache.
Overwrites This metric reports the cumulative number of requests to overwrite existing data in flash cache.
Overwrites Per Second This metric reports the cumulative number of requests per second to overwrite existing data in flash cache.
Read Hit Ratio for Random I/O This metric reports the read hit ratio which is caculated by dividing Read IOPS by the sum of Read IOPS and disk reads per second.
Read IOPS for Random I/O This metric reports the number of read requests per second from flash cache, for random I/O.
Read IOPS for Scan This metric reports the number of IO read per second from flash cache, for scan data.
Read Misses (MB) This metric reports the cumulative size of data read from disk which did not find all data from Flash Cache since the metric was created.
Read Throughput Redirected to Disk for Scan (MBPS) This metric reports the size of data read per second from disk, for scan data.
Read Throughput for Random I/O (MBPS) This metric reports the throughput of data read from flash cache for random I/O.
Read Throughput for Scan (MBPS) This metric reports the number of megabytes read per second from flash cache, for scan data.
Reads (MB) This metric reports the cumulative size of data read from Flash Cache since the metric was created.
Reads for Keep (MB) This metric reports the cumulative size of data read from Flash Cache keep objects since the metric was created.
Used (GB) This metric reports the size of used space on flash cache.
Write IO requests that bypass Flash Cache This metric reports the cumulative number of writes that bypass flash cache due to the large size of requested objects since the metric was created.

Exadata Flash IORM Consumer Group Metric

This metric category contains the IO statistics of flash by consumer group.

Target Version: All Versions

Collection Frequency: Every 15 Minutes

Metric Description
Average I/O Throughput (MB/Sec) This metric reports the number of megabytes of I/O per second for this consumer group to flash.
Average Wait Time for I/O (ms/req) This metric reports the average IORM wait time per request issued by a consumer group.
Average IORM Wait Time for Large I/O (ms/req) This metric reports the average IORM wait time per request issued by a consumer group for large I/O.
Average IORM Wait Time for Small I/O (ms/req) This metric reports the average IORM wait time per request issued by a consumer group for small I/O.
Cell Name This is the short name of the Exadata Storage Server without domain suffix.
I/O Requests per Second (IO/sec) This metric reports the number of IO requests issued by a consumer group to flash per second.
I/O Requests per Second - Large (IO/Sec) This metric reports the number of large IO requests issued by a consumer group to flash per second.
I/O Requests per Second - Small (IO/Sec) This metric reports the number of small IO requests issued by a consumer group to flash per second.
I/O Utilization (%) This metric reports the percentage of flash resources utilized by requests from this Consumer Group.

Exadata Flash IORM Database Metric

This metric category contains the IO statistics of flash by database.

Target Version: All Versions

Collection Frequency: Every 15 Minutes

Metric Description
Average I/O Throughput (MB/Sec) This metric reports the average number of megabytes of I/O per second for this database to flash.
Average Wait Time for I/O (ms/req) This metric reports the average IORM wait time per request issued to the flash by a database.
Average IORM Wait Time for Large I/O (ms/req) This metric reports the average IORM wait time per request issued to the flash by a database for large I/O.
Average IORM Wait Time for Small I/O (ms/req) This metric reports the average IORM wait time per request issued for the flash by a database for small I/O.
Cell Name This is the short name of the Exadata Storage Server without domain suffix.
I/O Requests per Second (IO/sec) This metric reports the number of IO requests issued by a database to the flash per second.
I/O Requests per Second - Large (IO/Sec) This metric reports the number of large IO requests issued by a database to the flash per second.
I/O Requests per Second - Small (IO/Sec) This metric reports the number of small IO requests issued by a database to the flash per second.
I/O Utilization (%) This metric reports the percentage of flash resources utilized by requests from this database.

Exadata Flash IORM Pluggable Database Metric

This metric category contains the IO statistics of flash by pluggable database.

Target Version: All Versions

Collection Frequency: Every 15 Minutes

Metric Description
Average I/O Throughput (MB/Sec) This metric reports the average number of megabytes of I/O per second for this pluggable database to flash disks.
Average IORM Wait Time for I/O (ms/req) This metric reports the average IORM wait time per request issued by a pluggable database to the flash disks.
Average IORM Wait Time for Large I/O (ms/req) This metric reports the average IORM wait time per request issued by a pluggable database to the flash disks for large I/O.
Average IORM Wait Time for Small I/O (ms/req) This metric reports the average IORM wait time per request issued by a pluggable database to the flash disks for small I/O.
Cell Name This is the short name of the Exadata Storage Server without domain suffix.
I/O Requests per Second (IO/sec) This metric reports the number of IO requests issued by a pluggable database to the flash disk per second.
I/O Requests per Second - Large (IO/Sec) This metric reports the number of large IO requests issued by a pluggable database to the flash disk per second.
I/O Requests per Second - Small (IO/Sec) This metric reports the number of small IO requests issued by a pluggable database to the flash disk per second.
I/O Utilization (%) This metric reports the percentage of flash resources utilized by requests from this pluggable database.

Exadata Flash Log Metric

This metric category contains the Exadata Flash Log metrics.

Target Version: All Versions

Collection Frequency: Every 15 Minutes

Metric Description
Cell Name This is the short name of the Exadata Storage Server without domain suffix.
Cumulative Disk Write Errors This metric reports the cumulative number of write errors encountered while writing to hard disks.
Cumulative Flash Write Errors This metric reports the cumulative number of write errors encountered while writing to flash disks.
Efficiency of Smart Flash Log Over the Past Hour This metric provides the efficiency of smart flash log over the past hour, that is, the ratio between the number of redo log writes completed by smart flash log in the past hour.
Efficiency of Smart Flash Logging (%) This metric provides the efficiency of Smart Flash Logging expressed as a percentage, that is, the ratio between the number of redo log writes completed by Smart Flash Log and the total number of redo log writes.
Megabytes per second Written to Flash This metric provides a number of megabytes per second written to flash disk.
Megabytes per second Written to Hard Disk This metric provides a number of megabytes per second written to hard disk.
Redo Data Kept This metric provides the number of bytes of redo data kept over time.
Redo Writes Exceeding Outlier Threshold This metric provides the number of redo writes that exceed the outlier threshold over time.
Redo Writes Prevented from Exceeding Outlier Threshold This metric provides the number of redo writes that were prevented from exceeding the outlier threshold over time.
Skipped Large Writes This metric provides the number of write operations that were skipped for Large I/O.
Skipped Writes Due to Slow Disk This metric provides the number of write operations that were skipped due to the reason that the hard disk was slow in responding.
Skipped Writes Due to Slow Disk During Last Minute This metric provides the number of write operations that were skipped due to the reason that the hard disk was slow in responding in the last minute.
Skipped Writes Due to Unavailable Buffer This metric provides the number of write operations that were skipped due to the unavailability of the buffer.
Writes Serviced This metric provides the number of write operations that were serviced over the selected time range.

Exadata IORM Consumer Group Metric

This metric category contains the Exadata IORM Consumer Group metrics.

Target Version: All Versions

Collection Frequency: Every 15 Minutes

Metric Description
Average I/O Throughput (MB/Sec) This metric reports the number of megabytes of I/O per second for this consumer group to hard disks.
Average Wait Time for I/O (ms/req) This metric reports the average IORM wait time per request issued by a consumer group.
Cell Name This is the short name of the Exadata Storage Server without domain suffix.
I/O Requests per Second (IO/sec) This metric reports the number of IO requests issued by a consumer group to hard disks per second.
I/O Utilization (%) This metric reports the percentage of disk resources utilized by requests from this Consumer Group.

Exadata IORM DB

This metric category contains the metrics collected for the IORM databases.

Target Version: All Versions

Collection Frequency: Every 15 Minutes

Metric Description
Average I/O Load This metric reports the average I/O load from this database for hard disks.
Average I/O Throughput (MB/Sec) This metric reports the number of megabytes of I/O per second for this database to hard disks.
Average Wait Time for I/O (ms/req) This metric reports the average wait time for I/O requests.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: Average Throttle Time per Disk I/O by Database for %target%:%object_name%:%cell_name% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Average Wait Time for Large I/O (ms/req) This metric reports the average wait time for large I/O requests.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: Average IORM wait time of Large Request in seconds for %target%:%object_name%:%cell_name% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Average Wait Time for Small I/O (ms/req) This metric reports the average wait time for small I/O requests.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: Average IORM wait time of Small Request in seconds for %target%:%object_name%:%cell_name% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Average latency of reading or writing blocks/request from flash disks This metric reports the rate which is the average latency of reading or writing blocks per request by a database from or to flash disks.
Average latency of reading or writing blocks/request from hard disks

This metric reports the rate which is the average latency of reading or writing blocks per request by a database from or to hard disks.

Average latency of reading or writing large blocks/request from hard disks This metric reports the rate which is the average latency of reading or writing large blocks per request by a database from or to hard disks.
Average latency of reading or writing small blocks/request from hard disks This metric reports the rate which is the average latency of reading or writing small blocks per request by a database from or to hard disks.
Cell Name This is the short name of the Exadata Storage Server without domain suffix.
Cumulative latency of reading or writing blocks from flash disks This metric reports the cumulative latency of reading or writing blocks by a database from or to flash disks.
Cumulative latency of reading or writing large blocks from hard disks This metric reports the cumulative latency of reading or writing large blocks by a database from or to hard disks.
Cumulative latency of reading or writing small blocks from hard disks This metric reports the cumulative latency of reading or writing small blocks by a database from or to hard disks.
I/O Requests per Second (IO/Sec) This metric reports the number of IO requests issued by a database to hard disks per second.
I/O Requests per Second - Large (IO/Sec) This metric reports the number of large IO requests issued by a database to hard disks per second.
I/O Requests per Second - Small (IO/Sec) This metric reports the number of small IO requests issued by a database to hard disks per second.
IO Utilization (%) This metric reports the percentage utilization for I/O requests.
Large I/O Utilization (%) This metric reports the percentage utilization for large I/O requests.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: Database IO Utilization for %target%:%object_name%:%cell_name% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Small IO Utilization (%) This metric reports the percentage utilization for small I/O requests.
Wait Time for Large I/O (ms) This metric specifies the average number of milliseconds that large I/O requests issued by the database have waited to be scheduled by IORM in the past minute. A large value indicates that the I/O workload from this database is exceeding the allocation specified for it in the inter-database plan.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: Wait Time of Large Requests for %target%:%object_name%:%cell_name% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Wait Time for Small I/O (ms) This metric specifies the average number of milliseconds that small I/O requests issued by the database have waited to be scheduled by IORM in the past minute. A large value indicates that the I/O workload from this database is exceeding the allocation specified for it in the inter-database plan.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: Wait Time of Small Requests for %target%:%object_name%:%cell_name% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Exadata IORM Pluggable Database Metric

This metric category contains the metrics collected for the IORM pluggable databases.

Target Version: 12c

Collection Frequency: Every 15 Minutes

Metric Description
Average I/O Load This metric reports the average I/O load from this pluggable database for hard disks.
Average I/O Throughput (MB/Sec) This metric reports the number of megabytes of I/O per second for this consumer group to hard disks.
Average Wait Time for I/O (ms/req) This metric reports the average wait time for large I/O requests.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: Average IORM Wait time per Large request in milliseconds for %target%:%object_name%:%cell_name% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Average Wait Time for Small I/O (ms/req) This metric reports the average wait time for small I/O requests.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: Average IORM Wait time per Small request in milliseconds for %target%:%object_name%:%cell_name% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Average Wait Time for I/O (ms/req) This metric reports the average wait time for I/O requests.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: Average IORM Wait time per I/O request in milliseconds for %target%:%object_name%:%cell_name% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Cell Name This is the short name of the Exadata Storage Server without domain suffix.
I/O Requests per Second (IO/Sec) This metric reports the number of IO requests issued by a pluggable database to hard disks per second.
I/O Requests per Second - Large (IO/Sec) This metric reports the number of large IO requests issued by a pluggable database to hard disks per second.
I/O Requests per Second - Small (IO/Sec) This metric reports the number of small IO requests issued by a pluggable database to hard disks per second.
I/O Utilization (%) This metric reports the percentage utilization for I/O requests.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: IO Utilization for %target%:%object_name%:%cell_name% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Large I/O Utilization (%) This metric reports the percentage of disk resources utilized by large requests from this Pluggable Database.
Small I/O Utilization (%) This metric reports the percentage of disk resources utilized by small requests from this Pluggable Database.
Wait Time for Large I/O (ms) This metric specifies the average number of milliseconds that large I/O requests issued by the database have waited to be scheduled by IORM in the past minute. A large value indicates that the I/O workload from this database is exceeding the allocation specified for it in the inter-database plan.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: Wait Time of Large requests for %target%:%object_name%:%cell_name% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Wait Time for Small I/O (ms) This metric specifies the average number of milliseconds that small I/O requests issued by the database have waited to be scheduled by IORM in the past minute.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: Wait Time of Small requests for %target%:%object_name%:%cell_name% is %value%, fallen below warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Exadata Key Performance Indicators

The following key performance indicator metrics are displayed for the Exadata Storage Server:

Metric Description Alert Message Clear Message

Exadata Key Performance Indicators

Key performance indicators for the Exadata Storage Server.

-

-

Total Flash Disk IOPS

Aggregated total read and write IOPS of all flash disks on the Exadata Storage Server.

Total flash disk IOPS for %target% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Total flash disk IOPS for %target% is %value%, fallen below warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Total Hard Disk IOPS

Aggregated total read and write IOPS of all hard disks on the Exadata Storage Server.

Total hard disk IOPS for %target% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Total hard disk IOPS for %target% is %value%, fallen below warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Total Flash Disk Throughput

Aggregated total read and write throughput of all flash disks on the Exadata Storage Server.

Total flash disk throughput for %target% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Total flash disk throughput for %target% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Total Hard Disk Throughput

Aggregated total read and write throughput of all hard disks on the Exadata Storage Server.

Total hard disk throughput for %target% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Total hard disk throughput for %target% is %value%, fallen below warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Average Flash Disk IO Load

Average IO load across all flash disks on the Exadata Storage Server.

Average flash disk IO load for %target% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Average flash disk IO load for %target% is %value%, fallen below warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Average Hard Disk IO Load

Average IO load across all hard disks on the Exadata Storage Server.

Average hard disk IO load for %target% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Average hard disk IO load for %target% is %value%, fallen below warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Average Flash Disk Response Time

Average read and write latency across all flash disks on the Exadata Storage Server.

Average flash disk response time for %target% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Average flash disk response time for %target% is %value%, fallen below warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Average Hard Disk Response Time

Average read and write latency across all hard disks on the Exadata Storage Server.

Average hard disk response time for %target% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Total hard disk response time for %target% is %value%, fallen below warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Flash Disk IO Health Exceptions

Number of flash disk Key Performance Indicators exceeding their critical thresholds for the Exadata Storage Server.

%target% has %value% flash disk Key Performance Indicators exceeding their critical thresholds.

%target% has %value% flash disk Key Performance Indicators exceeding their critical thresholds.

Hard Disk IO Health Exceptions

Number of hard disk Key Performance Indicators exceeding their critical thresholds for the Exadata Storage Server.

%target% has %value% hard disk Key Performance Indicators exceeding their critical thresholds.

%target% has %value% hard disk Key Performance Indicators exceeding their critical thresholds.

Exadata Smart IO Metric

This metric category contains the Exadata smart IO metrics.

Target Version: 11gR2, 12c

Collection Frequency: Every 15 Minutes

Metric Description
Cell Name This is the short name of the Exadata Storage Server without domain suffix.
Megabytes per second of pass through IOs This metric provides a number of megabytes per second saved by storage index.
Megabytes per second read from flash cache This metric provides a number of megabytes per second read from flash cache by smart IO.
Megabytes per second read from hard disk This metric provides a number of megabytes per second read from hard disk by smart IO.
Megabytes per second saved by storage index This metric provides a number of megabytes per second saved by storage index.

Exadata Storage Type

This metric provides information on the available storage types.

Target Version: All versions

Collection Frequency: Every 24 Hours (1440 Minutes)

Metric Description
Physical Disk Type This metric column lists available storage types as physical disks on Exadata storage server, for example, HarDisk and FlashDisk.
Number of Physical Disks This metric column provides the count of physical disks for each storage type on Exadata Storage Server, for example 12 for HardDisk, and 16 for FlashDisk.
Number of Cell Disks This metric column provides count of physical disks that are configured as cell disks for each storage type on Exadata Storage Server, for example 12 for HardDisk, and 16 for FlashDisk.

Filesystem Utilization

This metric category contains the metrics relating to the filesystem utilization.

Target Version: All versions

Collection Frequency: Every 24 Hours

Metric Description
Cell Name This is the short name of the Exadata Storage Server without domain suffix.
Filesystem Utilization % This metric provides the percentage of file system usage on the target.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: File system usage on %target%: %name%:%cell_name% is %value%, which has crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

HCA Port Configuration Monitor

This metric category contains the HCA port configuration monitor metrics.

HCA Port Errors

This metric category contains the HCA port error metrics.

Target Version: All versions

Collection Frequency: Every 15 Minutes

Metric Description
Excessive buffer overruns This metric reports the number of “buffer overruns exceeding the threshold" since the last metric collection.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: Port %PortNumber% has %value% excessive buffer overruns, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Incoming VL15 packets dropped due to resource limitation This metric reports the number of incoming VL 15 packets dropped due to lack of buffers since the last metric collection.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: Port %PortNumber% has %value% incoming VL15 packets dropped, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Link integrity errors This metric displays the number of link integrity errors, that is, errors on the local link.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: Port %PortNumber% has %value% link integrity errors, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Link recovers This metric reports the number of times the link error recovery process was completed successfully since the last metric collection.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: Port %PortNumber% has %value% link recovers, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Packets not transmitted due to constraints This metric reports the number of packets not transmitted due to constrains since the last metric collection.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: Port %PortNumber% has %value% packets not transmitted due to constraints, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Received packets discarded due to constraints This metric reports the number of packets discarded due to constraints since the last metric collection.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: Port %PortNumber% has %value% received packets discarded due to constraints, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Received packets marked with the EBP delimiter This metric reports the number of packets marked with the EBP delimiter received on the port.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: Port %PortNumber% has %value% received packets marked with the EBP delimiter, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Received packets with error This metric reports the number of packets received with errors since the last metric collection.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: Port %PortNumber% has %value% received packets containing an error, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Symbol errors This metric reports the number of symbols errors detected since the last metric collection.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: Port %PortNumber% has %value% symbol errors, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Total errors This metric reports the sum total of all errors listed in this section.

Default Warning Threshold: Not Defined

Default Critical Threshold: Not Defined

Alert Text: Port %PortNumber% has %value% total errors, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

HCA Port State

This metric category contains the HCA port state metrics.

Target Version: All versions

Collection Frequency: Every 15 Minutes

Metric Description
Active link width of port based on cable connectivity (e.g., 1X) This metric displays the active link width of the port based on the cable connectivity.
Is the link degraded? (active speed or width less than enabled) This metric reports whether or not the link is degraded. If the active speed of a link is less than the enabled speed, then it is considered to be degraded and this column value is set to 1.

Default Warning Threshold: Not Defined

Default Critical Threshold: 1

Alert Text: Port %PortNumber%(%ca_disp_name%) is running in degraded mode.

Link state (0 = Down, 1 = Active) This metric reports the link state. The link is down if the physical link state is 0.
Physical link state (0 = Disabled/Polling, 1 = LinkUp) This metric reports the physical link state. The physical link state is 0 if the port is in polling or disabled state.
The active link speed (Gbps) The metric reports the speed of the active link.

HCA Port State (For Alerts)

The metrics in this category describe the host channel adapters (HCA) port state .

Target Version: All versions

Collection Frequency: Every 15 Minutes

Metric Description
Is port disabled? This metric indicates whether the HCA port is disabled.

Default Warning Threshold: Not Defined

Default Critical Threshold: 1

Alert Text: Port %PortNumber%(%ca_disp_name%) is disabled.

Is port in 'polling' state? This metric indicates whether the HCA port is checking or polling for a peer port.

Default Warning Threshold: Not Defined

Default Critical Threshold: 1

Alert Text: Port %PortNumber%(%ca_disp_name%) is polling for peer port. This could happen when the cable is unplugged from one of the ends or the other end port is disabled.

Host Interconnect Statistics

This metric category contains the Host Interconnect Statistics metrics.

Target Version: All versions

Collection Frequency: Every 15 Minutes

Metric Description
Cell Name This is the short name of the Exadata Storage Server without domain suffix.
Host MB Dropped Per Sec This metric reports the number of megabytes dropped during transmission to a particular host in the interval.
Host MB Received Per Sec This metric reports the number of megabytes received from a particular host in the interval.
Host MB Resent Per Sec This metric reports the number of megabytes retransmitted to a particular host in the interval.
Host MB sent Per Sec This metric reports the number of megabytes transmitted to a particular host in the interval.
Host RDMA MB Dropped Per Sec This metric reports the number of megabytes dropped during RDMA transmission to a particular host in the interval.
Host RDMA Retry Latency (msec) This metric reports the latency of the retry action during RDMA transmission to a particular host in the interval.

Response

This metric category contains the metric used to detect whether or not the Management server on the cell is running. This metric is checked at 5 minute intervals. A one in the status column indicates that the cell is up, otherwise the cell is down.

Target Version: All versions

Collection Frequency: Every 5 Minutes

Metric Description
Response Status This metric is checked at 5 minute intervals. A one in the status column indicates that the cell is up, otherwise the cell is down.

Default Warning Threshold: Not Defined

Default Critical Threshold: 0

Alert Text: %target% is down. MS Status is %MSStatus% and Ping Status is %MgmtNetworkPingStatus%.

Top CPU Activity

This metric category contains the Top CPU metrics.

Target Version: All versions

Collection Frequency: Every 15 Minutes

Metric Description
Activity(%) This metric reports the percentage of total samples from a specific database.
Begin Sequence This metric reports the begin sequence number for collection.
Database Name This metric reports the database unique name ("other" represents unnamed database requests).
End Sequence -
Incarnation This metric reports the cellsrv incarnation number.
SQL ID This metric reports the SQL unique ID ("0000000000000" represents requests without a SQL ID).
Samples This metric reports the total samples collected for a specific database in this interval.
Total Samples This metric reports the total samples collected in this interval.

Network Port

This metric category contains the metrics used to monitor the performance, traffic statistics and the error statistics of the network ports.

Network Ports InfiniBand Error Statistics

This metric details the error statistics of the InfiniBand ports. The data is collected every 15 minutes.

Metric Summary

Each of the below metric columns have metric data such as Port ID, Average Value, Low Value, High Value, Last Known Value, Current Severity, Alert Triggered, and Last Collection Timestamp.

Metric Column Description

Execution Buffer Overrun Errors

Number of buffer overruns since last collection

Link Downed

Number of failed link errors recovered and link down errors since the last collection

Link Integrity Errors

Number of link integrity errors since last collection

Link Recovers

Number of link error recovers since last collection

Number of packets with the EBP delimiter received on the port since the last collection

Total number of packets with the EBP delimiter received on the port

Received Constraint Errors

Number of received constraint errors since last collection

Received Errors

Number of error packets received on the port since the last collection

Received Switch Relay Errors

Number of received switch relay errors since last collection

Sent Constraint Errors

Number of transmitted constraint errors since last collection

Sent Discards

Number of outbound packets discarded because of down/congested port since last collection

Symbol Errors

Number of minor link errors since the last collection. Usually an 8b/10b error due to a bit error

Total Errors

Total number of errors

Virtual Lanne 15 Packets Dropped

Number of incoming Virtual Lane 15 packets dropped due to resource limitations

Network Ports InfiniBand Performance

This metric provides the performance statistics of the InfiniBand ports. The metric is collected every 15 minutes.

Metric Summary

Metric Column Description Metric Data

Active link width of port based on cable connectivity (e.g., 1X)

The active width of the InfiniBand port

Port ID, Average Value, Last Collected Value, Current Severity, Alert Triggered, Last Collection Timestamp

Cable State

The state of the cable connected to the InfiniBand port

Port ID, Last Collected Value, Current Severity, Alert Triggered, Last Collection Timestamp

Enabled link speed (Gbps)

The enabled speed for the InfiniBand port

Port ID, Last Collected Value, Current Severity, Alert Triggered, Last Collection Timestamp

Enabled link width (e.g., 1X or 4X)

The enabled width of the InfiniBand port

Port ID, Last Collected Value, Current Severity, Alert Triggered, Last Collection Timestamp

Gateway Port Link Mode

The mode of the gateway port, if applicable

Port ID, Last Collected Value, Current Severity, Alert Triggered, Last Collection Timestamp

Is port disabled?

Indicates that the cable is present but the port is disabled

Port ID, Average Value, Low Value, High Value, Last Known Value, Current Severity, Alert Triggered, Last Collection Timestamp

Is port in 'polling' state?

Indicates that the cable is present but the port is polling

Port ID, Average Value, Low Value, High Value, Last Known Value, Current Severity, Alert Triggered, Last Collection Timestamp

Is the link degraded? (active speed or width less than enabled)

Indicates whether the link is degraded on the InfiniBand port

Port ID, Average Value, Low Value, High Value, Last Known Value, Current Severity, Alert Triggered, Last Collection Timestamp

Link state (0 = Down, 1 = Active)

The link state associated with the InfiniBand port

Port ID, Last Collected Value, Current Severity, Alert Triggered, Last Collection Timestamp

Local Port LID

The LID (local identifier) associated with the InfiniBand port

Port ID, Last Collected Value, Current Severity, Alert Triggered, Last Collection Timestamp

Physical link state (0 = Disabled/Polling, 1 = LinkUp )

The physical link state associated with the InfiniBand port

Port ID, Last Collected Value, Current Severity, Alert Triggered, Last Collection Timestamp

Port State

The state of the InfiniBand port

Port ID, Last Collected Value, Current Severity, Alert Triggered, Last Collection Timestamp

Remote Port LID

The LID (local identifier) associated with the remote InfiniBand port

Port ID, Last Collected Value, Current Severity, Alert Triggered, Last Collection Timestamp

Supported link speed (Gbps)

The supported speed for the InfiniBand port

Port ID, Last Collected Value, Current Severity, Alert Triggered, Last Collection Timestamp

Supported link width (e.g., 1X or 4X)

The supported width of the InfiniBand port

Port ID, Last Collected Value, Current Severity, Alert Triggered, Last Collection Timestamp

The active lin kspeed (Gbps)

The active speed of the InfiniBand port

Port ID, Last Collected Value, Current Severity, Alert Triggered, Last Collection Timestamp

Network Ports InfiniBand Traffic Statistics

This metric provides the traffic statistics of the InfiniBand ports. The metric is collected every 15 minutes.

Metric Summary

Each metric column contains the metric data such as Port ID, Average Value, Last Known Value, Current Severity, Alert Triggered, and Last Collection Timestamp.

Metric Column Description

Received Bytes

Total number of incoming octets

Received Multicast Packets

Total number of incoming multicast packets

Received Packets

Total number of incoming packets

Received Unicast Packets

Total number of incoming unicast packets

Sent Bytes

Total number of outgoing octets

Sent Multicast Packets

Total number of outgoing multicast packets

Sent Packets

Total number of outgoing packets

Sent Unicast Packets

Total number of outgoing unicast packets

Network Ports Performance

This metric provides the performance statistics of the network ports. The metric is collected every 15 minutes.

Metric Summary

In the following metrics columns, if the metric provides the status, then the metric data available are Name, Average Value, Last Collected Value, Current Severity, Alert Triggered, and Last Collection Timestamp. If the metric provides numerical values, then additionally, the metric data for Average Value, Low Value, High Value, and Last Known Value are available.

Metric Column Description

Admin State

Administrative state. For example, UP, DOWN, TESTING.

Discarded Packets

Number of discarded packets

Duplex Mode

Actual mode of the port. Full or Half.

Inbound Errors

Number of incoming errors

Inbound Multicast Packets

Number of incoming non-unicast packets

Inbound Octets

Number of incoming octets

Inbound Octets Rate

Total incoming octets rate

Inbound Unicast Packets

Number of incoming unicast packets

Inbound Unknown Protocol

Number of incoming unknown protocol errors

MTU

Actual physical MTU

Operational Status

Operation status of the port. For example, UNKNOWN, UP, DOWN, TESTING, UNCONNECTED.

Outbound Discards

Number of outgoing discards

Outbound Errors

Number of outgoing errors

Outbound Multicast Packets

Number of outbound non-unicast packets

Outbound Octets

Number of outbound octets

Outbound Octets Rate

Total outgoing octets rate

Outbound Unicast Packets

Number of outbound unicast packets

Partition Keys

List of partition keys to which this port belongs

Port is down

Port status became down

Speed

Actual speed of the port

Speed Units

The unit of speed. For example, bytes per second, kilobytes per second, megabytes per second, gigabytes per second.

Total Octets Rate

Total octets rate for incoming and outgoing data

vLAN IDs

List of vLAN IDs to which this port belongs

Password Expiration

This metric category provides details on how long before the current monitoring password for Exadata Storage Server will expire. This is applicable for Exadata Storage Server targets using the monitoring mechanism ExaCLI or RESTAPI.

Target Version: Exadata Storage Server target 19.1.0.0.0

Collection Frequency: Every 1 Hour

Metric Description
Days Until Password Expiration This metric shows the number of days until the password expiration. Default Warning Threshold is 14 days and Default Critical Threshold is 7 days.

Oracle Database Exadata Storage Server System

The Oracle Database Exadata Storage Server System target type is a system target that contains all the Oracle Exadata targets that provide storage for one single database.

Agg_Exadata_System_Celldisk_Metric

This metric category provides the metrics collected for a group of Exadata targets that are the storage for one database.

Target Version: All versions

Collection Frequency: Every 15 Minutes

Metric Description
Average Flash Disk IO Load This metric indicates the average input/output load to the Flash disk.
Average Flash Disk Read IOPS This metric indicates the average number of bytes read from the Flash disk.
Average Flash Disk Read Throughput This metric indicates the average number of bytes read from the Flash disk.
Average Flash Disk Write IOPS This metric indicates the average number of input/output operations written to the Flash disk
Average Flash Disk Write Throughput This metric indicates the average number of bytes written to the Flash disk.
Average Hard Disk IO Load This metric indicates the average I/O load to the hard disk.
Average Hard Disk Read IOPS This metric indicates the average number of read input/output operations from the hard disk.
Average Hard Disk Read Throughput This metric indicates the average number of bytes read from the hard disk.
Average Hard Disk Write IOPS This metric indicates the average number of input/output operations written to the hard disk.
Average Hard Disk Write Throughput This metric indicates the average number of bytes written to the hard disk.
Maximum Flash Disk IO Load This metric indicates the maximum I/O load to the Flash disk.
Maximum Flash Disk Read IOPS This metric indicates the maximum number of read input/output operations per second to the Flash disk.
Maximum Flash Disk Read Throughput This metric indicates the maximum number of bytes read from the Flash disk.
Maximum Flash Disk Write IOPS This metric indicates maximum number of input/output operations written to the Flash disk.
Maximum Flash Disk Write Throughput This metric indicates maximum number of bytes written to the Flash disk.
Maximum Hard Disk IO Load This metric indicates the maximum I/O load to the hard disk.
Maximum Hard Disk Read IOPS This metric indicates the maximum number of input/output operations read from the hard disk.
Maximum Hard Disk Read Throughput This metric indicates the maximum number of bytes read from the hard disk.
Maximum Hard Disk Write IOPS This metric indicates the maximum number of input/output operations written to the hard disk.
Maximum Hard Disk Write Throughput This metric indicates the maximum number of bytes written to the hard disk.
Minimum Flash Disk IO Load This metric indicates the minimum I/O load to the Flash disk.
Minimum Flash Disk Read IOPS This metric indicates the minimum number of read input/output operations from the Flash disk.
Minimum Flash Disk Read Throughput This metric indicates the minimum number of bytes read from the Flash disk.
Minimum Flash Disk Write IOPS This metric indicates the minimum number of input/output operations written to the Flash disk.
Minimum Flash Disk Write Throughput This metric indicates the minimum number of bytes written to the Flash disk.
Minimum Hard Disk IO Load This metric indicates the minimum I/O load to the hard disk.
Minimum Hard Disk Read IOPS This metric indicates the minimum number of read input/output operations per second to the hard disk.
Minimum Hard Disk Read Throughput This metric indicates the minimum number of bytes read from the hard disk.
Minimum Hard Disk Write IOPS This metric indicates the minimum number of input/output operations written to the hard disk.
Minimum Hard Disk Write Throughput This metric indicates the minimum number of bytes written to the hard disk.

Response

This metric category contains the metric used to detect the response of the Oracle Database Exadata Storage Server System.

Target Version: All versions

Collection Frequency: Event-driven

Metric Description
Status This metric's collection frequency is event-driven. A one in the status column indicates that the target is up, otherwise it is down.

Oracle Exadata Storage Server Grid

The Oracle Exadata Storage Server Grid target type is a system target that contains all the Oracle Exadata targets from the same Exadata Database Machine system.

Exadata Key Performance Indicators

The following key performance indicator metrics are displayed for the Exadata Storage Server Grid:

Metric Description Alert Message Clear Message

Exadata Key Performance Indicators

Key performance indicators for the Exadata Storage Server Grid.

-

-

Total Flash Disk IOPS

Aggregated total read and write IOPS of all flash disks on the Exadata Storage Server Grid.

Total flash disk IOPS for %target% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Total flash disk IOPS for %target% is %value%, fallen below warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Total Hard Disk IOPS

Aggregated total read and write IOPS of all hard disks on the Exadata Storage Server Grid.

Total hard disk IOPS for %target% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Total hard disk IOPS for %target% is %value%, fallen below warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Total Flash Disk Throughput

Aggregated total read and write throughput of all flash disks on the Exadata Storage Server Grid.

Total flash disk throughput for %target% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Total flash disk throughput for %target% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Total Hard Disk Throughput

Aggregated total read and write throughput of all hard disks on the Exadata Storage Server Grid.

Total hard disk throughput for %target% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Total hard disk throughput for %target% is %value%, fallen below warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Average Flash Disk IO Load

Average IO load across all flash disks on the Exadata Storage Server Grid.

Average flash disk IO load for %target% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Average flash disk IO load for %target% is %value%, fallen below warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Average Hard Disk IO Load

Average IO load across all hard disks on the Exadata Storage Server Grid.

Average hard disk IO load for %target% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Average hard disk IO load for %target% is %value%, fallen below warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Average Flash Disk Response Time

Average read and write latency across all flash disks on the Exadata Storage Server Grid.

Average flash disk response time for %target% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Average flash disk response time for %target% is %value%, fallen below warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Average Hard Disk Response Time

Average read and write latency across all hard disks on the Exadata Storage Server Grid.

Average hard disk response time for %target% is %value%, crossed warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Total hard disk response time for %target% is %value%, fallen below warning (%warning_threshold%) or critical (%critical_threshold%) threshold.

Flash Disk IO Health Exceptions

Number of members whose Exadata Key Performance Indicator Flash Disk IO Health Exceptions exceed their critical thresholds.

%target% has %value% member Exadata Storage Servers whose Key Performance Indicator Flash Disk IO Health Exceptions exceed their critical thresholds.

%target% has %value% member Exadata Storage Servers whose Key Performance Indicator Flash Disk IO Health Exceptions exceed their critical thresholds.

Hard Disk IO Health Exceptions

Number of members whose Exadata Key Performance Indicator Hard Disk IO Health Exceptions exceed their critical thresholds.

%target% has %value% member Exadata Storage Servers whose Key Performance Indicator Hard Disk IO Health Exceptions exceed their critical thresholds.

%target% has %value% member Exadata Storage Servers whose Key Performance Indicator Hard Disk IO Health Exceptions exceed their critical thresholds.

Response

This metric category contains the metric used to detect the response of the Oracle Exadata Storage Server Grid target.

Target Version: All versions

Collection Frequency: Event-driven

Metric Description
Status This metric's collection frequency is event-driven. A one in the status column indicates that the target is up, otherwise it is down.