8.1 Source
The Extract process is configured to run against the source technology, capturing data generated in the true source tecnology located somewhere else. This process is the extraction or the data capture mechanism of GG for DAA.
- Initial Load Extract: When you set up GG for DAA for initial loads, the Extract process captures the current, static set of data directly from the source objects. This configuration of Extract process uses source source to capture data.
-
Change Data Capture Extract: When you set up GG for DAA to keep the source data synchronized with another set of data, the Extract process captures the DML and (if supported) DDL operations performed on the configured objects after the initial synchronization has taken place. It stores these operations until it receives commit records or rollbacks for the transactions that contain them. If it receives a rollback, it discards the operations for that transaction. If it receives a commit, it persists the transaction to disk in a series of files called a trail, where it is queued for propagation to the target system. All the operations in each transaction are written to the trail and are in the order in which they were committed to the source technology. This design ensures both speed and data integrity. The format of the data written to trail files depends on the source technology.
- Amazon MSK
- Apache Cassandra
The Oracle GoldenGate for Distributed Applications and Analytics (GG for DAA) capture (Extract) for Cassandra Extract is used to get changes from Apache Cassandra databases. - Apache Kafka
The Oracle GoldenGate capture (Extract) for Kafka is used to read messages from a Kafka topic or topics and convert data into logical change records written to GoldenGate trail files. This section explains how to use Oracle GoldenGate capture for Kafka. - Azure Event Hubs
- Confluent Kafka
- DataStax
- Java Message Service (JMS)
- MongoDB
The Oracle GoldenGate capture (Extract) for MongoDB is used to get changes from MongoDB databases. - OCI Streaming
Parent topic: Replicate Data