Oracle Data Source

DataInLong provides Oracle read and write capabilities. This article introduces the pre-environment configuration and current capability support for real-time data synchronization using Oracle.
Supported Versions
Currently, DataInLong supports Oracle single table and database-level real-time reading and single table writing. To use real-time synchronization capabilities, the following version limitations must be observed:
Data Source Type
Edition
Oracle
11,12,19
Use Limits
Oracle read-only replicas only support logical standby, physical standby is not supported.
Tables without primary keys may have data duplication since exactly once cannot be guaranteed. Therefore, it is best to ensure the table has a primary key for real-time sync tasks.
To monitor table field changes on the Oracle side, do not configure the data source with the system/sys accounts, otherwise, all tables (including newly added tables) must enable log tracking for synchronization.
Preparing the Database Environment
Authorize database permissions 
The main tasks are to create basic synchronization users and grant permissions to pull existing data. Grant read permissions for views that need to be used and for tables to be imported.
The main purpose of the ANALYZE ANY privilege is to read the table structure and unique index. When a table does not have a primary key, a unique index can be used instead of a primary key.
sqlplus sys/password@host:port/SID AS SYSDBA;
CREATE USER flinkuser IDENTIFIED BY flinkpw DEFAULT TABLESPACE LOGMINER_TBS QUOTA UNLIMITED ON LOGMINER_TBS;
-- Allow users to connect to the database
  GRANT CREATE SESSION TO flinkuser; 
-- To support scenarios using cdb/pdb, grant users access to pdb/cdb
  GRANT SET CONTAINER TO flinkuser;
-- Grant users SELECT privileges on all tables in the database to read data
  GRANT SELECT ANY TABLE TO flinkuser;
--SELECT_CATALOG_ROLE is a predefined database role that provides permissions to view database system tables. Additionally, during the incremental phase, the logminer session relies on this privilege to query the table structure.
  GRANT SELECT_CATALOG_ROLE TO flinkuser;
﻿
--GRANT ANALYZE ANY is a grant statement used to authorize users to analyze all tables, indexes, partitions, and table functions owned by any user. When a table does not have a primary key, this privilege is used to obtain the unique index and replace the primary key with the unique index.
  GRANT ANALYZE ANY TO flinkuser;
Enable log archiving
Enable the generation of logical logs, otherwise Oracle cannot correctly generate part of the logs, making incremental data synchronization impossible.
1. Connect using the DBA account.
sqlplus sys/password@host:port/SID AS SYSDBA
2. Enable log archivingdisabling it will prevent reading incremental data.
alter system set db_recovery_file_dest_size = 10G;
alter system set db_recovery_file_dest = '/opt/oracle/oradata/recovery_area' scope=spfile;
shutdown immediate;
startup mount;
alter database archivelog;
alter database open;
﻿
Notes: Enabling log archiving requires a database restart and will occupy disk space
3. Check whether it is enabled.
-- Should now "Database log mode: Archive Mode"
archive log list;
4. Enable supplemental log
If not enabled, the non-updated fields in an Update will all be empty.
-- Enable supplemental logging for a specific table:
ALTER TABLE inventory.customers ADD SUPPLEMENTAL LOG DATA (ALL) COLUMNS;
Alternatively, you can enable it globally:
-- Enable supplemental logging for database
ALTER DATABASE ADD SUPPLEMENTAL LOG DATA;
5. Create Tablespace
      It is recommended to create a separate table space for real-time synchronization users, although an existing table space can also be reused.
 CREATE TABLESPACE logminer_tbs DATAFILE '/opt/oracle/oradata/SID/logminer_tbs.dbf' SIZE 25M REUSE AUTOEXTEND ON MAXSIZE 2048M;
﻿
Grant the permissions to read logs via logminer
 --A special grant statement that grants the user the SELECT privilege on the V$DATABASE view, mainly used to check whether the database's archived log is enabled (LOG_MODE) and to get the current SCN Point of the database.
  GRANT SELECT ON V_$DATABASE to flinkuser;
﻿
--EXECUTE_CATALOG_ROLE is a predefined role that mainly provides the capability for logminer to write the dictionary to the binlog, thus recording changes in the table structure into the logical log.
  GRANT EXECUTE_CATALOG_ROLE TO flinkuser;
﻿
--Allow the user to read data through the LOGMINING component
  GRANT LOGMINING TO flinkuser;
﻿
--The EXECUTE ON DBMS_LOGMNR privilege is used to grant the user the ability to execute programs within the DBMS_LOGMNR package. The DBMS_LOGMNR package is used to analyze and extract changed data from database log files.
  GRANT EXECUTE ON DBMS_LOGMNR TO flinkuser;
﻿
--GRANT EXECUTE ON DBMS_LOGMNR_D is a grant statement that gives the user execution privileges on the DBMS_LOGMNR_D package. DBMS_LOGMNR_D is an Oracle-provided package used to analyze online and archived redo log files at the database level
  GRANT EXECUTE ON DBMS_LOGMNR_D TO flinkuser;
﻿
--Logminer's data query interface is composed of a set of system-provided views. To read data from logminer, you need permissions to access the following views.
  GRANT SELECT ON V_$LOG TO flinkuser;
  GRANT SELECT ON V_$LOG_HISTORY TO flinkuser;
  GRANT SELECT ON V_$LOGMNR_LOGS TO flinkuser;
  GRANT SELECT ON V_$LOGMNR_CONTENTS TO flinkuser;
  GRANT SELECT ON V_$LOGMNR_PARAMETERS TO flinkuser;
  GRANT SELECT ON V_$LOGFILE TO flinkuser;
  GRANT SELECT ON V_$ARCHIVED_LOG TO flinkuser;
  GRANT SELECT ON V_$ARCHIVE_DEST_STATUS TO flinkuser;
﻿
  exit;
Database source configuration
Data source settings
﻿
﻿
﻿
Parameter
Description
Data Source
Select the Oracle data source to be synchronized
Source Table
According to business needs, choose "All Database Tables", "Specified Table", or "Specified Database"
﻿
All Database Tables: Monitor all databases under the data source. New databases and tables added during the task run will be synchronized to the target by default
Specified Table: Only synchronize the specified table
Specified Database: Monitor the specified database and schema. Synchronize all or matching tables under the schema
﻿
﻿
﻿
﻿
Read Mode
Full + Increment: Data synchronization is divided into full and increment phases. After the full phase is completed, the task enters the increment phase. The full phase will synchronize historical data in the database, and the incremental phase starts synchronizing from the binlog cdc location after the task starts.
Increment: Only synchronize data from the binlog cdc position after the task starts.
Consistency Semantics
Only represents the consistency semantics of the reading end. Supports At-least-once and Exactly-once.
At-least-once: Data may be read more than once, relying on the destination end's deduplication to ensure data consistency. Suitable for scenarios with large data volumes in the full phase and using non-numeric primary keys, with high synchronization performance requirements. 
Exactly-once: Data is read strictly once, with some performance losses. Tables without a primary key and unique index column are not supported. Suitable for general scenarios where the source table has a numeric primary key or unique index column.
The current version's two modes are state-incompatible. If the mode is changed after task submission, stateful restart is not supported.
Advanced Settings (optional)
You can configure parameters according to business needs
Note:
To monitor table field changes on the Oracle side, do not configure the data source with the system/sys accounts, otherwise, all tables (including newly added tables) must enable log tracking for synchronization.
Activate Command:
 ALTER TABLE SCHEMA_NAME.TBALE_NAME ADD SUPPLEMENTAL LOG DATA (ALL) COLUMNS;
Single Table Read Node Configuration
1. In the DataInLong page, click Real-time synchronization on the left directory bar.
2. In the real-time synchronization page, select Single Table Synchronization at the top to create a new one (you can choose either form or canvas mode) and enter the configuration page.
﻿
﻿
﻿
Parameter
Description
Data Source
Select the data source where the table to be synchronized is located.
Database
Supports selection or manual input of the library name to read from.
By default, the database bound to the data source is used as the default database. Other databases need to be manually entered.
If the data source network is not connected and the database information cannot be fetched directly, you can manually enter the database name. Data synchronization can still be performed when the Data Integration network is connected.
Schema
Support selection or manual input of available schema under this data source.
Table
Supports selecting or manually entering the table name to be read.
Read Mode
Full + Increment: Data synchronization is divided into full and increment phases. After the full phase is completed, the task enters the increment phase. The full phase will synchronize historical data in the database, and the incremental phase starts synchronizing from the binlog cdc location after the task starts.
Increment Only: Only synchronize data from the binlog cdc position after the task starts.
Consistency Semantics
Only represents the consistency semantics of the reading end. Supports At-least-once and Exactly-once.
At-least-once: Data may be read more than once, relying on the destination end's deduplication to ensure data consistency. Suitable for scenarios with large data volumes in the full phase and using non-numeric primary keys, with high synchronization performance requirements. 
Exactly-once: Data is read strictly once, with some performance losses. Tables without a primary key and unique index column are not supported. Suitable for general scenarios where the source table has a numeric primary key or unique index column.
The current version's two modes are state-incompatible. If the mode is changed after task submission, stateful restart is not supported.
Advanced Settings
(Optional)
You can configure parameters according to business needs.
Single Table Writing Node Configuration
1. In the DataInLong page, click Real-time synchronization on the left directory bar.
2. In the real-time synchronization page, select Single Table Synchronization at the top to create a new one (you can choose either form or canvas mode) and enter the configuration page.
﻿
﻿
﻿
Parameter
Description
Data Source
The Oracle data source to be written to.
Database
Supports selection or manual input of the database name to write to
By default, the database bound to the data source is used as the default database. Other databases need to be manually entered.
If the data source network is not connected and the database information cannot be fetched directly, you can manually enter the database name. Data synchronization can still be performed when the Data Integration network is connected.
Schema
Supports selecting or manually entering the Oracle data mode to be written to.
Table
Support selection or manual entry of the table name to be written to.
If the data source network is not connected and the table information cannot be fetched directly, you can manually enter the table name. Data synchronization can still be performed when the Data Integration network is connected.
Primary key
Select a field as the primary key for the target table.
Advanced Settings (Optional)
You can configure parameters according to business needs.
Log collection write node
Parameter
Description
Data Source
Select the available Oracle data source in the current project.
Database/Table
Select the corresponding database table from this data source.
Primary key
Select a field as the primary key for the data table.
Advanced Settings (optional)
You can configure parameters according to business needs.
Data type conversion support
Read 
The supported data types for reading from Oracle and their conversion relationships are as follows (when processing Oracle, the data types from the Oracle data source will first be mapped to the data types of the data processing engine):
Oracle Type
Internal Types
NUMBER(p, s <= 0), p - s < 3
TINYINT
NUMBER(p, s <= 0), p - s < 5
SMALLINT
NUMBER(p, s <= 0), p - s < 10
INT
NUMBER(p, s <= 0), p - s < 19
BIGINT
NUMBER(p, s <= 0), 19 <= p - s <= 38
DECIMAL(p - s, 0)
NUMBER(p, s > 0)
DECIMAL(p, s)
NUMBER(p, s <= 0), p - s > 38
STRING
FLOAT,BINARY_FLOAT
FLOAT
DOUBLE PRECISION,BINARY_DOUBLE
DOUBLE
NUMBER(1)
BOOLEAN
DATE,TIMESTAMP [(p)]
TIMESTAMP [(p)] [WITHOUT TIMEZONE]
TIMESTAMP [(p)] WITH TIME ZONE
TIMESTAMP [(p)] WITH TIME ZONE
TIMESTAMP [(p)] WITH LOCAL TIME ZONE
TIMESTAMP_LTZ [(p)]
CHAR(n), NCHAR(n), NVARCHAR2(n), VARCHAR(n), VARCHAR2(n), CLOB, NCLOB, XML types
STRING
BLOB,ROWID
BYTES
INTERVAL DAY TO SECOND,INTERVAL YEAR TO MONTH
BIGINT
Write
The supported data types for writing to Oracle and their conversion relationships are as follows:
Internal Types
Oracle Type
FLOAT
BINARY_FLOAT
DOUBLE
BINARY_DOUBLE
DECIMAL(p, s)
SMALLINT,FLOAT(s),DOUBLE PRECISION,REAL,NUMBER(p, s)
DATE
DATE
DECIMAL(20, 0)
-
FLOAT
REAL,FLOAT4
DOUBLE
FLOAT8,DOUBLE PRECISION
DECIMAL(p, s)
NUMERIC(p, s),DECIMAL(p, s)
BOOLEAN
BOOLEAN
DATE
DATE
TIMESTAMP [(p)][WITHOUT TIMEZONE]
TIMESTAMP [(p)]WITHOUT TIMEZONE
STRING
CHAR(n),VARCHAR(n),CLOB(n)
BYTES
RAW(s),BLOB
ARRAY
-

Was this page helpful?

You can also Contact Sales or Submit a Ticket for help.

Yes

Parameter	Description
Data Source	Select the Oracle data source to be synchronized
Source Table	According to business needs, choose "All Database Tables", "Specified Table", or "Specified Database"
		All Database Tables: Monitor all databases under the data source. New databases and tables added during the task run will be synchronized to the target by default Specified Table: Only synchronize the specified table Specified Database: Monitor the specified database and schema. Synchronize all or matching tables under the schema


Read Mode	Full + Increment: Data synchronization is divided into full and increment phases. After the full phase is completed, the task enters the increment phase. The full phase will synchronize historical data in the database, and the incremental phase starts synchronizing from the binlog cdc location after the task starts. Increment: Only synchronize data from the binlog cdc position after the task starts.
Consistency Semantics	Only represents the consistency semantics of the reading end. Supports At-least-once and Exactly-once. At-least-once: Data may be read more than once, relying on the destination end's deduplication to ensure data consistency. Suitable for scenarios with large data volumes in the full phase and using non-numeric primary keys, with high synchronization performance requirements. Exactly-once: Data is read strictly once, with some performance losses. Tables without a primary key and unique index column are not supported. Suitable for general scenarios where the source table has a numeric primary key or unique index column. The current version's two modes are state-incompatible. If the mode is changed after task submission, stateful restart is not supported.
Advanced Settings (optional)	You can configure parameters according to business needs

Oracle Type	Internal Types
NUMBER(p, s <= 0), p - s < 3	TINYINT
NUMBER(p, s <= 0), p - s < 5	SMALLINT
NUMBER(p, s <= 0), p - s < 10	INT
NUMBER(p, s <= 0), p - s < 19	BIGINT
NUMBER(p, s <= 0), 19 <= p - s <= 38	DECIMAL(p - s, 0)
NUMBER(p, s > 0)	DECIMAL(p, s)
NUMBER(p, s <= 0), p - s > 38	STRING
FLOAT,BINARY_FLOAT	FLOAT
DOUBLE PRECISION,BINARY_DOUBLE	DOUBLE
NUMBER(1)	BOOLEAN
DATE,TIMESTAMP [(p)]	TIMESTAMP [(p)] [WITHOUT TIMEZONE]
TIMESTAMP [(p)] WITH TIME ZONE	TIMESTAMP [(p)] WITH TIME ZONE
TIMESTAMP [(p)] WITH LOCAL TIME ZONE	TIMESTAMP_LTZ [(p)]
CHAR(n), NCHAR(n), NVARCHAR2(n), VARCHAR(n), VARCHAR2(n), CLOB, NCLOB, XML types	STRING
BLOB,ROWID	BYTES
INTERVAL DAY TO SECOND,INTERVAL YEAR TO MONTH	BIGINT

tencent cloud

New User Offers

Next-Generation CDN：EdgeOne

Elasticsearch Service free trial

Free Tier

Tencent Cloud Startup Program

Special Offers

Lighthouse Special Offers

Cloud Object Storage Special Offers

Featured Products

New Products

Education

Tencent Cloud Online Education Solutions

Gaming

Gaming Solution

Game Media Solutions

E-commerce

E-commerce retail solutions

Audio & Video

Audio/Video Solution

LVB Recording Solution

Interactive Classroom Solution

Interactive Live Streaming Solution

Audio Chat Social Networking Solution

Financial Services

Financial Services Solution

Compute

Cloud Virtual Machine

Auto Scaling

Batch Compute

CVM Dedicated Host

Database

TencentDB for MySQL

TencentDB for Redis®

TencentDB for CTSDB

TDSQL for MySQL

Data Transfer Service

TencentDB for MongoDB

TencentDB for PostgreSQL

TencentDB for SQL Server

Video Service

Cloud Streaming Services

Video on Demand

Media Processing Service

Cloud Application Rendering

Cloud Contact Center

Game Multimedia Engine

Chat

Real-time Communication

Tencent Effect SDK

AI and Machine Learning

Image Creation Large Model

Face Fusion

eKYC

Optical Character Recognition

Video Creation Large Model

Industry Applications

Tencent HealthCare Omics Platform

Container and Middleware

TDMQ for CKafka

Serverless Cloud Function

Tencent Kubernetes Engine

Tencent Kubernetes Engine for Serverless

Networking

Cloud Load Balancer

Virtual Private Cloud

Direct Connect

Cloud Connect Network

NAT Gateway

VPN Connection

Bandwidth Package

Anycast Internet Acceleration

Elastic Network Interface

Flow Logs

Global Application Acceleration Platform

Security

Captcha

Cloud Workload Protection Platform

Data Security Governance Center

Key Management Service