Parameters | Description |
Data Source | Available Iceberg Data Source. |
Database | Supports selection or manual input of the library name to read from. By default, the database bound to the data source is used as the default database. Other databases need to be manually entered. If the data source network is not connected and the database information cannot be fetched directly, you can manually enter the database name. Data synchronization can still be performed when the Data Integration network is connected. |
Table | Supports selecting or manually entering the table name to be read. |
Split Key | Specify the field for data sharding. After specifying, concurrent tasks will be launched for data synchronization. You can use a column in the source data table as the partition key. It is recommended to use the primary key or indexed column as the partition key. Note: If you want to start concurrent tasks for data synchronization, you must specify the split key, otherwise, it cannot be started. |
Filter Condition (Optional) | In actual business scenarios, you would typically select the current day's data for synchronization and specify the where condition as gmt_create>$bizdate. The where condition effectively handles incremental business synchronization. If the where clause is not provided, including missing the where key or value, the data synchronization will be considered as full data synchronization. |
Parameters | Description |
Data Destination | Iceberg Data Source to be written to. |
Database | Supports selection or manual input of the database name to write to By default, the database bound to the data source is used as the default database. Other databases need to be manually entered. If the data source network is not connected and the database information cannot be fetched directly, you can manually enter the database name. Data synchronization can still be performed when the Data Integration network is connected. |
Table | Supports selection or manual input of the table name to write to If the data source network is not connected and the table information cannot be fetched directly, you can manually enter the table name. Data synchronization can still be performed when the Data Integration network is connected. |
Write Mode | Iceberg write supports three modes: overwrite: Overwrite write. append: Append write. upsert: Data update and write based on the primary key field. |
Iceberg Data Type | Internal Types |
int,long | Long |
float,double,decimal | Double |
string,fixed,binary,struct,list,map | String |
date,time,timestamp,timestamptz | Date |
boolean | Boolean |
Internal Types | Iceberg Data Type |
Long | int,long(bigint) |
Double | float,double,decimal |
String | string,struct,list,map |
Date | date,time,timestamp,timestamptz |
Bytes | binary |
Boolean | boolean |
Was this page helpful?