Overview
DataHub supports accessing different types of data generated by various data sources for unified management and distribution to downstream offline/online processing systems, forming a clear data flow channel.
This document takes MongoDB as an example to describe how to create an async data pull task and modify the task configuration in the CKafka console.
Directions
Creating data access task
2. Click Data Access on the left sidebar, select the region, and click Create Task.
3. In the pop-up window, select Asynchronously pulled data > MongoDB for Data Source Type.
4. Click Next and enter the task details.
Task Name: It can only contain letters, digits, underscores, or symbols ("-" and ".").
Target CKafka Instance: Select a CKafka instance.
Target Topic: Select the target CKafka topic for data access.
Source Database Type:
TencentDB for MongoDB: Select a database instance.
Self-built MongoDB: Select your CLB instance and specify the port.
Username: Source MongoDB database username.
Password: Source MongoDB database password.
Database: Source MongoDB database name. You cannot select the default MongoDB database for data import.
Collection: Source MongoDB collection. You can keep the default setting, i.e., "", to listen on all collections, or specify a collection.
Copy Existing Data: Specify whether to replicate the existing data in the source MongoDB database.
5. Click Submit.
Changing data source and data target
2. Click Data Access on the left sidebar and click the ID of the target task to enter its basic information page.
3. Click Change Data Source in the top-right corner of the Data Source module to modify the data source information.
4. Click Change Data Target in the top-right corner of the Data Target module to modify the data target information.
Viewing monitoring data
2. Click Data Access on the left sidebar and click the ID of the target task to enter its basic information page.
3. Select the Monitoring tab to view the monitoring data of the target topic.
Pausing task
On the Data Access page, click Pause in the Operation column of the target task to pause the task. Note:
If you find that the data access task affects the normal use of CKafka, you can pause it.
Resuming task
On the Data Access page, click Resume in the Operation column of the target task to resume the paused task. Note:
A paused task can be resumed to continue dumping data.
Deleting task
On the Data Access page, click Delete in the Operation column of the target task and click OK in the pop-up window to delete the task. Note:
Once the task is deleted, data access will be stopped and the task record will be deleted, but the previously dumped data and CKafka instance involved will not be affected.
A task cannot be recovered once deleted. Proceed with caution.
Was this page helpful?