Note:
Due to varying degrees of support for different data source types, not all types include the following features. Please refer to the actual results displayed on the page.
On the corresponding pages of various features in Data Management, click the table name you need to view to enter the Table Details page. Depending on the data source, the Table Details page includes business information, technical information, asset rating, basic information, data preview, outputs and changes, data lineage, data temperature, partition information, data quality, and access log.
Business Information
Displays the asset catalog, tag, asset status, importance level, release time, project to which the table belongs, and table owner of the current data table.
Business information can be modified and adjusted by opening a pop-up window in the top-right corner of the interface.
Technical Information
Displays the data source type, data source, database, table type, storage size, storage path, lifecycle, recent data, and DDL changes of the current data table.
Asset Score
The comprehensive average score of each indicator, with a maximum score of 100, is updated daily.
Completeness: The current completeness of technical and business information.
Assurance: The current quality monitoring and access control status.
Timeliness: The data production on time in the past 30 days.
Stability: The table structure changes in the past 30 days.
Standardization: Not Online.
Basic Information
Provides the 'View DDL' and 'View Select Statements' features, displaying Field Name, English Field Name, Chinese Field Name, Field Type, and Description.
Partition Information
Partition information contains partition field information and detailed content of the partitions.
Data preview
Preview the contents of this data table, supporting up to the display of the first 5 records. Data is updated T+1.
Output and Changes
Output: Filter by output task/instance time, displaying task ID, execution count, scheduled time, start time, output time, execution duration, and production time consumption.
Changes: Displays the change records of the table in the past 30 days, including change time, change type, change log, operator, and number of affected tables. Supports downloading change information to a local Excel sheet.
Data Lineage
WeData lineage relationship displays full-link data flow within projects under the root account, including data sources, destinations, and associated tasks. The lineage relationship feature provides table/field-level lineage, involving both formal data tables used in tasks and temporary table inter-lineage. The current version supports lineage parsing for synchronization tasks and Hive SQL tasks, mainly covering MySQL and Hive tables.
Basic Operations
Data Lineage feature mainly displays upstream and downstream data flows of central tables/fields, defaulting to direct first-level upstream and downstream table lineage of central tables. Users can trace lineage relationships on the canvas, switch display object granularity, and more. Main features and operations are as follows:
|
Table View/Field View | Supports switching between table/field dimensions to display lineage relationships Table Lineage: Displays inter-table upstream and downstream relationships with tables as the granularity. Each node on the canvas represents a table. By default, it shows the direct first-level upstream and downstream formal table lineage of central tables. Field Lineage: Displays inter-table upstream and downstream associated fields with fields as the granularity. Each node on the canvas represents a field. The default field lineage centers on the table's first field. |
Show Temporary Tables/Compact Pattern | Temporary Table Switch: By default, the temporary table lineage is hidden and only the lineage between formal tables involved in the task is expanded. You can click the icon to toggle settings, which will initialize the canvas to display the first-level upstream and downstream formal tables and temporary tables of the central table. Note: A temporary table refers to a table temporarily generated during the task's SQL calculation process. Compact mode only displays the table name |
Import/Export | Supports importing and exporting data lineage using Excel tables. |
Map/Hierarchy Mode | Table Lineage supports Full Table Traceability/Single-link Traceability modes. By default, it follows map mode. |
Search | Allows searching for existing tables/fields on the canvas. After searching, the object will be displayed centrally on the canvas. |
Canvas Tool: Zoom In/Zoom Out/Restore/Fullscreen | Adjust the lineage canvas and node size. |
Lineage Canvas |
Table/Field: A node on the canvas represents a table/field. By default, the table that enters the table details page is the central table of the canvas. The left and right sides of the central table represent its associated upstream and downstream tables/fields. Name: Table/Field name. For non-central tables/fields, you can click the link above the node to quickly enter the table details page. Number of Upstream Objects: The number of first-level upstream tables/fields. If it is 0, there are no upstream objects. Number of Downstream Objects: The number of first-level downstream tables/fields. If it is 0, there are no downstream objects. Data Flow Direction: The arrow direction represents the data flow direction. The left side is the source data, and the right side is the destination data. Associated Task: Click the arrow to view the synchronization/SQL task information associated with generating this data lineage. Expand/Collapse: Click the upstream/downstream number on the node in the canvas to expand/collapse the upstream/downstream objects. If the table/field is located downstream of the central node, clicking will only expand its downstream objects; vice versa, it will only expand upstream objects. Quick Expand: Table Lineage > Map Mode allows you to right-click the target object to quickly expand the multi-level lineage upstream/downstream. |
Table View
The table lineage by default displays the number of directly associated first-level upstream and downstream tables of the central table. Upstream and downstream association tables and tasks support selecting the target table for upstream/downstream lineage tracing. One-time expansion of all directly first-level upstream/downstream tables of the selected table, while the lineage relationships of other tables at the same level remain unchanged.
Field View
Field lineage uses the first field of the central table as the initialization object, and by default expands the direct first-level upstream/downstream association quantity, upstream/downstream association table, and tasks. It supports selecting a target field for upstream/downstream lineage tracing. You can click the field selector in the upper left side of the canvas to switch display fields.
Data Temperature
Data Temperature provides temperature trends and frequent access task information.
Temperature Trend: The number of data accesses and table details views in the past seven days.
Frequent Access: The most frequently accessed tasks in the past 30 days (task ID, access type, task status, affiliated project, workflow, responsible person, and number of accesses).
Data Quality
Data Quality provides the quality control rules configured for the data tables, as well as an overview of the data quality inspection results output by these rules.
Access Log
Access Log provides a statistical overview of the data table access status, including visit date, access account, task ID, access type, number of executions, and other information.
Was this page helpful?