tencent cloud

All product documents
Data Lake Compute
Engine Local Cache
Last updated: 2024-07-31 17:48:05
Engine Local Cache
Last updated: 2024-07-31 17:48:05
To ensure stable operation of Spark engine query analysis when network bandwidth is limited (e.g. during storage system throttling), the DLC Spark engine provides a local cache capability. When you need to cache table data, you can quickly enable caching by adding engine configuration.

Directions

1. Create a Spark Engine: For details, see Purchase Exclusive Data Engine.
2. Add Cache Configuration: Go to the DLC Console > Data Engine. Select the engine created in Step 1, click Parameter Configuration, and add the configuration items from Cache Configuration Item Explanation.
Spark SQL Engine Configuration:

Note:
After the configuration is added, the engine cluster will restart. It is recommended to enable the cache when no tasks are running to avoid affecting ongoing tasks.
3. To use the engine cache, go to Data Exploration, write the query SQL in the SQL interface, select the engine with the cache enabled, and execute the SQL. Once executed, the engine will cache the DLC external table data locally. When the SQL is executed again, the data will be fetched from the local cache, improving query efficiency.
Spark SQL Engine Query:

Spark Batch Engine Query:


Cache Description

Cache Configuration Items Description

Configuration Items
Configuration Values
Configuration Items Description
spark.hadoop.fs.cosn.impl
alluxio.hadoop.ShimFileSystem
Fixed value; the configuration value is the cache implementation class. Configure this value to enable the cache feature. If the cache feature is enabled, configuring a value other than this will result in the engine not being able to access COS data. Please follow the instructions carefully.
If you need to disable the cache after enabling it, please delete this configuration item.

Cache Usage Instructions

1. Engine Type Description
SparkSQL Engine: When the engine restarts, the cached data becomes invalid because it is a local cache.
SparkBatch Engine: The SparkBatch engine runs tasks at the session level. Once the task execution is complete, the cached data becomes invalid.
2. Table Type Description
Currently, only DLC external tables are cached.

Was this page helpful?
You can also Contact Sales or Submit a Ticket for help.
Yes
No

Feedback

Contact Us

Contact our sales team or business advisors to help your business.

Technical Support

Open a ticket if you're looking for further assistance. Our Ticket is 7x24 available.

7x24 Phone Support
Hong Kong, China
+852 800 906 020 (Toll Free)
United States
+1 844 606 0804 (Toll Free)
United Kingdom
+44 808 196 4551 (Toll Free)
Canada
+1 888 605 7930 (Toll Free)
Australia
+61 1300 986 386 (Toll Free)
EdgeOne hotline
+852 300 80699
More local hotlines coming soon