In Hive, partition information appears in the form of directories. In Iceberg, partition information is recorded in the underlying data files, making Iceberg’s partitions more flexible and allowing the partitioning strategy to evolve with changes in data volume. In DLC, you can create Iceberg tables to utilize features such as hidden partitions.
Note:
By default, native tables are Iceberg tables. External tables, depending on the file format, can choose between Hive or Iceberg tables. For detailed syntax, refer to the document CREATE TABLE. With hidden partitions, when inserting and querying data, you do not need to specify partition information additionally as required in Hive.
Iceberg partition strategy supports the use of the following functions, with different fields and corresponding partition transformation strategies as shown in the table:
|
identity | Any | Source Type |
bucket | int, long, decimal, date, time, timestamp, timestamptz, string, uuid, fixed, binary | int |
truncate | int, long, decimal, string | Source Type |
year | date, timestamp, timestamptz | int |
month | date, timestamp, timestamptz | int |
day | date, timestamp, timestamptz | date |
hour | timestamp, timestamptz | int |
Was this page helpful?