tencent cloud

Feedback

Test Scheme Introduction

Last updated: 2024-07-31 09:19:12
    This document introduces how to use the Star Schema data set to perform performance testing on Tencent Cloud TCHouse-D, and provides a reference scheme for data import and performance testing.

    About SSB Performance Test

    SSB (Star Schema Benchmark) is a lightweight performance test set for data warehouse scenes. SSB provides a simplified star model data set based on TPC-H, which is mainly used to test the performance of multi-table join queries under the star model. In addition, the industry practice is to flatten SSB into a wide table model (hereinafter referred to as: SSB FLAT) to test the performance of the query engine.

    Test Scheme Introduction

    Test Environment Preparation

    Hardware Environment

    In the reference scheme given in this document, the tested cluster includes 1 FE and 3 BEs. The FE/BE node processes are deployed separately. The specific specifications are as follows. It should be noted that in actual testing, such a large amount of hardware resources will not be consumed.
    Node Type
    Node Specifications
    1 FE, standard
    CPU:4 cores
    Memory: 16 GB
    Hard disk: Enhanced SSD Cloud Disk 200 GB
    3 BEs, standard
    CPU:16 cores
    Memory: 64 GB
    Hard disk: Enhanced SSD Cloud Disk 1000 GB

    Software Version

    Tencent Cloud TChouse-D 1.2.7

    Test Script Preparation

    Download the SSB-H toolkit from Toolkit Address and compile it.

    Test SSB data set.

    Generate data set.

    sh bin/gen-ssb-data.sh -s 100 -c 100
    
    277M /data/ssb-tools/bin/ssb-data//customer.tbl
    228K /data/ssb-tools/bin/ssb-data//date.tbl
    5.9G /data/ssb-tools/bin/ssb-data//lineorder.tbl.1
    7.7G /data/ssb-tools/bin/ssb-data//lineorder.tbl.10
    7.6G /data/ssb-tools/bin/ssb-data//lineorder.tbl.2
    7.2G /data/ssb-tools/bin/ssb-data//lineorder.tbl.3
    6.0G /data/ssb-tools/bin/ssb-data//lineorder.tbl.4
    6.0G /data/ssb-tools/bin/ssb-data//lineorder.tbl.5
    6.0G /data/ssb-tools/bin/ssb-data//lineorder.tbl.6
    6.0G /data/ssb-tools/bin/ssb-data//lineorder.tbl.7
    7.4G /data/ssb-tools/bin/ssb-data//lineorder.tbl.8
    7.5G /data/ssb-tools/bin/ssb-data//lineorder.tbl.9
    116M /data/ssb-tools/bin/ssb-data//part.tbl
    17M /data/ssb-tools/bin/ssb-data//supplier.tbl
    Under the -s 100 parameter, the size of the data set generated is:
    SSB Table Name
    Number of rows
    Remarks
    LINEORDER
    600,037,902
    Product Order Details Table
    CUSTOMER
    3,000,000
    Customer Information Table
    PART
    1,400,000
    Parts Information Table
    SUPPLIER
    200,000
    Supplier Information Table
    DATE
    2,556
    Date Table

    Create a table

    Modify the doris-cluster.conf configuration file vim conf/doris-cluster.conf.
    Modify configuration: FE_HOST, PASSWORD, DB.
    # cat doris-cluster.conf
    
    # Any of FE host
    export FE_HOST='127.0.0.1'
    # http_port in fe.conf
    export FE_HTTP_PORT=8030
    # query_port in fe.conf
    export FE_QUERY_PORT=9030
    # Doris username
    export USER='root'
    # Doris password
    export PASSWORD=''
    # The database where SSB tables located
    export DB='ssb_100g'
    # The scale of testing data
    export SCALE='100g' # only support '100g' or '1t'
    Create a Table:
    sh bin/create-ssb-tables.sh

    Import Data

    sh bin/load-ssb-data.sh

    Check Imported Data

    selectcount(*)from part;
    selectcount(*)from customer;
    selectcount(*)from supplier;
    selectcount(*)fromdate;
    selectcount(*)from lineorder;
    selectcount(*)from lineorder_flat;
    The amount of data shall be consistent with the number of rows of generated data.

    Query

    Query SSB tables.
    # sh bin/run-ssb-queries.sh
    q1.1: 46
    q1.2: 29
    q1.3: 26
    q2.1: 340
    q2.2: 273
    q2.3: 257
    q3.1: 542
    q3.2: 237
    q3.3: 297
    q3.4: 57
    q4.1: 732
    q4.2: 372
    q4.3: 483
    total time: 3691 ms
    Query FLAT tables.
    # sh bin/run-ssb-flat-queries.sh
    q1.1: 26
    q1.2: 10
    q1.3: 35
    q2.1: 85
    q2.2: 83
    q2.3: 60
    q3.1: 157
    q3.2: 78
    q3.3: 75
    q3.4: 13
    q4.1: 131
    q4.2: 49
    q4.3: 33
    total time: 835 ms
    Thus, SSB data generation, table creation, import, and query are completed.
    Contact Us

    Contact our sales team or business advisors to help your business.

    Technical Support

    Open a ticket if you're looking for further assistance. Our Ticket is 7x24 avaliable.

    7x24 Phone Support