tencent cloud

피드백

Estimation Function

마지막 업데이트 시간:2024-01-22 10:52:48
    This document introduces the basic syntax and examples of estimation functions.
    Function
    Syntax
    Description
    approx_distinct
    approx_distinct(x)
    Returns the approximate number of distinct input values (column x).
    approx_percentile
    approx_percentile(x,percentage)
    Sorts the values in the x column in ascending order and returns the value approximately at the given `percentage` position.
    approx_percentile(x,array[percentage01, percentage02...])
    Sorts the values in the x column in ascending order and returns the values approximately at the given `percentage` positions (percentage01, percentage02...).

    approx_distinct

    The approx_distinct function is used to get the approximate number of distinct input values of a field. The standard result deviation is 2.3%.

    Syntax

    approx_distinct(x)

    Field description

    Parameter
    Description
    x
    The parameter value can be of any data type.

    Return value type

    Bigint

    Sample

    Use the count function to calculate the PV value and use the approx_distinct function to get the approximate number of distinct input values of the client_ip field and use it as the UV value.
    * | SELECT count(*) AS PV, approx_distinct(ip) AS UV

    approx_percentile

    The approx_percentile function is used to sort values of the target field in ascending order and return the value in the position around percentage. It uses the T-Digest algorithm for estimation, which has a low deviation and can meet the most statistical analysis requirements. If needed, you can use * | select count_if(x<(select approx_percentile(x,percentage))),count(*) to accurately count the number of field values below percentage and the total number of field values respectively and then verify the statistical deviation.

    Syntax

    Return the value (double) approximately at the given percentage position
    approx_percentile(x, percentage)
    Return the value (array) approximately at the given percentage positions (percentage01,percentage02...)
    approx_percentile(x, array[percentage01,percentage02...])

    Field description

    Parameter
    Description
    x
    Value type: double
    percentage
    Value range: [0,1]

    Return value type

    double or array.

    Sample

    Sample 1

    Sort the values of the resTotalTime column and return the value of resTotalTime approximately at the 50% position.
    * | select approx_percentile(resTotalTime,0.5)

    Sample 2

    Sort the values of the resTotalTime column and return the values of resTotalTime approximately at the 10%, 20%, and 60% positions.
    * | select approx_percentile(resTotalTime, array[0.2,0.4,0.6])
    
    문의하기

    고객의 업무에 전용 서비스를 제공해드립니다.

    기술 지원

    더 많은 도움이 필요하시면, 티켓을 통해 연락 바랍니다. 티켓 서비스는 연중무휴 24시간 제공됩니다.

    연중무휴 24시간 전화 지원