# Uninstall the existing enhanced monitoring software packagerpm -e rdma_monitor-1.0-1.tl2.x86_64# Download and install the millisecond-level monitoring component.# Once the software package is installed, a system service will be automatically registered to start and keep the enhanced monitoring alive without manual startup.wget http://mirrors.tencentyun.com/install/GPU/rdma_monitor-1.0-1.tl2.x86_64.rpm && rpm -ivh rdma_monitor-1.0-1.tl2.x86_64.rpm
ps -aux | grep monitor_server
English Metric Name | Chinese Metric Name | Metric Description (optional) | Unit | Dimension | Statistical Granularity |
RxHpbwAvg | Millisecond-level_average of RDMA network interface received bandwidth | The millisecond-level statistical granularity average of the RDMA network interface received bandwidth within 10 seconds | Mbps | InstanceId | 10s, 60s, 300s, 3,600s |
RxHpbwMax | Millisecond-level_maximum value of RDMA network interface received bandwidth | The millisecond-level statistical granularity maximum value of the RDMA network interface received bandwidth within 10 seconds | Mbps | InstanceId | 10s, 60s, 300s, 3,600s |
RxHpbwMin | Millisecond-level_minimum value of RDMA network interface received bandwidth | The millisecond-level statistical granularity minimum value of the RDMA network interface received bandwidth within 10 seconds | Mbps | InstanceId | 10s, 60s, 300s, 3,600s |
RxHpbwP50 | Millisecond-level_the 50th percentile of RDMA network interface received bandwidth | The millisecond-level statistical granularity 50th percentile of the RDMA network interface received bandwidth from lowest to highest within 10 seconds | Mbps | InstanceId | 10s, 60s, 300s, 3,600s, 86,400s |
RxHpbwP90 | Millisecond-level_the 90th percentile of RDMA network interface received bandwidth | The millisecond-level statistical granularity 90th percentile of the RDMA network interface received bandwidth from lowest to highest within 10 seconds | Mbps | InstanceId | 10s, 60s, 300s, 3,600s |
TxHpbwAvg | Millisecond-level_average of RDMA network interface transmitted bandwidth | The millisecond-level statistical granularity average of the RDMA network interface transmitted bandwidth within 10 seconds | Mbps | InstanceId | 10s, 60s, 300s, 3,600s |
TxHpbwMax | Millisecond-level_maximum value of RDMA network interface transmitted bandwidth | The millisecond-level statistical granularity maximum value of the RDMA network interface transmitted bandwidth within 10 seconds | Mbps | InstanceId | 10s, 60s, 300s, 3,600s |
TxHpbwMin | Millisecond-level_minimum value of RDMA network interface transmitted bandwidth | The millisecond-level statistical granularity minimum value of the RDMA network interface transmitted bandwidth within 10 seconds | Mbps | InstanceId | 10s, 60s, 300s, 3,600s |
TxHpbwP50 | Millisecond-level_the 50th percentile of RDMA network interface transmitted bandwidth | The millisecond-level statistical granularity 50th percentile of the RDMA network interface transmitted bandwidth from lowest to highest within 10 seconds | Mbps | InstanceId | 10s, 60s, 300s, 3,600s |
TxHpbwP90 | Millisecond-level_the 90th percentile of RDMA network interface transmitted bandwidth | The millisecond-level statistical granularity 90th percentile of the RDMA network interface transmitted bandwidth from lowest to highest within 10 seconds | Mbps | InstanceId | 10s, 60s, 300s, 3,600s |
# monitor_client With the enhanced monitoring automatically installed, /tmp/monitor.log is the customized data storage path. The file size will continue to grow, so be mindful of managing storage space.monitor_client -r -p raw > /tmp/monitor.log# -r continuously obtain data from the last 10s# -p; print selection# -p summary; default value; print statistical information# -p raw; print original data points# -p all; print both statistical information and original data points# You can use monitor_client -h to view more parameter descriptions.
Was this page helpful?