Title | Metric | Unit | Description |
Cluster regions in RIT status | ritCount | - | Number of regions in transition |
| ritCountOverThreshold | - | Number of regions that have been in transition for more than the threshold time |
Cluster RIT time | ritOldestAge | ms | Age of the longest region in transition |
Average number of regions per RegionServer | averageLoad | - | Average number of regions per RegionServer |
Cluster RegionServers | numRegionServers | - | Number of live RegionServers |
| numDeadRegionServers | - | Number of dead RegionServers |
Data read/written from/to HMaster | receivedBytes | bytes/s | Amount of data received by cluster |
| sentBytes | bytes/s | Amount of data sent by cluster |
Total cluster API requests | clusterRequests | count/s | Total number of cluster requests |
Cluster assignment manager operation | Assign_num_ops | - | Number of region assignments |
| BulkAssign_num_ops | - | Number of bulk region assignments |
Cluster load balancing operations | BalancerCluster_num_ops | - | Number of cluster load balancing operations |
Title | Metric | Unit | Description |
GC count | YGC | - | Young GC count |
| FGC | - | Full GC count |
GC time | FGCT | s | Full GC time |
| GCT | s | Garbage collection time |
| YGCT | s | Young GC time |
Memory zone proportion | S0 | % | Percentage of used Survivor 0 memory |
| E | % | Percentage of used Eden memory |
| CCS | % | Percentage of used compressed class space memory |
| S1 | % | Percentage of used Survivor 1 memory |
| O | % | Percentage of used Old memory |
| M | % | Percentage of used Metaspace memory |
JVM logs | LogFatal | - | Number of Fatal logs |
| LogError | - | Number of Error logs |
| LogWarn | - | Number of Warn logs |
| LogInfo | - | Number of Info logs |
JVM memory | MemNonHeapUsedM | MB | Non-heap memory size used by process |
| MemNonHeapCommittedM | MB | Non-heap memory size committed to process |
| MemHeapUsedM | MB | Heap memory size used by process |
| MemHeapCommittedM | MB | Heap memory size committed to process |
| MemHeapMaxM | MB | Maximum heap memory size available to process |
| MemMaxM | MB | Maximum memory size available to process |
JVM threads | ThreadsNew | - | Number of threads in NEW status |
| ThreadsRunnable | - | Number of threads in RUNNABLE status |
| ThreadsBlocked | - | Number of threads in BLOCKED status |
| ThreadsWaiting | - | Number of threads in WAITING status |
| ThreadsTimedWaiting | - | Number of threads in TIMED WAITING status |
| ThreadsTerminated | - | Number of threads currently in TERMINATED status |
RPC connections | numOpenConnections | - | Number of RPC connections |
RPC exceptions | FailedSanityCheckException | - | Number of FailedSanityCheckException exceptions |
| NotServingRegionException | - | Number of NotServingRegionException exceptions |
| OutOfOrderScannerNextException | - | Number of OutOfOrderScannerNextException exceptions |
| RegionMovedException | - | Number of RegionMovedException exceptions |
| RegionTooBusyException | - | Number of RegionTooBusyException exceptions |
| UnknownScannerException | - | Number of UnknownScannerException exceptions |
RPC queue requests | numCallsInPriorityQueue | - | Number of requests in the general queue |
| numCallsInReplicationQueue | - | Number of RPC requests in the replication queue |
Process start time | masterActiveTime | s | Master active time |
| masterStartTime | s | Master process start time |
Title | Metric | Unit | Description |
GC count | YGC | - | Young GC count |
| FGC | - | Full GC count |
GC time | FGCT | s | Full GC time |
| GCT | s | Garbage collection time |
| YGCT | s | Young GC time |
Memory zone proportion | S0 | % | Percentage of used Survivor 0 memory |
| E | % | Percentage of used Eden memory |
| CCS | % | Percentage of used compressed class space memory |
| S1 | % | Percentage of used Survivor 1 memory |
| O | % | Percentage of used Old memory |
| M | % | Percentage of used Metaspace memory |
JVM logs | LogFatal | - | Number of Fatal logs |
| LogError | - | Number of Error logs |
| LogWarn | - | Number of Warn logs |
| LogInfo | - | Number of Info logs |
JVM memory | MemNonHeapUsedM | MB | Non-heap memory size used by process |
| MemNonHeapCommittedM | MB | Non-heap memory size committed to process |
| MemHeapUsedM | MB | Heap memory size used by process |
| MemHeapCommittedM | MB | Heap memory size committed to process |
| MemHeapMaxM | MB | Maximum heap memory size available to process |
| MemMaxM | MB | Maximum memory size available to process |
JVM threads | ThreadsNew | - | Number of threads in NEW status |
| ThreadsRunnable | - | Number of threads in RUNNABLE status |
| ThreadsBlocked | - | Number of threads in BLOCKED status |
| ThreadsWaiting | - | Number of threads in WAITING status |
| ThreadsTimedWaiting | - | Number of threads in TIMED WAITING status |
| ThreadsTerminated | - | Number of threads currently in TERMINATED status |
Regions | regionCount | - | Number of regions |
Region localization | percentFilesLocal | % | Percentage of HFiles on the local HDFS data node in the region |
Region replica localization | percentFilesLocalSecondaryRegions | % | Percentage of HFiles on the local HDFS data node in the region replica |
RPC authentications | authenticationFailures | - | Number of RPC authentication failures |
| authenticationSuccesses | - | Number of RPC authentication successes |
RPC connections | numOpenConnections | - | Number of RPC connections |
RPC exceptions | FailedSanityCheckException | - | Number of FailedSanityCheckException exceptions |
| NotServingRegionException | - | Number of NotServingRegionException exceptions |
| OutOfOrderScannerNextException | - | Number of OutOfOrderScannerNextException exceptions |
| RegionMovedException | - | Number of RegionMovedException exceptions |
| RegionTooBusyException | - | Number of RegionTooBusyException exceptions |
| UnknownScannerException | - | Number of UnknownScannerException exceptions |
RPC handlers | numActiveHandler | - | Number of active RPC handlers |
| numActiveWriteHandler | - | Number of active read RPC handlers |
| numActiveReadHandler | - | Number of active write RPC handlers |
| numActiveScanHandler | - | Number of active scan RPC handlers |
| | | |
RPC queue requests | numCallsInPriorityQueue | - | Number of requests in the priority queue |
| numCallsInReplicationQueue | - | Number of RPC requests in the replication queue |
| numCallsInPriorityQueue | - | Number of requests in the general queue |
| numCallsInWriteQueue | - | Number of RPC calls in the write call queue |
| numCallsInReadQueue | - | Number of RPC calls in the read call queue |
| numCallsInScanQueue | - | Number of RPC calls in the scan call queue |
WAL files | hlogFileCount | - | Number of WAL files |
WAL file size | hlogFileSize | Byte | WAL file size |
MemStore size | memStoreSize | MB | MemStore size |
Stores | storeCount | - | Number of stores |
StoreFiles | storeFileCount | - | Number of StoreFiles |
StoreFile size | storeFileSize | MB | StoreFile size |
Disk write rate | flushedCellsSize | bytes/s | Disk write rate |
Average latency | Append_mean | ms | Average Append latency |
| Replay_mean | ms | Average Replay latency |
| Get_mean | ms | Average GET latency |
| updatesBlockedTime | ms | Number of milliseconds updates have been blocked so the memstore can be flushed |
RegionServer disk writes | FlushTime_num_ops | - | Number of MemStore flushes |
Requests in operation queue | splitQueueLength | - | Length of the split queue |
| compactionQueueLength | - | Length of the compaction queue |
| flushQueueLength | - | Length of the region flush queue |
Replay operations | Replay_num_ops | - | Number of Replay operations |
Slow operations | slowAppendCount | - | Number of Append requests that took over 1s to complete |
| slowDeleteCount | - | Number of Delete requests that took over 1s to complete |
| slowGetCount | - | Number of Get requests that took over 1s to complete |
| slowIncrementCount | - | Number of Increment requests that took over 1s to complete |
| slowPutCount | - | Number of Put requests that took over 1s to complete |
Split request | splitRequestCount | - | Number of split requested |
| splitSuccessCount | - | Number of successfully executed splits |
Cache blocks | blockCacheCount | - | Number of blocks in the block cache |
| blockCacheHitCount | - | Number of block cache hits |
| blockCacheMissCount | - | Number of block cache misses |
Cache read hit rate | blockCacheExpressHitPercent | % | Cache read hit rate |
Memory size used by the cache block | blockCacheSize | Byte | Memory size used by the cache block |
Index size | staticBloomSize | Byte | Uncompressed size of static bloom filters |
| staticIndexSize | Byte | Uncompressed size of static indexes |
| storeFileIndexSize | Byte | Size of indexes in StoreFiles on disk |
Received bytes | receivedBytes | bytes/s | Received bytes |
| sentBytes | bytes/s | Sent bytes |
Read and write requests | Total | count/s | Total number of requests. When there are scan requests, this value will be smaller than the sum of read and write requests |
| Read | count/s | Number of read requests |
| Write | count/s | Number of write requests |
| Append_num_ops | count/s | Number of Append requests |
| Mutate_num_ops | count/s | Number of Mutate requests |
| Delete_num_ops | count/s | Number of Delete requests |
| Increment_num_ops | count/s | Number of Increment requests |
| Get_num_ops | count/s | Number of Get requests |
| Put_num_ops | count/s | Number of Put requests |
| ScanTime_num_ops | count/s | Scan requests (time) |
| ScanSize_num_ops | count/s | Scan requests (size) |
Mutations | mutationsWithoutWALCount | - | Number of mutations |
Mutation size | mutationsWithoutWALSize | Byte | Mutation size |
Process start time | regionServerStartTime | s | Process start time |
Log sync | source.sizeOfLogQueue | - | Total length of synced logs |
Sync duration | source.ageOfLastShippedOp | ms | Sync duration |
Requests | ReadRequestCount | count/s | Read requests/s |
| WriteRequestCount | count/s | Write requests/s |
Requests | Read | count/s | Read requests/s |
| Write | count/s | Write requests/s |
Store size | memstoreSize | Byte | MemStore size |
| storeFileSize | Byte | StoreFile size |
Table-level request latency | getTime_99th_percentile | ms | 99th percentile of request processing latency |
| scanTime_99th_percentile | ms | 99th percentile of request processing latency |
| putTime_99th_percentile | ms | 99th percentile of request processing latency |
| incrementTime_99th_percentile | ms | 99th percentile of request processing latency |
| appendTime_99th_percentile | ms | 99th percentile of request processing latency |
| deleteTime_99th_percentile | ms | 99th percentile of request processing latency |
Request processing latency | 99th_percentile | ms | 99th percentile of request processing latency |
| 99.9th_percentile | ms | 99.9% request processing latency |
Request queueing latency | 99th_percentile | ms | 99th percentile of request queueing latency |
| 99.9th_percentile | ms | 99.9% request queueing latency |
Scan size | max | bytes | Maximum scan size |
| mean | bytes | Average scan size |
| min | bytes | Minimum scan size |
Scan time | max | s | Maximum scan time |
| mean | s | Average scan time |
| min | s | Minimum scan time |
Was this page helpful?