Tencent Cloud

Recent Pages

Custom Monitoring

Last updated: 2024-01-29 15:55:07
Overview
You can use TMP to customize the reported metric monitoring data so as to monitor internal status of applications or services, such as the number of processed requests and the number of orders. You can also monitor the processing duration of some core logic, such as requesting external services.
This document uses Go as an example to describe how to use TMP to customize reported metrics, visualization, and alerting.
Supported Programming Languages
Official SDKs from the native Prometheus community:
Go
Java or Scala
Python
Ruby
Third-Party SDKs for other programming languages:
Bash
C
C++
Common Lisp
Dart
Elixir
Erlang
Haskell
Lua for NGINX
Lua for Tarantool
.NET/C#
Node.js
Perl
PHP
R
Rust
For more information, please see CLIENT LIBRARIES.
Data Model
Prometheus has multidimensional analysis capabilities. A data model consists of the following parts:
Metric Name + Labels + Timestamp + Value/Sample
Metric Name: monitoring object (for example, http_request_total indicates the current total number of HTTP requests received by the system).
Labels: characteristics dimensions of the current sample, which are in K/V structure. Through such dimensions, Prometheus can filter, aggregate, and perform other operations on the sample data.
Timestamp: a timestamp accurate down to the millisecond
Value: a float64 value, which indicates the current sample value.
Metric Name/Labels can contain only ASCII characters, digits, underscores, and colons and must comply with the regular expression [a-zA-Z_:][a-zA-Z0-9_:]*. 
For more information on a data model, please see DATA MODEL.
For the best practice of metric and label naming, please see METRIC AND LABEL NAMING.
Metric Tracking Method
Prometheus provides four metric types for different monitoring scenarios: Counter, Gauge, Histogram, and Summary, as described below. For more information, please see METRIC TYPES.
The Prometheus community provides SDKs for multiple programing languages, all of which are basically similar in usage but differ mostly in syntax. This document uses Go as an example to describe how to report custom monitoring metrics.
Counter
A metric in Counter type increases monotonically and will be reset after service restart. You can use counters to monitor the numbers of requests, exceptions, user logins, orders, etc.
You can use a counter to monitor the number of orders as follows:
package order
﻿
import (
    "github.com/prometheus/client_golang/prometheus"
    "github.com/prometheus/client_golang/prometheus/promauto"
)
﻿
// Define the counter object to be monitored
var (
    opsProcessed = promauto.NewCounterVec(prometheus.CounterOpts{
        Name: "order_service_processed_orders_total",
        Help: "The total number of processed orders",
    }, []string{"status"}) // Processing status
)
﻿
// Process the order
func makeOrder() {
    opsProcessed.WithLabelValues("success").Inc() // Success
    // opsProcessed.WithLabelValues("fail").Inc() // Failure
﻿
    // Order placement business logic
}
For example, you can use the rate() function to get the order increase rate:
rate(order_service_processed_orders_total[5m])
Gauge
A gauge is a current value, which can be increased or reduced during metric timestamping. You can use gauges to monitor the current memory utilization, CPU utilization, current number of threads, queue size, etc.
You can use a gauge to monitor the size of an order queue as follows:
package order
﻿
import (
    "github.com/prometheus/client_golang/prometheus"
    "github.com/prometheus/client_golang/prometheus/promauto"
)
﻿
// Define the gauge object to be monitored
var (
    queueSize = promauto.NewGaugeVec(prometheus.GaugeOpts{
        Name: "order_service_order_queue_size",
        Help: "The size of order queue",
    }, []string{"type"})
)
﻿
type OrderQueue struct {
    queue chan string
}
﻿
func newOrderQueue() *OrderQueue  {
    return &OrderQueue{
        queue: make(chan string,100),
    }
}
﻿
// Produce an order message 
func (q *OrderQueue)produceOrder() {
    // Produce an order message
﻿
    // Increase the queue size by 1
    queueSize.WithLabelValues("make_order").Inc() // Order placement queue
    // queueSize.WithLabelValues("cancel_order").Inc() // Order cancellation queue
}
﻿
// Consume an order message 
func (q *OrderQueue)consumeOrder() {
    // Consume an order message
﻿
    // Reduce the queue size by 1
    queueSize.WithLabelValues("make_order").Dec()
}
You can use the gauge metric to directly view the current size of each type of queue of an order:
order_service_order_queue_size
Histogram
Prometheus calculates the sample distribution based on the configured Bucket to generate a histogram, which can be processed subsequently and is generally used for duration monitoring. For example, you can use a histogram to calculate the latencies of P99, P95, and P50 and monitor the numbers of processed items. With histograms, you don't need to use counters to count items. In addition, you can use histograms to monitor metrics such as API response time and database access time.
A histogram can be used in a similar way to a summary, so you can directly refer to the summary usage.
Summary
A summary is similar to a histogram, as it also calculates the sample distribution, but their differences lie in that a summary calculates the distribution (P99/P95/Sum/Count) on the client and therefore uses more client resources, and the data cannot be calculated and processed in an aggregated manner subsequently. You can use summaries to monitor metrics such as API response time and database access duration.
You can use a summary to monitor the order processing duration as follows:
package order
﻿
import (
    "net/http"
    "time"
﻿
    "github.com/prometheus/client_golang/prometheus"
    "github.com/prometheus/client_golang/prometheus/promauto"
    "github.com/prometheus/client_golang/prometheus/promhttp"
)
﻿
// Define the summary object to be monitored
var (
    opsProcessCost = promauto.NewSummaryVec(prometheus.SummaryOpts{
        Name: "order_service_process_order_duration",
        Help: "The order process duration",
    }, []string{"status"})
)
﻿
func makeOrder() {
    start := time.Now().UnixNano()
    // The order placement logic processing is completed, and the processing duration is recorded
    defer opsProcessCost.WithLabelValues("success").Observe((float64)(time.Now().UnixNano() - start))
﻿
    // Order placement business logic
    time.Sleep(time.Second) // Simulate the processing duration
}
You can use a summary metric to directly view the average order placement processing duration:
order_service_processed_order_duration_sum / order_service_processed_order_duration_count
Exposing Prometheus metrics
Use promhttp.Handler() to expose the metric tracking data to the HTTP service.
package main
﻿
import (
    "net/http"
﻿
    "github.com/prometheus/client_golang/prometheus/promhttp"
)
﻿
func main() {
        // Business code
﻿
        // Expose Prometheus metrics in the HTTP service
        http.Handle("/metrics", promhttp.Handler())
﻿
        // Business code
}
﻿
Collecting Data
After the tracking of custom metrics for your business is completed and the application is released, you can use Prometheus to collect the monitoring metric data. For more information, please see Go Integration.
Viewing Monitoring Data and Alerts
Open the Grafana service that comes with TMP and use Explore to view the monitoring metric data as shown below. You can also customize Grafana monitoring dashboards.
﻿
﻿
You can use Prometheus together with the alarming capabilities of Cloud Monitor to trigger alerts for custom monitoring metrics in real time. For more information, please see Alert Overview and Usage.
Contact Us
Contact our sales team or business advisors to help your business.
Technical Support
Open a ticket if you're looking for further assistance. Our Ticket is 7x24 avaliable.
7x24 Phone Support
tencent cloud

Recent Pages

Custom Monitoring

Overview

Supported Programming Languages

Data Model

Metric Tracking Method

Counter

Gauge

Histogram

Summary

Exposing Prometheus metrics

Collecting Data

Viewing Monitoring Data and Alerts

Was this page helpful?

Was this page helpful?

tencent cloud

Sign Up

Log in

Recent Pages

Custom Monitoring

Overview

Supported Programming Languages

Data Model

Metric Tracking Method

Counter

Gauge

Histogram

Summary

Exposing Prometheus metrics

Collecting Data

Viewing Monitoring Data and Alerts

Was this page helpful?

Was this page helpful?