A container format is a file format used to store digital video data in a computer system. Here is a list of popular container formats used in internet services:
Codec is short for coder-decoder and refers to a program or device that can encode and decode digital videos. Below are common codecs:
Bitrate refers to the number of bits that are required in playback of continuous media (e.g., compressed audio or video) per unit time. It is measured in bit/s or bps.
Frame rate refers to the number of frames in a video per unit time. It is measured in FPS (frame per second) or Hz.
Resolution determines a video's capability to define details and is represented in the number of pixels in each direction, e.g., 640 * 480.
A GOP (group of pictures) refers to a set of continuous pictures within an encoded video stream and specifies the order in which the contained frames are arranged. When a new GOP appears, the decoder does not need any previous frames in order to decode the next frame. A GOP can contain the following types of pictures:
The number of pictures within a GOP is called GOP length.
An IDR (instantaneous decoding refresh) picture is a type of I picture. It differs from a common I picture in that an IDR frame specifies that no frame after the IDR frame can reference any frame before it.
If IDR frame alignment is specified when multi-bitrate transcoding is performed on a video, IDR pictures of all the output videos will be precisely aligned by time point and picture content, so that video players can smoothly switch among videos at different bitrates without obvious lagging.
If you enable IDR frame alignment on the Tencent Cloud transcoding platform, the [frame rate](#frame rate), GOP length, codec, and [container format](#container format) of multiple output formats must be identical.
A profile is a collection of specific encoding parameters. You need to make tremendous endeavors in hardware and software to support a complete codec standard due to its large number of features. Therefore, to facilitate use of a codec standard, some common parameters are selected and combined to form different profiles. H.264 specifies three profiles as shown below:
A color space is an organization of colors. By combining color spaces such as RGB and HSB and physical device profiling, fixed analog and digital representations of colors can be obtained. A color model is a mathematical model describing the way colors in a color space can be represented as tuples of numbers (e.g., triples in RGB or quadruples in CMYK).
Video noise is random variation of brightness or colors in an image produced by a sensor, scanner circuit, or digital camera. It can also originate in film grain and fixed shot noise of a photon detector. It is generally viewed as an undesirable by-product of image capturing. Video noise reduction is to remove unwanted noise from a video while retaining useful information such as important details in the video.
In the era of analog television, the processing speed and network bandwidth of playback devices were limited. In this context, interlacing was developed to deliver videos at lower bitrates without reducing the source frame rates. It can reduce the video transmission bandwidth by 50% while basically retaining source image quality. However, it has noticeable negative affects such as low definition, flickering, and jaggies along image edges.
Nowadays, video playback devices and network bandwidth have been improved greatly, and interlacing gradually becomes obsolete and is not supported by some new device models. Therefore, old videos that were processed with interlacing need to be "deinterlaced".
A codec is a method of converting analog audio signals to digital signals (or vice versa) and mainly includes lossless and lossy encoding. According to sampling principles, encoded audio signals can only get "infinitely similar to" natural signals; therefore, all audio codecs are lossy in essence. In computer fields, pulse code modulation (PCM) that achieves the highest fidelity is generally agreed as lossless encoding. All the popular audio codecs in internet services are lossy, such as MP3 and AAC.
Sample rate refers to the number of discrete signals extracted from continuous signals per second. It is measured in Hz.
Please see the description of bitrate in the Video Encoding Terms section.
A sound channel refers to an independent audio signal collected from different spatial positions when sound is recorded or played back. The number of sound channels is the number of sound sources during recording or number of speakers during playback.
ISO date format is a time format as specified in ISO 8601. In Tencent Cloud MPS, unless otherwise specified, all time-related parameters use UTC time in ISO 8601 standard (in the format of YYYY-MM-DDThh:mm:ssZ). For example, 2018-10-01T10:00:00Z represents 18:00:00 on October 1, 2018 Beijing Time (UTC+8).
Was this page helpful?