Smart Subtitling

StreamLive

Introduction

Purchase Guide

Billing Overview

Overdue Payments and Service Suspension

Console Guide

Input Management

Channel Management

Overview

Creating a Channel

Step 1. Set the Basic Channel Information

Step 2. Add Inputs

Step 3. Configure Inputs

Step 4. Configure Output Groups

Modifying and Deleting a Channel

Exporting, Importing, Cloning a Channel

Monitoring Channel Quality

Watermark Management

Plan Management

Feature Guide

API Documentation

Making API Requests

Input Management APIs

ModifyStreamLiveInput

DescribeStreamLiveInputs

DescribeStreamLiveInput

DeleteStreamLiveInput

CreateStreamLiveInput

Input Security Group Management APIs

ModifyStreamLiveInputSecurityGroup

DescribeStreamLiveInputSecurityGroups

DescribeStreamLiveInputSecurityGroup

DeleteStreamLiveInputSecurityGroup

CreateStreamLiveInputSecurityGroup

Channel Management APIs

StopStreamLiveChannel

StartStreamLiveChannel

ModifyStreamLiveChannel

DescribeStreamLiveChannel

DeleteStreamLiveChannel

CreateStreamLiveChannel

DescribeStreamLiveChannels

Plan APIs

DescribeStreamLivePlans

CreateStreamLivePlan

DeleteStreamLivePlan

Watermark Management APIs

ModifyStreamLiveWatermark

DescribeStreamLiveWatermarks

DescribeStreamLiveWatermark

DeleteStreamLiveWatermark

CreateStreamLiveWatermark

Statistics APIs

DescribeStreamLiveChannelOutputStatistics

DescribeStreamLiveChannelInputStatistics

DescribeStreamLiveChannelAlerts

DescribeStreamLiveChannelLogs

DescribeStreamLiveRegions

DescribeStreamLiveTranscodeDetail

QueryInputStreamState

Data Types

Error Codes

StreamLive Policy

Data Processing and Security Agreement

Related Agreement

Service Level Agreement

DocumentationStreamLive Feature GuideCaptionSmart Subtitling

Smart Subtitling

Download PDF

Last updated: 2024-07-22 10:38:01

Smart Subtitling

Last updated: 2024-07-22 10:38:01

Download PDF

The Smart Subtitling function converts voice information into subtitles by real-time Automatic Speech Recognition (ASR) during live streaming, and subsequently translates them into the target language. Currently, this feature supports four languages: Chinese, English, Japanese, and Korean. In practical applications, please select the appropriate language according to your business needs and audience.
Points of Attention
Currently, the Smart Subtitling feature can only be used in joint transcoding.
The ability to generate the intelligent captions is provided by Media Processing Service (MPS) to StreamLive. When using it for the first time, you need to authorize MPS to access StreamLive's data to generate captions.
When using the intelligent caption feature in StreamLive, in addition to StreamLive's live transcoding fees, there will also be speech recognition fees from MPS. Translating across languages will incur speech translation fees from MPS. For specific billing information, please refer to the MPS Billing Document.
Prerequisites for Use
You have activated the StreamLive service.
Configuring Smart Subtitles
1. Log in to StreamLive Console, navigate to Channel Configuration and Configure Output Groups. For the Output you want to configure, click Setting to enter Transcoding Settings.
﻿
﻿
﻿
2. Select Joint Transcoding, and click Add Caption.
﻿
﻿
﻿
3. Scroll to the bottom of the page, and in the Caption module, choose the caption source as Analysis.
﻿
﻿
﻿
Note：
Since the feature is provided by MPS to StreamLive, role authorization is required to support MPS in obtaining StreamLive data and generating captions.
When in use, the system will verify if you have authorized it before. If you have, you can directly configure captions.
If you have not authorized before, the system will guide you through role authorization. Once you agree, you can continue configuring captions.
4. Configure caption
﻿
﻿
﻿
Configuration Item
Description
Format
Currently, only burn in is supported, which directly merges subtitles into the video.
Source language
It supports recognition of four source languages: Chinese, English, Japanese, and Korean.
Content type
Source: display only source language.
Source + Target: display source language and translation language.
Target: display only translation language.
Target Language
Currently, the source language can be translated into three target languages.
Dynamic/Steady State Effect
Default setting: Delayed Steady State Subtitles. The system will delay the live streaming according to the set time, but the experience of watching complete sentence is better. The default delay time is 10 seconds, with other options including 20 seconds, 30 seconds, and 60 seconds.
You can also choose: Real-time Dynamic Subtitles, which has a shorter delay, but the captions will dynamically correct the content word-by-word according to the spoken content.
Line Spacing
The vertical distance percentage represents the ratio of the vertical distance to the vertical width of the subtitles from the bottom of the screen. The vertical position of the subtitles is configured by adjusting the vertical distance percentage. Value range: 0 - 100%. 
Margins
The horizontal distance percentage represents the ratio of the horizontal distance to the horizontal width of the subtitles and the side of the screen. The horizontal position of the subtitles is configured by adjusting the horizontal distance percentage. Value range: 0 - 50%.
Lines
Options are 1 or 2, with 1 selected by default.
When lines exceed the displayed range, only the latest content will be displayed.
Characters Per Line
Value range: 1-100. Default value: 45. One Chinese character count as one word. One English character or number counts as half a word. The fewer the characters per line are, the larger the font size is.
Font
Options include Heiti, SimSun, Dynacw Diamond Black, and Helvetica.
The font selection will vary depending on the Source language and Target language.
Color
The font color is white by default. The color can be customized.
The background color is black by default. The customization of level of transparency is supported.
5. Preview
Turn on the preview switch, enter test text, and the preview effect will be displayed according to your previous configuration. Additionally, you can readjust the resolution of the preview screen.
﻿
﻿
﻿
6. Click Confirm to save your current Transcoding Configuration and Caption Configuration.
﻿

﻿

﻿

﻿



﻿

Was this page helpful?

You can also Contact Sales or Submit a Ticket for help.

Yes

Configuration Item	Description
Format	Currently, only burn in is supported, which directly merges subtitles into the video.
Source language	It supports recognition of four source languages: Chinese, English, Japanese, and Korean.
Content type	Source: display only source language. Source + Target: display source language and translation language. Target: display only translation language.
Target Language	Currently, the source language can be translated into three target languages.
Dynamic/Steady State Effect	Default setting: Delayed Steady State Subtitles. The system will delay the live streaming according to the set time, but the experience of watching complete sentence is better. The default delay time is 10 seconds, with other options including 20 seconds, 30 seconds, and 60 seconds. You can also choose: Real-time Dynamic Subtitles, which has a shorter delay, but the captions will dynamically correct the content word-by-word according to the spoken content.
Line Spacing	The vertical distance percentage represents the ratio of the vertical distance to the vertical width of the subtitles from the bottom of the screen. The vertical position of the subtitles is configured by adjusting the vertical distance percentage. Value range: 0 - 100%.
Margins	The horizontal distance percentage represents the ratio of the horizontal distance to the horizontal width of the subtitles and the side of the screen. The horizontal position of the subtitles is configured by adjusting the horizontal distance percentage. Value range: 0 - 50%.
Lines	Options are 1 or 2, with 1 selected by default. When lines exceed the displayed range, only the latest content will be displayed.
Characters Per Line	Value range: 1-100. Default value: 45. One Chinese character count as one word. One English character or number counts as half a word. The fewer the characters per line are, the larger the font size is.
Font	Options include Heiti, SimSun, Dynacw Diamond Black, and Helvetica. The font selection will vary depending on the Source language and Target language.
Color	The font color is white by default. The color can be customized. The background color is black by default. The customization of level of transparency is supported.

tencent cloud

New User Offers

Next-Generation CDN：EdgeOne

Elasticsearch Service Special Offers

Free Tier

Tencent Cloud Startup Program

Special Offers

Lighthouse Special Offers

Cloud Object Storage Special Offers

Featured Products

New Products

Education

Tencent Cloud Online Education Solutions

Gaming

Gaming Solution

Game Media Solutions

Financial Services

Financial Services Solution

Audio & Video

Audio/Video Solution

LVB Recording Solution

Interactive Classroom Solution

Interactive Live Streaming Solution

Audio Chat Social Networking Solution

Real Estate

Tencent Cloud LinkBase(Weiling)

E-commerce

E-commerce retail solutions

Compute

Cloud Virtual Machine

Auto Scaling

Batch Compute

CVM Dedicated Host

Database

TencentDB for MySQL

TencentDB for Redis®

TencentDB for CTSDB

TDSQL for MySQL

Data Transfer Service

TencentDB for MongoDB

TencentDB for PostgreSQL

TencentDB for SQL Server

TencentDB for TcaplusDB

Video Service

Cloud Streaming Services

Video on Demand

Media Processing Service

Cloud Application Rendering

Cloud Contact Center

Game Multimedia Engine

Chat

Real-time Communication

Tencent Effect SDK

AI and Machine Learning

Image Creation Large Model

Face Fusion

eKYC

Optical Character Recognition

Video Creation Large Model

Industry Applications

Tencent HealthCare Omics Platform

Container and Middleware

TDMQ for CKafka

Serverless Cloud Function

Tencent Kubernetes Engine

Tencent Kubernetes Engine for Serverless

Networking

Cloud Load Balancer

Virtual Private Cloud

Direct Connect

Cloud Connect Network

NAT Gateway

VPN Connection

Bandwidth Package

Anycast Internet Acceleration

Elastic Network Interface

Flow Logs

Global Application Acceleration Platform

Security

Captcha