tencent cloud

Feedback

Smart Subtitles

Last updated: 2024-11-04 10:46:28

    Overview

    With VOD II capabilities, you can automatically generate multilingual subtitle files for video content. This is suitable for short dramas aimed at international markets, multilingual online education courses, and organizing multinational audio and video conference minutes.
    The smart subtitling feature includes the Full Speech Recognition and Voice Translation capabilities. The former uses advanced ASR algorithms combined with massive training data, providing high accuracy even in conditions with ambient noise and background sounds. The latter leverages the advantages of Neural Machine Translation (NMT) and Statistical Machine Translation (SMT), learning translation knowledge from large multilingual corpora to achieve high-accuracy translations from the source language to target languages. Currently, it supports translation for dozens of languages, including Chinese, English, Japanese, Korean, French, Spanish, Italian, German, Turkish, Russian, Portuguese, Vietnamese, Indonesian, Thai, Malay, Arabic, and Hindi.

    Suitable Scenarios

    Voice Translation Scenarios

    Scenario
    Description
    Short Dramas Going Global
    For short dramas targeting international audiences, smart subtitling supports efficient output of multilingual translated subtitles, increasing the efficiency of generating global versions of short dramas.
    Movie/TV series
    When movies and TV series are released in different countries or regions, adding local language subtitles helps in their dissemination. Smart Translation Subtitles can automatically generate subtitles in the target country's or region's language, which greatly improves efficiency compared to manual subtitling.
    Multinational Audio/Video Meetings
    For key multinational meetings, recordings can have participants' native language subtitles automatically generated to help with organizing meeting minutes and viewing.
    Multilingual Recorded Courses Learning
    When online education courses are released in different countries or regions, smart translation subtitles can automatically generate local language subtitles, making it easier to understand and learn.

    Speech Full Text Recognition Scenarios

    Scenario
    Description
    Online classes
    Smart subtitling can automatically generate subtitles for online course recordings to make it easier for viewers to learn the content of the course.
    Social media platforms
    General users can add subtitles to their vlogs and share them with others.
    Movie/TV series
    Old videos are often not subtitled, which makes it difficult for some viewers to enjoy them. The smart subtitling feature can automatically generate subtitles for such videos, which is much more efficient than manual subtitling.
    Audio/Video meetings
    Subtitles can be automatically generated for recordings of important meetings to facilitate minutes generation. Subtitles also make it easier for viewers to navigate meeting recordings.
    Speech quality control
    Subtitles can be generated for call recordings to help call centers quickly evaluate the quality of customer service, improving the efficiency of speech quality assurance.

    How to Use

    Method 1: Launching tasks via the console

    You can launch Speech Recognition tasks via the VOD console. Use templates to specify the recognition items as either "Speech Full Text Recognition" or "Voice Translation" for the tasks. Once the task is complete, refresh the subtitle list to view the recognized or translated subtitle files. For detailed instructions, please refer to How to Add Subtitles to Videos.

    Method 2: Launching tasks via API

    Launching Voice Translation Tasks

    Intelligent subtitles are implemented through the audio and video CR feature of speech translation. Specific usage steps:
    1. Create an audio and video CR template, enable the speech translation configuration option (AsrTranslateConfigure) switch (Switch), set the media source language (SrcLanguage) and the translation target language (DstLanguage), specify the subtitle file format (SubtitleFormats) and set the subtitle name (SubtitleName):
    {
    "AsrTranslateConfigure": {
    "Switch": "ON",
    "SrcLanguage": "zh",
    "DstLanguage": "en",
    "SubtitleFormats": ["vtt", "srt"],
    "SubtitleName": "test"
    }
    }
    Get the template ID from the response.
    2. With the template ID obtained in step 1, initiate a smart subtitling task as instructed in Task Initiation.
    3. Get the task result as instructed in Getting Result.

    Initiate a full speech recognition task

    You can implement subtitles through the Video Content Recognition full speech recognition feature as follows:
    1. Create an audio/video content recognition template, enable the full speech recognition configuration item (AsrFullTextConfigure) switch (Switch), set the media source language (SrcLanguage), specify the format for the generated subtitle file (SubtitleFormats) and set the subtitle name (SubtitleName):
    {
    "AsrFullTextConfigure": {
    "Switch": "ON",
    "SrcLanguage": "zh",
    "SubtitleFormats": ["vtt", "srt"],
    "SubtitleName": "test"
    }
    }
    Get the template ID from the response.
    2. With the template ID obtained in step 1, initiate a smart subtitling task as instructed in Initiating a Task.
    3. Get the task result as instructed in Getting the Result.
    Contact Us

    Contact our sales team or business advisors to help your business.

    Technical Support

    Open a ticket if you're looking for further assistance. Our Ticket is 7x24 avaliable.

    7x24 Phone Support