Adding captions and tracks to HTML video and audio elements.
Jul 02, 2025 pm 04:05 PMTo embed video or audio with subtitles and audio tracks into a web page, it can be achieved through HTML native functionality. 1. Use the <track> tag to add a WebVTT format subtitle file and set the kind, srclang and label attributes; 2. Support multilingual subtitles through multiple <track> elements, and use the default attribute to set the default language; 3. Multi-track can control multiple <audio> elements to switch through JavaScript, or use more complex media extension solutions; 4. Pay attention to browser compatibility, path configuration and format verification to ensure normal operation on different devices and provide backup solutions.
When embedding video or audio in a web page, adding subtitles and audio tracks can significantly improve accessibility and user experience. HTML provides native support, which can be achieved in just a few steps.

Add subtitles using the <track></track>
tag
The <track></track>
element of HTML allows you to add subtitles, chapters, descriptions and other text tracks to videos or audios. The most common use is to add subtitles.

- Subtitle files are usually in WebVTT (.vtt) format.
-
<track></track>
needs to be placed inside<video></video>
or<audio></audio>
. - Set
kind
attribute to specify track types, such assubtitles
,captions
,descriptions
, etc. -
srclang
is used to specify the language, and the browser selects the appropriate subtitles accordingly.
For example:
<video controls> <source src="movie.mp4" type="video/mp4"> <track src="en.vtt" kind="subtitles" srclang="en" label="English"> </video>
This way the user can choose to enable English subtitles in the player.

Supports multilingual subtitles
If your website is for international users, it is a good idea to provide subtitles in multiple languages. Just add multiple <track>
elements and set different srclang
and label
.
<video controls> <source src="movie.mp4" type="video/mp4"> <track src="en.vtt" kind="subtitles" srclang="en" label="English" default> <track src="zh.vtt" kind="subtitles" srclang="zh" label="Chinese"> </video>
A few points to note:
- The browser will automatically match the appropriate subtitles according to the user's system language.
- Adding the
default
attribute can enable a certain language by default. - Subtitle files in different languages ??need to accurately correspond to the content, otherwise it will cause confusion.
Add multiple tracks (such as comment tracks)
Although HTML native support for multi-tracks is not as direct as subtitles, it can be used to switch different audio tracks through JavaScript.
A common practice is to use multiple <audio>
elements to switch by controlling their playback status:
<audio id="mainAudio" src="music.mp3" controls></audio> <button onclick="switchTrack('music.mp3')">Main track</button> <button onclick="switchTrack('commentary.mp3')">Comment track</button> <script> function switchTrack(src) { const audio = document.getElementById('mainAudio'); audio.src = src; audio.play(); } </script>
Of course, if more complex track management is required, you can consider using Media Source Extensions or third-party libraries.
Notes and compatibility issues
Although HTML's multimedia functions are becoming more and more powerful, there are still some things to pay attention to in practical applications:
- Not all browsers fully support all features of WebVTT.
- Safari on iOS has limited support for certain
<track></track>
features. - The subtitle file path must be correct, and the server configuration must also allow cross-domain loading (if there is a cross-domain case).
- The multi-track switching experience may vary by device or browser.
To ensure the best results:
- Test your implementation on multiple browsers and devices.
- Provide fallback (alternative solution), such as displaying static subtitles when prompted not supported.
- Use the tool to verify that the WebVTT file format is correct.
Basically that's it. It is not complicated to implement, but it is easy to ignore details, especially file format and path issues.
The above is the detailed content of Adding captions and tracks to HTML video and audio elements.. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undress AI Tool
Undress images for free

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics

HTMLhasevolvedsignificantlysinceitscreationtomeetthegrowingdemandsofwebdevelopersandusers.Initiallyasimplemarkuplanguageforsharingdocuments,ithasundergonemajorupdates,includingHTML2.0,whichintroducedforms;HTML3.x,whichaddedvisualenhancementsandlayout

It is a semantic tag used in HTML5 to define the bottom of the page or content block, usually including copyright information, contact information or navigation links; it can be placed at the bottom of the page or nested in, etc. tags as the end of the block; when using it, you should pay attention to avoid repeated abuse and irrelevant content.

ThetabindexattributecontrolshowelementsreceivefocusviatheTabkey,withthreemainvalues:tabindex="0"addsanelementtothenaturaltaborder,tabindex="-1"allowsprogrammaticfocusonly,andtabindex="n"(positivenumber)setsacustomtabbing

Adeclarationisaformalstatementthatsomethingistrue,official,orrequired,usedtoclearlydefineorannounceanintent,fact,orrule.Itplaysakeyroleinprogrammingbydefiningvariablesandfunctions,inlegalcontextsbyreportingfactsunderoath,andindailylifebymakingintenti

loading="lazy" is an HTML attribute for and which enables the browser's native lazy loading function to improve page performance. 1. It delays loading non-first-screen resources, reduces initial loading time, saves bandwidth and server requests; 2. It is suitable for large amounts of pictures or embedded content in long pages; 3. It is not suitable for first-screen images, small icons, or lazy loading using JavaScript; 4. It is necessary to cooperate with optimization measures such as setting sizes and compressing files to avoid layout offsets and ensure compatibility. When using it, you should test the scrolling experience and weigh the user experience.

The key to using elements to represent navigation link areas is semantics and clear structure, usually in conjunction with organizational links. 1. The basic structure is to put the parallel links in and wrap them inside, which is friendly to auxiliary tools and is conducive to style control and SEO; 2. Commonly used in or, for placing main navigation or footer link collections; 3. A page can contain multiple areas, such as main menu, sidebar or footer independent navigation.

When writing legal and neat HTML, you need to pay attention to clear structure, correct semantics and standardized format. 1. Use the correct document type declaration to ensure that the browser parses according to the HTML5 standard; 2. Keep the tag closed and reasonably nested to avoid forgetting closed or wrong nesting elements; 3. Use semantic tags such as, etc. to improve accessibility and SEO; 4. The attribute value is always wrapped in quotes, and single or double quotes are used uniformly. Boolean attributes only need to exist, and the class name should be meaningful and avoid redundant attributes.

The web page structure needs to be supported by core HTML elements. 1. The overall structure of the page is composed of , , which is the root element, which stores meta information and displays the content; 2. The content organization relies on title (-), paragraph () and block tags (such as ,) to improve organizational structure and SEO; 3. Navigation is implemented through and implemented, commonly used organizations are linked and supplemented with aria-current attribute to enhance accessibility; 4. Form interaction involves , , and , to ensure the complete user input and submission functions. Proper use of these elements can improve page clarity, maintenance and search engine optimization.
