Convert Text Files to UTF‑8

Upload one or more text or subtitle files. We convert them to UTF‑8 encoding.

Supported: any plain text file. Upload multiple files or a ZIP. Maximum 100 files.
Convert subtitles to SRT

Fix Garbled Subtitle Characters with UTF-8 Encoding

Seeing strange symbols, question marks, or boxes instead of proper text in your subtitles? This is a character encoding problem. Our UTF-8 converter fixes subtitle files that display garbled characters by converting them to the universal UTF-8 encoding standard.

UTF-8 is the modern encoding standard that supports all languages—from Chinese and Arabic to Russian and emoji. Converting your subtitles to UTF-8 ensures they display correctly on all modern media players, devices, and platforms.

Quick Start Guide

  • Upload garbled subtitle file - Supports SRT, ASS, SSA, SMI, VTT
  • Automatic conversion to UTF-8 - Detects original encoding and fixes it
  • Download fixed file - Characters now display correctly
  • Batch processing - Fix multiple files at once (max 100 files)

Common Encoding Problems We Fix

Garbled Chinese, Japanese, or Korean Text

Downloaded Asian language subtitles showing random symbols or boxes? The file is likely encoded in GB2312, GBK, Shift-JIS, or EUC-KR instead of UTF-8. These older encodings don't work properly on modern systems.

Example: æ±è¯­ or 中æ instead of proper Chinese characters.

Broken Cyrillic (Russian, Bulgarian, Serbian)

Russian or other Cyrillic subtitles appearing as ïðèâåò or привет? The file was created using Windows-1251 or ISO-8859-5 encoding. Converting to UTF-8 restores proper Cyrillic display.

Arabic or Hebrew Text Issues

Right-to-left languages like Arabic and Hebrew require proper UTF-8 encoding. Files in Windows-1256 (Arabic) or Windows-1255 (Hebrew) show corrupted text on modern devices.

Western European Special Characters

Seeing é instead of é, or ü instead of ü? Your subtitle file uses ISO-8859-1 or Windows-1252 encoding. French, German, Spanish, and other European language subtitles need UTF-8 conversion for proper character display.

Vietnamese, Thai, and Other Languages

If Vietnamese tonal marks, Thai characters, or other special language symbols appear broken, the encoding is incompatible. UTF-8 supports every modern language including emoji and special symbols.

Why Use UTF-8 Encoding?

Universal Language Support

UTF-8 is the universal encoding standard that supports over 140,000 characters from virtually every writing system: Latin, Cyrillic, Chinese, Japanese, Korean, Arabic, Hebrew, Thai, emoji, and more. One encoding for all languages.

Modern Device Compatibility

All modern devices, operating systems, and media players default to UTF-8. Files encoded in older standards (ISO-8859-1, Windows-1252, GB2312, etc.) cause display problems on smartphones, smart TVs, and modern computers.

Web and Streaming Ready

Uploading subtitles to YouTube, Netflix, or other streaming platforms? They require UTF-8 encoding. Converting ensures your subtitles work correctly when uploaded to web platforms.

Future-Proof

Legacy encodings are deprecated and will eventually stop working. UTF-8 is the industry standard and will remain compatible for decades. Converting now prevents future problems.

How Encoding Problems Happen

Old Subtitle Creation Tools

Older subtitle editing software (especially from the Windows XP era) defaulted to legacy encodings like Windows-1252 or ISO-8859-1. Subtitles created with these tools display incorrectly on modern systems.

Incorrect File Saving

When editing subtitle files in Notepad or basic text editors, if you "Save As" without specifying UTF-8, Windows may save in ANSI (Windows-1252) encoding, breaking special characters.

Downloaded from Foreign Websites

Subtitle sites in non-English countries often use their local encoding (Chinese sites use GB2312, Russian sites use Windows-1251, etc.). These files work fine in their original location but break when used elsewhere.

Format Conversion Errors

Some subtitle format converters don't preserve encoding properly. Converting from ASS to SRT or VTT to SRT can accidentally change the encoding if the converter isn't encoding-aware.

Testing Your Fixed Subtitles

VLC Media Player

Open VLC and load your converted subtitle file. Go to Subtitle → Sub Track → Track 1. If characters now display correctly, the encoding is fixed. If you still see boxes, you may need to change VLC's subtitle font (see our VLC font guide).

Windows Media Player

Drag your video file and the converted subtitle file (with matching names) into the same folder. Play the video in Windows Media Player—subtitles should load automatically with proper characters.

Text Editors

Open the converted file in Notepad++ or Visual Studio Code. Look at the encoding indicator in the status bar—it should say "UTF-8" or "UTF-8 with BOM". Special characters should display correctly in the editor.

Advanced Tips

UTF-8 with BOM vs Without BOM

UTF-8 can have a BOM (Byte Order Mark) or not. For subtitles, UTF-8 without BOM is preferred because some media players misinterpret the BOM as visible characters. Our tool outputs UTF-8 without BOM for maximum compatibility.

Fixing Manually in Notepad++

If you want to manually check encoding: Open the file in Notepad++, go to Encoding → Convert to UTF-8 (not "Encode in UTF-8"—that just marks it without converting). Save and test.

Batch Converting Entire Subtitle Collections

Have hundreds of subtitle files with encoding issues? Our tool supports batch processing—upload up to 100 files at once or use a ZIP archive. All files are converted to UTF-8 in one operation.

Troubleshooting

Characters Still Garbled After Conversion

If characters are still broken after UTF-8 conversion, the original file may be double-encoded or corrupted. Try these solutions:

  • Re-download from source: Get a fresh copy of the subtitle file
  • Try manual conversion: Open in Notepad++, select correct source encoding (e.g., Chinese Simplified → GB2312), then convert to UTF-8
  • Check font issues: Some symbols require specific fonts. See our VLC font troubleshooting guide

VLC Still Shows Boxes or Question Marks

If encoding is correct but VLC shows □ boxes, this is a font problem, not an encoding problem. VLC's default font doesn't include Chinese, Arabic, or other complex scripts. Solution: Change VLC's subtitle font to Arial Unicode MS or Noto Sans. See our detailed guide.

File Won't Upload or Convert

Our UTF-8 converter only works with text-based subtitle files (SRT, ASS, SSA, SMI, VTT). If you're trying to convert image-based formats like SUP, SUB/IDX, you need OCR conversion instead:

Real-World Example

Korean Drama Subtitle Crisis Averted

"Downloaded Korean drama subtitles from a fan site. On my Smart TV, all Korean characters showed as ïØº or ?? making the subtitles completely unreadable."

Solution: Uploaded the SRT file to Subconverter's UTF-8 tool. Downloaded the converted file in 3 seconds. Copied to my TV's USB drive. All Korean characters now display perfectly—watched the entire series without issues!

Related Tools

Frequently Asked Questions

Will converting to UTF-8 fix all my subtitle problems?

UTF-8 conversion fixes encoding issues (garbled characters, wrong symbols). It does NOT fix timing problems, formatting issues, or font display problems. For timing, use our Subtitle Shifter. For VLC font issues, see our font troubleshooting guide.

Can I convert files that are already in UTF-8?

Yes, it's safe to convert UTF-8 files through the tool. If a file is already UTF-8, the conversion will not harm it—you'll get the same file back. However, if it's already UTF-8 and characters are still broken, the problem is likely with fonts, not encoding.

Will this work for embedded subtitles in video files?

No. This tool only works with external subtitle files (.srt, .ass, .vtt, etc.). If your subtitles are embedded in an MKV or MP4 video file, you need to extract them first using tools like MKVToolNix or FFmpeg, then convert the extracted file.

Why do some characters show as boxes even after conversion?

If you still see □ boxes after UTF-8 conversion, it's a font issue, not an encoding issue. Your media player's font doesn't include those characters. In VLC, change the subtitle font to "Arial Unicode MS" or download Noto Sans CJK for Chinese/Japanese/Korean support.

Can I use this for languages like Chinese, Arabic, or Thai?

Absolutely! UTF-8 supports every language including Chinese (Simplified and Traditional), Arabic, Hebrew, Thai, Vietnamese, Hindi, and over 100 other scripts. This tool is specifically designed to fix multi-language encoding problems.