UTF-8, or 8-bit Unicode Transformation Format, was developed as an extension of ASCII, or American Standard Code for Information Interchange, with the view to expand its scope from Western European languages to a broader range of world languages with different character sets. ASCII is a 7-bit system, but is now generally expressed in 8-bit code, or octets. UTF-8 characters are referenced by between one and four octets. Thankfully, this means ASCII's single octet is compatible with UTF-8, and so conversion from ASCII to UTF-8 is seamless. However, it is possible for UTF-8 text to use characters not included in ASCII's 127 character set, so conversion from UTF-8 to ASCII can result in a loss of information.
- Skill level:
Other People Are Reading
Go to the character set converter tool on Bisqwit's language conversion website (http://kanjidict.stc.cx/recode.php) in a web browser. This online tool allows you to paste the text you want to convert from UTF-8 to ASCII into a box.
Select the UTF-8 text you want to convert, from your text editor or whatever program you are using to view the UTF-8 text.
Copy the selected UTF-8 text using the Edit > Copy command, or CTRL+C on your keyboard if using Windows.
Paste the UTF-8 text into the box on the online character set converter page.
Select "UTF-8" from the "Input encoding" drop-down menu, and select "ASCII" from the "Output encoding" drop-down menu.
Click the "Convert" button. The results page will be blank except for the ASCII output. Usually you will see at least some text displayed here, but if all the characters in your UTF-8 input were non-ASCII characters, the page will be blank. From this page you can return to the converter tool by clicking your browser's "back" button.
- 20 of the funniest online reviews ever
- 14 Biggest lies people tell in online dating sites
- Hilarious things Google thinks you're trying to search for