Difference between revisions of "UTF-8"

Revision as of 17:07, 12 May 2021

UTF-8 refers to Unicode Transformation Format 8-bit, which is a variable-width encoding that can represent every character in the Unicode character set that was designed for backward compatibility with ASCII.

Overview

UTF-8 encodes each Unicode character as a variable number of 1 to 4 octets. The number of octets depends on the integer value assigned to the character. UTF-8 is the default encoding for XML and has been the dominant character encoding on the web since 2010.^[1]

W3C has offered several reasons for the popularity of UTF-8:

An HTML page can only be in one encoding, and UTF-8 can support many languages and accommodate many pages and forms.
Barriers to using Unicode are very low; by January 2012, Google reported that over 60% of the Web in their sample used UTF-8.
ASCII is a subset of UTF-8; all ASCII characters in UTF-8 use the same bytes as an ASCII encoding, helping with interoperability.
The HTML5 specification says "Authoring tools should default to using UTF-8 for newly-created documents."^[2]

References

[1] About utf-8

[2] Why choose UTF-8, W3C

[1]

[2]

Revision as of 17:07, 12 May 2021 (view source) Jessica (talk \| contribs) (Created page with "'''UTF-8''' refers to Unicode Transformation Format 8-bit is a variable-width encoding that can represent every character in the Unicode character set that was designed for ba...")		Revision as of 17:07, 12 May 2021 (view source) Jessica (talk \| contribs) Newer edit →
Line 1:		Line 1:
−	'''UTF-8''' refers to Unicode Transformation Format 8-bit is a variable-width encoding that can represent every character in the Unicode character set that was designed for backward compatibility with [[ASCII]].	+	'''UTF-8''' refers to Unicode Transformation Format 8-bit, which is a variable-width encoding that can represent every character in the Unicode character set that was designed for backward compatibility with [[ASCII]].

	==Overview==		==Overview==

Difference between revisions of "UTF-8"

Revision as of 17:07, 12 May 2021

Overview

References

Navigation menu

Search