Home > Terms > English, UK (UE) > UTF-8 encoding scheme

UTF-8 encoding scheme

The Unicode encoding scheme that serialises a UTF-8 code unit sequence in exactly the same order as the code unit sequence itself.

  • In the UTF-8 encoding scheme, the UTF-8 code unit sequence <4D D0 B0 E4 BA 8C F0 90 8C 82> is serialised as <4D D0 B0 E4 BA 8C F0 90 8C 82>.
  • Because the UTF-8 encoding form already deals in ordered byte sequences, the UTF-8 encoding scheme is trivial. The byte ordering is already obvious and completely defined by the UTF-8 code unit sequence itself. The UTF-8 encoding scheme is defined merely for completeness of the Unicode character encoding model.
  • While there is obviously no need for a byte order signature when using UTF-8, there are occasions when processes convert UTF-16 or UTF-32 data containing a byte order mark into UTF-8. When represented in UTF-8, the byte order mark turns into the byte sequence . Its usage at the beginning of a UTF-8 data stream is neither required nor recommended by the Unicode Standard, but its presence does not affect conformance to the UTF-8 encoding scheme. Identification of the byte sequence at the beginning of a data stream can, however, be taken as a near-certain indication that the data stream is using the UTF-8 encoding scheme.
This is auto-generated content. You can help to improve it.
0
Collect to Blossary

Member comments

You have to log in to post to discussions.

Terms in the News

Featured Terms

Harry8L
  • 0

    Terms

  • 0

    Blossaries

  • 1

    Followers

Industry/Domain: People Category: Singers

Tata Young

Amita Marie Young (born Su Min Ta Marie Young) better known under her stage name Tata Young, is a Thai singer, model actress and dancer born on ...

Contributor

Featured blossaries

Theater Arts

Category: Entertainment   1 20 Terms

Basketball

Category: Sports   1 20 Terms