Home > Terms > English, UK (UE) > Extended grapheme cluster (EGC)

Extended grapheme cluster (EGC)

The text between extended grapheme cluster boundaries as specified by Unicode Standard Annex #29, "Unicode Text Segmentation."

  • Extended grapheme clusters are defined in a parallel manner to legacy grapheme clusters, but also include sequences of spacing marks.
  • Grapheme clusters and extended grapheme clusters may not have any particular linguistic significance, but are used to break up a string of text into units for processing.
  • Grapheme clusters and extended grapheme clusters may be adjusted for particular processing requirements, by tailoring the rules for grapheme cluster segmentation.
  • The associated base character is the base character in the combining character sequence that a combining mark is part of.
  • A combining mark in a defective combining character sequence has no associated base character and thus cannot be said to depend on any particular base character. This is one of the reasons why fallback processing is required for defective combining character sequences.
  • Dependence concerns all combining marks, including spacing marks and combining marks that have no visible display.
This is auto-generated content. You can help to improve it.
0
Collect to Blossary

Member comments

You have to log in to post to discussions.

Terms in the News

Featured Terms

Sysop02
  • 0

    Terms

  • 0

    Blossaries

  • 1

    Followers

Industry/Domain: Construction Category: Windows

mortise and tenon

A strong joint wood made by the lace of a mortise in a table and one matching outgoing member (Tenon) on the other.

Contributor

Featured blossaries

Highest Paid Soccer Player

Category: Sports   1 11 Terms

BPMN

Category: Business   1 10 Terms