Theword joiner(WJ) is aUnicodeformatcharacterwhich is used to indicate thatline breakingshould not occur at its position.[1]It does not affect the formation ofligaturesorcursivejoining and is ignored for the purpose of text segmentation.[1]It is encoded since Unicode version 3.2 (released in 2002) asU+2060WORD JOINER(⁠).
The word joiner replaces thezero-width no-break space(ZWNBSP,U+FEFF), as a usage of the no-break space of zero width. TheZWNBSPis originally and currently used as thebyte order mark(BOM) at the start of a file. However, if encountered elsewhere, it should, according to Unicode, be treated as a word joiner, ano-break spaceof zero width.
The deliberate use of U+FEFF for this purpose is deprecated as of Unicode 3.2, with theword joinerstrongly preferred.[1][2]
See also
edit- Byte order mark,which usesU+FEFFZERO WIDTH NO-BREAK SPACE(ZWNBSP) character
- Zero-width space
- Zero-width joiner,which in scripts such as Arabic or Indic causes two characters to be shown in a connected form, even if they would otherwise not.
References
edit- ^abc"Layout Controls"(PDF).The Unicode Standard, Version 12.0.0.The Unicode Consortium. p. 871.
- ^FAQ - UTF-8, UTF-16, UTF-32 & BOM,”What should I do with U+FEFF in the middle of a file? “.