mlterm
Tested Software version 3.9.3 on Linux Full results available at ucs-detect repository path data/linux-mlterm-3.9.3.yaml
Wide character support
The best wide unicode table version for mlterm appears to be 15.0.0, this is from a summary of the following results:
version |
n_errors |
n_total |
pct_success |
---|---|---|---|
‘5.1.0’ |
0 |
26 |
100.0% |
‘5.2.0’ |
78 |
269 |
71.0% |
‘6.0.0’ |
0 |
13 |
100.0% |
‘9.0.0’ |
0 |
5000 |
100.0% |
‘10.0.0’ |
73 |
735 |
90.1% |
‘11.0.0’ |
6 |
62 |
90.3% |
‘12.0.0’ |
6 |
62 |
90.3% |
‘12.1.0’ |
0 |
1 |
100.0% |
‘13.0.0’ |
55 |
541 |
89.8% |
‘14.0.0’ |
4 |
41 |
90.2% |
‘15.0.0’ |
1 |
15 |
93.3% |
‘15.1.0’ |
5 |
5 |
0.0% |
Sequence of a WIDE character from Unicode Version 15.0.0, from midpoint of alignment failure records:
Codepoint |
Python |
Category |
wcwidth |
Name |
---|---|---|---|---|
‘\U0001fabc’ |
So |
2 |
JELLYFISH |
Total codepoints: 1
Shell test using printf(1),
'|'
should align in output:$ printf "\xf0\x9f\xaa\xbc|\\n12|\\n" 🪼| 12|
python wcwidth.wcswidth() measures width 2, while mlterm measures width 0.
Emoji ZWJ support
The best Emoji ZWJ table version for mlterm appears to be None, this is from a summary of the following results:
version |
n_errors |
n_total |
pct_success |
---|---|---|---|
‘2.0’ |
22 |
22 |
0.0% |
‘4.0’ |
500 |
500 |
0.0% |
‘5.0’ |
100 |
100 |
0.0% |
‘11.0’ |
73 |
73 |
0.0% |
‘12.0’ |
112 |
112 |
0.0% |
‘12.1’ |
165 |
165 |
0.0% |
‘13.0’ |
51 |
51 |
0.0% |
‘13.1’ |
83 |
83 |
0.0% |
‘14.0’ |
20 |
20 |
0.0% |
‘15.0’ |
1 |
1 |
0.0% |
‘15.1’ |
109 |
109 |
0.0% |
Sequence of an Emoji ZWJ Sequence from Emoji Version 15.1, from midpoint of alignment failure records:
Codepoint |
Python |
Category |
wcwidth |
Name |
---|---|---|---|---|
‘\U0001f9d1’ |
So |
2 |
ADULT |
|
‘\u200d’ |
Cf |
0 |
ZERO WIDTH JOINER |
|
‘\U0001f9bc’ |
So |
2 |
MOTORIZED WHEELCHAIR |
|
‘\u200d’ |
Cf |
0 |
ZERO WIDTH JOINER |
|
‘\u27a1’ |
So |
1 |
BLACK RIGHTWARDS ARROW |
|
‘\ufe0f’ |
Mn |
0 |
VARIATION SELECTOR-16 |
Total codepoints: 6
Shell test using printf(1),
'|'
should align in output:$ printf "\xf0\x9f\xa7\x91\xe2\x80\x8d\xf0\x9f\xa6\xbc\xe2\x80\x8d\xe2\x9e\xa1\xef\xb8\x8f|\\n12|\\n" 🧑🦼➡️| 12|
python wcwidth.wcswidth() measures width 2, while mlterm measures width 7.
Variation Selector-16 support
Emoji VS-16 results for mlterm is 100 errors out of 100 total codepoints tested, 0.0% success. Sequence of a NARROW Emoji made WIDE by Variation Selector-16, from midpoint of alignment failure records:
Codepoint |
Python |
Category |
wcwidth |
Name |
---|---|---|---|---|
‘\U0001f325’ |
So |
1 |
WHITE SUN BEHIND CLOUD |
|
‘\ufe0f’ |
Mn |
0 |
VARIATION SELECTOR-16 |
Total codepoints: 2
Shell test using printf(1),
'|'
should align in output:$ printf "\xf0\x9f\x8c\xa5\xef\xb8\x8f|\\n12|\\n" 🌥️| 12|
python wcwidth.wcswidth() measures width 2, while mlterm measures width 1.
Language Support
The following 82 languages were tested with 100% success:
Adyghe, Aja, Amarakaeri, Arabic, Standard, Assyrian Neo-Aramaic, Baatonum, Bamun, Bhojpuri, Bora, Burmese, Chakma, Cherokee (cased), Chickasaw, Chinantec, Chiltepec, Dagaare, Southern, Dangme, Dendi, Dinka, Northeastern, Ditammari, Dzongkha, Evenki, Fon, Fur, Ga, Gen, Gilyak, Gujarati, Gumuz, Hindi, Idoma, Kabardian, Khmer, Central, Khün, Lamnso’, Lao, Lingala (tones), Magahi, Maithili, Maldivian, Mazahua Central, Mixtec, Metlatónoc, Mon, Mòoré, Nanai, Navajo, Nuosu, Orok, Otomi, Mezquital, Panjabi, Eastern, Pashto, Northern, Picard, Pular (Adlam), Sanskrit, Sanskrit (Grantha), Secoya, Seraiki, Serer-Sine, Shan, Siona, South Azerbaijani, Tagalog (Tagalog), Tai Dam, Tamang, Eastern, Tamazight, Central Atlas, Tamazight, Central Atlas (Tifinagh), Tamazight, Standard Morocan, Tamil, Tamil (Sri Lanka), Telugu, Tem, Thai, Thai (2), Ticuna, Uduk, Vai, Veps, Vietnamese, Vietnamese (Han nom), Waama, Yoruba, Yukaghir, Northern, Éwé.
The following 16 languages are not fully supported:
lang |
n_errors |
n_total |
pct_success |
---|---|---|---|
Malayalam |
357 |
1630 |
78.1% |
Javanese (Javanese) |
242 |
1453 |
83.3% |
Mongolian, Halh (Mongolian) |
3 |
33 |
90.9% |
Sinhala |
107 |
1655 |
93.5% |
Bengali |
80 |
1413 |
94.3% |
Farsi, Western |
39 |
1822 |
97.9% |
Dari |
36 |
1872 |
98.1% |
Tibetan, Central |
2 |
260 |
99.2% |
Marathi |
9 |
1614 |
99.4% |
Yaneshaʼ |
6 |
2536 |
99.8% |
Nepali |
3 |
1385 |
99.8% |
Kannada |
1 |
1080 |
99.9% |
Panjabi, Western |
2 |
2419 |
99.9% |
Yiddish, Eastern |
1 |
1775 |
99.9% |
Urdu |
1 |
2237 |
100.0% |
Urdu (2) |
1 |
2251 |
100.0% |
Malayalam
Sequence of language Malayalam from midpoint of alignment failure records:
Codepoint |
Python |
Category |
wcwidth |
Name |
---|---|---|---|---|
‘\u0d38’ |
Lo |
1 |
MALAYALAM LETTER SA |
|
‘\u0d4d’ |
Mn |
0 |
MALAYALAM SIGN VIRAMA |
|
‘\u0d25’ |
Lo |
1 |
MALAYALAM LETTER THA |
|
‘\u0d3e’ |
Mc |
0 |
MALAYALAM VOWEL SIGN AA |
|
‘\u0d2a’ |
Lo |
1 |
MALAYALAM LETTER PA |
|
‘\u0d28’ |
Lo |
1 |
MALAYALAM LETTER NA |
|
‘\u0d2e’ |
Lo |
1 |
MALAYALAM LETTER MA |
|
‘\u0d3e’ |
Mc |
0 |
MALAYALAM VOWEL SIGN AA |
|
‘\u0d23’ |
Lo |
1 |
MALAYALAM LETTER NNA |
|
‘\u0d4d’ |
Mn |
0 |
MALAYALAM SIGN VIRAMA |
|
‘\u200c’ |
Cf |
0 |
ZERO WIDTH NON-JOINER |
Total codepoints: 11
Shell test using printf(1),
'|'
should align in output:$ printf "\xe0\xb4\xb8\xe0\xb5\x8d\xe0\xb4\xa5\xe0\xb4\xbe\xe0\xb4\xaa\xe0\xb4\xa8\xe0\xb4\xae\xe0\xb4\xbe\xe0\xb4\xa3\xe0\xb5\x8d\xe2\x80\x8c|\\n123456|\\n" സ്ഥാപനമാണ്| 123456|
python wcwidth.wcswidth() measures width 6, while mlterm measures width 7.
Javanese (Javanese)
Sequence of language Javanese (Javanese) from midpoint of alignment failure records:
Codepoint |
Python |
Category |
wcwidth |
Name |
---|---|---|---|---|
‘\ua9a5’ |
Lo |
1 |
JAVANESE LETTER PA |
|
‘\ua9b1’ |
Lo |
1 |
JAVANESE LETTER SA |
|
‘\ua9ab’ |
Lo |
1 |
JAVANESE LETTER RA |
|
‘\ua9ba’ |
Mc |
0 |
JAVANESE VOWEL SIGN TALING |
|
‘\ua98f’ |
Lo |
1 |
JAVANESE LETTER KA |
|
‘\ua9a0’ |
Lo |
1 |
JAVANESE LETTER TA |
Total codepoints: 6
Shell test using printf(1),
'|'
should align in output:$ printf "\xea\xa6\xa5\xea\xa6\xb1\xea\xa6\xab\xea\xa6\xba\xea\xa6\x8f\xea\xa6\xa0|\\n12345|\\n" ꦥꦱꦫꦺꦏꦠ| 12345|
python wcwidth.wcswidth() measures width 5, while mlterm measures width 6.
Mongolian, Halh (Mongolian)
Sequence of language Mongolian, Halh (Mongolian) from midpoint of alignment failure records:
Codepoint |
Python |
Category |
wcwidth |
Name |
---|---|---|---|---|
‘\u1828’ |
Lo |
1 |
MONGOLIAN LETTER NA |
|
‘\u1821’ |
Lo |
1 |
MONGOLIAN LETTER E |
|
‘\u1837’ |
Lo |
1 |
MONGOLIAN LETTER RA |
|
‘\u180e’ |
Cf |
0 |
MONGOLIAN VOWEL SEPARATOR |
|
‘\u1821’ |
Lo |
1 |
MONGOLIAN LETTER E |
Total codepoints: 5
Shell test using printf(1),
'|'
should align in output:$ printf "\xe1\xa0\xa8\xe1\xa0\xa1\xe1\xa0\xb7\xe1\xa0\x8e\xe1\xa0\xa1|\\n1234|\\n" ᠨᠡᠷᠡ| 1234|
python wcwidth.wcswidth() measures width 4, while mlterm measures width 5.
Sinhala
Sequence of language Sinhala from midpoint of alignment failure records:
Codepoint |
Python |
Category |
wcwidth |
Name |
---|---|---|---|---|
‘\u0db4’ |
Lo |
1 |
SINHALA LETTER ALPAPRAANA PAYANNA |
|
‘\u0dca’ |
Mn |
0 |
SINHALA SIGN AL-LAKUNA |
|
‘\u200d’ |
Cf |
0 |
ZERO WIDTH JOINER |
|
‘\u0dbb’ |
Lo |
1 |
SINHALA LETTER RAYANNA |
|
‘\u0d9a’ |
Lo |
1 |
SINHALA LETTER ALPAPRAANA KAYANNA |
|
‘\u0dcf’ |
Mc |
0 |
SINHALA VOWEL SIGN AELA-PILLA |
|
‘\u0dc1’ |
Lo |
1 |
SINHALA LETTER TAALUJA SAYANNA |
|
‘\u0db1’ |
Lo |
1 |
SINHALA LETTER DANTAJA NAYANNA |
|
‘\u0dba’ |
Lo |
1 |
SINHALA LETTER YAYANNA |
Total codepoints: 9
Shell test using printf(1),
'|'
should align in output:$ printf "\xe0\xb6\xb4\xe0\xb7\x8a\xe2\x80\x8d\xe0\xb6\xbb\xe0\xb6\x9a\xe0\xb7\x8f\xe0\xb7\x81\xe0\xb6\xb1\xe0\xb6\xba|\\n12345|\\n" ප්රකාශනය| 12345|
python wcwidth.wcswidth() measures width 5, while mlterm measures width 7.
Bengali
Sequence of language Bengali from midpoint of alignment failure records:
Codepoint |
Python |
Category |
wcwidth |
Name |
---|---|---|---|---|
‘\u09b8’ |
Lo |
1 |
BENGALI LETTER SA |
|
‘\u09cd’ |
Mn |
0 |
BENGALI SIGN VIRAMA |
|
‘\u09ac’ |
Lo |
1 |
BENGALI LETTER BA |
|
‘\u09c0’ |
Mc |
0 |
BENGALI VOWEL SIGN II |
|
‘\u0995’ |
Lo |
1 |
BENGALI LETTER KA |
|
‘\u09c3’ |
Mn |
0 |
BENGALI VOWEL SIGN VOCALIC R |
|
‘\u09a4’ |
Lo |
1 |
BENGALI LETTER TA |
|
‘\u09bf’ |
Mc |
0 |
BENGALI VOWEL SIGN I |
|
‘\u200c’ |
Cf |
0 |
ZERO WIDTH NON-JOINER |
|
‘\u0987’ |
Lo |
1 |
BENGALI LETTER I |
Total codepoints: 10
Shell test using printf(1),
'|'
should align in output:$ printf "\xe0\xa6\xb8\xe0\xa7\x8d\xe0\xa6\xac\xe0\xa7\x80\xe0\xa6\x95\xe0\xa7\x83\xe0\xa6\xa4\xe0\xa6\xbf\xe2\x80\x8c\xe0\xa6\x87|\\n12345|\\n" স্বীকৃতিই| 12345|
python wcwidth.wcswidth() measures width 5, while mlterm measures width 6.
Farsi, Western
Sequence of language Farsi, Western from midpoint of alignment failure records:
Codepoint |
Python |
Category |
wcwidth |
Name |
---|---|---|---|---|
‘\u0648’ |
Lo |
1 |
ARABIC LETTER WAW |
|
‘\u062d’ |
Lo |
1 |
ARABIC LETTER HAH |
|
‘\u0634’ |
Lo |
1 |
ARABIC LETTER SHEEN |
|
‘\u06cc’ |
Lo |
1 |
ARABIC LETTER FARSI YEH |
|
‘\u0627’ |
Lo |
1 |
ARABIC LETTER ALEF |
|
‘\u0646’ |
Lo |
1 |
ARABIC LETTER NOON |
|
‘\u0647’ |
Lo |
1 |
ARABIC LETTER HEH |
|
‘\u200c’ |
Cf |
0 |
ZERO WIDTH NON-JOINER |
|
‘\u0627’ |
Lo |
1 |
ARABIC LETTER ALEF |
|
‘\u06cc’ |
Lo |
1 |
ARABIC LETTER FARSI YEH |
Total codepoints: 10
Shell test using printf(1),
'|'
should align in output:$ printf "\xd9\x88\xd8\xad\xd8\xb4\xdb\x8c\xd8\xa7\xd9\x86\xd9\x87\xe2\x80\x8c\xd8\xa7\xdb\x8c|\\n123456789|\\n" وحشیانهای| 123456789|
python wcwidth.wcswidth() measures width 9, while mlterm measures width 10.
Dari
Sequence of language Dari from midpoint of alignment failure records:
Codepoint |
Python |
Category |
wcwidth |
Name |
---|---|---|---|---|
‘\u0648’ |
Lo |
1 |
ARABIC LETTER WAW |
|
‘\u062d’ |
Lo |
1 |
ARABIC LETTER HAH |
|
‘\u0634’ |
Lo |
1 |
ARABIC LETTER SHEEN |
|
‘\u06cc’ |
Lo |
1 |
ARABIC LETTER FARSI YEH |
|
‘\u0627’ |
Lo |
1 |
ARABIC LETTER ALEF |
|
‘\u0646’ |
Lo |
1 |
ARABIC LETTER NOON |
|
‘\u0647’ |
Lo |
1 |
ARABIC LETTER HEH |
|
‘\u200c’ |
Cf |
0 |
ZERO WIDTH NON-JOINER |
|
‘\u06cc’ |
Lo |
1 |
ARABIC LETTER FARSI YEH |
|
‘\u06cc’ |
Lo |
1 |
ARABIC LETTER FARSI YEH |
Total codepoints: 10
Shell test using printf(1),
'|'
should align in output:$ printf "\xd9\x88\xd8\xad\xd8\xb4\xdb\x8c\xd8\xa7\xd9\x86\xd9\x87\xe2\x80\x8c\xdb\x8c\xdb\x8c|\\n123456789|\\n" وحشیانهیی| 123456789|
python wcwidth.wcswidth() measures width 9, while mlterm measures width 10.
Tibetan, Central
Sequence of language Tibetan, Central from midpoint of alignment failure records:
Codepoint |
Python |
Category |
wcwidth |
Name |
---|---|---|---|---|
‘\u0f7c’ |
Mn |
0 |
TIBETAN VOWEL SIGN O |
|
‘\u0f66’ |
Lo |
1 |
TIBETAN LETTER SA |
|
‘\u0f0b’ |
Po |
1 |
TIBETAN MARK INTERSYLLABIC TSHEG |
|
‘\u0f54’ |
Lo |
1 |
TIBETAN LETTER PA |
|
‘\u0f60’ |
Lo |
1 |
TIBETAN LETTER -A |
|
‘\u0f72’ |
Mn |
0 |
TIBETAN VOWEL SIGN I |
|
‘\u0f0b’ |
Po |
1 |
TIBETAN MARK INTERSYLLABIC TSHEG |
|
‘\u0f50’ |
Lo |
1 |
TIBETAN LETTER THA |
|
‘\u0f7c’ |
Mn |
0 |
TIBETAN VOWEL SIGN O |
|
‘\u0f56’ |
Lo |
1 |
TIBETAN LETTER BA |
|
‘\u0f0b’ |
Po |
1 |
TIBETAN MARK INTERSYLLABIC TSHEG |
|
‘\u0f51’ |
Lo |
1 |
TIBETAN LETTER DA |
|
‘\u0f56’ |
Lo |
1 |
TIBETAN LETTER BA |
|
‘\u0f44’ |
Lo |
1 |
TIBETAN LETTER NGA |
|
‘\u0f0b’ |
Po |
1 |
TIBETAN MARK INTERSYLLABIC TSHEG |
|
‘\u0f61’ |
Lo |
1 |
TIBETAN LETTER YA |
|
‘\u0f7c’ |
Mn |
0 |
TIBETAN VOWEL SIGN O |
|
‘\u0f51’ |
Lo |
1 |
TIBETAN LETTER DA |
|
‘\u0f0d’ |
Po |
1 |
TIBETAN MARK SHAD |
Total codepoints: 19
Shell test using printf(1),
'|'
should align in output:$ printf "\xe0\xbd\xbc\xe0\xbd\xa6\xe0\xbc\x8b\xe0\xbd\x94\xe0\xbd\xa0\xe0\xbd\xb2\xe0\xbc\x8b\xe0\xbd\x90\xe0\xbd\xbc\xe0\xbd\x96\xe0\xbc\x8b\xe0\xbd\x91\xe0\xbd\x96\xe0\xbd\x84\xe0\xbc\x8b\xe0\xbd\xa1\xe0\xbd\xbc\xe0\xbd\x91\xe0\xbc\x8d|\\n123456789012345|\\n" ོས་པའི་ཐོབ་དབང་ཡོད།| 123456789012345|
python wcwidth.wcswidth() measures width 15, while mlterm measures width 16.
Marathi
Sequence of language Marathi from midpoint of alignment failure records:
Codepoint |
Python |
Category |
wcwidth |
Name |
---|---|---|---|---|
‘\u091c’ |
Lo |
1 |
DEVANAGARI LETTER JA |
|
‘\u094d’ |
Mn |
0 |
DEVANAGARI SIGN VIRAMA |
|
‘\u092f’ |
Lo |
1 |
DEVANAGARI LETTER YA |
|
‘\u093e’ |
Mc |
0 |
DEVANAGARI VOWEL SIGN AA |
|
‘\u200c’ |
Cf |
0 |
ZERO WIDTH NON-JOINER |
|
‘\u0905’ |
Lo |
1 |
DEVANAGARI LETTER A |
|
‘\u0930’ |
Lo |
1 |
DEVANAGARI LETTER RA |
|
‘\u094d’ |
Mn |
0 |
DEVANAGARI SIGN VIRAMA |
|
‘\u0925’ |
Lo |
1 |
DEVANAGARI LETTER THA |
|
‘\u0940’ |
Mc |
0 |
DEVANAGARI VOWEL SIGN II |
Total codepoints: 10
Shell test using printf(1),
'|'
should align in output:$ printf "\xe0\xa4\x9c\xe0\xa5\x8d\xe0\xa4\xaf\xe0\xa4\xbe\xe2\x80\x8c\xe0\xa4\x85\xe0\xa4\xb0\xe0\xa5\x8d\xe0\xa4\xa5\xe0\xa5\x80|\\n12345|\\n" ज्याअर्थी| 12345|
python wcwidth.wcswidth() measures width 5, while mlterm measures width 6.
Yaneshaʼ
Sequence of language Yaneshaʼ from midpoint of alignment failure records:
Codepoint |
Python |
Category |
wcwidth |
Name |
---|---|---|---|---|
‘\u0303’ |
Mn |
0 |
COMBINING TILDE |
|
‘a’ |
Ll |
1 |
LATIN SMALL LETTER A |
|
‘n’ |
Ll |
1 |
LATIN SMALL LETTER N |
|
‘a’ |
Ll |
1 |
LATIN SMALL LETTER A |
|
‘r’ |
Ll |
1 |
LATIN SMALL LETTER R |
|
‘e’ |
Ll |
1 |
LATIN SMALL LETTER E |
|
‘t’ |
Ll |
1 |
LATIN SMALL LETTER T |
Total codepoints: 7
Shell test using printf(1),
'|'
should align in output:$ printf "\xcc\x83anaret|\\n123456|\\n" ̃anaret| 123456|
python wcwidth.wcswidth() measures width 6, while mlterm measures width 7.
Nepali
Sequence of language Nepali from midpoint of alignment failure records:
Codepoint |
Python |
Category |
wcwidth |
Name |
---|---|---|---|---|
‘\u092a’ |
Lo |
1 |
DEVANAGARI LETTER PA |
|
‘\u0941’ |
Mn |
0 |
DEVANAGARI VOWEL SIGN U |
|
‘\u0930’ |
Lo |
1 |
DEVANAGARI LETTER RA |
|
‘\u094d’ |
Mn |
0 |
DEVANAGARI SIGN VIRAMA |
|
‘\u200d’ |
Cf |
0 |
ZERO WIDTH JOINER |
|
‘\u092f’ |
Lo |
1 |
DEVANAGARI LETTER YA |
|
‘\u093e’ |
Mc |
0 |
DEVANAGARI VOWEL SIGN AA |
|
‘\u0907’ |
Lo |
1 |
DEVANAGARI LETTER I |
|
‘\u090f’ |
Lo |
1 |
DEVANAGARI LETTER E |
|
‘\u0915’ |
Lo |
1 |
DEVANAGARI LETTER KA |
|
‘\u094b’ |
Mc |
0 |
DEVANAGARI VOWEL SIGN O |
Total codepoints: 11
Shell test using printf(1),
'|'
should align in output:$ printf "\xe0\xa4\xaa\xe0\xa5\x81\xe0\xa4\xb0\xe0\xa5\x8d\xe2\x80\x8d\xe0\xa4\xaf\xe0\xa4\xbe\xe0\xa4\x87\xe0\xa4\x8f\xe0\xa4\x95\xe0\xa5\x8b|\\n12345|\\n" पुर्याइएको| 12345|
python wcwidth.wcswidth() measures width 5, while mlterm measures width 7.
Kannada
Sequence of language Kannada from midpoint of alignment failure records:
Codepoint |
Python |
Category |
wcwidth |
Name |
---|---|---|---|---|
‘\u0cb5’ |
Lo |
1 |
KANNADA LETTER VA |
|
‘\u0cbe’ |
Mc |
0 |
KANNADA VOWEL SIGN AA |
|
‘\u0c95’ |
Lo |
1 |
KANNADA LETTER KA |
|
‘\u0ccd’ |
Mn |
0 |
KANNADA SIGN VIRAMA |
|
‘\u200c’ |
Cf |
0 |
ZERO WIDTH NON-JOINER |
|
‘\u0cb8’ |
Lo |
1 |
KANNADA LETTER SA |
|
‘\u0ccd’ |
Mn |
0 |
KANNADA SIGN VIRAMA |
|
‘\u0cb5’ |
Lo |
1 |
KANNADA LETTER VA |
|
‘\u0cbe’ |
Mc |
0 |
KANNADA VOWEL SIGN AA |
|
‘\u0ca4’ |
Lo |
1 |
KANNADA LETTER TA |
|
‘\u0c82’ |
Mc |
0 |
KANNADA SIGN ANUSVARA |
|
‘\u0ca4’ |
Lo |
1 |
KANNADA LETTER TA |
|
‘\u0ccd’ |
Mn |
0 |
KANNADA SIGN VIRAMA |
|
‘\u0cb0’ |
Lo |
1 |
KANNADA LETTER RA |
|
‘\u0ccd’ |
Mn |
0 |
KANNADA SIGN VIRAMA |
|
‘\u0caf’ |
Lo |
1 |
KANNADA LETTER YA |
Total codepoints: 16
Shell test using printf(1),
'|'
should align in output:$ printf "\xe0\xb2\xb5\xe0\xb2\xbe\xe0\xb2\x95\xe0\xb3\x8d\xe2\x80\x8c\xe0\xb2\xb8\xe0\xb3\x8d\xe0\xb2\xb5\xe0\xb2\xbe\xe0\xb2\xa4\xe0\xb2\x82\xe0\xb2\xa4\xe0\xb3\x8d\xe0\xb2\xb0\xe0\xb3\x8d\xe0\xb2\xaf|\\n12345678|\\n" ವಾಕ್ಸ್ವಾತಂತ್ರ್ಯ| 12345678|
python wcwidth.wcswidth() measures width 8, while mlterm measures width 9.
Panjabi, Western
Sequence of language Panjabi, Western from midpoint of alignment failure records:
Codepoint |
Python |
Category |
wcwidth |
Name |
---|---|---|---|---|
‘\u0628’ |
Lo |
1 |
ARABIC LETTER BEH |
|
‘\u06d2’ |
Lo |
1 |
ARABIC LETTER YEH BARREE |
|
‘\u200c’ |
Cf |
0 |
ZERO WIDTH NON-JOINER |
|
‘\u0631’ |
Lo |
1 |
ARABIC LETTER REH |
|
‘\u0648’ |
Lo |
1 |
ARABIC LETTER WAW |
|
‘\u0632’ |
Lo |
1 |
ARABIC LETTER ZAIN |
|
‘\u06af’ |
Lo |
1 |
ARABIC LETTER GAF |
|
‘\u0627’ |
Lo |
1 |
ARABIC LETTER ALEF |
|
‘\u0631’ |
Lo |
1 |
ARABIC LETTER REH |
|
‘\u06cc’ |
Lo |
1 |
ARABIC LETTER FARSI YEH |
|
‘\u060c’ |
Po |
1 |
ARABIC COMMA |
Total codepoints: 11
Shell test using printf(1),
'|'
should align in output:$ printf "\xd8\xa8\xdb\x92\xe2\x80\x8c\xd8\xb1\xd9\x88\xd8\xb2\xda\xaf\xd8\xa7\xd8\xb1\xdb\x8c\xd8\x8c|\\n1234567890|\\n" بےروزگاری،| 1234567890|
python wcwidth.wcswidth() measures width 10, while mlterm measures width 11.
Yiddish, Eastern
Sequence of language Yiddish, Eastern from midpoint of alignment failure records:
Codepoint |
Python |
Category |
wcwidth |
Name |
---|---|---|---|---|
‘\u202e’ |
Cf |
0 |
RIGHT-TO-LEFT OVERRIDE |
|
‘A’ |
Lu |
1 |
LATIN CAPITAL LETTER A |
|
‘\u202c’ |
Cf |
0 |
POP DIRECTIONAL FORMATTING |
Total codepoints: 3
Shell test using printf(1),
'|'
should align in output:$ printf "\xe2\x80\xaeA\xe2\x80\xac|\\n1|\\n" A| 1|
python wcwidth.wcswidth() measures width 1, while mlterm measures width 3.
Urdu
Sequence of language Urdu from midpoint of alignment failure records:
Codepoint |
Python |
Category |
wcwidth |
Name |
---|---|---|---|---|
‘\u0601’ |
Cf |
0 |
ARABIC SIGN SANAH |
|
‘\u06f1’ |
Nd |
1 |
EXTENDED ARABIC-INDIC DIGIT ONE |
|
‘\u06f9’ |
Nd |
1 |
EXTENDED ARABIC-INDIC DIGIT NINE |
|
‘\u06f4’ |
Nd |
1 |
EXTENDED ARABIC-INDIC DIGIT FOUR |
|
‘\u06f8’ |
Nd |
1 |
EXTENDED ARABIC-INDIC DIGIT EIGHT |
|
‘\u0621’ |
Lo |
1 |
ARABIC LETTER HAMZA |
Total codepoints: 6
Shell test using printf(1),
'|'
should align in output:$ printf "\xd8\x81\xdb\xb1\xdb\xb9\xdb\xb4\xdb\xb8\xd8\xa1|\\n12345|\\n" ۱۹۴۸ء| 12345|
python wcwidth.wcswidth() measures width 5, while mlterm measures width 6.
Urdu (2)
Sequence of language Urdu (2) from midpoint of alignment failure records:
Codepoint |
Python |
Category |
wcwidth |
Name |
---|---|---|---|---|
‘\u0601’ |
Cf |
0 |
ARABIC SIGN SANAH |
|
‘\u06f1’ |
Nd |
1 |
EXTENDED ARABIC-INDIC DIGIT ONE |
|
‘\u06f9’ |
Nd |
1 |
EXTENDED ARABIC-INDIC DIGIT NINE |
|
‘\u06f4’ |
Nd |
1 |
EXTENDED ARABIC-INDIC DIGIT FOUR |
|
‘\u06f8’ |
Nd |
1 |
EXTENDED ARABIC-INDIC DIGIT EIGHT |
|
‘\u0621’ |
Lo |
1 |
ARABIC LETTER HAMZA |
Total codepoints: 6
Shell test using printf(1),
'|'
should align in output:$ printf "\xd8\x81\xdb\xb1\xdb\xb9\xdb\xb4\xdb\xb8\xd8\xa1|\\n12345|\\n" ۱۹۴۸ء| 12345|
python wcwidth.wcswidth() measures width 5, while mlterm measures width 6.