From: <Saved by Microsoft Internet Explorer 5>
Subject: Unicode Technical Report Number 1
Date: Sat, 5 Jul 2003 19:40:44 +0630
MIME-Version: 1.0
Content-Type: text/html;
	charset="Windows-1252"
Content-Transfer-Encoding: quoted-printable
Content-Location: http://www.unicode.org/reports/tr1.html
X-MimeOLE: Produced By Microsoft MimeOLE V5.50.4910.0300

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD><TITLE>Unicode Technical Report Number 1</TITLE>
<META content=3D"text/html; charset=3Dwindows-1252" =
http-equiv=3DContent-Type>
<META content=3D"MSHTML 5.00.3502.5390" name=3DGENERATOR></HEAD>
<BODY aLink=3D#0000ff bgColor=3D#ffffff link=3D#ff0000 text=3D#000000 =
vLink=3D#808080>
<H1>Unicode Technical Report #1</H1>
<H2>Burmese <BR>Khmer <BR>Ethiopian </H2>
<TABLE border=3D1 width=3D"100%">
  <TBODY>
  <TR>
    <TD>Revision</TD>
    <TD>03 Nov 1992; rev 92/11/25 </TD></TR>
  <TR>
    <TD>Authors </TD>
    <TD>Burmese proposal was written by Andy Daniels, with contributions =
by=20
      Lloyd Anderson, Glenn Adams, and Lee Collins.<BR>Khmer proposal =
was=20
      written by Andy Daniels.<BR>Ethiopian proposal was written by Joe =
Becker.=20
    </TD></TR>
  <TR>
    <TD>Date </TD>
    <TD>1992</TD></TR>
  <TR>
    <TD>This Version</TD>
    <TD><A=20
      =
href=3D"http://www.unicode.org/unicode/reports/tr1.html">http://www.unico=
de.org/unicode/reports/tr1.html</A></TD></TR>
  <TR>
    <TD>Previous Version</TD>
    <TD>&nbsp;</TD></TR>
  <TR>
    <TD>Latest Version</TD>
    <TD><A=20
      =
href=3D"http://www.unicode.org/unicode/reports/tr1.html">http://www.unico=
de.org/unicode/reports/tr1.html</A></TD></TR></TBODY></TABLE>
<P>Technical Reports contain material that has been approved by the =
Unicode=20
Consortium for publication, but that is not necessarily considered part =
of the=20
Unicode Standard. Often, technical reports are superseded by later=20
standardization, or the informative material that they contain is =
incorporated=20
into explanatory chapters in subsequent editions of The Unicode =
Standard. &nbsp;=20
Sometimes minor updates to the Unicode Standard itself are published as=20
Technical Reports.&nbsp; Some Technical Reports are not available for=20
downloading.</P>
<HR>

<P align=3Dcenter><FONT size=3D4><STRONG>Technical Report =
#1</STRONG></FONT> <FONT=20
size=3D4><STRONG>- Burmese, Khmer, and Ethiopia</STRONG></FONT></P>
<P align=3Dleft><BR>This Technical Report is comprised of three concrete =
proposals=20
to which the Unicode Technical Committee is strongly committed in their =
current=20
form.</P>
<P><EM><STRONG>Status of this document</STRONG></EM></P>
<P><EM>This document has been considered and approved by the Unicode =
Technical=20
Committee for publication as a Technical Report. At the current time, =
the=20
specifications in this technical report are provided as information and =
guidance=20
to implementers of the Unicode Standard, but do not form part of the =
standard=20
itself. The Unicode Technical Committee may decide to incorporate all or =
part of=20
the material of this technical report into a future version of the =
Unicode=20
Standard, either as informative or as normative specification. Please =
mail=20
corrigenda and other comments to <A=20
href=3D"mailto:unicore@unicode.org">mailto:unicore@unicode.org</A>.</EM><=
/P>
<HR>

<H1>Document</H1>
<P>This Technical Report is comprised of three concrete proposals to =
which the=20
Unicode Technical Committee is strongly committed in their current form. =
These=20
are: Ethiopian, Burmese, and Khmer. These proposals have been reviewed=20
internally and have been relatively stable over a period of time. The =
committee=20
believes they represent good technical solutions for the proposed =
scripts, and=20
therefore also recommends that specific codepoints within the body of =
Unicode be=20
allocated for them, as follows: </P>
<P>Burmese U+0F00 U+0F7F<BR>Khmer U+0F80 U+0FFF<BR>Ethiopian U+1200 =
U+125F </P>
<P>Specific open issues for each of these are addressed in the =
respective draft=20
block introductions. These open issues do not detract substantially from =
the=20
solidity of the proposals. </P>
<HR>

<H2>Burmese Proposal Description</H2>
<P>Burmese U+0F00 -&gt; 0F7F </P>
<P>The Burmese script is used to write Burmese, the majority language of =
Myanmar=20
(formerly Burma) and Pali. Variations and extentions of the script are =
used to=20
write other languages of the region, such as Shan and Mon, and also to =
write=20
Sanskrit. </P>
<P>The Burmese script derives from 11th century Mon. The Mon script =
itself is=20
probably borrowed directly from South India. The earliest Mon =
inscription, found=20
at Lopburi in Thailand, dates from the eight century and is written in =
the=20
Pallava script used at the Hinayana Buddhist center of Conjeeveram in =
the area=20
of Madras on the east coast of India. In A.D. 1057 one of the first =
Burmese=20
kings, Aniruddha, conquered Thaton, a major Mon center, and brought back =
with=20
him to Pagan the most learned monks, artists and artisans of the Mon. =
The first=20
inscription in Burmese dates from the following year and is written in =
an=20
alphabet almost identical with that of the Mon inscriptions. Aside from =
rounding=20
of the originally square characters, this script has remained largely =
unchanged=20
to the present. </P>
<P>The Burmese script therfore ultimately derives from Brahmi, and so =
shares the=20
structural features of its relatives: Consonant symbols include an =
inherent=20
vowel; various signs are placed before, above, below and after a =
consonant to=20
indicate a vowel other than the inherent one; ligatures and conjuncts =
are used=20
to indicate consonant clusters. </P>
<P>In the course of its adaptation to non-Indo-Aryan languages, =
the<BR>Burmese=20
script has acquired some features that distinguish it from<BR>other =
Indic=20
scripts. The killer, or virama, participates in some<BR>common =
constructions=20
that would be clumsy to handle the way they<BR>would be in the other =
Indic=20
scripts, so the control function of<BR>the virama is separated from the=20
diacritic function of the killer.<BR>The virama, 0F4D is used to form =
conjunct=20
consonants, while the<BR>killer, 0F52, is a simple diacritic and has no =
effect=20
on character<BR>shaping. The killer is also combined with the VOWEL SIGN =
O=20
(0F4B)<BR>to form the low level tone vowel "o." When used this way,=20
this<BR>symbol is known as hyei hto, or "thrust forward." </P>
<P>Burmese distinguishes as set of "medial" consonants. =
Originally<BR>conjunct=20
forms of YA PALE, YA GAU, WA and HA, they are used in<BR>modern Burmese =
to form=20
new letters and to spell certain vowel and<BR>consonant combinations. =
They are=20
treated here as no different from<BR>any other conjunct and should be =
coded=20
using the virama. </P>
<P>ISSUE: There's no reason from the point of view of the =
rendering<BR>engine to=20
have separate codes for the medials. Some implementors<BR>feel that the =
medials=20
should nevertheless have separate codes.<BR>Including them introduces =
alternate=20
spellings for the same syllables,<BR>something that should be avoided. =
If there=20
are compelling reasons<BR>for including the medials, there is certainly =
room to=20
add them. </P>
<P>When a syllable has more than one medial, it is recommended =
that<BR>they=20
appear in the order that such syllables are traditionally<BR>spelled. =
That is,=20
HA HTOU, before YA PIN or YA YI, before WA HSWE.<BR>Note that YA PIN and =
YA YI=20
cannot appear in the same syllable in<BR>Burmese. For example, "cwei" =
("to drop=20
off") is coded as<BR>0F15+0F5C+0F5E+0F47. "Hmyu" ("to delight, allure") =
is coded=20
as<BR>0F2E+0F5F+0F5D+0F42. This differs from the order in which =
medials<BR>are=20
normally written. </P>
<P>ISSUE: This rule is not strictly necessary, but regularizes =
the<BR>spelling=20
and simplifies rendering, string comparison and other<BR>functions. </P>
<P>Burmese has several glyphs that are used with varying =
semantics<BR>which are=20
here given separate code points for each different usage.<BR>The =
following pairs=20
of letters look the same, but must be distinguished<BR>in the text =
stream: </P>
<P>EHKAYA U (0F09) and NYA GALEI (0F5B)<BR>GA NGE (0F17) and DIGIT HYI=20
(0F6E)<BR>WA (0F35) and DIGIT THOUN NYA (0F66)<BR>YA GAU (0F30) and =
DIGIT HKUN=20
NI (0F6D)<BR>DIGIT LEI (0F6A) and SYMBOL LAGAUN (0F73) </P>
<P>The last two pairs are distinguished in some fonts but not =
in<BR>others. </P>
<P>Also, the LETTER 0 (0F13) is distinguished from the =
sequence<BR>0F48+0F5D,=20
and the ZA MYINZWE is distinguished from 0F1A+0F5C. </P>
<P>Symbols not found as single characters are formed from =
sequences<BR>of the=20
basic characters given here. For example,tha ji ("great<BR>tha") is =
coded by the=20
sequence 0F38+0F4D+0F38, i.e., it is a conjunct<BR>formed from two THAs. =
Kinzi=20
is a conjunct formed from LETTER NGA<BR>followed by some other =
consonant, that=20
is, the sequence<BR>0F19+0F4D+Consonant. Low level tone "o" has already =
been=20
noted.<BR>Level tone "ou" is to be coded as 0F41+0F3F. Other=20
combinations<BR>follow similarly. </P>
<P>The LETTER A, though classified here are a vowel, is actually =
a<BR>consonant.=20
Thus it can combine with any of the vowel symbols. </P>
<P>The tone mark AUKA MYI is often written to the left of a =
subscript<BR>vowel=20
sign or medial consonant. It should, nevertheless come after<BR>the =
vowel or=20
medial in the text stream. It is also used with killed<BR>consonants in =
writing=20
closed syllables. In this case, too, the AUKA<BR>MYI should come after =
the ATHA=20
in the text stream. For example,<BR>the word /hyun./ (short, high =
falling tone)=20
should be represented<BR>as 0F30+0F5F+0F5E+0F02+0F51. </P>
<P>The SYMBOL HNAI is only used in the literary=20
combination<BR>0F73+0F19+0F52+0F03, meaning "the aforementioned." </P>
<P>Burmese does not use any whitespace between words. If word=20
boundary<BR>indications are desired, for example for the use of =
automatic=20
line<BR>layout algorithms, U+200B, ZERO WIDTH SPACE, is to be used. </P>
<P>Block Structure: Burmese characters are mapped to their=20
corresponding<BR>ISCII slots whenever possible. Gaps in the block result =
mainly=20
from<BR>this mapping. Several ranges of code points are reserved for=20
future<BR>expansion. A notable exception is the pair NYA GALEI and NYA=20
JI.<BR>Historically, NYA GALEI is a simple palatal nasal, while NYA JI =
is<BR>a=20
ligature representing a double NYA GALEI. NYA JI, however, has<BR>come =
to be=20
regarded as the primary form of the letter in Burmese,<BR>so it is =
assigned to=20
the "preferred" ISCII slot for the palatal<BR>nasal (U+0F1E), and NYA =
GALEI is=20
placed at U+0F5F. </P>
<P>U+0F00 to U+0F01 Unassigned<BR>U+0F02 to U+0F03 Various =
signs<BR>U+0F04=20
Unassigned<BR>U+0F05 to U+0F14 Independent vowels<BR>U+0F15 to U+0F39=20
Consonants<BR>U+0F3A to U+0F3E Unassigned<BR>U+0F3F to U+0F4C Dependent =
vowel=20
signs<BR>U+0F4D Virama<BR>U+0F4E to U+0F50 Unassigned<BR>U+0F51 to =
U+0F52 Tone=20
marks<BR>U+0F53 to U+0F5F Unassigned, reserved for extensions<BR>U+0F60 =
to=20
U+0F63 Additional dependent vowel signs<BR>U+0F64 to U+0F65 =
Unassigned<BR>U+0F66=20
to U+0F6F Digits<BR>U+0F70 to U+0F73 Special symbols<BR>U+0F74 to U+0F77 =

Unassigned, reserved for additional symbols </P>
<P>Note: The transliteration used here follows D. Haigh Roop, An =
Introduction to=20
the Burmese Writing System (1972). Tone indications are left out of the=20
character names. </P>
<P>ISSUE: As with Khmer, if there is a more standard =
transliteration,<BR>it=20
should be used. </P>
<P>ISSUE: Old Burmese has a small subscript LETTER A, which is =
the<BR>precursor=20
of the tone mark AUKA MYA and appears exactly where modern<BR>Burmese =
would use=20
the latter. This can probably be treated as a<BR>font difference. There =
is also=20
a superscript form of YA GAU, similar<BR>in use to the Indic repha. This =
can=20
probably be accommodated in<BR>the shaping rules. This is not a major =
issue as=20
there is plenty of<BR>room to add these characters. Further =
investigation is=20
required. </P>
<P>&nbsp;</P>
<P><STRONG>DRAFT 03 Nov 1992; rev 92/11/25<BR></STRONG><BR><STRONG>DRAFT =
BURMESE=20
CHARACTER NAMES</STRONG><BR><BR>0F00<BR>0F01<BR><BR>@ Various =
Signs<BR>0F02=20
BURMESE THEIDHEI TIN<BR>=3D little thing put on<BR>anusvara, =
niggahita<BR>0F03=20
BURMESE HYEIGA PAU<BR>=3D dots ahead<BR>visarga<BR>0F04<BR><BR>@ =
Independent=20
Vowels<BR>0F05 BURMESE LETTER A<BR>0F06<BR>0F07 BURMESE PALI EHKAYA =
I<BR>=3D=20
letter pali I<BR>0F08 BURMESE EHKAYA I<BR>=3D letter I<BR>0F09 BURMESE =
EHKAYA=20
U<BR>=3D letter U<BR>x Burmese nya galei -&gt; 0F5B<BR>0F0A<BR>0F0B =
BURMESE LETTER=20
VOCALIC R<BR>Sanskrit<BR>0F0C BURMESE LETTER VOCALIC=20
L<BR>Sanskrit<BR>0F0D<BR>0F0E<BR>0F0F BURMESE EHKAYA EI<BR>=3D letter=20
EI<BR>0F10<BR>0F11<BR>0F12<BR>0F13 BURMESE LETTER O<BR>x =
sra<BR>0F14<BR><BR>@=20
Consonants<BR>0F15 BURMESE KA JI<BR>=3D great ka<BR>0F16 BURMESE HKA =
GWEI<BR>=3D=20
curved hka<BR>0F17 BURMESE GA NGE<BR>=3D small ga<BR>x Burmese digit hyi =
-&gt;=20
0F6E<BR>0F18 BURMESE GA JI<BR>=3D great ga<BR>0F19 BURMESE LETTER =
NGA<BR>0F1A=20
BURMESE SA LOUN<BR>=3D round sa<BR>0F1B BURMESE HSA LEIN<BR>=3D twisted =
hsa<BR>0F1C=20
BURMESE ZA GWE<BR>=3D split za<BR>0F1D BURMESE ZA MYINZWE<BR>=3D bridle =
za<BR>x=20
cya<BR>0F1E BURMESE NYA JI<BR>=3D great nya<BR>0F1F BURMESE TA =
TALINJEI<BR>=3D=20
bier-hook ta<BR>0F20 BURMESE HTA WUNBE<BR>=3D duck hta<BR>0F21 BURMESE =
DA=20
YINGAU<BR>=3D crooked-breasted da<BR>0F22 BURMESE DA YEIHMOU<BR>=3D =
water-dipper=20
da<BR>0F23 BURMESE NA JI<BR>=3D great na<BR>0F24 BURMESE TA WUNBU<BR>=3D =
pot-bellied=20
ta<BR>0F25 BURMESE HTA HSINDU<BR>=3D elephant-fetter hta<BR>0F26 BURMESE =
DA=20
DWEI<BR>=3D twisted da<BR>0F27 BURMESE DA AUHCAI<BR>=3D bottom-indented =
da<BR>0F28=20
BURMESE NA NGE<BR>=3D small na<BR>0F29<BR>0F2A BURMESE PA ZAU<BR>=3D =
steep-sided=20
pa<BR>0F2B BURMESE HPA OUHTOU<BR>=3D capped hpa<BR>0F2C BURMESE BA =
LAHCAI<BR>=3D=20
top-indented ba<BR>0F2D BURMESE BA GOUN<BR>=3D hump-backed ba<BR>0F2E =
BURMESE=20
LETTER MA<BR>0F2F BURMESE YA PALE<BR>=3D supine ya<BR>0F30 BURMESE YA =
GAU<BR>=3D=20
crooked ya<BR>x Burmese digit hkun ni -&gt; 0F6D<BR>0F31<BR>0F32 BURMESE =
LETTER=20
LA<BR>0F33 BURMESE LA JI<BR>=3D great la<BR>0F34<BR>0F35 BURMESE LETTER =
WA<BR>x=20
Burmese digit thoun nya -&gt; 0F66<BR>0F36 BURMESE LETTER SANSKRIT=20
SHA<BR>Sanskrit<BR>0F37 BURMESE LETTER SANSKRIT SSA<BR>Sanskrit<BR>0F38 =
BURMESE=20
LETTER THA<BR>0F39 BURMESE LETTER=20
HA<BR><BR>0F3A<BR>0F3B<BR>0F3C<BR>0F3D<BR><BR>@ Vowel Signs<BR>0F3E =
BURMESE YEI=20
HCA<BR>=3D line drawn down<BR>0F3F BURMESE LOUNJI TIN<BR>=3D big circle =
put=20
on<BR>0F40 BURMESE LOUNJI TIN HSAN HKA<BR>=3D big circle put on with a =
grain of=20
rice<BR>0F41 BURMESE TAHCAUN NGIN<BR>=3D one stroke drawn out<BR>0F42 =
BURMESE=20
HNAHCAUN NGIN<BR>=3D two strokes drawn out<BR>0F43 BURMESE VOWEL SIGN =
VOCALIC=20
R<BR>Sanskrit<BR>0F44 BURMESE VOWEL SIGN VOCALIC=20
RR<BR>Sanskrit<BR>0F45<BR>0F46<BR>0F47 BURMESE THAWEI HTOU<BR>=3D thrust =
in=20
front<BR>0F48 BURMESE NAU PYI<BR>=3D thrown =
backwards<BR>0F49<BR>0F4A<BR>0F4B=20
BURMESE VOWEL SIGN O<BR>0F4C<BR><BR>@ Virama<BR>0F4D BURMESE VIRAMA<BR>x =
Burmese=20
atha -&gt; 0F52<BR>0F4E<BR>0F4F<BR>0F50<BR><BR>@ Tone Marks<BR>0F51 =
BURMESE AUKA=20
MYI<BR>=3D stopped below<BR>0F52 BURMESE ATHA<BR>=3D killer<BR>=3D hyei =
htou, "thrust=20
forward"<BR>x Burmese virama -&gt;=20
0F4D<BR>0F53<BR>0F54<BR>0F55<BR>0F56<BR>0F57<BR>0F58<BR>0F59<BR>0F5A<BR>0=
F5B<BR>0F5C<BR>0F5D<BR>0F5E<BR><BR>@=20
Consonants<BR>0F5F BURMESE NYA GALEI<BR>=3D little nya<BR>x Burmese =
ehkaya u -&gt;=20
0F09<BR><BR>@ Vowel Signs<BR>0F60 BURMESE LETTER VOCALIC =
RR<BR>Sanskrit<BR>0F61=20
BURMESE LETTER VOCALIC LL<BR>Sanskrit<BR>0F62 BURMESE VOWEL SIGN VOCALIC =

L<BR>Sanskrit<BR>0F63 BURMESE VOWEL SIGN VOCALIC=20
LL<BR>Sanskrit<BR>0F64<BR>0F65<BR><BR>@ Digits<BR>0F66 BURMESE DIGIT =
THOUN=20
NYA<BR>=3D digit zero<BR>x Burmese wa -&gt; 0F35<BR>0F67 BURMESE DIGIT =
TI<BR>=3D=20
digit one<BR>0F68 BURMESE DIGIT HNI<BR>=3D digit two<BR>0F69 BURMESE =
DIGIT=20
THOUN<BR>=3D digit three<BR>0F6A BURMESE DIGIT LEI<BR>=3D digit =
four<BR>x Burmese=20
symbol lagaun -&gt; 0F73<BR>0F6B BURMESE DIGIT NGA<BR>=3D digit =
five<BR>0F6C=20
BURMESE DIGIT HCAU<BR>=3D digit six<BR>0F6D BURMESE DIGIT HKUN NI<BR>=3D =
digit=20
seven<BR>x Burmese ya gau -&gt; 0F30<BR>0F6E BURMESE DIGIT HYI<BR>=3D =
digit=20
eight<BR>x Burmese ga nge -&gt; 0F17<BR>0F6F BURMESE DIGIT KOU<BR>=3D =
digit=20
nine<BR><BR>@ Various symbols<BR>0F70 BURMESE SYMBOL YWEI<BR>0F71 =
BURMESE SYMBOL=20
EHKAYA I<BR>0F72 BURMESE SYMBOL HNAI<BR>0F73 BURMESE SYMBOL LAGAUN<BR>x =
Burmese=20
digit lei -&gt;=20
0F6A<BR>0F74<BR>0F75<BR>0F76<BR>0F77<BR>0F78<BR>0F79<BR>0F7A<BR>0F7B<BR>0=
F7C<BR>0F7D<BR>0F7E<BR>0F7F<BR></P>
<HR>

<P><FONT size=3D5><STRONG>Khmer Proposal =
Description</STRONG></FONT><BR></P>
<P><FONT size=3D3>Khmer U+0F80 -&gt; 0FDF</FONT><BR><BR>Cambodian, also =
known as=20
Khmer, is the official language of Cambodia.<BR>Mutually intelligible =
dialects=20
are also spoken in northeastern<BR>Thailand and the Mekong Delta region =
of=20
Vietnam. While not itself<BR>an Indo-European language, much of the=20
administrative, military<BR>and literary vocabulary of Khmer is borrowed =
from=20
Sanskrit. With<BR>the advent of Theravada Buddhism at the beginning of =
the=20
fifteenth<BR>century, Khmer began to borrow Pali words, and continues to =

use<BR>Pali as a major source of neologisms today. There is also=20
much<BR>cross-borrowing between Thai and Khmer, as well as a=20
relatively<BR>recent infusion of French words and a smattering of =
Chinese=20
and<BR>Vietnamese loanwords in colloquial speech.<BR><BR>The Khmer =
script,=20
called a'saa kmae ("Khmer letters"), as well as<BR>Thai, Lao, Burmese, =
Old Mon=20
and others, are all descended from the<BR>Brahmi script of South India. =
The=20
exact geographical source, or<BR>possibly sources, has not been =
determined, but=20
there is a great<BR>similarity between the earliest inscriptions in the =
region=20
and the<BR>Pallawa script of the Coromandel coast of =
India.<BR><BR>Structurally,=20
the Khmer script stays very close to its southern<BR>Brahmi origins. =
There is a=20
set of 35 consonants, each with an<BR>inherent vowel sound. Additional =
signs are=20
placed before, above,<BR>below and after the consonants to indicate =
vowels other=20
than the<BR>inherent one. Consonant clusters are represented by=20
conjunct<BR>consonants, where the first consonant of the cluster =
maintains=20
its<BR>full form and succeeding consonants are written as =
subscripts.<BR><BR>The=20
Khmer language has a much richer set of vowels than the =
Indo-Aryan<BR>languages=20
for which the ancestral script was used. By the same<BR>token, there is =
a much=20
smaller set of consonant sounds. The Khmer<BR>script is adapted to the =
language=20
by adding extra vowel signs and<BR>various diacritic marks, and by using =
the=20
choice of consonant as<BR>well as of vowel signs to determine the =
particular=20
vowel sound<BR>represented. Thus most vowel signs do not have a single =
value=20
but<BR>must be interpreted in the context of the associated =
consonant.<BR>This=20
is very similar to the situation in Thai and Lao, where<BR>different =
consonant=20
symbols have the same sound but encode different<BR>tones.<BR><BR>There =
are two=20
basic styles of script in modern Khmer, each with<BR>two major =
variations. They=20
are the a'saa criang ("slanted script")<BR>and the a'saa muul ("round =
script").=20
There is no fundamental<BR>structural difference between them, however, =
so the=20
"standing"<BR>variant of the slanted script is chosen here as=20
representative.<BR><BR>Representation:<BR><BR>The Khmer script follows =
the model=20
of Devanagari and other Indic<BR>scripts. The basic unit is the syllabic =
cluster=20
consisting of a<BR>series of consonants separated by WIRIAM (0FC5), =
followed by=20
one<BR>or both of the pronounciation shifters MUSEKATOAN (0FCA) and=20
TRUYSAP<BR>(0FCB), followed by an optional vowel, followed by diacritics =

and<BR>quality marks. For example, the word /knyom/, "I," is coded as=20
the<BR>string 0F81+0FC5+0F89+0FB5+0FC2.<BR><BR>In cases where there is =
already=20
some other superscript in the<BR>cluster, the two pronounciation =
shifters are=20
written as the subscript<BR>symbol kbiah kraom, which looks much like =
VOWEL SIGN=20
O. This vowel<BR>sign is not to be used for this purpose. It is the=20
responsibility<BR>of the presentation software to select the correct =
appearance=20
of<BR>the shifter. For example, /sii/, "to eat," should be coded=20
as<BR>0F9F+0FCB+0FB4, not as 0F9F+0FB7+0FB4.<BR><BR>RAWBAT (0FCC) =
historically=20
corresponds to the Devanagari repha,<BR>that is, to an initial /r/. It =
has lost=20
this function in Khmer and<BR>instead is considered a simple diacritic =
similar=20
to TOANDAKHIAT in<BR>both reading and sorting. There are also many cases =
of=20
consonant<BR>clusters with initial /r/ that should be written with a =
full=20
RAW<BR>and not a RAWBAT, so a separate character is provided for=20
it.<BR><BR>Khmer writing does not normally separate words with white =
space=20
as<BR>European languages do. If it is desirable to represent word=20
boundaries<BR>in the text stream, for example, for use by automatic line =

layout<BR>algorithms, U+200B, ZERO WIDTH SPACE, should be =
used.<BR><BR>Two=20
relatively rare symbols in modern usage are not included here.<BR>They =
are pnek=20
moan, the "cock's eye," and "komout." They are<BR>identical in form and =
function=20
to the Thai characters FONGMAN and<BR>KHOMUT, respectively, so the =
latter two=20
should be used when these<BR>symbols are needed.<BR><BR>Block=20
Structure:<BR><BR>U+0F80 to U+0FA2 Consonants<BR>U+0FA3 to U+0FB1 =
Independent=20
Vowels<BR>U+0FB2 to U+0FC1 Vowel Signs<BR>U+0FC2 to U+0FC4 Quality=20
Marks<BR>U+0FC5 Virama<BR>U+0FC6 to U+0FC7 Unassigned<BR>U+0FC8 to =
U+0FCF=20
Diacritics<BR>U+0FD0 to U+0FD9 Digits<BR>U+0FDA to U+0FDE Symbols and=20
Punctuation<BR>U+0FDF Unassigned<BR><BR>ISSUES: The independent vowels =
LETTER AO=20
TYPE 2 and LETTER AW TYPE<BR>2 are variant forms of LETTER AO TYPE 1 and =
LETTER=20
AW TYPE 1,<BR>respectively. It is not believed that they are in free=20
variation:<BR>LETTER AO TYPE 2 occurs only in the combination "aoy," =
while=20
LETTER<BR>AW TYPE 2 is only cited in a few references, but not used. =
There<BR>is=20
an opportunity to unify these pairs. Note that LETTER UW and<BR>LETTER =
OU are=20
also listed as variants, but they are actually not<BR>in free variation, =
so both=20
must be provided.<BR><BR>It may be desirable to add the vowel sign AM =
instead of=20
using the<BR>combination AA+NIKAHAT. This would simplify a common =
special=20
case<BR>in sorting.<BR><BR>The punctuation marks KHAN and BARIYAOSAN may =
be=20
unified with some<BR>other characters, just as Indic dandas have been. A =
likely=20
candidate<BR>for the former is Thai PAI YAN NOI. Such a unification, as =
well=20
as<BR>that of the "cock's eye" and "cow piss" characters presents=20
an<BR>interesting challenge to the font mechanism of a Unicode=20
rendering<BR>engine: Different glyphs may be required for the same=20
character<BR>when used in conjunction with different scripts. This seems =

like<BR>a needless complication for what are otherwise simple,=20
non-combining<BR>characters.<BR><BR>It may be more desirable from a =
political=20
standpoint to follow<BR>either the Thai or the ISCII coding schemes. =
Sample=20
charts have<BR>been produced showing how this may be done. If this is =
indeed=20
the<BR>path taken, those charts should be expanded to include all=20
characters<BR>in this proposal.<BR><BR>The vowel encoding takes an =
ISCII-like=20
approach, coding as single<BR>characters vowels that consist of two or =
more=20
disjoint glyphs. If<BR>vowel symbols are instead decomposed into their=20
constituent glyphs<BR>and those coded separately, there is then no =
advantage to=20
the code<BR>point assignments made here. In such a case, the assignments =

should<BR>be made according to the Thai pattern.<BR><BR>The romanization =
scheme=20
here is rather ad-hoc. If a more commonly<BR>accepted one exists, the =
character=20
names should be changed accordingly.<BR><BR><STRONG>Draft 03 October =
1992; rev=20
92/11/25<BR><BR>DRAFT KHMER CHARACTER NAMES</STRONG><BR><BR>@ =
Consonants<BR>0F80=20
KHMER LETTER KAA<BR>0F81 KHMER LETTER KHAA<BR>0F82 KHMER LETTER =
KAW<BR>0F83=20
KHMER LETTER KHAW<BR>0F84 KHMER LETTER NGAW<BR>0F85 KHMER LETTER =
CAA<BR>0F86=20
KHMER LETTER CHAA<BR>0F87 KHMER LETTER CAW<BR>0F88 KHMER LETTER =
CHAW<BR>0F89=20
KHMER LETTER NYAW<BR>0F8A KHMER LETTER DAA<BR>0F8B KHMER LETTER =
RETROFLEX=20
THAA<BR>0F8C KHMER LETTER DAW<BR>0F8D KHMER LETTER RETROFLEX =
THAW<BR>0F8E KHMER=20
LETTER NAA<BR>0F8F KHMER LETTER TAA<BR>0F90 KHMER LETTER THAA<BR>0F91 =
KHMER=20
LETTER TAW<BR>0F92 KHMER LETTER THAW<BR>0F93 KHMER LETTER NAW<BR>0F94 =
KHMER=20
LETTER BAA<BR>0F95 KHMER LETTER PHAA<BR>0F96 KHMER LETTER PAW<BR>0F97 =
KHMER=20
LETTER PHAW<BR>0F98 KHMER LETTER MAW<BR>0F99 KHMER LETTER YAW<BR>0F9A =
KHMER=20
LETTER RAW<BR>0F9B KHMER LETTER LAW<BR>0F9C KHMER LETTER WAW<BR>0F9D =
KHMER=20
LETTER SHAA<BR>Sanskrit<BR>0F9E KHMER LETTER SSAA<BR>Sanskrit<BR>0F9F =
KHMER=20
LETTER SAA<BR>0FA0 KHMER LETTER HAA<BR>0FA1 KHMER LETTER LAA<BR>0FA2 =
KHMER=20
LETTER QAA<BR>glottal stop<BR><BR>@ Independent Vowels<BR>0FA3 KHMER =
LETTER=20
E<BR>0FA4 KHMER LETTER EY<BR>0FA5 KHMER LETTER O<BR>0FA6 KHMER LETTER =
UW<BR>0FA7=20
KHMER LETTER OU<BR>0FA8 KHMER LETTER AE<BR>0FA9 KHMER LETTER AY<BR>0FAA =
KHMER=20
LETTER AO TYPE 1<BR>0FAB KHMER LETTER AO TYPE 2<BR>0FAC KHMER LETTER AW =
TYPE=20
1<BR>0FAD KHMER LETTER AW TYPE 2<BR>0FAE KHMER LETTER RIK<BR>0FAF KHMER =
LETTER=20
RII<BR>0FB0 KHMER LETTER LIK<BR>0FB1 KHMER LETTER LII<BR><BR>@ Vowel=20
Signs<BR>0FB2 KHMER VOWEL SIGN AA<BR>0FB3 KHMER VOWEL SIGN E<BR>0FB4 =
KHMER VOWEL=20
SIGN EY<BR>0FB5 KHMER VOWEL SIGN U<BR>0FB6 KHMER VOWEL SIGN UI<BR>0FB7 =
KHMER=20
VOWEL SIGN O<BR>x kbiah kraom<BR>0FB8 KHMER VOWEL SIGN OU<BR>0FB9 KHMER =
VOWEL=20
SIGN UA<BR>0FBA KHMER VOWEL SIGN AU<BR>0FBB KHMER VOWEL SIGN IE<BR>0FBC =
KHMER=20
VOWEL SIGN IU<BR>0FBD KHMER VOWEL SIGN EI<BR>0FBE KHMER VOWEL SIGN =
AE<BR>0FBF=20
KHMER VOWEL SIGN AY<BR>0FC0 KHMER VOWEL SIGN AO<BR>0FC1 KHMER VOWEL SIGN =

AW<BR><BR>@ Quality Marks<BR>0FC2 KHMER SIGN NIKAHAT<BR>=3D sra =
am<BR>=3D=20
damla<BR>0FC3 KHMER SIGN REAHMUK<BR>=3D wihsakea<BR>=3D =
wihsancani<BR>0FC4 KHMER=20
SIGN YUKALEAPINTU<BR>=3D coc pi<BR><BR>@ Virama<BR>0FC5 KHMER SIGN=20
WIRIAM<BR>virama<BR>0FC6<BR>0FC7<BR><BR>@ Diacritics<BR>0FC8 KHMER VOWEL =
SIGN=20
BANTA<BR>=3D sangkat<BR>=3D reahsannya<BR>0FC9 KHMER VOWEL SIGN SANYOK=20
SANNYA<BR>0FCA KHMER SIGN MUSEKATOAN<BR>=3D tmin kandao<BR>vowel =
pronounciation=20
shifter<BR>0FCB KHMER SIGN TRUYSAP<BR>vowel pronounciation =
shifter<BR>0FCC KHMER=20
SIGN RAWBAT<BR>=3D rephea<BR>0FCD KHMER SIGN TOANDAKHIAT<BR>=3D =
samlap<BR>=3D=20
patdesaet<BR>0FCE KHMER SIGN KAKABAT<BR>=3D caung kaek<BR>0FCF KHMER =
SIGN=20
AHSDA<BR>=3D leik prabuy<BR><BR>@ Digits<BR>0FD0 KHMER DIGIT =
ZERO<BR>0FD1 KHMER=20
DIGIT ONE<BR>0FD2 KHMER DIGIT TWO<BR>0FD3 KHMER DIGIT THREE<BR>0FD4 =
KHMER DIGIT=20
FOUR<BR>0FD5 KHMER DIGIT FIVE<BR>0FD6 KHMER DIGIT SIX<BR>0FD7 KHMER =
DIGIT=20
SEVEN<BR>0FD8 KHMER DIGIT EIGHT<BR>0FD9 KHMER DIGIT NINE<BR><BR>@ =
Symbols and=20
Punctuation<BR>0FDA KHMER CURRENCY SYMBOL RIAL<BR>0FDB KHMER LEIK =
TO<BR>=3D=20
amendit sannya<BR>repetition sign<BR>0FDC KHMER CAMNOC PI KUH<BR>x =
(division=20
sign -&gt; 00F7)<BR>x (tibetan comma -&gt; 1038)<BR>colon, =
semicolon<BR>0FDD=20
KHMER KHAN<BR>full stop, ellipsis, abbreviation<BR>0FDE KHMER =
BARIYAOSAN<BR>end=20
of section<BR>0FDF</P>
<HR>

<P><FONT size=3D5><STRONG>Proposal for Ethiopian=20
Encoding</STRONG></FONT><BR><BR>The Ethiopian proposal consists of a =
list of=20
questions/issues, a<BR>chart, a character names list, and a block =
introduction.=20
The<BR>content is based on UTC/1991-026 On the Extended Ethiopic =
Alphabet<BR>of=20
February 26, 1991 and its later adjustments by Lloyd =
Anderson,<BR>unioned with=20
features of the Xerox Amharic implementation by Joe<BR>Becker. The =
character=20
names are based on those in DP 10646, which<BR>came from WG2/N459 =
"Ethiopian=20
character sets" by Michael Mann.<BR><BR>QUESTIONS FOR =
REVIEWERS:<BR><BR>1. Is=20
this collection missing any important, well-established<BR>"extension" =
letters=20
for writing less-common languages?<BR><BR>2. Are the glyphs in the =
charts=20
appropriate?<BR><BR>3. Can you supply documentation to support the =
specification=20
of<BR>the following two characters?<BR>121D ETHIOPIAN CONSONANT GG=20
1237<BR>ETHIOPIAN VOWEL PHONETIC AE In particular, does U+1237<BR>occur =
(as a=20
vowel, not as a mark of "w" rounding) on any consonant<BR>other than =
U+1211?=20
Should the combination of U+1237 with U+1211<BR>simply be encoded as a =
distinct=20
consonant (to be added between<BR>current U+1211 and U+1212)?<BR><BR>4. =
Are the=20
following characters specified correctly?<BR>1256 ETHIOPIAN =
COMMA<BR>modern=20
usage like colon<BR>1257 ETHIOPIAN COLON<BR>modern usage like =
semicolon<BR>1259=20
ETHIOPIAN NEW COMMA<BR>modern usage<BR><BR>5. Do syllable glyph variants =
ever=20
occur distinctively within the<BR>same text, or are they merely font =
design=20
choices like the glyph<BR>variants of Latin "a" or =
"g"?<BR><BR>ISSUES:<BR><BR>*=20
In this design, no provision is made for coding the syllable<BR>glyphs; =
it is=20
intended that they be excluded from Unicode/10646<BR>BMP. If we learn =
that glyph=20
variants may occur distinctively, then<BR>we may need to define some =
additional=20
means for specifying glyph<BR>variants within plain text.<BR><BR>* =
Should we=20
define an Ethiopian White Space character which can be<BR>easily =
guaranteed to=20
have the same (minimum) width as U+1255<BR>ETHIOPIAN WORDSPACE? =
Currently=20
opinion is that this is unnecessary.<BR><BR>Ethiopian (U+1200 -&gt;=20
U+125F)<BR><BR>The Ethiopian script, which originally evolved for the=20
archaic<BR>language Ge'ez, is currently used to write several languages=20
of<BR>Eastern Africa, including Amharic, Tigre, and Oromo. The=20
script<BR>continues to be extended for writing languages that have=20
little<BR>tradition of printed typography; new characters to cover=20
such<BR>extensions may added to the standard later as definitive=20
information<BR>about them becomes available.<BR><BR>Encoding Principles. =
The=20
visible glyphs of the Ethiopian script<BR>are not the objects shown in =
the=20
encoding chart. The elements of<BR>the encoding are the alphabet =
underlying the=20
script, thus the<BR>encoding is (roughly) phonetic rather than glyphic. =
These=20
alphabetic<BR>letters are expected to be the units of keyboard input and =
all=20
text<BR>representation short of rendering.<BR><BR>Rendering. Each =
visible glyph=20
of the Ethiopian script represents<BR>a syllable rather than a single =
letter.=20
The syllables can all be<BR>treated as simple (consonant + vowel) pairs, =
so that=20
each glyph<BR>can be thought of as a ligature of two underlying letters. =

Thus<BR>the syllable "MA" would be represented in the encoding as=20
U+1203<BR>ETHIOPIAN CONSONANT M plus U+1233 ETHIOPIAN VOWEL A. The=20
syllable<BR>glyphs themselves are not intended to be incorporated in=20
this<BR>encoding. The individual consonant or vowel codes should not=20
be<BR>isolated (i.e. unpaired) in normal final text, and their =
rendering<BR>in=20
such circumstances is an option of the implementation. =
One<BR>possibility is to=20
use special symbols for the individual letters,<BR>as is done in the =
code charts=20
here.<BR><BR>Chart Symbols Representing Individual Letters. Since the=20
Ethiopian<BR>glyphs are normally syllabic, the script provides no=20
unambiguous<BR>way of representing the underlying individual letters.=20
Therefore<BR>in the code charts and names list, a convention has been=20
adopted<BR>in which consonant letters are represented by their "first"=20
form<BR>surrounded by a dotted circle, and vowel letters are =
represented<BR>by a=20
typical glyph fragment attached to a dotted circle. This is<BR>not =
intended to=20
imply direct glyphic composition of those forms,<BR>but merely to =
signify the=20
underlying letters.<BR><BR>Encoding/Rendering of "First Form" Syllables. =
The=20
circled consonants<BR>in the charts U+1200 -&gt; U+1224 are underlying =
letters,=20
they should<BR>not be confused with rendered full first form syllable =
glyphs.=20
As<BR>with all glyphs in the script, the first form syllables are =
encoded<BR>as=20
simple (consonant + vowel) pairs. Thus the glyph "MAE" would<BR>be =
represented=20
in the encoding as U+1203 ETHIOPIAN CONSONANT M plus<BR>U+1230 ETHIOPIAN =
VOWEL=20
AE. This pair would then be rendered via<BR>a "ligature" MAE whose =
appearance=20
would resemble the chart symbol<BR>for U+1203 ETHIOPIAN CONSONANT M =
without the=20
circle.<BR><BR>Encoding/Rendering of Lone Consonants ("Sixth Form"=20
Syllables).<BR>The sixth form syllable glyphs are sometimes pronounced =
as=20
though<BR>they were lone consonants (i.e. the vowel is dropped in=20
speech),<BR>but this does not change their encoding. As with all glyphs =
in=20
the<BR>script, the sixth form syllables are encoded as simple =
(consonant<BR>+=20
vowel) pairs. Thus the spoken lone consonant "M" would be<BR>represented =
in the=20
encoding as U+1203 ETHIOPIAN CONSONANT M plus<BR>U+1235 ETHIOPIAN VOWEL=20
SCHWA.<BR><BR>Variant Glyph Forms. The script sometimes provides =
different=20
glyph<BR>forms to represent the same syllables. It is assumed that=20
these<BR>alternatives do not vary freely, in other words that is=20
appropriate<BR>for a given font to contain only one selected glyph form =
for=20
each<BR>syllable. Therefore no mechanism is provided for specifying=20
glyph<BR>variants within a plain text stream of characters. The =
situation<BR>is=20
analogous to that of the glyph variants of Latin "a" or =
"g".<BR><BR>Letter=20
Names. The Ethiopian script often has multiple letters<BR>corresponding =
to the=20
same Latin letter, making it difficult to<BR>assign unique Latin names.=20
Therefore the names list makes use of<BR>certain devices (such as =
doubling a=20
Latin letter in the name) merely<BR>to create uniqueness; this has no =
relation=20
to the phonetics of the<BR>Ethiopian letters.<BR><BR>Encoding Order and =
Sorting.=20
The order of the letters in the encoding<BR>is based on the traditional=20
alphabetical order. This order differs<BR>from the sort order used for =
one or=20
another language, if only<BR>because in many languages various pairs or =
triplets=20
of letters are<BR>treated as equivalent in the first sorting pass. For =
example,=20
an<BR>Amharic dictionary is likely to start out with a section headed=20
by<BR>three letters:<BR><BR>U+1200 ETHIOPIAN CONSONANT H<BR>U+1202 =
ETHIOPIAN=20
CONSONANT HH<BR>U+120E ETHIOPIAN CONSONANT X<BR><BR>Thus the encoding =
order=20
cannot and does not implement a collation<BR>procedure for any =
particular=20
language using this script.<BR><BR>Space Characters. The traditional =
word=20
separator is U+1255 ETHIOPIAN<BR>WORDSPACE ( : ), but in modern usage a =
plain=20
white wordspace is<BR>becoming common. The ASCII character U+0020 SPACE =
is=20
suitable for<BR>the latter usage, although its (minimum) width is not=20
guaranteed<BR>to be the same as that of the traditional=20
wordspace.<BR><BR>Diacritical Marks. The mark U+030E NON-SPACING DOUBLE=20
VERTICAL<BR>LINE ABOVE may occasionally be used to indicate emphasis=20
or<BR>gemination. If this or other diacritical marks are used, =
they<BR>follow=20
the vowel letter of the syllable to which they apply.<BR><BR>Encoding =
Structure.=20
The Unicode block for the Ethiopian script is<BR>divided into the =
following=20
ranges:<BR><BR>U+1200 to U+1224 Consonant phonetic letters<BR>U+1225 to =
U+122F=20
Currently unassigned<BR>U+1230 to U+123D Vowel phonetic letters (U+1239 =
is an=20
intentional gap)<BR>U+123E to U+123F Currently unassigned<BR>U+1240 to =
U+1254=20
Numbers (U+1240 is an intentional gap)<BR>U+1255 to U+125B =
Punctuation<BR>U+125C=20
to U+125F Currently unassigned<BR><BR><STRONG>Draft October 30, 1992; =
rev=20
93/01/08<BR><BR>ETHIOPIAN CHARACTER NAMES LIST<BR></STRONG><BR>@ =
Consonant=20
phonetic letters<BR>1200 ETHIOPIAN CONSONANT H<BR>1201 ETHIOPIAN =
CONSONANT=20
L<BR>1202 ETHIOPIAN CONSONANT HH<BR>1203 ETHIOPIAN CONSONANT M<BR>1204 =
ETHIOPIAN=20
CONSONANT SZ<BR>1205 ETHIOPIAN CONSONANT R<BR>1206 ETHIOPIAN CONSONANT =
S<BR>1207=20
ETHIOPIAN CONSONANT SH<BR>1208 ETHIOPIAN CONSONANT Q<BR>1209 ETHIOPIAN =
CONSONANT=20
QH<BR>120A ETHIOPIAN CONSONANT B<BR>120B ETHIOPIAN CONSONANT V<BR>120C =
ETHIOPIAN=20
CONSONANT T<BR>120D ETHIOPIAN CONSONANT C<BR>120E ETHIOPIAN CONSONANT =
X<BR>120F=20
ETHIOPIAN CONSONANT N<BR>1210 ETHIOPIAN CONSONANT NY<BR>1211 ETHIOPIAN =
CONSONANT=20
GLOTTAL<BR>1212 ETHIOPIAN CONSONANT K<BR>1213 ETHIOPIAN CONSONANT =
XX<BR>1214=20
ETHIOPIAN CONSONANT W<BR>1215 ETHIOPIAN CONSONANT NULL<BR>1216 ETHIOPIAN =

CONSONANT Z<BR>1217 ETHIOPIAN CONSONANT ZH<BR>1218 ETHIOPIAN CONSONANT =
Y<BR>1219=20
ETHIOPIAN CONSONANT D<BR>121A ETHIOPIAN CONSONANT DD<BR>Oromo<BR>121B =
ETHIOPIAN=20
CONSONANT J<BR>121C ETHIOPIAN CONSONANT G<BR>121D ETHIOPIAN CONSONANT=20
GG<BR>Bilen<BR>121E ETHIOPIAN CONSONANT TH<BR>121F ETHIOPIAN CONSONANT=20
CH<BR>1220 ETHIOPIAN CONSONANT PH<BR>1221 ETHIOPIAN CONSONANT TS<BR>1222 =

ETHIOPIAN CONSONANT TZ<BR>1223 ETHIOPIAN CONSONANT F<BR>1224 ETHIOPIAN =
CONSONANT=20
P<BR>1225<BR>1226<BR>1227<BR>1228<BR>1229<BR>122A<BR>122B<BR>122C<BR>122D=
<BR>122E<BR>122F<BR><BR>@=20
Vowel phonetic letters<BR>1230 ETHIOPIAN VOWEL AE<BR>1231 ETHIOPIAN =
VOWEL=20
U<BR>1232 ETHIOPIAN VOWEL I<BR>1233 ETHIOPIAN VOWEL A<BR>1234 ETHIOPIAN =
VOWEL=20
E<BR>1235 ETHIOPIAN VOWEL SCHWA<BR>1236 ETHIOPIAN VOWEL O<BR>1237 =
ETHIOPIAN=20
VOWEL PHONETIC AE<BR>used primarily with U+1211 ETHIOPIAN CONSONANT=20
GLOTTAL<BR>1238 ETHIOPIAN VOWEL WAE<BR>1239<BR>123A ETHIOPIAN VOWEL =
WI<BR>123B=20
ETHIOPIAN VOWEL WA<BR>123C ETHIOPIAN VOWEL WE<BR>123D ETHIOPIAN VOWEL=20
W<BR>123E<BR>123F<BR><BR>@ Numbers<BR>1240<BR>1241 ETHIOPIAN NUMBER =
ONE<BR>1242=20
ETHIOPIAN NUMBER TWO<BR>1243 ETHIOPIAN NUMBER THREE<BR>1244 ETHIOPIAN =
NUMBER=20
FOUR<BR>1245 ETHIOPIAN NUMBER FIVE<BR>1246 ETHIOPIAN NUMBER SIX<BR>1247=20
ETHIOPIAN NUMBER SEVEN<BR>1248 ETHIOPIAN NUMBER EIGHT<BR>1249 ETHIOPIAN =
NUMBER=20
NINE<BR>124A ETHIOPIAN NUMBER TEN<BR>124B ETHIOPIAN NUMBER =
TWENTY<BR>124C=20
ETHIOPIAN NUMBER THIRTY<BR>124D ETHIOPIAN NUMBER FORTY<BR>124E ETHIOPIAN =
NUMBER=20
FIFTY<BR>124F ETHIOPIAN NUMBER SIXTY<BR>1250 ETHIOPIAN NUMBER =
SEVENTY<BR>1251=20
ETHIOPIAN NUMBER EIGHTY<BR>1252 ETHIOPIAN NUMBER NINETY<BR>1253 =
ETHIOPIAN NUMBER=20
HUNDRED<BR>1254 ETHIOPIAN NUMBER TEN THOUSAND<BR><BR>@ =
Punctuation<BR>1255=20
ETHIOPIAN WORDSPACE<BR>1256 ETHIOPIAN COMMA<BR>modern usage like =
colon<BR>1257=20
ETHIOPIAN COLON<BR>modern usage like semicolon<BR>1258 ETHIOPIAN =
PERIOD<BR>1259=20
ETHIOPIAN NEW COMMA<BR>modern usage<BR>125A ETHIOPIAN QUESTION=20
MARK<BR>archaic<BR>125B ETHIOPIAN PARAGRAPH SEPARATOR<BR>archaic<BR></P>
<HR>

<H2>Copyright</H2>
<P>Copyright =A9 1992-1998 Unicode, Inc. All Rights Reserved. The =
Unicode=20
Consortium makes no expressed or implied warranty of any kind, and =
assumes no=20
liability for errors or omissions. No liability is assumed for =
incidental and=20
consequential damages in connection with or arising out of the use of =
the=20
information or programs contained or accompanying this technical report. =
</P>
<P>Unicode and the Unicode logo are trademarks of Unicode, Inc., and are =

registered in some jurisdictions.</P>
<HR>

<P>&nbsp;</P></BODY></HTML>

