[ocaml-i18n] Native UTF-8 strings

Yamagata Yoriyuki yoriyuki at mbg.ocn.ne.jp
Tue Nov 25 17:17:44 PST 2003


It looks like most people has moved to Orcaware, so let continue the
discussion.  This message contains a Japanese character.  If this
causes a serious problem for someone, please let me know.

From: Jun.Furuse at inria.fr
Subject: Re: [ocaml-i18n] Native UTF-8 strings
Date: Mon, 24 Nov 2003 16:05:29 +0100

> Asian identifiers inside Caml language? Funny, but we would carefully
> determine how to treat Caml's case sensitiveness in such Asian
> languages without capital/non-capital differences of letters. (In
> perl, there is no such a problem, I think.) If we go to this direction,
> we should start from Caml programming in Greece and Russian, 
> which seem to be easier :-).

Mame proposed to make the prefix "激" (geki) equivalent to
capitalization.  See 2003/05/08 entry of
http://dame.dyndns.org/memo.cgi (Japanese)

But seriously, we should follow Unicode character properties here.
Unicode standard defines all ideographs are "neutral" (no casing).
all neutrals in ocaml identifiers could be treated as lowercases.  If
someone want an ideograph module name, we can use some
convention like prefixing ideographs by "M".  Same for variants.
--
Yamagata Yoriyuki



More information about the Ocaml-i18n mailing list