2009/9/3 Ceki Gulcu <ceki@qos.ch>
<snipped>
I am actually considering removing the parent-child relationship
between language and language+country. It just does not make sense to
do a different translation for en_UK and en_US. You would do one
translation for English "en" serving all English speaking
countries. Similarly, it does not make sense to do a French
translation for France and another for Switzerland as the language is
the same (French). Doing a Swiss version of a German web-site could
make sense because Swiss-German is considerably different than the
German spoken in Germany. Actually, the Swiss-German spoken in
different regions of Switzerland have notable differences as well. But
given that Swiss-German is *not* a written language, a web-site for
Swiss-German customers would be written in German. As far as I can
tell, for most practical purposes, it's the language that matters, not
the country and (almost) certainly not the region.

For translation purposes, "en_UK" and "en_US" can just map to
"en". This does not mean that en_UK and en_US should be reduced to
"en" because the respective currency or date format conventions are
likely to be different. In CAI10 the emphasis is on translations.
...  Translating applications to different languages is an expensive and
cumbersome process.  

I understand and agree with your emphasis on translation, and the difference between translation and other localisation issues.  But I don't agree that this leads us away from Java's Locale as the basis for Cal10n.  I like things to be kept simple - however, on the principle of least surprise, maintaining the existing well-known model keeps Cal10n more approachable to potential users.  In the case of English, UK and US share almost all locale settings except for date format, but the grammar and spellings are a little different so the translation is a little different.

It is clear that organisations do feel they need to support finer-grained translation than just "English" or "French".  For example, Mozilla and Open Office are both available with en-UK language packs as well as en-US.  Cal10n must not cut off such usage, IMHO.

Therefore I recommend you keep the parent-child relationship between language and language+country.  As I said before, I think it would be wise to retain the region level as well, although I have a less-strong view on that (I can't remember ever seeing a serious use of it).
 
Hopefully, CAL10N can alleviate some of the pain
with a more streamline process. My next goal is to eliminate
native2ascii converter from the translation process.


Wahey!!!  One UTF-8 to rule them all!  (Or maybe UTF-8 or UTF-16 using the Unicode BOM to indicate which, with UTF-8 being the default.)   Native2ascii was one of Sun's early short-sighted mistakes.

I hope you can appreciate my well-intended frank views.  :)
Rick
--
Big Bee Consultants Limited : Registered in England & Wales No. 6397941
Registered Office: 71 The Hundred, Romsey, Hampshire, SO51 8BZ