Summary of i18n ad hoc group meeting - 21st March 2011

Phillips, Addison addison at lab126.com
Fri Mar 25 13:10:28 PDT 2011


Those would be BCP 47 language tags too, though. I guess the question is what level of support the spec should contain (MUST/SHOULD/MAY support RFC 6067).

Addison

From: Shawn Steele [mailto:Shawn.Steele at microsoft.com]
Sent: Friday, March 25, 2011 12:56 PM
To: Phillips, Addison; Nebojša Ćirić; es-discuss at mozilla.org
Subject: RE: Summary of i18n ad hoc group meeting - 21st March 2011

I think we meant allowing the -u extensions for, at least, collation variants.


- Shawn

 
http://blogs.msdn.com/shawnste
Selfhost a custom locale from \\scratch2\scratch\shawnste\customlocaledrop\install.bat<file:///\\scratch2\scratch\shawnste\customlocaledrop\install.bat>
(Selfhost 7929)

From: es-discuss-bounces at mozilla.org [mailto:es-discuss-bounces at mozilla.org] On Behalf Of Phillips, Addison
Sent: Poʻalima, Malaki 25, 2011 10:53 AM
To: Nebojša Ćirić; es-discuss at mozilla.org
Subject: RE: Summary of i18n ad hoc group meeting - 21st March 2011

Hi Nebojša,

Some comments follow.


1.       “Use Unicode identifier vs. BCP47 in the API” It isn’t clear what you mean by this. I would strongly prefer that we use BCP 47 identifiers. If you mean “allow the Unicode locales extension to BCP 47”, I’m fine, but I don’t see why we would want to use the older LDML identifiers at all.

2.       For Collator, I’d like an option “ignoreSpaces”. I know that the default implementation of ICU’s collator ignores spaces, but I have a number of real-world cases in which I want the opposite behavior (in which I tailor my collator using rules).

Particularly important to me is the sorting of book titles. If I have two strings: “The New Fishing Bible” and “The Newer Fishing Bible”, I want to sort them in the order given, but most collators ignore spaces (comparing the second ‘e’ in Newer to the ‘F’ in Fishing). So I want the option of making whitespace meaningful for collation.

Since most implementations are similar to ICU, though, maybe this would be the one “strange” option (“compareWhitespace”???)

3.       “Start with ICU patterns and see if they work for everybody” I support establishing the pattern language early on. I need to be able to use Message, Choice, and Plural formatters and it’s all too common to need to specify particular patterns in those cases (for substitutions), not to mention direct usage in number and date formatters. The symbols used by ICU/CLDR are common enough and, if incomplete for a given implementers needs (Microsoft?), could be appropriately extended. Since these symbols are also used by Java, PHP, and some other languages, it would be unsurprising to use them in ECMAScript.

4.       “Allow short/long dateType to get value from the system or cloud” I’m not sure I understand this one. Was this the part of the conversation in which we allowed the pattern to be garnered either from the system (e.g. the Regional Options control panel on Windows) or from the implementation??

Addison

Addison Phillips
Globalization Architect (Lab126)
Chair (W3C I18N WG)

Internationalization is not a feature.
It is an architecture.



From: Nebojša Ćirić [mailto:cira at google.com]
Sent: Monday, March 21, 2011 5:29 PM
To: es-discuss at mozilla.org
Subject: Summary of i18n ad hoc group meeting - 21st March 2011

Started implementation of collator in Chrome and hit a problem:

var coll = locale.collator();
array.sort(coll.compare);

Compare method gets bound to the undefined or global object at the call site. Erik mentioned that this problem will be solved in Harmony by passing additional "this" parameter to for-each and likes. We would like to propose extending this syntax to the sort method too.

We discussed each part of the API in order to get detailed parameters of each constructor and method.

General:

 *   Add options property to each class that would give you actual value for the user parameters. For example, if user asked for islamic calendar, and we only have islamic-civil, we set calendar property to islamic-civil. Allows developer to iterate until satisfied with the result.
 *   Use Unicode identifier vs. BCP47 in the API
Collator:

 *   numeric - specifies numeric sort (9 comes before 12)
 *   ignoreVariants - ignore all of case, width and kana
 *   ignoreWidth, ignoreCase and ignoreKana - subvariants we may implement to fine tune the behavior
 *   ignoreAccents - ignore accents
 *   ignoreSymbols - ignore punctuation and symbols
 *   variant - phonebook, ... - string
NumberFormat:

 *   Allow patterns to specify grouping, currency symbol position and sign location
 *   Start with ICU patterns and see if they work for everybody
 *   Don't support overrides for grouping separator and decimal point for now
 *   Specify both currencySymbol and currencyCode as override
DateTimeFormatSymbols:

 *   Added Era and day period methods (AM/PM)
 *   Moved all methods to DateTimeFormat class
 *   Remove DTFSymbols class
DateTimeFormat:

 *   Specify calendar names better (move work to Unicode/LDML and point to their document).
 *   Allow short/long dateType to get value from the system or cloud.
 *   .options[skeleton] should contain best match for the given skeleton
--
Nebojša Ćirić
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.mozilla.org/pipermail/es-discuss/attachments/20110325/50d97802/attachment-0001.html>


More information about the es-discuss mailing list