Summary of i18n ad hoc group meeting - 21st March 2011
addison at lab126.com
Fri Mar 25 10:52:36 PDT 2011
Some comments follow.
1. “Use Unicode identifier vs. BCP47 in the API” It isn’t clear what you mean by this. I would strongly prefer that we use BCP 47 identifiers. If you mean “allow the Unicode locales extension to BCP 47”, I’m fine, but I don’t see why we would want to use the older LDML identifiers at all.
2. For Collator, I’d like an option “ignoreSpaces”. I know that the default implementation of ICU’s collator ignores spaces, but I have a number of real-world cases in which I want the opposite behavior (in which I tailor my collator using rules).
Particularly important to me is the sorting of book titles. If I have two strings: “The New Fishing Bible” and “The Newer Fishing Bible”, I want to sort them in the order given, but most collators ignore spaces (comparing the second ‘e’ in Newer to the ‘F’ in Fishing). So I want the option of making whitespace meaningful for collation.
Since most implementations are similar to ICU, though, maybe this would be the one “strange” option (“compareWhitespace”???)
3. “Start with ICU patterns and see if they work for everybody” I support establishing the pattern language early on. I need to be able to use Message, Choice, and Plural formatters and it’s all too common to need to specify particular patterns in those cases (for substitutions), not to mention direct usage in number and date formatters. The symbols used by ICU/CLDR are common enough and, if incomplete for a given implementers needs (Microsoft?), could be appropriately extended. Since these symbols are also used by Java, PHP, and some other languages, it would be unsurprising to use them in ECMAScript.
4. “Allow short/long dateType to get value from the system or cloud” I’m not sure I understand this one. Was this the part of the conversation in which we allowed the pattern to be garnered either from the system (e.g. the Regional Options control panel on Windows) or from the implementation??
Globalization Architect (Lab126)
Chair (W3C I18N WG)
Internationalization is not a feature.
It is an architecture.
From: Nebojša Ćirić [mailto:cira at google.com]
Sent: Monday, March 21, 2011 5:29 PM
To: es-discuss at mozilla.org
Subject: Summary of i18n ad hoc group meeting - 21st March 2011
Started implementation of collator in Chrome and hit a problem:
var coll = locale.collator();
Compare method gets bound to the undefined or global object at the call site. Erik mentioned that this problem will be solved in Harmony by passing additional "this" parameter to for-each and likes. We would like to propose extending this syntax to the sort method too.
We discussed each part of the API in order to get detailed parameters of each constructor and method.
* Add options property to each class that would give you actual value for the user parameters. For example, if user asked for islamic calendar, and we only have islamic-civil, we set calendar property to islamic-civil. Allows developer to iterate until satisfied with the result.
* Use Unicode identifier vs. BCP47 in the API
* numeric - specifies numeric sort (9 comes before 12)
* ignoreVariants - ignore all of case, width and kana
* ignoreWidth, ignoreCase and ignoreKana - subvariants we may implement to fine tune the behavior
* ignoreAccents - ignore accents
* ignoreSymbols - ignore punctuation and symbols
* variant - phonebook, ... - string
* Allow patterns to specify grouping, currency symbol position and sign location
* Start with ICU patterns and see if they work for everybody
* Don't support overrides for grouping separator and decimal point for now
* Specify both currencySymbol and currencyCode as override
* Added Era and day period methods (AM/PM)
* Moved all methods to DateTimeFormat class
* Remove DTFSymbols class
* Specify calendar names better (move work to Unicode/LDML and point to their document).
* Allow short/long dateType to get value from the system or cloud.
* .options[skeleton] should contain best match for the given skeleton
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the es-discuss