Help from people experienced in Arabic, Hebrew, and CJK languages
Hi to all, starting from a simple problem, "Text::Unaccent doesn't works on RH/Centos, how to replace it", it starts a good discussion about support of complex UTF-8 languges. The discussion needs help from people with experience in Arabic, Hebrew, and CJK languages The link: http://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=14759 Please try to join the discussion also if you don't know perl, but if you know well one of those: - Arabic alphabet and language - Hebrew alphabet and language - CJK writing systems and languages Bye Zeno Tajoli -- Zeno Tajoli /Dipartimento Sviluppi Innovativi/ - Automazione Biblioteche Email: z.tajoli@cineca.it Fax: 051/6132198 *CINECA* Consorzio Interuniversitario - Sede operativa di Segrate (MI)
Hi, On Thu, Dec 10, 2015 at 6:01 AM, Tajoli Zeno <z.tajoli@cineca.it> wrote:
The link: http://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=14759
Please try to join the discussion also if you don't know perl, but if you know well one of those: - Arabic alphabet and language - Hebrew alphabet and language - CJK writing systems and languages
In addition, it would be great to have input from Koha users using *any* language that contains diacritics, as there is a functionality question underlying the discussion. At the moment, the module in question is used to remove diacritics when automatically generating a public catalog username -- e.g., if a patron whose surname is Müller is registered with a Koha database, would they expect, based on their use of other websites, that their username could include the diacritic (e.g., "müller")? Or would they expect that it would never include the diacritics (e.g., "muller")? Regards, Galen -- Galen Charlton Infrastructure and Added Services Manager Equinox Software, Inc. / The Open Source Experts email: gmc@esilibrary.com direct: +1 770-709-5581 cell: +1 404-984-4366 skype: gmcharlt web: http://www.esilibrary.com/ Supporting Koha and Evergreen: http://koha-community.org & http://evergreen-ils.org
On 12/10/2015 07:35 AM, Galen Charlton wrote:
Hi,
On Thu, Dec 10, 2015 at 6:01 AM, Tajoli Zeno <z.tajoli@cineca.it> wrote:
The link: http://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=14759
Please try to join the discussion also if you don't know perl, but if you know well one of those: - Arabic alphabet and language - Hebrew alphabet and language - CJK writing systems and languages
In addition, it would be great to have input from Koha users using *any* language that contains diacritics, as there is a functionality question underlying the discussion.
At the moment, the module in question is used to remove diacritics when automatically generating a public catalog username -- e.g., if a patron whose surname is Müller is registered with a Koha database, would they expect, based on their use of other websites, that their username could include the diacritic (e.g., "müller")? Or would they expect that it would never include the diacritics (e.g., "muller")?
Regards,
Galen
I don't know about umlauts, but at least in Hebrew it would be fine to remove the diacritics. They are mostly unused in adult daily life and I can't imagine anyone trying to add diacritics to their name (most people even don't know how to type those). And if the odd person comes along who tries that, removing those would be totally fine. I'll jump over the other discussion see if I can help. Although I'm a Koha user/admin and I know Hebrew, I don't mix the two :)
2015-12-10 23:35 GMT+08:00 Galen Charlton <gmc@esilibrary.com>:
On Thu, Dec 10, 2015 at 6:01 AM, Tajoli Zeno <z.tajoli@cineca.it> wrote:
The link: http://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=14759
Please try to join the discussion also if you don't know perl, but if you know well one of those: - Arabic alphabet and language - Hebrew alphabet and language - CJK writing systems and languages
In addition, it would be great to have input from Koha users using *any* language that contains diacritics, as there is a functionality question underlying the discussion.
At the moment, the module in question is used to remove diacritics when automatically generating a public catalog username -- e.g., if a patron whose surname is Müller is registered with a Koha database, would they expect, based on their use of other websites, that their username could include the diacritic (e.g., "müller")? Or would they expect that it would never include the diacritics (e.g., "muller")?
I am not sure what happen in Koha. But Koha developer may find some clue from this old document. "The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!) / by Joel Spolsky Wednesday, October 08, 2003, http://www.joelonsoftware.com/articles/Unicode.html There is a a Chinese version http://www.csie.ntu.edu.tw/~p92005/Joel/Unicode.html Did Koha deal with Unicode properly from the very beginning? -- Wishing you all the best. . . . Anthony Mao 毛慶禎 +886 2 29052334 (voice) + 886 2 29017405 (FAX)
participants (4)
-
Anthony Mao -
Galen Charlton -
Tajoli Zeno -
Yuval Hager