[Koha] Multilingual Searching and ICU Chains

Ere Maijala ere.maijala at helsinki.fi
Sat Nov 23 02:06:05 NZDT 2019


Hi Katrin,

Well, sort of. In the wiki
(https://wiki.koha-community.org/wiki/Elasticsearch) there's a whole
sentence on this:

"Note that all data field mappings will automatically handle alternate
script fields (880) for MARC 21 records."

Implementation details in bug 20244.

--Ere

Katrin Fischer kirjoitti 22.11.2019 klo 14.23:
> Hi Ere,
> 
> love to hear this! :) Do we have documentation about this?
> 
> Katrin
> 
> On 22.11.19 12:12, Ere Maijala wrote:
>> Hi,
>>
>> I might start to sound like a salesman, but the Elasticsearch indexing
>> code in Koha handles and indexes alternate script fields (880)
>> automatically in the correct search fields. :)
>>
>> Regards,
>> Ere
>>
>> Katrin Fischer kirjoitti 22.11.2019 klo 11.44:
>>> Hi Charles,
>>>
>>> if ICU with Zebra is set up correctly, you should be able to search for
>>> the records using the original script (as cataloged) without any further
>>> setup. If the original script is in 880 only the keyword search will
>>> include them by default. But this is not related to ICU - the problem is
>>> that indexing is not set up to include the 880 xxx in the corresponding
>>> indexes.
>>>
>>> If you want to search transliterated forms or need some other specific
>>> things, adjustments to the chains might be needed.
>>>
>>> Hope this helps,
>>>
>>> Katrin
>>>
>>> On 20.11.19 20:39, Charles Kelley wrote:
>>>> Hello, all!
>>>>
>>>>       I am developing a catalog of multilingual, multiscript materials:
>>>> Arabic, Bengali, Burmese, Chinese, English, French, German, Japanese,
>>>> Korean, Mongolian, Russian, Vietnamese, to name but a few. Since both
>>>> Koha
>>>> and MarcEdit are UTF8, I can enter the cataloging records and edit
>>>> them in
>>>> their appropriate script.
>>>>
>>>>       What I cannot do is to search them. I have seen on the Koha
>>>> Community
>>>> wiki is that ICU chains have to be configured. Here's the link:
>>>> https://wiki.koha-community.org/wiki/ICU_Chains_Library. I also
>>>> found an
>>>> exchange about this:
>>>> https://lists.katipo.co.nz/public/koha/2012-January/031714.html.
>>>>
>>>>       Is one to infer that the ICU chain has to be configured for each
>>>> writing system? Are there other multilingual libraries that have
>>>> implemented multilingual, multiscript searching in Koha and how did
>>>> they do
>>>> so?
>>>>
>>>>       Many thanks for your help in this matter.
>>>>
>>> _______________________________________________
>>> Koha mailing list  http://koha-community.org
>>> Koha at lists.katipo.co.nz
>>> https://lists.katipo.co.nz/mailman/listinfo/koha
> _______________________________________________
> Koha mailing list  http://koha-community.org
> Koha at lists.katipo.co.nz
> https://lists.katipo.co.nz/mailman/listinfo/koha

-- 
Ere Maijala
Kansalliskirjasto / The National Library of Finland


More information about the Koha mailing list