[Koha] Elastic search for Arabic
David Cook
dcook at prosentient.com.au
Fri Aug 30 12:40:50 NZST 2024
Hi Mohamad,
Does your collection only contain Arabic or does it contain multiple languages?
I've ben considering moving an Arabic/French/English collection to Elasticsearch, so I'd love to hear more about your experience.
David Cook
Senior Software Engineer
Prosentient Systems
Suite 7.03
6a Glen St
Milsons Point NSW 2061
Australia
Office: 02 9212 0899
Online: 02 8005 0595
-----Original Message-----
Date: Thu, 29 Aug 2024 06:27:57 +0000
From: Mohamad F Barham <mbarham at birzeit.edu>
To: Fridolin SOMERS <fridolin.somers at biblibre.com>,
"koha at lists.katipo.co.nz" <koha at lists.katipo.co.nz>
Subject: Re: [Koha] Elastic search for Arabic
Message-ID:
<TYZPR01MB38233C5A86A7B129CF7D4ECFA3962 at TYZPR01MB3823.apcprd01.prod.exchangelabs.com>
Content-Type: text/plain; charset="utf-8"
Dears,
I just need to update you regarding elastic search in Arabic,
SOLVED
Solution was so simple, using elastic search built-in arabic analyzer (REF https://www.elastic.co/guide/en/elasticsearch/reference/7.17/analysis-lang-analyzer.html#arabic-analyzer )
Using kibana opened biblio index settings, added
# this will remove the specified words from the stemmer
"index.analysis.filter.arabic_keywords.keywords": [
"الله"
],
"index.analysis.filter.arabic_keywords.type": "keyword_marker",
# this for Arabic stemmer filter
"index.analysis.filter.arabic_stemmer.type": "stemmer",
"index.analysis.filter.arabic_stemmer.language": "arabic",
-------------
Then add the filters to the current analyzer (order is important)
"index.analysis.analyzer.analyzer_standard.filter": [
"icu_folding",
"arabic_keywords",
"arabic_stemmer"
],
----------------
Then reindex from terminal
koha-elasticsearch --rebuild -b -c 2000 -p 8 koha
Mohamad Barham
System Engineer | Information Technology Department
Birzeit University
P.O.Box. 14, Birzeit, Palestine
Tel: + 970 22982012 | Mob: +970 597 861929 | Ext: 5616
mbarham at birzeit.edu | www.birzeit.edu<http://www.birzeit.edu/>
More information about the Koha
mailing list