Hi Kathrin
we looked into this ages ago and it didn't seem possible to achieve both - treating hyphen (-) as a space and not a space at the same time. Maybe we missed something - If there is a solution, I'd be interested in a how-to! :)
Just to clarify: We don't want to treat hyphen as a space. We just want to ignore/omit it while searching. (Well, maybe the "vice versa" part of my e-mail is not all too sophisticated...) So "Sinti-Swing" would NOT become "Sinti Swing" (as it is actually the case) but just "SintiSwing". Of course, a search term like "Sinti Swing" should just stay unaltered as there is no hyphen in it anyway. Best wishes: Michael -- Geschäftsführer · Diplombibliothekar BBS, Informatiker eidg. Fachausweis Admin Kuhn GmbH · Pappelstrasse 20 · 4123 Allschwil · Schweiz T 0041 (0)61 261 55 61 · E mik@adminkuhn.ch · W www.adminkuhn.ch
On 24.09.19 20:47, Michael Kuhn wrote:
Hi David
Many thanks for your reply and the hints!
After a standard installation of Koha 18.11 the CHR indexing is used, thus the configuration is done in file "word-phrase-utf.chr".
A catalog search * for "Sintiswing" shows 1 hit * for "Sinti-Swing" shows 18 hits, the hyphen is used as a breaking character, so any record containing "Sinti-Swing" or "Sinti" and "Swing" is found, but not "Sintiswing"
I changed the following line, omitting the hyphen (between comma and dot):
space {\001-\040}!"#$%&'\()*+,./:;<=>?@\[\\]^_`\{|}~’{\x88-\x89}{\x98-\x9C}¡¿«»
After a Zebra reindexing a catalog search * for "Sintiswing" shows 1 hit * for "Sinti-Swing" now shows only 8 hits, the hyphen is no more used as a breaking character, so any record containing "Sinti Swing" or "Sinti-Swing" is found, but not "Sintiswing"
I also tried to add "map (-) @" but this leads to the original results.
In short: My change of configuration didn't lead to the desired result... If searching for "Sintiswing" also "Sinti-Swing" should be found, and vice versa. This is not the case.
Since I couldn't find any documentation about CHR indexing - does anyone know where to find out more about the CHR way of indexing?
Best wishes: Michael
Koha mailing list http://koha-community.org Koha@lists.katipo.co.nz https://lists.katipo.co.nz/mailman/listinfo/koha