[Koha] normalization rule - defining a matching rule
Archives and Collections Society
paul.a at aandc.org
Sat Jan 15 05:55:39 NZDT 2011
An expansion of Pete's email below; we have a major problem using
"Normalization Rules" and "Normalization Checks" at
<http://koha-admin/cgi-bin/koha/admin/matching-rules.pl?op=edit_matching_rule&matcher_id=1>
to compare our biblio records with those of LoC (Library of Congress) for
staged records.
Please note that we have searched the Koha docs; if this is a bug, please
advise.
Take the following examples:
LoC ... 020 // $a 9004158634 (hbk. : alk. paper)
Ours ... 020 // $a 9004158634
LoC ... 020 // $a 0870213601 : $c $22.95
Ours ... 020 // $a 0870213601
Koha does *not* match these. (the paranthetical comment in the first
example appears "legal" under MARC; the colon in the second appears to be
contrary to MARC definitions.)
The Koha admin page above gives two possibilities: (a) defining the
"Normalization rule:" for the matchpoint (this appears to be matcher.pm
which looks for ISBN and the date field in 008), or (b) adding a "match
check" which might add flexibility. We have played unsuccessfully with
these for a few days now.
Questions: Is there some documentation on this anywhere? If not, has
anyone here got a solution?
Thanks in advance,
Paul
Tired old sys-admin (now getting slightly frustrated)
At 01:57 PM 1/10/2011 -0800, you wrote:
>Can someone please point me to more info about normalization rules in the
>context of defining a matching rule in Koha?
>
>Scenario:
>Our set of books currently in Koha was imported from Marc that we created
>using MarcEdit (mapping from an Excel spreadsheet). As a result our Marc
>information is likely "not as good" as that of LOC for any given book. So
>now we are trying to "overlay", or "overwrite" as many of the biblio records
>in our Koha database with (hopefully) "better" MARC records from LOC (or
>wherever).
>
>Problem:
>When I do a bulk z39.50 lookup on ISBN numbers (using MarcEdit), the ISBN
>numbers I receive from LOC often contain trailing text (like "paperback", or
>":"). When I stage these records and try to match the staged records on
>ISBN in Koha this trailing text prevents many matches.
>
>I am assuming that it is the job of some sort of normalization routine to
>force a match, however I am not able to find any documentation or examples
>describing how one would define his/her own normalization routine.
>
>Can someone please provide an example?
>
>Thanks,
>Pete.
>--
>View this message in context:
>http://koha.1045719.n5.nabble.com/normalization-rule-defining-a-matching-rule-tp3335536p3335536.html
>Sent from the Koha - Discuss mailing list archive at Nabble.com.
>_______________________________________________
>Koha mailing list http://koha-community.org
>Koha at lists.katipo.co.nz
>http://lists.katipo.co.nz/mailman/listinfo/koha
---
Archives and Collections (ACS) Society
205, Main Street, Picton, Ontario, K0K 2T0, Canada
http://www.AandC.org
Canadian Charitable Organization 88721 9921 RR0001
Dedicated to maritime conservation and education.
More information about the Koha
mailing list