Trying to match and overlay ISBN - (pbk) after ISBN
Hi, I'm trying to match and overlay a couple of thousand bibs. I have already loaded approx 9,000 bibs on the koha database. All have ISBNs. But when I try to match and overlay with better records, the incoming records have the same ISBN but with (pbk) appended after the ISBN in the 020 $a - e.g. a190446730X (pbk). This is valid in Marc21. But koha refuses to match on any of these incoming records. I could strip out the (pbk) but that will not fix the problem for new bibs in the future. Any ideas? Thanks, Marty McGovern This email and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. It is possible for data transmitted by email to be deliberately or accidentally corrupted or intercepted. For this reason, where the communication is by email, Interleaf Technology does not accept any responsibility for any breach of confidence which may arise through the use of this medium. This footnote also confirms that this email message has been swept for the presence of known computer viruses.
Maty -- For your purposes, stripping the "(pbk)" is probably the quickest solution for the existing data. The basic ISBN ("190446730X") is also valid MARC21. I consider this multiplicity unfortunate since it provides an opportunity for mismatching what should be a reliable identifier. The longer term fix would be to alter the matching rules, but this should be done very deterministically including clear tests. False positives would be a dealbreaker. Something like: sub isbn_compare ($$) { my ($isbn1, $isbn2) = @_; foreach ($isbn1, $isbn2) { $_ =~ s/\D//g; # kill everything but digits } return $isbn1 eq $isbn2; } --Joe On Thu, Mar 5, 2009 at 11:55 AM, Martin McGovern <mmg@interleaf.ie> wrote:
Hi,
I’m trying to match and overlay a couple of thousand bibs.
I have already loaded approx 9,000 bibs on the koha database. All have ISBNs. But when I try to match and overlay with better records, the incoming records have the same ISBN but with (pbk) appended after the ISBN in the 020 $a - e.g. *a*190446730X (pbk).
This is valid in Marc21. But koha refuses to match on any of these incoming records.
I could strip out the (pbk) but that will not fix the problem for new bibs in the future.
Any ideas?
Thanks,
Marty McGovern
For your purposes, stripping the "(pbk)" is probably the quickest solution for the existing data.
As far as I know, there's no easy way of doing batch edits in Koha. Is this correct? If so, all the MARC records would have to be exported, edited, & re-imported. I think Biblios may have this functionality, but I've not found Biblios to be particularly user-friendly. Cheers, Cab Vinton, Director Sanbornton Public Library Sanbornton, NH
I suspect in 3.2 there may be some better matching for ISBN 10/13 and also the ending character problem - I say that because I am sponsoring some work in that area. I saw that as a huge problem for me and my 67 sites and thousands of records we load monthly. David Schuster Cab Vinton wrote:
For your purposes, stripping the "(pbk)" is probably the quickest solution for the existing data.
As far as I know, there's no easy way of doing batch edits in Koha. Is this correct? If so, all the MARC records would have to be exported, edited, & re-imported.
I think Biblios may have this functionality, but I've not found Biblios to be particularly user-friendly.
Cheers,
Cab Vinton, Director Sanbornton Public Library Sanbornton, NH _______________________________________________ Koha mailing list Koha@lists.katipo.co.nz http://lists.katipo.co.nz/mailman/listinfo/koha
-- View this message in context: http://www.nabble.com/Trying-to-match-and-overlay-ISBN---%28pbk%29-after-ISB... Sent from the Koha - Discuss mailing list archive at Nabble.com.
participants (4)
-
Cab Vinton -
David Schuster -
Joe Atzberger -
Martin McGovern