<span style="font-family: arial,helvetica,sans-serif;">Maty --</span><br style="font-family: arial,helvetica,sans-serif;"><br style="font-family: arial,helvetica,sans-serif;"><span style="font-family: arial,helvetica,sans-serif;">For your purposes, stripping the "(pbk)" is probably the quickest solution for the existing data. The basic ISBN ("</span><span style="font-size: 10pt; font-family: arial,helvetica,sans-serif; color: black;">190446730X") is also valid MARC21. I consider this multiplicity unfortunate since it provides an opportunity for mismatching what should be a reliable identifier. <br>
<br>The longer term fix would be to alter the matching rules, but this should be done very deterministically including clear tests. False positives would be a dealbreaker. <br><br>Something like:<br><br>sub isbn_compare ($$) {<br>
my ($isbn1, $isbn2) = @_;<br> foreach (</span><span style="font-size: 10pt; font-family: arial,helvetica,sans-serif; color: black;">$isbn1, $isbn2) {</span><span style="font-size: 10pt; font-family: arial,helvetica,sans-serif; color: black;"><br>
</span><span style="font-size: 10pt; font-family: arial,helvetica,sans-serif; color: black;"> $_ =~ s/\D//g; # kill everything but digits<br></span><span style="font-size: 10pt; font-family: arial,helvetica,sans-serif; color: black;"></span> }<br>
<span style="font-size: 10pt; font-family: arial,helvetica,sans-serif; color: black;"> return $isbn1 eq $isbn2;<br>}<br><br>--Joe<br><br></span><div style="font-family: arial,helvetica,sans-serif;" class="gmail_quote">On Thu, Mar 5, 2009 at 11:55 AM, Martin McGovern <span dir="ltr"><<a href="mailto:mmg@interleaf.ie">mmg@interleaf.ie</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
<div bgcolor="white" link="#1F86FF" vlink="#005DC9" lang="EN-US">
<div>
<p>Hi,</p>
<p>I’m trying to match and overlay a couple of thousand bibs.<br></p>
<p>I have already loaded approx 9,000 bibs on the koha
database. All have ISBNs. But when I try to match and overlay with better
records, the incoming records have the same ISBN but with (pbk) appended after
the ISBN in the 020 $a - e.g. <span style="font-size: 10pt; color: black;"></span><span style="font-size: 8pt; color: black;"></span><b><span style="font-size: 10pt; color: black;">a</span></b><span style="font-size: 10pt; color: black;">190446730X (pbk)</span>.<br>
<br>This is valid in Marc21. But koha refuses to match on any of
these incoming records.<br><br>I could strip out the (pbk) but that will not fix the
problem for new bibs in the future.</p>
<p>Any ideas?<br><br>Thanks,</p><p><span style="color: navy;">Marty
McGovern <br></span></p></div></div></blockquote></div>