When I do a "rebuild_zebra.pl -b" I allways get the following error:
problem with :577 : :38: parser error : Premature end of data in tag subfield line 38 <subfield code="6">SCH ^ :38: parser error : Premature end of data in tag datafield line 33 <subfield code="6">SCH ^ :38: parser error : Premature end of data in tag record line 2 <subfield code="6">SCH ^ at /usr/lib/perl5/XML/LibXML/SAX/Parser.pm line 31
<?xml version="1.0" encoding="UTF-8"?> <record xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.loc.gov/MARC21/slim http://www.loc.gov/ standards/marcxml/schema/MARC21slim.xsd" xmlns="http://www.loc.gov/MARC21/slim">
<leader>00521nam a2200133 4500</leader> <controlfield tag="008"> 960105s1970||||||||||||||||||||||||||||ger</controlfield> <datafield tag="245" ind1="0" ind2="4"> <subfield code="a">Die schönsten Helden- und Rittersagen des Mittelalters /</subfield> <subfield code="c">[Für die Jugend bearb. von] Gerhard Aick ; Mit
It looks like you biblio records are not encoded in UTF8 but in ISO-8859-1. For whatever reason, utf8 records are transcoded into iso-8859-1. In this case, a properly encoded character like ö (schönsten) is transformed in several octets ö Consequently, you get a new line in the middle of a tag, which is not accepted by SAX parser. You have to determine if your biblio records are properly encoded in Koha DB. If this is not the case, something get wrong when you imported them. You have to tell more about your source of biblio records and send an sample. If your records are correct in Koha DB, it means that rebuild_zebra is responsible to transform them. Hope this help.