[Koha] Corrupted record

Frederic Demians frederic at tamil.fr
Fri Jun 20 05:36:04 NZST 2008


> When I do a "rebuild_zebra.pl -b" I allways get the following error:
>
> problem with :577 : :38: parser error : Premature end of data in tag
> subfield line 38
> <subfield code="6">SCH
> ^
> :38: parser error : Premature end of data in tag datafield line 33
> <subfield code="6">SCH
> ^
> :38: parser error : Premature end of data in tag record line 2
> <subfield code="6">SCH
> ^
> at /usr/lib/perl5/XML/LibXML/SAX/Parser.pm line 31
>
> <?xml version="1.0" encoding="UTF-8"?>
> <record
> xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
> xsi:schemaLocation="http://www.loc.gov/MARC21/slim
> http://www.loc.gov/ standards/marcxml/schema/MARC21slim.xsd"
> xmlns="http://www.loc.gov/MARC21/slim">
>
> <leader>00521nam a2200133 4500</leader>
> <controlfield tag="008">
> 960105s1970||||||||||||||||||||||||||||ger</controlfield>
> <datafield tag="245" ind1="0" ind2="4">
> <subfield code="a">Die schönsten Helden- und Rittersagen des
> Mittelalters /</subfield>
> <subfield code="c">[Für die Jugend bearb. von] Gerhard Aick ; Mit

It looks like you biblio records are not encoded in UTF8 but in
ISO-8859-1. For whatever reason, utf8 records are transcoded into
iso-8859-1. In this case, a properly encoded character like ö
(schönsten) is transformed in several octets ö Consequently, you get a
new line in the middle of a tag, which is not accepted by SAX parser.

You have to determine if your biblio records are properly encoded in
Koha DB. If this is not the case, something get wrong when you imported
them. You have to tell more about your source of biblio records and send
an sample.

If your records are correct in Koha DB, it means that rebuild_zebra is
responsible to transform them.

Hope this help.


More information about the Koha mailing list