Help with documentation errors
Hello all, I noticed that the 3.8 manual wasn't updating using the cron job we run nightly and so Larry found the following errors (note that it's not just the 3.8 English manual): 3.10, French: fr/manual.xml:2181: parser error : CharRef: invalid decimal value les amendes est de 5 € (c'est | dire qu'un adhérent ayant accumulé ^ fr/manual.xml:2181: parser error : xmlParseCharRef: invalid xmlChar value 0 les amendes est de 5 € (c'est | dire qu'un adhérent ayant accumulé ^ fr/manual.xml:2186: parser error : CharRef: invalid decimal value qui porterait la somme totale due par l'adhérent | 5 €. Selon ^ fr/manual.xml:2186: parser error : xmlParseCharRef: invalid xmlChar value 0 qui porterait la somme totale due par l'adhérent | 5 €. Selon ^ fr/manual.xml:5968: parser error : Opening and ending tag mismatch: chapter line 85 and section </section> ^ fr/manual.xml:15191: parser error : Opening and ending tag mismatch: book line 4 and section </section> ^ fr/manual.xml:15192: parser error : Extra content at the end of the document </chapter> ^ unable to parse fr/manual.xml 3.8, English: en/manual.xml:41551: parser error : Premature end of data in tag para line 41550 information. ^ en/manual.xml:41551: parser error : Premature end of data in tag section line 41527 information. ^ en/manual.xml:41551: parser error : Premature end of data in tag appendix line 41521 information. ^ en/manual.xml:41551: parser error : Premature end of data in tag book line 4 information. ^ unable to parse en/manual.xml 3.6, French: fr/manual.xml:18780: parser error : CharRef: invalid decimal value Ne pas rentrer de signes monétaires ($, € )</para> ^ fr/manual.xml:18780: parser error : xmlParseCharRef: invalid xmlChar value 0 Ne pas rentrer de signes monétaires ($, € )</para> ^ fr/manual.xml:18818: parser error : CharRef: invalid decimal value Ne pas rentrer de signes monétaires ($, € )</para> ^ fr/manual.xml:18818: parser error : xmlParseCharRef: invalid xmlChar value 0 Ne pas rentrer de signes monétaires ($, € )</para> ^ unable to parse fr/manual.xml -- So, it looks like the 3.10 French, 3.8 English, and 3.6 French manuals are not updating. However, the 3.6 English one should be. My question is, how do encoding errors like the ones on the English manual happen? I looked at that line in the code and see nothing, I even deleted that line and still can't save the file because it tells me that there are encoding errors in the file. Any help would be greatly appreciated so that I can get the 3.8 manual back on track. Nicole PS. If you know how to fix it you're free to submit a patch of course!!
Interestingly enough Jenkins has been telling us that for ages. http://jenkins.koha-community.org/job/Koha_Docs/ (red is bad, if you click on a job then console output you will see the error messages) Jenkins tests it after every push so it is well worth paying attention to. I don't see any encoding errors just invalid xml, caused by missing tags or badly formed entities. Chris Nicole Engard <nengard@gmail.com> wrote:
Hello all,
I noticed that the 3.8 manual wasn't updating using the cron job we run nightly and so Larry found the following errors (note that it's not just the 3.8 English manual):
3.10, French: fr/manual.xml:2181: parser error : CharRef: invalid decimal value les amendes est de 5 € (c'est | dire qu'un adhérent ayant accumulé ^ fr/manual.xml:2181: parser error : xmlParseCharRef: invalid xmlChar value 0 les amendes est de 5 € (c'est | dire qu'un adhérent ayant accumulé ^ fr/manual.xml:2186: parser error : CharRef: invalid decimal value qui porterait la somme totale due par l'adhérent | 5 €. Selon ^ fr/manual.xml:2186: parser error : xmlParseCharRef: invalid xmlChar value 0 qui porterait la somme totale due par l'adhérent | 5 €. Selon ^ fr/manual.xml:5968: parser error : Opening and ending tag mismatch: chapter line 85 and section </section> ^ fr/manual.xml:15191: parser error : Opening and ending tag mismatch: book line 4 and section </section> ^ fr/manual.xml:15192: parser error : Extra content at the end of the document </chapter> ^ unable to parse fr/manual.xml
3.8, English: en/manual.xml:41551: parser error : Premature end of data in tag para line 41550 information. ^ en/manual.xml:41551: parser error : Premature end of data in tag section line 41527 information. ^ en/manual.xml:41551: parser error : Premature end of data in tag appendix line 41521 information. ^ en/manual.xml:41551: parser error : Premature end of data in tag book line 4 information. ^ unable to parse en/manual.xml
3.6, French: fr/manual.xml:18780: parser error : CharRef: invalid decimal value Ne pas rentrer de signes monétaires ($, € )</para> ^ fr/manual.xml:18780: parser error : xmlParseCharRef: invalid xmlChar value 0 Ne pas rentrer de signes monétaires ($, € )</para> ^ fr/manual.xml:18818: parser error : CharRef: invalid decimal value Ne pas rentrer de signes monétaires ($, € )</para> ^ fr/manual.xml:18818: parser error : xmlParseCharRef: invalid xmlChar value 0 Ne pas rentrer de signes monétaires ($, € )</para> ^ unable to parse fr/manual.xml --
So, it looks like the 3.10 French, 3.8 English, and 3.6 French manuals are not updating. However, the 3.6 English one should be.
My question is, how do encoding errors like the ones on the English manual happen? I looked at that line in the code and see nothing, I even deleted that line and still can't save the file because it tells me that there are encoding errors in the file. Any help would be greatly appreciated so that I can get the 3.8 manual back on track.
Nicole
PS. If you know how to fix it you're free to submit a patch of course!! _______________________________________________ Koha mailing list http://koha-community.org Koha@lists.katipo.co.nz http://lists.katipo.co.nz/mailman/listinfo/koha
On Tue, Nov 20, 2012 at 1:29 PM, Chris Cormack <chrisc@catalyst.net.nz>wrote:
I don't see any encoding errors just invalid xml, caused by missing tags or badly formed entities.
And can you tell me what those missing tags or badly formed entities are? I see properly nested tags in the 3.8 english manual - in fact they're identical to the 3.10 manual at that line and it's not throwing errors. Nicole
Nicole, xmllint report a similar error: xmllint --valid --noout manual.xml I/O error : encoder error manual.xml:41554: parser error : Premature end of data in tag programlisting line 41554 system:<programlisting>host: ^ manual.xml:41554: parser error : Premature end of data in tag para line 41553 system:<programlisting>host: ^ manual.xml:41554: parser error : Premature end of data in tag section line 41527 system:<programlisting>host: ^ manual.xml:41554: parser error : Premature end of data in tag appendix line 41521 system:<programlisting>host: ^ manual.xml:41554: parser error : Premature end of data in tag book line 4 system:<programlisting>host: ^ but, if you change the encoding to "UTF-8", diff manual.xml manual.utf8.xml 1c1 < <?xml version="1.0" encoding="US-ASCII"?> ---
<?xml version="1.0" encoding="UTF-8"?>
then there is no more error, There must be a non obvious char that is doing the mess. Regards, Bernardo -- Bernardo Gonzalez Kriegel bgkriegel@gmail.com On Tue, Nov 20, 2012 at 3:41 PM, Nicole Engard <nengard@gmail.com> wrote:
On Tue, Nov 20, 2012 at 1:29 PM, Chris Cormack <chrisc@catalyst.net.nz
wrote:
I don't see any encoding errors just invalid xml, caused by missing tags or badly formed entities.
And can you tell me what those missing tags or badly formed entities are? I see properly nested tags in the 3.8 english manual - in fact they're identical to the 3.10 manual at that line and it's not throwing errors.
Nicole _______________________________________________ Koha mailing list http://koha-community.org Koha@lists.katipo.co.nz http://lists.katipo.co.nz/mailman/listinfo/koha
Nicole, the problem seems to be with some "spaces". I attach a diff that fixes the problem without changing the encoding. Regards, Bernardo -- Bernardo Gonzalez Kriegel bgkriegel@gmail.com On Tue, Nov 20, 2012 at 6:44 PM, Bernardo Gonzalez Kriegel < bgkriegel@gmail.com> wrote:
Nicole, xmllint report a similar error: xmllint --valid --noout manual.xml I/O error : encoder error manual.xml:41554: parser error : Premature end of data in tag programlisting line 41554 system:<programlisting>host: ^ manual.xml:41554: parser error : Premature end of data in tag para line 41553 system:<programlisting>host: ^ manual.xml:41554: parser error : Premature end of data in tag section line 41527 system:<programlisting>host: ^ manual.xml:41554: parser error : Premature end of data in tag appendix line 41521 system:<programlisting>host: ^ manual.xml:41554: parser error : Premature end of data in tag book line 4 system:<programlisting>host: ^ but, if you change the encoding to "UTF-8", diff manual.xml manual.utf8.xml 1c1 < <?xml version="1.0" encoding="US-ASCII"?> ---
<?xml version="1.0" encoding="UTF-8"?>
then there is no more error, There must be a non obvious char that is doing the mess.
Regards, Bernardo
-- Bernardo Gonzalez Kriegel bgkriegel@gmail.com
On Tue, Nov 20, 2012 at 3:41 PM, Nicole Engard <nengard@gmail.com> wrote:
On Tue, Nov 20, 2012 at 1:29 PM, Chris Cormack <chrisc@catalyst.net.nz
wrote:
I don't see any encoding errors just invalid xml, caused by missing tags or badly formed entities.
And can you tell me what those missing tags or badly formed entities are? I see properly nested tags in the 3.8 english manual - in fact they're identical to the 3.10 manual at that line and it's not throwing errors.
Nicole _______________________________________________ Koha mailing list http://koha-community.org Koha@lists.katipo.co.nz http://lists.katipo.co.nz/mailman/listinfo/koha
participants (3)
-
Bernardo Gonzalez Kriegel -
Chris Cormack -
Nicole Engard