Magnus,<div><br></div><div>You probably tried this already, but what about using MarcEdit to convert the records from MARC-8 to UTF-8? \xFC sounds like a MARC-8 accent if I&#39;ve ever seen one...</div><div><br></div><div>

Regards,</div><div>Jared</div><div><br><div class="gmail_quote">On Thu, Dec 30, 2010 at 1:36 PM, Magnus Enger <span dir="ltr">&lt;<a href="mailto:magnus@enger.priv.no">magnus@enger.priv.no</a>&gt;</span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;">

Dear all,<br>

<br>

I&#39;m at my wit&#39;s end here...<br>

<br>

I&#39;m trying to move some records from a 2.2.9 install to a 3.2.x<br>

install. Yep, just the records, so i exported them from 2.2.9 and have<br>

them in a file - I&#39;m not trying to convert/upgrade the whole<br>

database/installation.<br>

<br>

Now, I think the main problem is that a number of the records have<br>

characters in them that &quot;look strange&quot;, like this:<br>

aus der wirtschaftlichen AbhÃ±gigkeit von Militär und Rüstung<br>

How it got to be like that I don&#39;t know...<br>

<br>

Now when I try to run <a href="http://bulkmarcimport.pl" target="_blank">bulkmarcimport.pl</a> in verbose mode I get lots of this:<br>

<br>

.....................Bad MARC record 94: utf8 &quot;\xE4&quot; does not map to<br>

Unicode at /usr/lib/perl/5.10/Encode.pm line 174, &lt;GEN11&gt; line 95.<br>

 skipped<br>

Bad MARC record 95: utf8 &quot;\xFC&quot; does not map to Unicode at<br>

/usr/lib/perl/5.10/Encode.pm line 174, &lt;GEN11&gt; line 96.<br>

 skipped<br>

.Bad MARC record 97: utf8 &quot;\xFC&quot; does not map to Unicode at<br>

/usr/lib/perl/5.10/Encode.pm line 174, &lt;GEN11&gt; line 98.<br>

 skipped<br>

.Bad MARC record 99: utf8 &quot;\xFC&quot; does not map to Unicode at<br>

/usr/lib/perl/5.10/Encode.pm line 174, &lt;GEN11&gt; line 100.<br>

 skipped<br>

.Bad MARC record 101: utf8 &quot;\xE4&quot; does not map to Unicode at<br>

/usr/lib/perl/5.10/Encode.pm line 174, &lt;GEN11&gt; line 102.<br>

 skipped<br>

.........Bad MARC record 111: utf8 &quot;\xE9&quot; does not map to Unicode at<br>

/usr/lib/perl/5.10/Encode.pm line 174, &lt;GEN11&gt; line 112.<br>

 skipped<br>

<br>

I tried running the file I exported from 2.2.9 through iconv to<br>

convert it to UTF-8, but of course that changes the length of some<br>

fields, resulting in &quot;clipped&quot; fields.<br>

<br>

I tried creating a script to parse the records and walk through every<br>

field and subfield, convert the subfields to UTF-8 and re-assemble the<br>

records, but this seems to only result in errors like the ones above,<br>

e.g.:<br>

utf8 &quot;\xC3&quot; does not map to Unicode at /usr/lib/perl/5.10/Encode.pm line 174.<br>

<br>

If anyone has any tips on what to do in a situation like this I would<br>

be forever grateful!<br>

<br>

Best regards,<br>

<font color="#888888">Magnus Enger<br>

<a href="http://libriotech.no" target="_blank">libriotech.no</a><br>

_______________________________________________<br>

Koha mailing list  <a href="http://koha-community.org" target="_blank">http://koha-community.org</a><br>

<a href="mailto:Koha@lists.katipo.co.nz">Koha@lists.katipo.co.nz</a><br>

<a href="http://lists.katipo.co.nz/mailman/listinfo/koha" target="_blank">http://lists.katipo.co.nz/mailman/listinfo/koha</a><br>

</font></blockquote></div><br><br clear="all"><br>-- <br>Jared Camins-Esakov<div>Freelance bibliographer, C &amp; P Bibliography Services, LLC</div><div>(phone) +1 (917) 727-3445</div><div>(e-mail) <a href="mailto:jcamins@cpbibliography.com" target="_blank">jcamins@cpbibliography.com</a></div>

<div>(web) <a href="http://www.cpbibliography.com/" target="_blank">http://www.cpbibliography.com/</a></div><br>

</div>