[Koha] Identifying Records with Non-Roman Characters

Mon Jun 7 12:53:24 NZST 2021

Hello, all!

    Our catalog has 8766 records that lack the 880 field for non-Roman
titles. Examples:

=245  10$aВеликая отецественная война 1941-1945 /$cВ. И. Чуйков
=246  11$a Velikaia otechestvennaia voina 1941-1945

=245  10$a英汉常用物理学小词典 /$cYing Li, Lian Wei
=246  11$aYing-Han changyong wulixue xiaocidian

    Of those, a significant portion is in Cyrillic, CJK, Greek, etc. Is
there a way in either Koha or MarcEdit to isolate said records? The problem
is to turn the 245 into 880, turn the 246 into 245 fields and then pair the
resulting 880 and the new 245 into something like

=880  10$6245-45/(N$aВеликая отецественная война 1941-1945 /$cВ. И. Чуйков.
=245  10$6880-45$aVelikaia otechestvennaia voina, 1941-1945.

=880  10$6245-45/[dollar]1$a英汉常用物理学小词典 /$cYing Li, Lian Wei.
=245  10$6880-45$aYing-Han changyong wulixue xiaocidian.

    The indicators are easy to fix after the 245 and 880 fields are
reconstructed. The 245 $c subfield can be copied from the 880 $c subfield.
I'm just trying to automate the process as much as possible and thereby
avoid the painstaking manual corrections if they can be avoided.

    This is an interim step. Eventually the catalog will have to be
recataloged from scratch, I think.

    Many thanks, all!

-- 

    気を付けて。 /ki wo tukete/ = Take care.

    -- Charles.

    Charles Kelley, MLS
    PSC 704 Box 1029
    APO AP 96338

    Charles Kelley
    Tsukimino 1-Chome 5-2
    Tsukimino Gaadenia #210
    Yamato-shi, Kanagawa-ken
    〒242-0002 JAPAN

    +1-301-741-7122 [US cell]
    +81-80-4356-2178 [JPN cell]

    mnogojazyk at aol.com [h]
    cmkelleymls at gmail.com [p]

    linkedin.com/in/cmkelleymls <http://www.linkedin.com/in/cmkelleymls>
    Meeting Your Information Needs. Virtually.