Scraping online web catalogues to provide Z39.50 server
I wrote simple Z39.50 server which uses WWW::Mechanize to scrape web pages and produce MARC records which can then be imported in Koha. Short announcement is at: http://blog.rot13.org/2010/10/z3950_server_which_scrapes_web_catalogues_for_... and source code is on github: http://github.com/dpavlin/Biblio-Z3950 Source is important, because you will have to modify it, even to use another Aleph instance because my instance uses Croatian language, and there are regexes inside which include few words of language to find number or results. But, I hope this will motivate someone to provide a data source or two over Z39.50 :-) -- ...2share!2flame... http://blog.rot13.org
Hi, 2010/10/25 Dobrica Pavlinušić <dpavlin@gmail.com>:
I wrote simple Z39.50 server which uses WWW::Mechanize to scrape web pages and produce MARC records which can then be imported in Koha.
Short announcement is at:
http://blog.rot13.org/2010/10/z3950_server_which_scrapes_web_catalogues_for_...
Thanks, this looks interesting. Pity that the catalogs in question don't provide Z39.50 service directly, but whatever works to get the data moving... Regards, Galen -- Galen Charlton gmcharlt@gmail.com
participants (2)
-
Dobrica Pavlinušić -
Galen Charlton