I've been looking for a way to export the metatdata from our Omeka instance (1.3) into a form that we can use to batch ingest into another environment, Dspace. I've attempted to install the OAI PMH Repository plugin and utilize that, but I'm having trouble figuring out how to use it. I'm definitely a novice when it comes to OAI harvesting.
What methods do I have to export the metadata, and how can I test to see if the plugin is functional? If I could find a way to get the metadata in an .XML file with some reference to their associated objects (pic, video, etc.), I can write a program to manipulate it into the proper form for DSpace ingestion, but I'm not sure where to go from here.
Have you taken a look at the omeka-xml output? You should have a link to an omeka-xml document on the sidebar of the admin show page for any item, and also on the item browse page. Omeka also has other similar outputs, and you could also try the AtomOutput plugin, which adds Atom XML output.
The OAI-PMH repository, as you guessed, may also be sufficient for your needs. Any OAI harvester should handle Omeka's output just fine, all you should need to use a harvester is the repository's base URL, which the plugin includes on your admin dashboard sidebar.
It looks like these export options would be very useful, however they only export one page of items at a time. Is there a way to get them to display the export for all of the records in one file?
Many of these formats, the OAI repository in particular, purposely limit the size of each page to cut down on memory usage and rely on the client to request the next "page" of data
If you do want to use the feeds and get one big feed out of them, you may be able to work around this limitation by altering the number of items per page in Site Settings to be large enough to include all the items you want.
Note that increasing the number of items on a page affects the normal page outputs as well, and will increase Omeka's memory usage on those pages and when creating the feed. If you're dealing with lots of items, you may run up against your PHP memory limit or the script execution time limit.
Has anyone had any success reconfiguring the page output for the OAI-PMH Repository plugin to do a bulk export, such as John F. suggested here?
We too are considering bulk export options for adding our metadata to a collection management system we plan on implementing in the coming months and are wondering if there will be any further development of a bulk export plugin, or the backup plugin that John F. talked about developing back in 2009, https://groups.google.com/forum/#!searchin/omeka-dev/backup$20plugin/omeka-dev/trE6jQ-aFvg/yRyUdAMnsYwJ.
Having reviewed the current XML output formats, we see that to capture the universe of our metadata we would not only need to export each individual record, but we would need to export both the DCMES-XML and the Omeka-XML formats. Is there any way to export an entire record's metadata in a single format (qdc at the least)?
Already posted this to dev forum, https://groups.google.com/d/msg/omeka-dev/IhV9Hlyw1rE/781KTvdyqAAJ
Thanks in advance for the assistance,
Aly DesRochers &
Anne-Marie H. Viola
Metadata & Cataloguing Specialist,
Image Collections and Fieldwork Archives (ICFA)
Dumbarton Oaks Research Library and Collection
1703 32nd Street, NW Washington, DC 20007
This discussion is on the dev list, so please follow-up there for anyone else who is interested.