Bulk Export

By Daniel Berthereau Extensible bulk export and formatter module for admin and public sides.
Download 3.4.39

Bulk Export (module for Omeka S)

New versions of this module and support for Omeka S version 3.0 and above are available on GitLab, which seems to respect users and privacy better than the previous repository.

Bulk Export is a generic export module for Omeka S that provides common output formats (json, xml, spreadsheet, text), both for admin, public and api sides. It is easily extensible by other modules.

Just add an extension to the resource browse, resource show pages or api pages, for example /admin/item.ods, /s/fr/151.odt, /api/items.tsv, or /api-local/item/151.table.json (integration of the old feature from module Api Info).

For complex requests, numerous resources, or slow output, for example the geojson that requires calls to remote https://geonames.org, you can use a specific page in the admin board.

All outputs can be configured in admin settings and in site settings.

The list of available output can be added to resource browse view and in resource show view via a resource page block or via an event.

Internally, it allows to manage output formats, that are responsible for exporting metadata into a file, a stream or as a string.

You may use module Bibliography to add old bibliographic formats (bibtex, csl, ris).

Installation

See general end user documentation for installing a module.

This module requires the module Common, that should be installed first.

For Omeka S v3, the module Blocks Disposition can be used to add it in the public sites. This is useless for Omeka S v4.

  • From the zip

Download the last release BulkExport.zip from the list of releases, and uncompress it in the modules directory.

  • From the source and for development

If the module was installed from the source, rename the name of the folder of the module to BulkExport, go to the root of the module, and run:

composer install --no-dev
  • For test

The module includes a comprehensive test suite with unit and functional tests. Run them from the root of Omeka:

vendor/bin/phpunit -c modules/BulkExport/phpunit.xml --testdox

Quick start

Automatic list of available outputs

The list of available outputs is added automatically in the admin resource browse pages and in the resource show pages. The list of exporters is configurable in the settings. This is the same for the sites: use the site settings and eventually the blocks disposition settings (for Omeka S < v4) to display the list of exporters. For Omeka S v4, use the resource page blocks.

View helper

The view helper $this->bulkExport($resourcesOrIdsOrQuery, $options) can be used anywhere else.

Manual creation of export urls

The export is available directly as /s/my-site/item/{id}.ods, or any other extension (csv, tsv, ods, json, jsonld, geojson, list.txt, txt, odt), or the one of other modules, in particular Bibliography. This feature is compatible with the module Clean Url.

The export is available through the api endpoint too at /api/items/{id}.ods (or any other extension).

It is available with the module Api Info too (deprecated) at /api/infos/items/{id}.ods.

Routes can be create manually or with the routes of the module: site/resource-output, site/resource-output-id, admin/resource-output, admin/resource-output-id, api/default/output.

Heavy export

The limit to the number of resources to output in a single call is specified in the settings. To output more resources or for complex or slow formats, you need to use the bulk export process, that will create the output in a file via a background job: just config a writer for default params, then use it and process the export.

Advanced spreadsheet options

Custom field labels and field merging (format_fields_labels)

You can customize column headers and merge multiple fields into a single column using the format_fields_labels option. This is useful for:

  • Renaming columns with custom labels
  • Merging multiple properties into a single column
  • Controlling the order of columns in the output

Format: one line per definition, with Label = field1 field2 ...

Example:

Identifier = dcterms:identifier o:id
Title = dcterms:title dcterms:alternative

This will create: - An "Identifier" column containing values from both dcterms:identifier and o:id - A "Title" column containing values from both dcterms:title and dcterms:alternative

Fields listed in format_fields_labels appear first in the output, in their specified order. Other fields are appended after.

Multiple shapers per metadata (metadata_shapers)

A single metadata field can have multiple shapers applied, each creating its own column in the output. This is useful when you need the same data transformed in different ways.

Format: Each entry specifies a metadata field and a shaper to apply.

Example: If you configure dcterms:title with both "Uppercase" and "Lowercase" shapers, the output will contain: - dcterms:title - original value - dcterms:title [Uppercase] - uppercase transformation - dcterms:title [Lowercase] - lowercase transformation

One value per column (value_per_column)

When enabled, each value of a multi-valued property gets its own column instead of being joined with a separator. The module pre-scans all resources to determine the maximum number of values for each property.

Example: An item with 3 subjects will produce 3 columns, all named dcterms:subject:

dcterms:subject dcterms:subject dcterms:subject
Subject 1 Subject 2 Subject 3

For items with fewer values, the extra columns are left empty.

Column metadata (column_metadata)

You can include language, datatype, and/or visibility in column headers using the column_metadata option. Available options:

  • language: Add language tag to header (e.g., dcterms:subject @fr)
  • datatype: Add datatype to header (e.g., dcterms:subject ^^uri)
  • visibility: Add visibility indicator for private values (e.g., dcterms:subject [private])

This option works in two modes:

With value_per_column enabled: Creates separate columns for each value within each metadata group.

Example with column_metadata: ['language'] and value_per_column: true:

dcterms:subject @fr dcterms:subject @fr dcterms:subject @en
Sujet 1 Sujet 2 English subject

Without value_per_column: Creates one column per metadata group, with values joined by the separator.

Example with column_metadata: ['language'] and value_per_column: false:

dcterms:subject @fr dcterms:subject @en
Sujet 1 | Sujet 2 English subject | Another one

This is useful when you want to keep all French subjects in one column, all English subjects in another, etc.

Notes

  • To convert an export with linked resource exported as url + label into linked resources importable, you need to apply this formula in LibreOffice Calc: =REGEX($Export.B2; "(http:/api/items/)(\d+)([^|\n]*)"; "$2"; "g") (to be adapted to your output).

TODO

  • [x] For spreadsheet, add an option to set headers with the datatype and the language (so multiple headers for the same property).
  • [x] Rights on exports.
  • [x] Deletion of old exports.
  • [x] Factorized writers into formatters
  • [ ] Extract duplicated mapping methods (mapResourceTypeToEntity, etc.) into ResourceTypeMappingTrait
  • [ ] Integrate the new feature api output for Omeka S v4.1.
  • [ ] Integrate with module Mapper for custom export mappings.
  • [ ] Enable real-time output for any size
  • [ ] Optimize pre-scan phase for value_per_column mode with database-level aggregation (SQL COUNT/GROUP BY) instead of PHP loops
  • [ ] Add progress callbacks for real-time export status updates
  • [ ] Consider async export with Server-Sent Events or WebSocket notifications
  • [ ] Simplify exporter configuration (preset templates)
  • [ ] Export preview
  • [ ] Improve progress display during background exports (percentage, end)
  • [ ] The browse view may be like the Menu view (all edits in one page)
  • [ ] Add drag-and-drop column ordering for spreadsheet exports
  • [ ] Export history with re-run capability
  • [ ] One-click re-export with same settings
  • [ ] For api, allow to pass settings like in module Api Info.
  • [ ] Use request header "Accept" like .extension.
  • [ ] Select resources like in the module ebook.

Warning

Use it at your own risk.

It’s always recommended to backup your files and your databases and to check your archives regularly so you can roll back if needed.

# database dump example
mysqldump -u omeka -p omeka | gzip > "omeka.$(date +%Y%m%d_%H%M%S).sql.gz"

Troubleshooting

See online issues on the module issues page on GitLab.

License

This module is published under the CeCILL v2.1 license, compatible with GNU/GPL and approved by FSF and OSI.

This software is governed by the CeCILL license under French law and abiding by the rules of distribution of free software. You can use, modify and/ or redistribute the software under the terms of the CeCILL license as circulated by CEA, CNRS and INRIA at the following URL "http://www.cecill.info".

As a counterpart to the access to the source code and rights to copy, modify and redistribute granted by the license, users are provided only with a limited warranty and the software’s author, the holder of the economic rights, and the successive licensors have only limited liability.

In this respect, the user’s attention is drawn to the risks associated with loading, using, modifying and/or developing or reproducing the software by the user in light of its specific status of free software, that may mean that it is complicated to manipulate, and that also therefore means that it is reserved for developers and experienced professionals having in-depth computer knowledge. Users are therefore encouraged to load and test the software’s suitability as regards their requirements in conditions enabling the security of their systems and/or data to be ensured and, more generally, to use and operate it in the same conditions as regards security.

The fact that you are presently reading this means that you have had knowledge of the CeCILL license and that you accept its terms.

Copyright

  • Copyright BibLibre, 2016-2017
  • Copyright Daniel Berthereau, 2019-2026 (see Daniel-KM on GitLab)

This module was initially inspired by the Omeka Classic, built by Biblibre, but is now fully restructured and rewritten on another design pattern.

Version Released Minimum Omeka version
3.4.39January 12, 2026 [info]^4.0.0
3.4.38November 03, 2025 [info]^4.0.0
3.4.37September 01, 2025 [info]^4.0.0
3.4.36March 31, 2025 [info]^4.0.0
3.4.35March 03, 2025 [info]^4.0.0
3.4.34July 22, 2024 [info]^4.0.0
3.4.33July 01, 2024 [info]^4.0.0
3.4.32May 20, 2024 [info]^4.0.0
3.4.31April 22, 2024 [info]^4.0.0
3.4.30December 04, 2023 [info]^4.0.0
3.4.29October 30, 2023 [info]^4.0.0
3.4.28October 02, 2023 [info]^4.0.0
3.4.27August 28, 2023 [info]^4.0.0
3.4.26July 31, 2023 [info]^4.0.0
3.4.25July 17, 2023 [info]^4.0.0
3.4.24July 10, 2023 [info]^4.0.0
3.4.23July 03, 2023 [info]^4.0.0
3.4.22June 26, 2023 [info]^4.0.0
3.4.21June 20, 2023 [info]^4.0.0
3.4.20June 19, 2023 [info]^3.1.0 || ^4.0.0
3.4.19June 12, 2023 [info]^3.1.0 || ^4.0.0
3.4.18May 29, 2023 [info]^3.1.0 || ^4.0.0
3.4.17May 22, 2023 [info]^3.1.0 || ^4.0.0
3.4.16May 08, 2023 [info]^3.1.0 || ^4.0.0
3.4.15January 09, 2023 [info]^3.1.0 || ^4.0.0
3.3.14August 22, 2022 [info]^3.1.0
3.3.13.9July 25, 2022 [info]^3.1.0
3.3.13.8June 20, 2022 [info]^3.1.0
3.3.13.7May 30, 2022 [info]^3.1.0
3.3.13.6March 21, 2022 [info]^3.1.0