Adds all required Fedora objects to allow users to ingest and retrieve web archives through the Islandora interface.
This module requires the following modules/libraries:
Install as usual, see this for further information.
Set the paths for warcindex
and warcfilter
in Administration » Islandora » Solution pack configuation » Web ARChives (admin/islandora/solution_pack_config/web_archive).
Further documentation for this module is available at our wiki.
Having problems or solved a problem? Check out the Islandora google groups for a solution.
Q. Can you search the content in the web archives?
A. Yes. If you are using Solr 4+, the WARC_FILTERED
datastream will automatically be indexed via Apache Tika. You will need to add ds.WARC_FILTERED^1
to the Query fields form in Adminstration » Islandora » Solr Index » Solr Settings (admin/islandora/search/islandora_solr/settings).
If you would like to contribute to this module, please check out CONTRIBUTING.md. In addition, we have helpful Documentation for Developers info, as well as our Developers section on the Islandora.ca site.