CLOCKSS Technology

From Dryad wiki
Revision as of 09:46, 20 September 2013 by Ryan Scherle (talk | contribs) (Manifest Pages)

Jump to: navigation, search

Dryad's content will be replicated through the CLOCKSS network.

CLOCKSS will crawl the Dryad site and harvest publicly-viewable content. Data files will only be replicated in CLOCKSS once any embargo has expired.

Manifest Pages

Each quarter, Dryad will publish a set of manifest pages that list content newly available in Dryad.

There is a command line process that generates a manifest page. The manifest page will be a static page. It will contain links to all "new" data packages. A new data package is a package that meets at least one of the following conditions:

  • The package was archived since the previous manifest page was generated.
  • The package, or one of its constituent files, was modified since the previous manifest page was generated.
  • At least one file in the package has come out of embargo status since the previous manifest page was generated.
  • A new version of the package has become available since the previous manifest page was generated.

To generate a new manifest page:


A summary manifest page is available from This page contains a link to each of the individual manifest pages previously generated.

Relation to DSpace

Dryad has implemented this as a slight modification to DSpace's normal sitemap/htmlmap feature. The relevant classes are:

  • dspace/modules/api/src/main/java/org/dspace/app/sitemap/
  • dspace/modules/api/src/main/java/org/dspace/app/sitemap/