CLOCKSS Technology

From Dryad wiki
Revision as of 10:41, 20 September 2013 by Ryan Scherle (talk | contribs) (Manifest Pages)

Jump to: navigation, search

Dryad's content will be replicated through the CLOCKSS network.

CLOCKSS will crawl the Dryad site and harvest publicly-viewable content. Data files will only be replicated in CLOCKSS once any embargo has expired.

Manifest Pages

Each quarter, Dryad will publish a set of manifest pages that list content newly available in Dryad.

There is a command line process that generates a manifest page. The manifest page will be a static page. It will contain links to all "new" data packages. A new data package is a package that meets at least one of the following conditions:

  • The package was archived since the previous manifest page was generated.
  • The package, or one of its constituent files, was modified since the previous manifest page was generated.
  • At least one file in the package has come out of embargo status since the previous manifest page was generated.
  • A new version of the package has become available since the previous manifest page was generated.

To generate a new manifest page:

/opt/dryad/bin/generate-sitemaps

A summary manifest page is available from http://datadryad.org/htmlmap. This page contains a link to each of the individual manifest pages previously generated.