Difference between revisions of "CLOCKSS Technology"

From Dryad wiki
Jump to: navigation, search
(Manifest Pages)
(Manifest Pages)
Line 5: Line 5:
 
== Manifest Pages ==
 
== Manifest Pages ==
  
Each quarter, Dryad will publish a manifest page that describes content newly available in Dryad. The manifest page will link to each item that CLOCKSS needs to crawl. This includes:
+
Each quarter, Dryad will publish a set of manifest pages that list content newly available in Dryad.  
* Submissions that have been archived in the repository during the quarter.
 
* Submissions for which at least one file has come out of embargo status during the quarter.
 
* Submissions for which a new version has become available during the quarter.
 
* Submissions that have been edited in other ways (e.g., a metadata change) during the quarter.
 
  
Each manifest page must contain (in the text or in a comment):
+
There should be a command line process that generates a manifest page. The manifest page will be a static page. It will contain links to all "new" data packages. A new data package is a package that meets at least one of the following conditions:
CLOCKSS system has permission to ingest, preserve, and serve this Archival Unit.
+
* The package was archived since the previous manifest page was generated.
 +
* The package, or one of its constituent files, was modified since the previous manifest page was generated.
 +
* At least one file in the package has come out of embargo status since the previous manifest page was generated.
 +
* A new version of the package has become available since the previous manifest page was generated.
 +
 
 +
Each manifest page will have a statement at the top of the page stating: CLOCKSS system has permission to ingest, preserve, and serve this Archival Unit.
 +
 
 +
A summary manifest page will be available from http://datadryad.org/clockss/. This page should It will contain a link to each manifest page previously generated.
  
 
[[Category:Technical Documentation]]
 
[[Category:Technical Documentation]]

Revision as of 11:35, 28 January 2013

Dryad's content will be replicated through the CLOCKSS network.

CLOCKSS will crawl the Dryad site and harvest publicly-viewable content. Data files will only be replicated in CLOCKSS once any embargo has expired.

Manifest Pages

Each quarter, Dryad will publish a set of manifest pages that list content newly available in Dryad.

There should be a command line process that generates a manifest page. The manifest page will be a static page. It will contain links to all "new" data packages. A new data package is a package that meets at least one of the following conditions:

  • The package was archived since the previous manifest page was generated.
  • The package, or one of its constituent files, was modified since the previous manifest page was generated.
  • At least one file in the package has come out of embargo status since the previous manifest page was generated.
  • A new version of the package has become available since the previous manifest page was generated.

Each manifest page will have a statement at the top of the page stating: CLOCKSS system has permission to ingest, preserve, and serve this Archival Unit.

A summary manifest page will be available from http://datadryad.org/clockss/. This page should It will contain a link to each manifest page previously generated.