Difference between revisions of "CLOCKSS Technology"

From Dryad wiki
Jump to: navigation, search
(Manifest Pages)
(Manifest Pages)
Line 6: Line 6:
  
 
Each quarter, Dryad will publish a set of manifest pages that list content newly available in Dryad.
 
Each quarter, Dryad will publish a set of manifest pages that list content newly available in Dryad.
<pre>
 
/opt/dryad/bin/generate-sitemaps
 
</pre>
 
  
There will be a command line process that generates a manifest page. The manifest page will be a static page. It will contain links to all "new" data packages. A new data package is a package that meets at least one of the following conditions:
+
There is a command line process that generates a manifest page. The manifest page will be a static page. It will contain links to all "new" data packages. A new data package is a package that meets at least one of the following conditions:
 
* The package was archived since the previous manifest page was generated.
 
* The package was archived since the previous manifest page was generated.
 
* The package, or one of its constituent files, was modified since the previous manifest page was generated.
 
* The package, or one of its constituent files, was modified since the previous manifest page was generated.
Line 16: Line 13:
 
* A new version of the package has become available since the previous manifest page was generated.
 
* A new version of the package has become available since the previous manifest page was generated.
  
Each manifest page will have a statement at the top of the page stating: CLOCKSS system has permission to ingest, preserve, and serve this Archival Unit.
+
To generate a new manifest page:
 +
<pre>
 +
/opt/dryad/bin/generate-sitemaps
 +
</pre>
  
A summary manifest page will be available from http://datadryad.org/clockss/. This page will contain a link to each of the individual manifest pages previously generated.
+
A summary manifest page is available from http://datadryad.org/htmlmap. This page contains a link to each of the individual manifest pages previously generated.
: NOTE: as of 2013-03-17 this page is unavailable (Page not found / We can't find the page you asked for. / Go to Dryad home)
 
  
 
[[Category:Technical Documentation]]
 
[[Category:Technical Documentation]]

Revision as of 10:41, 20 September 2013

Dryad's content will be replicated through the CLOCKSS network.

CLOCKSS will crawl the Dryad site and harvest publicly-viewable content. Data files will only be replicated in CLOCKSS once any embargo has expired.

Manifest Pages

Each quarter, Dryad will publish a set of manifest pages that list content newly available in Dryad.

There is a command line process that generates a manifest page. The manifest page will be a static page. It will contain links to all "new" data packages. A new data package is a package that meets at least one of the following conditions:

  • The package was archived since the previous manifest page was generated.
  • The package, or one of its constituent files, was modified since the previous manifest page was generated.
  • At least one file in the package has come out of embargo status since the previous manifest page was generated.
  • A new version of the package has become available since the previous manifest page was generated.

To generate a new manifest page:

/opt/dryad/bin/generate-sitemaps

A summary manifest page is available from http://datadryad.org/htmlmap. This page contains a link to each of the individual manifest pages previously generated.