Difference between revisions of "Curation"

From Dryad wiki
Jump to: navigation, search
Line 1: Line 1:
==Introduction==
+
= Introduction =
  
 
The Dryad repository has a curator who works to enforce quality control on existing content in Dryad and collaborate in development of the repository.
 
The Dryad repository has a curator who works to enforce quality control on existing content in Dryad and collaborate in development of the repository.
Line 6: Line 6:
 
* Digital curation can be defined as follows: 'The activity of managing the use of data from its point of creation to ensure it is available for discovery and re-use in the future.' Data curation can also include managing vast data sets for daily use; updating it to keep it readable, etc. Therefore the term data curator is applicable to a large range of professional backgrounds, from minimal management of digital materials, to the addition of metadata, to managing institutional repositories.
 
* Digital curation can be defined as follows: 'The activity of managing the use of data from its point of creation to ensure it is available for discovery and re-use in the future.' Data curation can also include managing vast data sets for daily use; updating it to keep it readable, etc. Therefore the term data curator is applicable to a large range of professional backgrounds, from minimal management of digital materials, to the addition of metadata, to managing institutional repositories.
  
==Professional Data Curation Tasks==
+
= Professional Data Curation Tasks =
  
 
# Name authority/authority control for authors
 
# Name authority/authority control for authors
Line 17: Line 17:
 
# Determine when files need to be migrated to new formats and supervise the migration process.
 
# Determine when files need to be migrated to new formats and supervise the migration process.
  
== Curation Time Estimates ==
+
= Curation Time Estimates =
  
 
A Google Spreadsheet outlines our current estimates for [http://spreadsheets.google.com/pub?key=tmU679YYLC4fNXVzHAFgFeg&output=html time to curate an item]. The curation times are specified for three different levels of curation. NOTE: these estimates were developed for Dryad sustainability planning. As a result, they assume the use of tools/features that are not yet available in Dryad (but which we reasonably expect to include by the end of the NSF grant).
 
A Google Spreadsheet outlines our current estimates for [http://spreadsheets.google.com/pub?key=tmU679YYLC4fNXVzHAFgFeg&output=html time to curate an item]. The curation times are specified for three different levels of curation. NOTE: these estimates were developed for Dryad sustainability planning. As a result, they assume the use of tools/features that are not yet available in Dryad (but which we reasonably expect to include by the end of the NSF grant).
Line 26: Line 26:
 
* Level 3 tasks will take an additional 1-3 hours.
 
* Level 3 tasks will take an additional 1-3 hours.
  
== November 2011 Curation Practices ==
+
= Curation Practices =
 +
 
 +
== Current Curation Practices (November 2011) ==
  
 
The files below reflect current curatorial practice in the repository as of February 6, 2012. These include some work-arounds and manual processes that will become automated or semi-automated. Implementation of the Dryad Application Profile version 3.0 will cause significant changes to these practices.
 
The files below reflect current curatorial practice in the repository as of February 6, 2012. These include some work-arounds and manual processes that will become automated or semi-automated. Implementation of the Dryad Application Profile version 3.0 will cause significant changes to these practices.
Line 33: Line 35:
 
*Also see [[Templates for Correspondence]].
 
*Also see [[Templates for Correspondence]].
  
== April 2011 Curation Practices ==
+
== Historic Curation Practices ==
 +
=== April 2011 ===
  
 
The files below reflect current curatorial practice in the repository as of May 3, 2011. These include some work-arounds and manual processes that will become automated or semi-automated. Implementation of the Dryad Application Profile version 3.0 will cause significant changes to these practices.
 
The files below reflect current curatorial practice in the repository as of May 3, 2011. These include some work-arounds and manual processes that will become automated or semi-automated. Implementation of the Dryad Application Profile version 3.0 will cause significant changes to these practices.
Line 40: Line 43:
 
*Also see [[Templates for Correspondence]].
 
*Also see [[Templates for Correspondence]].
  
== January 2011 Curation Practices  ==
+
=== January 2011 ===
  
 
The files below reflect current curatorial practice in the repository as of January 21, 2011. These include temporary work-arounds and manual processes that will become automated or semi-automated. Implementation of the Dryad Application Profile version 3.0 will cause significant changes to these practices.
 
The files below reflect current curatorial practice in the repository as of January 21, 2011. These include temporary work-arounds and manual processes that will become automated or semi-automated. Implementation of the Dryad Application Profile version 3.0 will cause significant changes to these practices.
Line 47: Line 50:
 
*Also see [[Templates for Correspondence]].
 
*Also see [[Templates for Correspondence]].
  
== Third Quarter 2010 Curation Practices ==
+
=== Third Quarter 2010 ===
  
 
The files below reflect current curatorial practice in the repository as of July, 2010. These include temporary work-arounds and many manual processes that will become automated or semi-automated. Implementation of the Dryad Application Profile version 3.0 will cause significant changes to these practices.
 
The files below reflect current curatorial practice in the repository as of July, 2010. These include temporary work-arounds and many manual processes that will become automated or semi-automated. Implementation of the Dryad Application Profile version 3.0 will cause significant changes to these practices.
 
* July 2010 Curation Checklist: [[Media:July2010CurationChecklist.pdf|PDF]], [[Media:July2010CurationChecklist.doc|Word]].
 
* July 2010 Curation Checklist: [[Media:July2010CurationChecklist.pdf|PDF]], [[Media:July2010CurationChecklist.doc|Word]].
  
== Second Quarter 2010 Curation Practices ==
+
=== Second Quarter 2010 ===
  
 
The release of DSpace 1.6.0 will allow enhanced curation tools to be implemented in Dryad. This space will be updated with second quarter curation practices (pushed to third quarter).
 
The release of DSpace 1.6.0 will allow enhanced curation tools to be implemented in Dryad. This space will be updated with second quarter curation practices (pushed to third quarter).
Line 58: Line 61:
 
* March 2010 Curation Tools Wish List: [[Media:March2010CurationWishlist.pdf|PDF]], [[Media:March2010CurationWishlist.doc|Word]]. Also see [[Curation#Summer_2009_Curation_Project|Summer 2009 Curation Project]] documentation.
 
* March 2010 Curation Tools Wish List: [[Media:March2010CurationWishlist.pdf|PDF]], [[Media:March2010CurationWishlist.doc|Word]]. Also see [[Curation#Summer_2009_Curation_Project|Summer 2009 Curation Project]] documentation.
  
== First Quarter 2010 Curation Practices ==
+
=== First Quarter 2010 ===
  
 
As of early 2010 all submissions to Dryad are undergoing Level 1 curation. A rough checklist of curation practice is available below. A fuller curation manual will be created to reflect the new release of DSpace/Dryad in spring 2010.
 
As of early 2010 all submissions to Dryad are undergoing Level 1 curation. A rough checklist of curation practice is available below. A fuller curation manual will be created to reflect the new release of DSpace/Dryad in spring 2010.
Line 64: Line 67:
 
* January 2010 Curation Checklist: [[Media:January2010CurationChecklist.pdf|PDF]], [[Media:January2010CurationChecklist.doc|Word]]
 
* January 2010 Curation Checklist: [[Media:January2010CurationChecklist.pdf|PDF]], [[Media:January2010CurationChecklist.doc|Word]]
  
== Summer 2009 Curation Project ==
+
=== Summer 2009 Curation Project ===
  
 
Sarah Carrier produced two documents during summer 2009 that detail the curatorial management of data and metadata in Dryad, and offer some ideas for overall policy and requirements. The first document is for the redesign of the Dryad interface to better accommodate curation tasks. The second is a manual that details the current (as of summer 2009) curation workflow. This manual will be used by a curator hired fall 2009.
 
Sarah Carrier produced two documents during summer 2009 that detail the curatorial management of data and metadata in Dryad, and offer some ideas for overall policy and requirements. The first document is for the redesign of the Dryad interface to better accommodate curation tasks. The second is a manual that details the current (as of summer 2009) curation workflow. This manual will be used by a curator hired fall 2009.
Line 79: Line 82:
 
* [[Cataloging Guidelines 2009]]
 
* [[Cataloging Guidelines 2009]]
  
== Other Resources ==
+
= Other Resources =
  
 
* [http://fedora-commons.org/confluence/display/FCCWG/Data+Curation Fedora Data Curation Working Group]
 
* [http://fedora-commons.org/confluence/display/FCCWG/Data+Curation Fedora Data Curation Working Group]

Revision as of 17:16, 27 July 2012

Introduction

The Dryad repository has a curator who works to enforce quality control on existing content in Dryad and collaborate in development of the repository.

From the Digital Curation Centre:

  • Digital curation can be defined as follows: 'The activity of managing the use of data from its point of creation to ensure it is available for discovery and re-use in the future.' Data curation can also include managing vast data sets for daily use; updating it to keep it readable, etc. Therefore the term data curator is applicable to a large range of professional backgrounds, from minimal management of digital materials, to the addition of metadata, to managing institutional repositories.

Professional Data Curation Tasks

  1. Name authority/authority control for authors
  2. Quality control
    • Clean up citation fields.
    • View the contents of metadata fields across the repository, and enforce consistency.
  3. Maintain documentation of cataloging/curation policies.
  4. Spot check entries to make sure they have high-quality metadata.
  5. Spot check entries to make sure the files have the data they claim to have.
  6. Determine when files need to be migrated to new formats and supervise the migration process.

Curation Time Estimates

A Google Spreadsheet outlines our current estimates for time to curate an item. The curation times are specified for three different levels of curation. NOTE: these estimates were developed for Dryad sustainability planning. As a result, they assume the use of tools/features that are not yet available in Dryad (but which we reasonably expect to include by the end of the NSF grant).

Summarizing from the spreadsheet, curation for each article will require:

  • Level 1 tasks will take about 5 minutes.
  • Level 2 tasks will take an additional 15 minutes.
  • Level 3 tasks will take an additional 1-3 hours.

Curation Practices

Current Curation Practices (November 2011)

The files below reflect current curatorial practice in the repository as of February 6, 2012. These include some work-arounds and manual processes that will become automated or semi-automated. Implementation of the Dryad Application Profile version 3.0 will cause significant changes to these practices.

Historic Curation Practices

April 2011

The files below reflect current curatorial practice in the repository as of May 3, 2011. These include some work-arounds and manual processes that will become automated or semi-automated. Implementation of the Dryad Application Profile version 3.0 will cause significant changes to these practices.

January 2011

The files below reflect current curatorial practice in the repository as of January 21, 2011. These include temporary work-arounds and manual processes that will become automated or semi-automated. Implementation of the Dryad Application Profile version 3.0 will cause significant changes to these practices.

Third Quarter 2010

The files below reflect current curatorial practice in the repository as of July, 2010. These include temporary work-arounds and many manual processes that will become automated or semi-automated. Implementation of the Dryad Application Profile version 3.0 will cause significant changes to these practices.

  • July 2010 Curation Checklist: PDF, Word.

Second Quarter 2010

The release of DSpace 1.6.0 will allow enhanced curation tools to be implemented in Dryad. This space will be updated with second quarter curation practices (pushed to third quarter).

First Quarter 2010

As of early 2010 all submissions to Dryad are undergoing Level 1 curation. A rough checklist of curation practice is available below. A fuller curation manual will be created to reflect the new release of DSpace/Dryad in spring 2010.

  • January 2010 Curation Checklist: PDF, Word

Summer 2009 Curation Project

Sarah Carrier produced two documents during summer 2009 that detail the curatorial management of data and metadata in Dryad, and offer some ideas for overall policy and requirements. The first document is for the redesign of the Dryad interface to better accommodate curation tasks. The second is a manual that details the current (as of summer 2009) curation workflow. This manual will be used by a curator hired fall 2009.

  • Curation Proposal: PDF, Word
  • Curation Manual: PDF, Word
  • Akio's technical evaluation of the Curation Proposal: PDF, Word

These two documents represent the latest information regarding curation. Other pages on the wiki that include curation information used during the summer 2009 curation project:

Other Resources