Workshop May 2007 day2 questions

Theme 1: Promoting participation

 * We'd like more ideas for promoting participation and ease-of-use - what would makes deposition attractive for depositors?
 * What functionality would make data more valuable, usable, that a SSR could implement?
 * What uses for the data in a SSR should be promoted, apart from simple download and reanalysis? (Virtual Observatory as a model)
 * Services to provide
 * Mechanisms of encouraging compliance with policies in the absence of legal force.

Theme 2: Lessons learned from other projects
What other groups to tap into, like the Science Commons, SPARC OpenData, iAssist?
 * Other role models for small science repositories
 * Who can we learn from, and what lessons do we take from them?
 * What other sites/institutions are doing something really well?
 * Can we further clarify the community-source model?
 * Pros and cons of funding models such as institutional or society subscriptions, pay-for-submission, pay-for-access, and how they can be combined.
 * It would be useful to have a list of existing repositories to pick through and evaluate
 * What are the processes that can be ported from existing archives in their institutional context?
 * Useful links
 * Ecological Archives
 * 50 hectare plot data
 * Brain museum
 * Comparative placental anatomy
 * ICPSR
 * BioCASE
 * BioCorder
 * caBIO
 * CHRONOS
 * CODATA
 * Conservation Commons
 * Digital Library for Earth Systems Education (DLESE)
 * GEON
 * Global Biodiversity Information Facility (GBIF)
 * Journal of Applied Econometrics Data Archive
 * Knowledge Network for Biocomplexity (KNB)
 * Marine Metadata Initiative
 * NOAA paleoclimatology center
 * OCHRE
 * Odum Institute (UNC)
 * OpenContext
 * Virtual Data Center for Social Science
 * Virtual Observatory
 * NOAA World Data Center System

Theme 3: Architecture

 * Who houses stuff, both records and identifiers?
 * what needs to be identifiable?
 * who would be authorities?
 * Is it productive to process full-text for automated generation of 'context' metadata?
 * A key decision in CiteSeer was to store a local copy and not depend on remote copies. Does this make sense for a SSR handshaking with, say, an image dB?
 * Is everyone in agreement with the 'don't compete with Google' conclusion from yesterday.
 * What are the pros and cons of DOIs, handles, other identifiers?
 * Data and applications: where does the complexity belong?
 * simple data with complex applications?
 * complex data with simple applications?
 * data and applications with a complex schema in between
 * What sort of metadata is important independent of the publications that reference it?
 * How does death fit into the metadata lifecycle?
 * How to nurture the bottom-up growth of data standards?