Workshop May 2007 day2 questions
From Dryad wiki
(Redirected from May 2007 workshop day2 questions)
Theme 1: Promoting participation
- We'd like more ideas for promoting participation and ease-of-use - what would makes deposition attractive for depositors?
- What functionality would make data more valuable, usable, that a SSR could implement?
- What uses for the data in a SSR should be promoted, apart from simple download and reanalysis? (Virtual Observatory as a model)
- Services to provide
- Mechanisms of encouraging compliance with policies in the absence of legal force.
Theme 2: Lessons learned from other projects
- Other role models for small science repositories
- Who can we learn from, and what lessons do we take from them?
- What other sites/institutions are doing something really well?
- Can we further clarify the community-source model?
- Pros and cons of funding models such as institutional or society subscriptions, pay-for-submission, pay-for-access, and how they can be combined.
- It would be useful to have a list of existing repositories to pick through and evaluate
- What are the processes that can be ported from existing archives in their institutional context?
What other groups to tap into, like the Science Commons, SPARC OpenData, iAssist?
- Useful links
- Ecological Archives
- 50 hectare plot data
- Brain museum
- Comparative placental anatomy
- Conservation Commons
- Digital Library for Earth Systems Education (DLESE)
- Global Biodiversity Information Facility (GBIF)
- Journal of Applied Econometrics Data Archive
- Knowledge Network for Biocomplexity (KNB)
- Marine Metadata Initiative
- NOAA paleoclimatology center
- Odum Institute (UNC)
- Virtual Data Center for Social Science
- Virtual Observatory
- NOAA World Data Center System
Theme 3: Architecture
- Who houses stuff, both records and identifiers?
- what needs to be identifiable?
- who would be authorities?
- Is it productive to process full-text for automated generation of 'context' metadata?
- A key decision in CiteSeer was to store a local copy and not depend on remote copies. Does this make sense for a SSR handshaking with, say, an image dB?
- Is everyone in agreement with the 'don't compete with Google' conclusion from yesterday.
- What are the pros and cons of DOIs, handles, other identifiers?
- Data and applications: where does the complexity belong?
- simple data with complex applications?
- complex data with simple applications?
- data and applications with a complex schema in between
- What sort of metadata is important independent of the publications that reference it?
- How does death fit into the metadata lifecycle?
- How to nurture the bottom-up growth of data standards?