Old:December 2006 Workshop Plans
From Dryad wiki
December Meeting
- Date: Dec 5, 2006
- Time: ?
- Place: NESCent
Participants
- Kathleen Smith
- NESCent-MRC DRIADE team
- Michael Whitlock (American Naturalist)
- Don Waller (SSE)
- Mark Rausher (Evolution)
- Marcy Uyenoyama (MBE)
- Bob Peet (to discuss the experience of Ecology)
- Others?
Objective
- To inform the society and journal reps of our plans to date
- To discuss how to gather requirements.
Agenda
- (Uncertain until we have a better idea of the length of the meeting)
- Todd & Hilmar: Known and unknown requirements to date, and plans for two workshops
- Jane: The landscape of data repositories, and what will be needed from societies and journals
March Meeting
Potential participants
- NESCent-MRC people
- Matt Jones (SEEK)
- Sarah & Eric Kanza (OpenContext) & Ahrash Bissell (OpenContext, Duke)
- ? (CIESEN)
- Lagoze? (OAI)
- ? (Marine Metadata Initiative)
- ? (Odum Institute) – sustainability, heterogenous ingest
- Tom Hammond (Conservation Commons)
- John Willbanks, (Science Commons, GBIF) – IP, metadata standards
- David Schloen (OCHRE)?
- Bruce Bauer (World Data Center for Paleoclimatology)
- Val Tannen (Data integration, Treebase)
- ? (Genbank)
- Greg Ricardo (Morphbank)
- Emilia Martins (EthBase)
- ? (NSF Cyberinfrastructure)
Potential floaters
- C. Lynch
- J. Kim
- R. Page
Goals
1. Obtain guidance on design features covering all aspects of the OAIS model: Ingest—receipt and verification of records; Archival Storage—secure storage of records; Data Management—secure management of records; Administration—management of internal and external relations; Preservation—management of record integrity and security over time; and Access—provision of records in response to user queries.
2. Particular areas of concern
Management issues
- How would the system be administered?
- How can depositors be incentivized?
- How would the system be sustainably funded?
- What intellectual property policies need to be put into place?
System architecture issues
- What is the role for bona fide data federation/integration technology?
- What is the (meaning of and) need for a trusted repository?
- What is the role of distributed/grid storage?
User interface and interoperability issues
- How to synchronize ingestion with journal publication and 3rd-part database deposition?
- How to enable harvesting of data by 3rd-parties (e.g. OAI-MHP)?
- What should be the functionality of the interface to the centralized registry?
- What metadata is required an how to generate it (Dublin Core, DDI-lite, EML, standards imposed by specialized repositories)?
- Hilmar's notes, to be integrated as appropriate
-
- Raw data in repositories or processed data only? Spreadsheet data?
- Central or distributed repositories? Or virtual repository only (similar to GBIF)?
- Role of data integration?
- What types of metadata standards and metadata types do we want to capture/store?
Not sure this shouldn't be decided by us before? --Hlapp 11:40, 10 November 2006 (EST) - Do we need to plan for metdata lifeycle management, and to what extent?
- Should the system be capable of metadata generation, and if so to what extent, with how much human review?
Provisional agenda
Day 1:
- Introductions and presentation of objectives
- Refine, as a group, tasks for the breakout sessions.
- Three concurrent breakout sessions over lunch and into early afternoon, with short chalktalks relevant to each topic followed by focused discussion on the breakout tasks.
- Break
- Late afternoon breakout group summaries
Day 2:
- Three concurrent morning breakout sessions, again with short chalktalks relevant to each topic followed by focused discussion on the breakout tasks.
- Lunch
- 1-2 hour large group discussion
- Writing of recommendations