Digital Repository of Information and Data for Evolution (DRIADE)
A joint project of and the
Project DRIADE (Digital Repository of Information and Data for Evolution) is developing a repository for scientific data objects supporting published research in the field of evolutionary biology. DRIADE will support data acquisition, ensure long-term preservation of data objects, and promote resource discovery, data sharing, and data reuse of heterogeneous digital datasets.
Apart from a few specialized databases, there has historically been little cyberinfrastructure development in the field of evolutionary biology. Existing systems for the storage and retrieval of heterogeneous scientific data either put a high burden of metadata generation on the individual researcher or do not capture sufficient metadata to enable resource discovery and reuse. DRIADE seeks to balance a need for low barriers, which invite contribution from the wide range of scientists participating in the field of evolutionary biology, with a series of sophisticated, higher-level goals supporting data analysis activities. Such activities include computational methods and are required to advance the field of evolutionary biology.
The goal of this project is to develop a Digital Repository for Information and Data in Evolution (DRIADE). It is intended to be the primary home for published data in the field of evolutionary biology. Building on existing technologies and following the OAIS functional model, we are developing software to support digital resource lifecycle management from data ingestion to curation to discovery and reuse.
- Computer-aided metadata generation and augmentation will assist the data provider in capturing metadata of sufficient richness and quality to enable advanced data discovery, reusability and data integration.
- Specialized modules will allow data submission to be coordinated with the manuscript review and publication process of participating journals, as well as with the submission process to external specialized databases (e.g. for sequence data, phylogenies, anatomical images). This will provide one-stop data submission for the user.
- Data and metadata quality control are to be overseen by a curatorial staff, supported by a separate data curation software module that employs automatic techniques to evaluate metadata quality.
- An identity, authority and data security module will implement fine-grained data access privileges for users using global user identities.
- Resource discovery, sharing, and interoperability with external repositories will be enabled by implementing the OAI-PMH metadata harvesting standard supplemented by custom web services. These services will be exposed to collaborating journals, specialized data repositories, third-party content aggregators, and the DRIADE web portal itself.
Extensive evaluations and user testing are being employed throughout the design and implementation process by
- conducting metadata generation studies and analyzing the resulting quality of metadata content,
- developing data use cases,
- and conducting information retrieval experiments and usability studies to evaluate the effectiveness and performance of the system.
A separate working group of stakeholders is charged with developing a management structure to ensure the long-term maintenance and financial sustainability of the repository.
This wiki is for the use of working group participants. If you would like to obtain access, please send a request to Hilmar Lapp.
We are looking for a highly motivated indvidual to join NESCent's Informatics team as a Digital Data Repository Architect. For details please see the full job advertisement on the NESCent employment page.