DataONE
{{Short description|International federation of data repositories}}
{{Infobox biodatabase
| title = DataONE
| organism = all
| description = DataONE, data on the Earth, Life, and the Environment
| scope = Earth science, Ecology, Environmental Science, Social Science
| url = [https://dataone.org/ DataONE]
| download = [https://search.dataone.org/ DataONE Search]
| webservice = [https://releases.dataone.org/online/api-documentation-v2.0.1/ API]
| license = Various open data licenses
| logo =
| standalone=[https://cran.r-project.org/web/packages/dataone/index.html dataone R package]
| webapp=[https://search.dataone.org DataONE]
| format= Comma-separated values
NetCDF
XML
Satellite imagery
| bookmark=yes
}}
DataONE{{cite web|url=https://dataone.org/ |title=DataONE |publisher=DataONE |date= |accessdate=2016-04-21}} is a network of interoperable data repositories facilitating data sharing, data discovery, and open science.{{Cite journal| doi = 10.1045/january2011-michener| issn = 1082-9873| volume = 17| issue = 1/2| last1 = Michener| first1 = William| last2 = Vieglais| first2 = Dave| last3 = Vision| first3 = Todd| last4 = Kunze| first4 = John| last5 = Cruse| first5 = Patricia| last6 = Janée| first6 = Greg| title = DataONE: Data Observation Network for Earth: Preserving Data and Enabling Innovation in the Biological and Environmental Sciences| journal = D-Lib Magazine| accessdate = 2014-02-12| date = January 2011| url = http://www.dlib.org/dlib/january11/michener/01michener.html| doi-access = free}} Originally supported by $21.2 million in funding from the US National Science Foundation as one of the initial DataNet programs in 2009,{{cite web |title= DataNet Full Proposal: DataNetONE (Observation Network for Earth) |work= Award abstract #0830944 |publisher= National Science Foundation |date= August 26, 2014 |url= http://nsf.gov/awardsearch/showAward?AWD_ID=0830944 |access-date= May 7, 2017 }} funding was renewed in 2014 through 2020 with an additional $15 million.{{cite web |title=Award Abstract # 1430508 DataONE (Data Observation Network for Earth) |url=https://www.nsf.gov/awardsearch/showAward?AWD_ID=1430508&HistoricalAwards=false |website=National Science Foundation |publisher=National Science Foundation |access-date=2021-06-24}}
DataONE helps preserve, access, use, and reuse of multi-discipline scientific data through the construction of primary cyberinfrastructure and an education and outreach program.
DataONE provides scientific data archiving for ecological and environmental data produced by scientists. DataONE's goal is to preserve and provide access to multi-scale, multi-discipline, and multi-national data. Users include scientists, ecosystem managers, policy makers, students, educators, librarians, and the public.
DataONE links together existing cyberinfrastructure to provide a distributed framework, management, and technologies that enable long-term preservation of multi-scale, multi-discipline, and multi-national observational data. The distributed framework is composed of Coordinating Nodes located at the Oak Ridge Campus at Tennessee, University of California Santa Barbara, and University of New Mexico, and member nodes. DataONE also provides resources including tools for accessing and using it.{{cite web |url= https://www.dataone.org/investigator-toolkit |title= Investigator Toolkit |publisher= DataONE |work= Web page |access-date= May 7, 2017 }}
Coordinating nodes
The three coordinating nodes provide network-wide services to member nodes. They are geographically replicated, with mirrored content and full copies of science metadata.
William Michener of the University of New Mexico (UNM) directed the project, and UNM is one of the coordinating nodes.{{Cite news |title= DataONE (Observation Network for Earth) Project at UNM Receives $20 Million Award |date= November 18, 2009 |work= Press release |publisher=University of New Mexico |url= http://www.unm.edu/~market/cgi-bin/archives/004536.html |url-status= dead |archivedate= November 27, 2009 |archiveurl= https://web.archive.org/web/20091127033130/http://www.unm.edu/~market/cgi-bin/archives/004536.html |access-date= May 7, 2017 }}
Coordinating nodes are UNM, Oak Ridge Campus (partnership of Oak Ridge National Laboratory (ORNL) and University of Tennessee), and the University of California, Santa Barbara.
Member nodes
Member nodes consist of Earth observing institutions, projects, and networks. They provide resources for their own data and replicated data, and focus on serving their specific constituencies. These member nodes are geographically distributed and include:
- Cornell Lab of Ornithology eBird{{cite web|url=http://ebird.org/content/ebird/ |title=Welcome to eBird |publisher=eBird.org |date= |accessdate=2016-04-21}}
- Dryad{{cite web|url=http://datadryad.org/ |title=Dryad Digital Repository - Dryad |website=Datadryad.org |date= |accessdate=2016-04-21}}
- Earth Data Analysis Center (EDAC){{cite web|url=http://edac.unm.edu/ |title=Earth Data Analysis Center | Center for Geospatial & Information Technology Services |website=Edac.unm.edu |date= |accessdate=2016-04-21}}
- Environmental Data for the Oak Ridge Area (EDORA){{cite web|url=http://mercury-ops2.ornl.gov/edora/ |title=Environmental Data for the Oak Ridge Area : Search |website=Mercury-ops2.ornl.gov |accessdate=2016-04-21}}
- Ecological Society of America (ESA) Data Registry{{cite web|url=http://data.esa.org/esa/ |title=ESA Data Registry |website=Data.esa.org |date= |accessdate=2016-04-21}}
- Europe Long-Term Ecosystem Research Network (LTER Europe){{cite web|url=http://www.lter-europe.net/ |title=Taking Europe's pulse - Research for our continent's future — LTER in Europe |website=Lter-europe.net |date= |accessdate=2016-04-21}}
- Global Lake Ecological Observatory Network (GLEON){{cite web|url=http://gleon.org/ |title=GLEON |publisher=GLEON |date= |accessdate=2016-04-21}}
- Gulf of Alaska Data Portal{{cite web|url=http://portal.aoos.org/gulf-of-alaska.php |title=Gulf of Alaska Data Portal |website=Portal.aoos.org |accessdate=2016-04-21}}
- International Arctic Research Center (IARC) Data Archive{{cite web|url=http://climate.iarc.uaf.edu/geonetwork/srv/en/main.home |title=The IARC Data Archive at UAF, an AA/EO employer and educational institution |website=Climate.iarc.uaf.edu |date=2007-08-23 |accessdate=2016-04-21}}
- Knowledge Network for Biocomplexity{{cite web |url=http://knb.ecoinformatics.org/index.jsp |format=JSP |title=Cumulative human impacts data (2008 and 2013) Halpern B, et al. 2015 |website=Knb.ecoinformatics.org |accessdate=2016-04-21 |url-status=dead |archiveurl=https://web.archive.org/web/20131113135840/http://knb.ecoinformatics.org/index.jsp |archivedate=2013-11-13 }}
- Long Term Ecological Research Network (LTER){{cite web|author=The Long Term Ecological Research Network |url=http://www.lternet.edu/ |title=The Long Term Ecological Research Network | Long-term, broad-scale research to understand our world |website=Lternet.edu |date= |accessdate=2016-04-21}}
- Merritt Repository{{cite web|url=https://merritt.cdlib.org/ |title=UC3 Merritt Home |website=Merritt.cdlib.org |date= |accessdate=2016-04-21}}
- Minnesota Population Center (MPC){{cite web|url=https://www.ipums.org/ |title=MPC Data Projects |website=Ipums.org |date= |accessdate=2016-04-21}}
- Montana IoE Data Repository{{cite web |url=https://www.dataone.org/current-member-nodes#nodes/IOE |title=Current Member Nodes |publisher=DataONE |date= |accessdate=2016-04-21 |archive-date=2016-04-19 |archive-url=https://web.archive.org/web/20160419041502/https://www.dataone.org/current-member-nodes#nodes/IOE |url-status=dead }}
- Nevada Research Data Center{{cite web |url=http://sensor.nevada.edu/NRDC/ |title=Nevada Research Data Center |website=Sensor.nevada.edu |date= |accessdate=2016-04-21 |archive-date=2016-05-11 |archive-url=https://web.archive.org/web/20160511153553/http://sensor.nevada.edu/NRDC/ |url-status=dead }}
- New Mexico Experimental Program to Stimulate Competitive Research (NM EPSCoR){{cite web |url=https://www.dataone.org/current-member-nodes#nodes/NMEPSCOR |title=Current Member Nodes |publisher=DataONE |date= |accessdate=2016-04-21 |archive-date=2016-04-19 |archive-url=https://web.archive.org/web/20160419041502/https://www.dataone.org/current-member-nodes#nodes/NMEPSCOR |url-status=dead }}
- NOAA National Centers for Environmental Information (NCEI) Oceanographic Data [https://www.dataone.org/current-member-nodes#nodes/NCEI Archive] {{Webarchive|url=https://web.archive.org/web/20160408224335/https://www.dataone.org/current-member-nodes#nodes/NCEI |date=2016-04-08 }}
- ONEShare Repository{{cite web|url=https://oneshare.cdlib.org/xtf/search |title=Dash |website=Oneshare.cdlib.org |accessdate=2016-04-21 |url-status=dead |archiveurl=https://web.archive.org/web/20160325215935/https://oneshare.cdlib.org/xtf/search |archivedate=2016-03-25 }}
- ORNL Distributed Active Archive Center{{cite journal|url=http://daac.ornl.gov/ |title=ORNL DAAC for Biogeochemical Dynamics |doi=10.1016/j.foreco.2008.11.016 |website=Daac.ornl.gov |date= |accessdate=2016-04-21|url-access=subscription }}
- Partnership for Interdisciplinary Studies of Coastal Oceans (PISCO){{cite web |url=http://data.piscoweb.org/ |title=Pisco | Pisco |website=Data.piscoweb.org |date= |accessdate=2016-04-21 |archive-url=https://web.archive.org/web/20160502003313/http://data.piscoweb.org/ |archive-date=2016-05-02 |url-status=dead }}
- Program for Research on Biodiversity (PPPBio){{cite web|url=https://ppbio.inpa.gov.br/en/home |title=Welcome to the CENBAM Portal and PPBio Western Amazonia! | ppbio.inpa.gov.br/inicio |publisher=Ppbio.inpa.gov.br |date= |accessdate=2022-08-26}}
- Regional and Global Biogeochemical Dynamics Data (RGD){{cite journal |url=http://daac.ornl.gov/mercury.html |title=Regional and Global Data Available Through Mercury |doi=10.1016/j.foreco.2008.11.016 |website=Daac.ornl.gov |date=2010-03-18 |accessdate=2016-04-21 |archive-url=https://web.archive.org/web/20071210150850/http://daac.ornl.gov/mercury.html |archive-date=2007-12-10 |url-status=dead |url-access=subscription }}
- SANParks Data Repository{{cite web |url=http://sanparks.org.za/ |title=South African National Parks - SANParks - Official Website - Accommodation, Activities, Prices, Reservations |website=SANParks.org.za |date= |accessdate=2016-04-21 |archive-date=2016-01-11 |archive-url=https://web.archive.org/web/20160111193421/http://www.sanparks.org.za/ |url-status=dead }}
- SEAD Virtual Archive{{cite web|url=http://sead-data.net/ |title=SEAD | A Knowledge Network for Collaboration, Data Curation, and Discovery |website=Sead-data.net |date= |accessdate=2016-04-21}}
- Taiwan Forestry Research Institute{{cite web|url=http://metacat.tfri.gov.tw/tfri/ |title=TFRI Metacat Data Catalog |website=Metacat.tfri.gov.tw |date= |accessdate=2016-04-21}}
- Terrestrial Ecosystem Research Network (TERN){{cite web|url=http://www.tern.org.au/ |title=Terrestrial Ecosystem Research Network: Home |publisher=TERN |date= |accessdate=2016-04-21}}
- University of Kansas - Biodiversity Institute{{cite web|url=http://biodiversity.ku.edu/ |title=KU Biodiversity Institute & Natural History Museum |website=Biodiversity.ku.edu |date= |accessdate=2016-04-21}}
- USA National Phenology Network{{cite web|url=https://www.usanpn.org/ |title=USA National Phenology Network | USA National Phenology Network |website=Usanpn.org |date=2016-04-15 |accessdate=2016-04-21}}
- USGS Science Data Catalog (SDC){{cite web|url=http://data.usgs.gov/datacatalog/ |title=U.S. Geological Survey Science Data Catalog |website=Data.usgs.gov |date= |accessdate=2016-04-21}}
Investigator Tool Kit
The Tool Kit provides tools for researchers to access DataONE. These are both general purpose and discipline-specific tools, and developers adapt existing tools where possible. The tool kit includes Java and Python libraries, an R programming language plug-in for analysis, extensions for Excel, the VisTrails scientific workflow, and the Kepler scientific workflow system.
Data management
DataONE provides a place for scientists to store data and its associated metadata. The metadata makes this data searchable and accessible to other scientists. Data management practices include
- Data management planning
- Data acquisition (techniques, protocols, methods)
- Data protection (backing up)
- Data entry and manipulation (naming files, organization) Matlab, R
- Quality control on data
- Data analysis
- Workflow tools (VisTrails, Kepler scientific workflow system)
- Data documentation (metadata)
- Data sharing, citation, and discovery
- Data preservation and curation
Some of the additional data management planning resources include: a primer for best practices, a database for best practices in data management, educational modules and tutorials, webinars, and an investigator toolkit.
These have been used or adapted for use under Creative Commons license by organizations and institutions that seek to educate other communities about data and research management. Understanding different audiences of users led to the development of possible user personas as models for users such as early-career researchers, science data librarians, citizen scientists or K-12 educators.
Collaborations
DataONE collaborates with other institutions to bring together tools that help with data management practices. One of those tools, developed in collaboration with other organizations and hosted by the University of California Digital Curation Center, is the DMPTool for data management planning. The DMP Tool is used by and referenced by many research data management plans and institutions in the US and around the world. Another recent collaboration in this area is the shared construction of a Data Management Training Clearinghouse for Earth sciences, in partnership with USGS and the Community for Data Integration (CDI).{{cite web |title=Data Management Training Clearinghouse - ScienceBase-Catalog |url=https://www.sciencebase.gov/catalog/item/56d88012e4b015c306f6cffc |website=www.sciencebase.gov |access-date=3 April 2022}}
Community
The DataONE community includes research networks, professional societies, libraries, academic institutions, data centers, data repositories, environmental observatory networks, educators, scientists, policy makers, administrators, citizen scientists, international organizations, NGOs, ecosystem managers, students, private companies and the public.
DataONE has a users group that meets yearly to provide feedback.{{cite web |url= https://www.dataone.org/dataone-users-group |title=Users Group |publisher=DataONE |date= |accessdate=2016-04-21}}
References
{{Reflist|30em}}
External links
- [http://www.nature.com/news/specials/datasharing/index.html Nature.com]
- [https://www.nsf.gov/pubs/2007/nsf07601/nsf07601.htm Nsf.gov]
{{Authority control}}