COVID-19 datasets
{{short description|Datasets on COVID-19}}
COVID-19 datasets are public databases for sharing case data and medical information related to the COVID-19 pandemic.
Aggregate statistics
= United States =
== Volunteer/non-government ==
== [[U.S. Department of Health & Human Services]] ==
class="wikitable sortable mw-collapsible"
!Name !Geographic Level !Timeseries !Testing Sites !Testing Number !Cases !Hospitalizations !Deaths !Vaccination Sites !Vaccination Number |
[https://healthdata.gov/dataset/covid-19-diagnostic-laboratory-testing-pcr-testing-time-series COVID-19 Diagnostic Laboratory Testing (PCR Testing) Time Series] {{Webarchive|url=https://web.archive.org/web/20210312211407/https://healthdata.gov/dataset/covid-19-diagnostic-laboratory-testing-pcr-testing-time-series |date=2021-03-12 }}
|State |Yes |No |Yes |No |No |No |No |No |
[https://healthdata.gov/dataset/covid-19-reported-patient-impact-and-hospital-capacity-facility COVID-19 Reported Patient Impact and Hospital Capacity by Facility] {{Webarchive|url=https://web.archive.org/web/20210312212746/https://healthdata.gov/dataset/covid-19-reported-patient-impact-and-hospital-capacity-facility |date=2021-03-12 }}
|Point (lat/long) |Yes |No |No |No |Yes |No |No |No |
[https://healthdata.gov/dataset/covid-19-estimated-patient-impact-and-hospital-capacity-state COVID-19 Estimated Patient Impact and Hospital Capacity by State] {{Webarchive|url=https://web.archive.org/web/20210312212715/https://healthdata.gov/dataset/covid-19-estimated-patient-impact-and-hospital-capacity-state |date=2021-03-12 }}
|State |No |No |No |No |Yes |No |No |No |
[https://healthdata.gov/dataset/covid-19-reported-patient-impact-and-hospital-capacity-state COVID-19 Reported Patient Impact and Hospital Capacity by State] {{Webarchive|url=https://web.archive.org/web/20210308011204/https://healthdata.gov/dataset/covid-19-reported-patient-impact-and-hospital-capacity-state |date=2021-03-08 }}
|State |No |No |No |No |Yes |No |No |No |
[https://healthdata.gov/dataset/covid-19-reported-patient-impact-and-hospital-capacity-state-timeseries COVID-19 Reported Patient Impact and Hospital Capacity by State Timeseries] {{Webarchive|url=https://web.archive.org/web/20210310091444/https://healthdata.gov/dataset/covid-19-reported-patient-impact-and-hospital-capacity-state-timeseries |date=2021-03-10 }}
|State |Yes |No |No |No |Yes |No |No |No |
= Global =
- Johns Hopkins Coronavirus Resource Center: Global aggregated data including cases, testing, contact tracing, and vaccine development{{Cite web|title=Home|url=https://coronavirus.jhu.edu/|access-date=2020-09-18|website=Johns Hopkins Coronavirus Resource Center|language=en}}
- World Health Organization (WHO) Coronavirus Disease Dashboard: a database of confirmed cases and deaths reported globally and broken down by region.{{Cite web|title=WHO Coronavirus Disease (COVID-19) Dashboard|url=https://covid19.who.int/|access-date=2020-09-18|website=covid19.who.int|language=en}} This database is part of the WHO Health Data Platform.{{Cite web|title=World Health Data Platform - WHO|url=https://www.who.int/data|access-date=2020-09-18|website=www.who.int|language=en}}
- COVID-19 Africa Open Data Project: a volunteer-run database and dashboard reporting region, country and district level case counts, deaths, healthcare worker infections, healthcare services and urgent needs.{{cite web |title=COVID-19 Africa Open Data |url=http://covid-19-africa.sen.ovh/ |access-date=16 November 2020}}
Data hubs
- [https://hdruk.ac.uk Health Data Research UK] provides a searchable registry of health data resources from the United Kingdom, including [https://web.www.healthdatagateway.org/search?search=COVID-19&tab=Datasets COVID-19 related datasets].
- NIH Open Access Datasets: The National Institutes of Health provide open-access data and computational resources related to COVID-19.{{Cite web|title=Open-Access Data and Computational Resources to Address COVID-19 {{!}} Data Science at NIH|url=https://datascience.nih.gov/covid-19-open-access-resources|access-date=2020-10-13|website=datascience.nih.gov}}
- COVID-19 Open Research Dataset (CORD-19): The Semantic Scholar project of the Allen Institute for AI hosts CORD-19, a public dataset of academic articles about COVID-19 and related research.{{Cite web |title=CORD-19 |url=https://www.semanticscholar.org/cord19|access-date=2020-10-13|website=Semantic Scholar}} The dataset is updated daily and includes both peer-reviewed articles and preprints.{{Cite web|title=Analysis of COVID-19 publications identifies research gaps|url=https://eurekalert.org/pub_releases/2020-09/cp-aoc091720.php|access-date=2020-10-13|website=EurekAlert!|language=en}} CORD-19 was originally released on March 16, 2020, by researchers and leaders from the Allen Institute for AI, Chan Zuckerberg Initiative, Georgetown University's Center for Security and Emerging Technhology, Microsoft, and the National Library of Medicine.{{Cite web|title=Call to Action to the Tech Community on New Machine Readable COVID-19 Dataset|url=https://trumpwhitehouse.archives.gov/briefings-statements/call-action-tech-community-new-machine-readable-covid-19-dataset/|access-date=2020-10-13|via=National Archives|work=whitehouse.gov|language=en-US}} The dataset is created through the use of text mining of the current research literature.{{Cite web|title=NLM Leverages Data, Text Mining to Sharpen COVID-19 Research Databases|url=https://governmentciomedia.com/nlm-leverages-data-text-mining-sharpen-covid-19-research-databases|access-date=2020-10-13|website=governmentciomedia.com|date=11 May 2020 |language=en}}
Topic-specific and special-interest resources
= Genomics =
- Consensus genome data for SARS-CoV-2 is available through GISAID for registered users{{cite web|title=GISAID|url=https://www.gisaid.org/|access-date=29 December 2020|website=www.gisaid.org}} and included in an interactive Phylogenetic tree dashboard {{cite web|title=Nexstrain SARS-CoV-2 Dashboard|url=https://nextstrain.org/ncov/global|access-date=29 December 2020|website=nextstrain.org}} on Nextstrain, an open-source pathogen genome data project.{{cite web|title=Nextstrain|url=https://docs.nextstrain.org/en/latest/learn/about-nextstrain.html|access-date=29 December 2020|website=docs.nextstrain.org}}
= Imaging (Radiology) =
- Characteristic imaging features on chest radiographs and computed tomography (CT) of people who are symptomatic include asymmetric peripheral ground-glass opacities without pleural effusions.{{cite journal |vauthors=Li Y, Xia L |title=Coronavirus Disease 2019 (COVID-19): Role of Chest CT in Diagnosis and Management |journal=American Journal of Roentgenology |pages=1280–1286 |date=March 2020|volume=214 |issue=6 |pmid=32130038 |doi=10.2214/AJR.20.22954|s2cid=212416282 |doi-access=free }} The University of Montreal and Mila created the "COVID-19 Image Data Collection" in March which is a public data repository of chest imaging.{{cite web |title=COVID-19 related projects |at=COVID-19 image data collection |website=Mila |access-date=12 July 2020 | url=https://mila.quebec/en/covid-19/}}{{cite web |title=COVID-19 image data collection |url=https://github.com/ieee8023/covid-chestxray-dataset |website=GitHub |access-date=12 July 2020}}{{cite arXiv |last1=Cohen |first1=Joseph |title=COVID-19 image data collection |eprint=2003.11597 |date=25 March 2020|class=eess.IV }} The Medical Imaging Databank in Valencian Region released a large dataset of chest imaging from Spain.{{cite web |title=BIMCV-COVID19, Datasets related to COVID19's pathology course |url=https://bimcv.cipf.es/bimcv-projects/bimcv-covid19/ |website=Medical Imaging Databank in Valencian Region Medical |access-date=12 July 2020}}{{cite arXiv |last1=de la Iglesia Vayá |first1=Maria |title=BIMCV COVID-19+: a large annotated dataset of RX and CT images from COVID-19 patients |date=1 June 2020 |class=eess.IV |eprint=2006.01174 }} The Italian Radiological Society is compiling an international online database of imaging findings for confirmed cases.{{cite web |title=COVID-19 Database |url= https://www.sirm.org/category/senza-categoria/covid-19/ | website=Società Italiana di Radiologia Medica e Interventistica |access-date=2020-03-11|language=it}} Online radiology case sharing platforms such as Eurorad and Radiopaedia serve as platforms for sharing COVID-19 case data and imaging.{{cite web |title=Pneumothorax and pneumomediastinum: a rare complication in the evolution of COVID-19 pneumonia. |url=https://www.eurorad.org/case/16844 |website=Eurorad |access-date=12 July 2020}}{{cite web |last1=Bell |first1=Daniel |last2=Knipe |first2=Henry |title=COVID-19 (summary) |url=https://radiopaedia.org/articles/covid-19-summary |website=Radiopaedia |access-date=12 July 2020}}