List of biological databases#Antimicrobial resistance databases
{{short description|None}}
Biological databases are stores of biological information.{{cite journal | vauthors = Wren JD, Bateman A | title = Databases, data tombs and dust in the wind | journal = Bioinformatics | volume = 24 | issue = 19 | pages = 2127–8 | date = October 2008 | pmid = 18819940 | doi = 10.1093/bioinformatics/btn464 | doi-access = free }} The journal Nucleic Acids Research regularly publishes special issues on biological databases and has a list of such databases. The 2018 issue has a list of about 180 such databases and updates to previously described databases.{{Cite web|url=https://academic.oup.com/nar/issue/46/D1|title=Volume 46 Issue D1 {{!}} Nucleic Acids Research {{!}} Oxford Academic|website=academic.oup.com|language=en|access-date=2018-09-04}} [https://www.omicsdi.org/ Omics Discovery Index] can be used to browse and search several biological databases. Furthermore, the [https://data.niaid.nih.gov/ NIAID Data Ecosystem Discovery Portal] developed by the National Institute of Allergy and Infectious Diseases (NIAID) enables searching across databases.
Meta databases
Meta databases are databases of databases that collect data about data to generate new data. They are capable of merging information from different sources and making it available in a new and more convenient form, or with an emphasis on a particular disease or organism. Originally, metadata was only a common term referring simply to data about data such as tags, keywords, and markup headers.
- ConsensusPathDB: a molecular functional interaction database, integrating information from 12 others
- Entrez (National Center for Biotechnology Information)
- Neuroscience Information Framework (University of California, San Diego): integrates hundreds of neuroscience relevant resources; many are listed below
Model organism databases
Model organism databases provide in-depth biological data for intensively studied organisms.
- PomBase: the knowledgebase for the fission yeast Schizosaccharomyces pombe{{cite journal | vauthors = Rutherford KM, Lera-Ramírez M, Wood V | title = PomBase: a Global Core Biodata Resource—growth, collaboration, and sustainability | journal = Genetics | volume = 227 | issue = 1 | date = May 2024 | pmid = 38376816 | pmc = 11075564 | doi = 10.1093/genetics/iyae007 }}
- SubtiWiki: integrated database for the model bacterium Bacillus subtilis{{cite journal | vauthors = Zhu B, Stülke J | title = SubtiWiki in 2018: from genes and proteins to functional network annotation of the model organism Bacillus subtilis | journal = Nucleic Acids Research | volume = 46 | issue = D1 | pages = D743–D748 | date = January 2018 | pmid = 29788229 | pmc = 5753275 | doi = 10.1093/nar/gkx908 }}
- TAIR: the knowledgebase for the plant Arabidopsis thaliana{{cite journal |author=Margarita Garcia-Hernandez |author2=Tanya Berardini |author3=Guanghong Chen |author4=Debbie Crist |author5=Aisling Doyle |author6=Eva Huala |author7=Emma Knee |author8=Mark Lambrecht |author9=Neil Miller |author10=Lukas A. Mueller |author11=Suparna Mundodi |author12=Leonore Reiser |author13=Seung Y. Rhee |author14=Randy Scholl |author15=Julie Tacklind |author16=Dan C. Weems |author17=Yihe Wu |author18=Iris Xu |author19=Daniel Yoo |author20=Jungwon Yoon |author21=Peifen Zhang | title = TAIR: a resource for integrated Arabidopsis data | journal = Functional & Integrative Genomics | volume = 2 | pages = 239–253 | date = November 2002 |issue=6 |doi=10.1007/s10142-002-0077-z|pmid=12444417 |s2cid=7827488 }}
Nucleic acid databases
= DNA databases =
{{hatnote|For DNA barcoding databases with a taxonomic focus, see #Taxonomic databases.}}
The primary databases make up the International Nucleotide Sequence Database (INSD). The include:
- DNA Data Bank of Japan (National Institute of Genetics)
- EMBL (European Bioinformatics Institute)
- GenBank (National Center for Biotechnology Information)
DDBJ (Japan), GenBank (USA) and European Nucleotide Archive (Europe) are repositories for nucleotide sequence data from all organisms. All three accept nucleotide sequence submissions, and then exchange new and updated data on a daily basis to achieve optimal synchronisation between them. These three databases are primary databases, as they house original sequence data. They collaborate with Sequence Read Archive (SRA), which archives raw reads from high-throughput sequencing instruments.
Secondary databases are:{{clarify|reason=What's the definition? Why are some super human-centered while others are not?|date=February 2022}}
- HapMap
- OMIM (Online Mendelian Inheritance in Man): inherited diseases
- RefSeq
- 1000 Genomes Project: launched in January 2008. The genomes of more than a thousand anonymous participants from a number of different ethnic groups were analyzed and made publicly available.
- [http://eggnog5.embl.de/#/app/home EggNOG Database:] a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. It provides multiple sequence alignments and maximum-likelihood trees, as well as broad functional annotation.{{cite journal | vauthors = Powell S, Forslund K, Szklarczyk D, Trachana K, Roth A, Huerta-Cepas J, Gabaldón T, Rattei T, Creevey C, Kuhn M, Jensen LJ, von Mering C, Bork P | display-authors = 6 | title = eggNOG v4.0: nested orthology inference across 3686 organisms | journal = Nucleic Acids Research | volume = 42 | issue = Database issue | pages = D231-9 | date = January 2014 | pmid = 24297252 | pmc = 3964997 | doi = 10.1093/nar/gkt1253 | doi-access = free }}{{cite journal | vauthors = Huerta-Cepas J, Szklarczyk D, Heller D, Hernández-Plaza A, Forslund SK, Cook H, Mende DR, Letunic I, Rattei T, Jensen LJ, von Mering C, Bork P | display-authors = 6 | title = eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses | journal = Nucleic Acids Research | volume = 47 | issue = D1 | pages = D309–D314 | date = January 2019 | pmid = 30418610 | pmc = 6324079 | doi = 10.1093/nar/gky1085 | doi-access = free }}
Other databases
= Gene expression databases =
= Genome databases =
These databases collect genome sequences, annotate and analyze them, and provide public access. Some add curation of experimental literature to improve computed annotations. These databases may hold many species genomes, or a single model organism genome.
{{columns-list|colwidth=22em|
- ArrayExpress:[https://www.ebi.ac.uk/arrayexpress/ ArrayExpress] archive of functional genomics data; stores data from high-throughput functional genomics experiments from EMBL
- Bioinformatic Harvester
- Cervical cancer gene database
- Ensembl: provides automatic annotation databases for human, mouse, other vertebrate and eukaryote genomes
- Ensembl Genomes: provides genome-scale data for bacteria, protists, fungi, plants and invertebrate metazoa, through a unified set of interactive and programmatic interfaces (using the Ensembl software platform)
- FlyBase: genome of the model organism Drosophila melanogaster
- Gene Disease Database
- Gene Expression Omnibus (GEO[https://www.ncbi.nlm.nih.gov/geo/ GEO]): a public functional genomics data repository from the U.S. National Cancer Institute (NCI), which supports array- and sequence-based data. Tools for querying and downloading gene expression profiles are provided.
- Human Protein Atlas (HPA{{Cite web|url=https://www.proteinatlas.org/|title=The Human Protein Atlas|website=www.proteinatlas.org|access-date=2019-05-27}}): a public database with expression profiles of human protein coding genes both on mRNA and protein level in tissues, cells, subcellular compartments, and cancer tumors.
- Legume Information System (LIS): genomic database for the legume family{{cite journal | vauthors = Dash S, Campbell JD, Cannon EK, Cleary AM, Huang W, Kalberer SR, Karingula V, Rice AG, Singh J, Umale PE, Weeks NT, Wilkey AP, Farmer AD, Cannon SB | display-authors = 6 | title = Legume information system (LegumeInfo.org): a key component of a set of federated data resources for the legume family | journal = Nucleic Acids Research | volume = 44 | issue = D1 | pages = D1181-8 | date = January 2016 | pmid = 26546515 | pmc = 4702835 | doi = 10.1093/nar/gkv1159 }}
- Personal Genome Project: human genomes of 100,000 volunteers from around the world
- RGD (Rat Genome Database): genomic and phenotype data for Rattus norvegicus
- Saccharomyces Genome Database:{{Cite web|url=https://www.yeastgenome.org/|title=Saccharomyces Genome Database {{!}} SGD|website=www.yeastgenome.org|access-date=2018-09-04}} genome of the yeast model organism
- SNPedia
- SoyBase Database{{cite journal | vauthors = Grant D, Nelson RT, Cannon SB, Shoemaker RC | title = SoyBase, the USDA-ARS soybean genetics and genomics database | journal = Nucleic Acids Research | volume = 38 | issue = Database issue | pages = D843-6 | date = January 2010 | pmid = 20008513 | pmc = 2808871 | doi = 10.1093/nar/gkp798 }} (SoyBase): USDA soybean genetics and genomic database (Soybean)
- UCSC Malaria Genome Browser: genome of malaria causing species (Plasmodium falciparum and others)
- Wormbase: genome of the model organism Caenorhabditis elegans and WormBase ParaSite for parasitic species
- Xenbase: genome of the model organism Xenopus tropicalis and Xenopus laevis
- Zebrafish Information Network: genome of this fish model organism
}}
= Phenotype databases =
- PHI-base: pathogen-host interaction database. It links gene information to phenotypic information from microbial pathogens on their hosts. Information is manually curated from peer-reviewed literature.
- RGD Rat Genome Database: genomic and phenotype data for Rattus norvegicus
- PomBase database: manually curated phenotypic data for the yeast Schizosaccharomyces pombe
= [[RNA]] databases =
{{hatnote|For rRNA databases with a taxonomic focus, see #Taxonomic databases.}}
- miRBase: the microRNA database
- PolymiRTS: a database of DNA variations in putative microRNA target sites
- PolyQ: database of polyglutamine repeats in disease and non-disease associated proteins
- Rfam: a database of RNA families
- IRESbase: A comprehensive database of experimentally validated internal ribosome entry sites.{{cite web |url=http://reprod.njmu.edu.cn/cgi-bin/iresbase/index.php |title=IRESbase}}
Amino acid / protein databases
(See also: List of proteins in the human body)
Several publicly available data repositories and resources have been developed to support and manage protein related information, biological knowledge discovery and data-driven hypothesis generation.{{cite book | vauthors = Chen C, Huang H, Wu CH | title = Protein Bioinformatics | chapter = Protein Bioinformatics Databases and Resources | series = Methods in Molecular Biology | volume = 1558 | pages = 3–39 | date = 2017 | pmid = 28150231 | pmc = 5506686 | doi = 10.1007/978-1-4939-6783-4_1 | isbn = 978-1-4939-6781-0 | publisher = Springer New York | veditors = Wu CH, Arighi CN, Ross KE | place = New York, NY }} The databases in the table below are selected from the databases listed in the Nucleic Acids Research (NAR) databases issues and database collection and the databases cross-referenced in the UniProtKB. Most of these databases are cross-referenced with UniProt / UniProtKB so that identifiers can be mapped to each other.
Proteins in human:
There are about ~20,000 protein coding genes in the standard human genome. (Roughly ~1200 already have Wikipedia articles - the Gene Wiki - about them) if we are Including splice variants, there could be as many as 500,000 unique human proteins{{cite journal | doi=10.1093/molbev/msz147 | title=The Oxymonad Genome Displays Canonical Eukaryotic Complexity in the Absence of a Mitochondrion | date=2019 | last1=Karnkowska | first1=Anna | last2=Treitli | first2=Sebastian C. | last3=Brzoň | first3=Ondřej | last4=Novák | first4=Lukáš | last5=Vacek | first5=Vojtěch | last6=Soukal | first6=Petr | last7=Barlow | first7=Lael D. | last8=Herman | first8=Emily K. | last9=Pipaliya | first9=Shweta V. | last10=Pánek | first10=Tomáš | last11=Žihala | first11=David | last12=Petrželková | first12=Romana | last13=Butenko | first13=Anzhelika | last14=Eme | first14=Laura | last15=Stairs | first15=Courtney W. | last16=Roger | first16=Andrew J. | last17=Eliáš | first17=Marek | last18=Dacks | first18=Joel B. | last19=Hampl | first19=Vladimír | journal=Molecular Biology and Evolution | volume=36 | issue=10 | pages=2292–2312 | pmid=31387118 | pmc=6759080 }}
= Different types of Protein databases =
{{See also|Protein structure database}}
{{Expand section|date=January 2015}}
{{Static row numbers}}
{{row hover highlight}}
{{sticky header}}
{{sort under}}
class="wikitable sortable static-row-numbers sticky-header sort-under mw-datatable plainrowheaders" |
scope="col" | DB name
! scope="col" | DB website ! scope="col" | Provider ! scope="col" | Data sources ! scope="col" | Revenue/Sponsors sources ! scope="col" | Integrates ! scope="col" | Desc. ! scope="col" | Size ! scope="col" | DB type ! scope="col" | Actively maintained |
---|
scope="row" | InterPro
|http://www.ebi.ac.uk/interpro/ |ELIXIR infrastructure |European Bioinformatics Institute |EMBL, The Welcome trust, BBSRC |CATH-Gene3D, CDD, HAMAP, MobiDB, PANTHER, Pfam, SMART, SUPERFAMILY, SFLD, TIGRFAMs, |classifies proteins into families and predicts the presence of domains and sites | |Protein sequence databases |Yes |
scope="row" | NextProt
|https://www.nextprot.org/ |CALIPHO (is a group at the SIB) |Swiss Institute of Bioinformatics |https://www.sib.swiss/about/funding-sources |UniProt, Cellosaurus, Gnomad, IntAct, SRAA Atlas, Uniprot - GOA, BGEE, COSMIC, MassIVE, Peptide atlas |a human protein-centric knowledge resource | |Protein sequence databases |Yes |
scope="row" | Wiki-pi
|http://severus.dbmi.pitt.edu/wiki-pi/ |Madhavi K. Ganapathiraju | | | | |At present Wiki-Pi contains 48,419 unique interactions among 10,492 proteins. However it is not clear if this is unique proteins[13] |Protein interaction Database |?? |
scope="row" | Human Protein Reference Database
| | |Institute of Bioinformatics (IOB), Bangalore, India | | | | | |
scope="row" | Pfam
| | | | |protein families database of alignments and HMMs | |Protein sequence databases | |
scope="row" | Human Proteinpedia
| | |Institute of Bioinformatics (IOB), Bangalore and Johns Hopkins University, | | | |The human Proteinpedia is based on HPRD (Human protein reference database)which is a repository hosting over 30,000 human proteins. However it is unclear how many of these are unique proteins | | |
scope="row" | Human Protein Atlas
| | | |The Swedish Government | | |It contains roughly 10 million IHC images of a bit less than 25,000 antibodies. But once again it is unclear how many of these are unique | | |
scope="row" | PRINTS
| | | | |a compendium of protein fingerprints | |Protein sequence databases | |
scope="row" | PROSITE
| | | | | |database of protein families and domains | |Protein sequence databases | |
scope="row" | Protein Information Resource
| | |Georgetown University Medical Center [GUMC] | | | | |Protein sequence databases | |
scope="row" | SUPERFAMILY
| | | | | |library of HMMs representing superfamilies and database of (superfamily and family) annotations for all completely sequenced organisms | |Protein sequence databases | |
scope="row" | Swiss-Prot
| | |Swiss Institute of Bioinformatics | | |protein knowledgebase | |Protein sequence databases | |
scope="row" | Protein Data Bank
| | | | | |Protein DataBank in Europe (PDBe),{{cite journal |display-authors=6 |vauthors=Mir S, Alhroub Y, Anyango S, Armstrong DR, Berrisford JM, Clark AR, Conroy MJ, Dana JM, Deshpande M, Gupta D, Gutmanas A, Haslam P, Mak L, Mukhopadhyay A, Nadzirin N, Paysan-Lafosse T, Sehnal D, Sen S, Smart OS, Varadi M, Kleywegt GJ, Velankar S |date=January 2018 |title=PDBe: towards reusable data delivery infrastructure at protein data bank in Europe |journal=Nucleic Acids Research |volume=46 |issue=D1 |pages=D486–D492 |doi=10.1093/nar/gkx1070 |pmc=5753225 |pmid=29126160}} ProteinDatabank in Japan (PDBj),{{cite journal |vauthors=Kinjo AR, Bekker GJ, Suzuki H, Tsuchiya Y, Kawabata T, Ikegawa Y, Nakamura H |date=January 2017 |title=Protein Data Bank Japan (PDBj): updated user interfaces, resource description framework, analysis tools for large structures |journal=Nucleic Acids Research |volume=45 |issue=D1 |pages=D282–D288 |doi=10.1093/nar/gkw962 |pmc=5210648 |pmid=27789697}} Research Collaboratory for Structural Bioinformatics (RCSB){{cite journal |display-authors=6 |vauthors=Rose PW, Prlić A, Altunkaya A, Bi C, Bradley AR, Christie CH, Costanzo LD, Duarte JM, Dutta S, Feng Z, Green RK, Goodsell DS, Hudson B, Kalro T, Lowe R, Peisach E, Randle C, Rose AS, Shao C, Tao YP, Valasatava Y, Voigt M, Westbrook JD, Woo J, Yang H, Young JY, Zardecki C, Berman HM, Burley SK |date=January 2017 |title=The RCSB protein data bank: integrative view of protein, gene and 3D structural information |journal=Nucleic Acids Research |volume=45 |issue=D1 |pages=D271–D281 |doi=10.1093/nar/gkw1000 |pmc=5210513 |pmid=27794042}} |(PDB) |Protein structure databases | |
scope="row" | Structural Classification of Proteins (SCOP)
| | | | | | | |Protein structure databases | |
scope="row" | CATH database
| | | | | | | |Protein structure databases | |
scope="row" |ModBase
| | | | | |database of comparative protein structure models |Protein model databases | |
scope="row" | SIMAP
| | | | | |database of protein similarities computed using FASTA | |Protein model databases | |
scope="row" | Swiss-model
| | | | | |server and repository for protein structure models | |Protein model databases | |
scope="row" | AAindex
| | | | | | database of amino acid indices, amino acid mutation matrices, and pair-wise contact potentials | |Protein model databases | |
scope="row" |BioGRID
| | | |Samuel Lunenfeld Research Institute | |general repository for interaction datasets | |Protein-protein and other molecular interactions | |
scope="row" |RNA-binding protein database
| | | | | | | |Protein-protein and other molecular interactions | |
scope="row" |Database of Interacting Proteins
| | | | | | |Protein-protein and other molecular interactions | |
scope="row" |IntAct{{cite journal |display-authors=6 |vauthors=Hermjakob H, Montecchi-Palazzi L, Lewington C, Mudali S, Kerrien S, Orchard S, Vingron M, Roechert B, Roepstorff P, Valencia A, Margalit H, Armstrong J, Bairoch A, Cesareni G, Sherman D, Apweiler R |date=January 2004 |title=IntAct: an open source molecular interaction database |journal=Nucleic Acids Research |volume=32 |issue=Database issue |pages=D452-5 |doi=10.1093/nar/gkh052 |pmc=308786 |pmid=14681455}}
| | | | |open-source database for molecular interactions | |Protein-protein and other molecular interactions | |
scope="row" |String
| | | | | |an open source molecular interaction database to study interactions between proteins | |Protein-protein and other molecular interactions | |
scope="row" |Human Protein Atlas
| | | | | |aims at mapping all the human proteins in cells, tissues and organs |Protein expression databases | |
scope="row" | ProteinModelPortal
|[http://www.proteinmodelportal.org/ Protein Model Portal of the PSI-Nature Structural Biology Knowledgebase] |?? | |?? | | | |3D structure protein databases | |
scope="row" | SWISS-MODEL Repository
|[http://swissmodel.expasy.org/repository/ Database of annotated 3D protein structure models] |University of Basel | |The Swiss government | | | |3D structure protein databases | |
scope="row" | DisProt
|[http://www.disprot.org/ Database of Protein Disorder] |ELIXIR infrastructure |Indiana University School of Medicine, Temple University, University of Padua |funding from the European Union's Horizon 2020 |Swiss Prot/Uni Prot, CATH, Pfam, Europe PMC, BITEM, ECO, Geneontology |database of experimental evidences of disorder in proteins | |3D structure protein databases, Protein sequence databases | |
scope="row" | MobiDB
|[http://mobidb.bio.unipd.it/ Database of intrinsically disordered and mobile proteins] |John Moult, Christine Orengo, Predrag Radivojac |Italian Government | |database of intrinsic protein disorder annotation | |3D structure protein databases, Protein sequence databases | |
scope="row" | ModBase
|[http://modbase.compbio.ucsf.edu/modbase-cgi/index.cgi Database of Comparative Protein Structure Models] |Ursula Pieper, Ben Webb, Narayanan Eswar, Andrej Sali Roberto Sanchez | |UCSF, Sali Lab | | | |3D structure protein databases | |
scope="row" | PDBsum
|[http://www.ebi.ac.uk/pdbsum/ Pictorial database of 3D structures in the Protein Data Bank] |European Bioinformatics Institute 2013 | |Wellcome Trust | | | |3D structure protein databases | |
scope="row" | CCDS
|[https://www.ncbi.nlm.nih.gov/CCDS/CcdsBrowse.cgi The Consensus CDS protein set database] |NCBI | |?? | | | |Sequence databases | |
scope="row" | UniProtKB
|[http://www.uniprot.org/ Universal Protein Resource (UniProt)] |?? | |?? | | | |Sequence databases | |
scope="row" | Swiss Prot/Uni Prot
|https://www.sib.swiss/swiss-prot and https://www.uniprot.org/ | |SIB Swiss Institute of Bioinformatics |European Bioinformatics Institute (EMBL-EBI) | | |Swiss-Prot has collected over 81 000 variants in roughly 13,000 human protein sequence records from peer-reviewed literature. It is unclear how many unique proteins types are present in the database. | | |
Signal transduction pathway databases
- NCI-Nature Pathway Interaction Database
- Netpath: curated resource of signal transduction pathways in humans
- Reactome: navigable map of human biological pathways, ranging from metabolic processes to hormonal signalling (Ontario Institute for Cancer Research, European Bioinformatics Institute, NYU Langone Medical Center, Cold Spring Harbor Laboratory)
- WikiPathways
Metabolic pathway and protein function databases
{{columns-list|colwidth=22em|
- BioCyc Database Collection: includes EcoCyc and MetaCyc
- BRENDA: the comprehensive enzyme information system, including FRENDA, AMENDA, DRENDA, and KENDA
- HMDB: contains detailed information about small molecule metabolites found in the human body
- KEGG PATHWAY Database (Univ. of Kyoto)
- MANET database (University of Illinois)
- Reactome: navigable map of human biological pathways, ranging from metabolic processes to hormonal signalling (Ontario Institute for Cancer Research, European Bioinformatics Institute, NYU Langone Medical Center, Cold Spring Harbor Laboratory)
- SABIO-RK: database for biochemical reactions and their kinetic properties
- WikiPathways
}}
Taxonomic databases
{{Main|List of biodiversity databases}}
Numerous databases collect information about species and other taxonomic categories. The Catalogue of Life is a special case as it is a meta-database of about 150 specialized "global species databases" (GSDs) that have collected the names and other information on (almost) all described and thus "known" species.
- BacDive: bacterial metadatabase that provides strain-linked information about bacterial and archaeal biodiversity, including taxonomy information
- Catalogue of Life: a meta-database of all species on earth
- EzTaxon-e: database for the identification of prokaryotes based on 16S ribosomal RNA gene sequences
- NCBI Taxonomy: a taxonomic database operated by NCBI and concentrating on all taxa for which DNA sequences are available (those sequences are stored by GenBank, another database operated by NCBI).
Image databases
Images play a critical role in biomedicine, ranging from images of anthropological specimens to zoology. However, there are relatively few databases dedicated to image collection, although some projects such as iNaturalist collect photos as a main part of their data. A special case of "images" are 3-dimensional images such as protein structures or 3D-reconstructions of anatomical structures. Image databases include, among others:{{cite journal | vauthors = Ellenberg J, Swedlow JR, Barlow M, Cook CE, Sarkans U, Patwardhan A, Brazma A, Birney E | display-authors = 6 | title = A call for public archives for biological image data | journal = Nature Methods | volume = 15 | issue = 11 | pages = 849–854 | date = November 2018 | pmid = 30377375 | pmc = 6884425 | doi = 10.1038/s41592-018-0195-8 }}
- Allen Brain Atlas
- Digital Brain Bank{{cite journal | vauthors = Tendler BC, Hanayik T, Ansorge O, Bangerter-Christensen S, Berns GS, Bertelsen MF, Bryant KL, Foxley S, van den Heuvel MP, Howard AF, Huszar IN, Khrapitchev AA, Leonte A, Manger PR, Menke RA, Mollink J, Mortimer D, Pallebage-Gamarallage M, Roumazeilles L, Sallet J, Scholtens LH, Scott C, Smart A, Turner MR, Wang C, Jbabdi S, Mars RB, Miller KL | display-authors = 6 | title = The Digital Brain Bank, an open access platform for post-mortem imaging datasets | journal = eLife | volume = 11 | pages = e73153 | date = March 2022 | pmid = 35297760 | pmc = 9042233 | doi = 10.7554/eLife.73153 | doi-access = free }}
- Electron Microscopy Public Image Archive (EMPIAR){{cite journal | vauthors = Iudin A, Korir PK, Salavert-Torres J, Kleywegt GJ, Patwardhan A | title = EMPIAR: a public archive for raw electron microscopy image data | journal = Nature Methods | volume = 13 | issue = 5 | pages = 387–388 | date = May 2016 | pmid = 27067018 | doi = 10.1038/nmeth.3806 | s2cid = 38996040 }}
- Image Data Resource
- Morphobank
- Morphosource
= Radiologic databases =
Additional databases
= Exosomal databases =
- ExoCarta
- Extracellular RNA Atlas: a repository of small RNA-seq and qPCR-derived exRNA profiles from human and mouse biofluids
= Mathematical model databases =
- Biomodels Database: published mathematical models describing biological processes
- [https://morpheus.gitlab.io/model/ MorpheusML Model Repository]: published, community-contributed, and educational multi-scale and multicellular models for systems biology{{cite journal | vauthors = Starruß J, de Back W, Brusch L, Deutsch A | title = Morpheus: a user-friendly modeling environment for multiscale and multicellular systems biology. | journal = Bioinformatics | volume = 30 | issue = 9 | pages = 1331-1332 | date = January 2014 | pmid = 24443380 | pmc = 3998129 | doi = 10.1093/bioinformatics/btt772 | doi-access = free }}
= Databases on [[antimicrobial resistance]] rates and antibiotic consumption =
=Databases on [[antimicrobial resistance]] mechanisms=
{{columns-list|colwidth=22em|
- AMRFinderPlus
- Antimicrobial Drug Database (AMDD)
- ARDB (no longer maintained)
- ARGminer
- BacMet
- Beta-Lactamase Database (BLAD)
- CBMAR
- The Comprehensive Antibiotic Resistance Database
- FARME
- INTEGRALL
- LacED
- MEGARes
- MUBII-TB-DB
- Mustard Database
- MvirDB
- PathoPhenoDB
- PATRIC database
- RAC: Repository of Antibiotic resistance Cassettes
- ResFinder
- TBDReaMDB
- u-CARE
- VFDB
}}
Wiki-style databases
Specialized databases
{{columns-list|colwidth=22em|
- Barcode of Life Data Systems: database of DNA barcodes
- Bacterial Pesticidal Protein Database{{cite journal | author1 = Crickmore, N. | author2 = Berry, C. | author3 = Panneerselvam, S. | author4 = Mishra, R. | author5 = Connor, T. R. | author6 = Bonning, B. C. | title = A structure-based nomenclature for Bacillus thuringiensis and other bacteria-derived pesticidal proteins | journal = Journal of Invertebrate Pathology | volume = 186 | issue = D1 | page = 107438 | date = November 2021| doi = 10.1016/j.jip.2020.107438 | pmid = 32652083 | doi-access = free }}{{cite journal | author1 = Panneerselvam S | author2 = Mishra R | author3 = Berry C | author4 = Crickmore N | author5 = Bonning BC | title = BPPRC database: a web-based tool to access and analyse bacterial pesticidal proteins. | journal = Database (Oxford) | volume = 186 | issue = D1 | page = 107438 | date = 2022| doi = 10.1093/database/baac022 | pmid = 35396594 | pmc = 9216523 | doi-access = free }}
- The Cancer Genome Atlas (TCGA): provides data from hundreds of cancer samples obtained using high-throughput techniques such as gene expression profiling, copy number variation profiling, SNP genotyping, genome-wide DNA methylation profiling, microRNA profiling, and exon sequencing of at least 1,200 genes
- Cellosaurus: a knowledge resource on cell lines
- CTD (Comparative Toxicogenomics Database): describes chemical-gene-disease interactions
- DiProDB: a database to collect and analyse thermodynamic, structural and other dinucleotide properties
- [http://www.housekeeping.unicamp.br Housekeeping and Reference Transcript Atlas (HRT Atlas)]{{cite journal | vauthors = Hounkpe BW, Chenou F, de Lima F, De Paula EV | title = HRT Atlas v1.0 database: redefining human and mouse housekeeping genes and candidate reference transcripts by mining massive RNA-seq datasets | journal = Nucleic Acids Research | volume = 49 | issue = D1 | pages = D947–D955 | date = January 2021 | pmid = 32663312 | doi = 10.1093/nar/gkaa609| pmc=7778946 | doi-access = free }} web-based tool for searching cell specific candidate reference genes/transcripts suitable for qPCR experiment normalization. HRT Atlas also describes a complete list of human and mouse housekeeping genes and transcripts
- Dryad: repository of data underlying scientific publications in the basic and applied biosciences
- Edinburgh Mouse Atlas
- EPD Eukaryotic Promoter Database
- FINDbase (the Frequency of INherited Disorders database)
- GigaDB: repository of large scale datasets underlying scientific publications in the biological and biomedical research
- HGNC (HUGO Gene Nomenclature Committee): a resource for approved human gene nomenclature
- International Human Epigenome Consortium:[http://epigenomesportal.ca/ihec/index.html (IHEC) data portal] integrates epigenomic reference data from well-known national endeavors such as the Canadian CEEHRC,[http://www.epigenomes.ca/ CEEHRC] European Blueprint,[http://www.blueprint-epigenome.eu/ Blueprint] European Genome-phenome Archive (EGA[https://www.ebi.ac.uk/ega/ EGA]), US ENCODE and NIH Roadmap, German DEEP,[http://www.deutsches-epigenom-programm.de DEEP] Japanese CREST,[http://crest-ihec.jp/english/database/index.html CREST] Korean KNIH, Singapore's GIS and China's EpiHK{{cite journal|title=Sharing epigenomes globally|journal=Nature Methods|date=2018|volume=15|issue=3|page=151|doi=10.1038/nmeth.4630|language=En|issn=1548-7105|doi-access=free}}
- MethBase: database of DNA methylation data visualized on the UCSC Genome Browser
- Minimotif Miner: database of short contiguous functional peptide motifs
- Oncogenomic databases: a compilation of databases that serve for cancer research
- PubMed: references and abstracts on life sciences and biomedical topics
- RIKEN integrated database of mammals
- TDR Targets: a chemogenomics database focused on drug discovery in tropical diseases
- TRANSFAC: a database about eukaryotic transcription factors, their genomic binding sites and DNA-binding profiles
- JASPAR: a database of manually curated, non-redundant transcription factor binding profiles.
- [https://metosite.uma.es MetOSite]: a database about methionine sulfoxidation sites and its functional roles in proteins{{cite journal | vauthors = Valverde H, Cantón FR, Aledo JC | title = MetOSite: an integrated resource for the study of methionine residues sulfoxidation | journal = Bioinformatics | volume = 35 | issue = 22 | pages = 4849–4850 | date = November 2019 | pmid = 31197322 | pmc = 6853639 | doi = 10.1093/bioinformatics/btz462 }}
- Healthcare Cost and Utilization Project (HCUP) is the largest collection of hospital care data in the United States. It includes hundreds of millions of inpatient, outpatient, and emergency records.
- [https://lexas.f.u-tokyo.ac.jp LEXAS] curates descriptions of biological experiments from PMC articles.
- Bovine Metabolome Database is a free web database that lists known bovine metabolites }}
References
{{reflist}}
External links
- [https://web.archive.org/web/20060112045100/http://www.oxfordjournals.org/nar/database/c Nucleic Acid Research Molecular Biology Database Collection] – over 1,600 databases
- [https://www.oxfordjournals.org/nar/database/c Nucleic Acid Research (NAR) Database Summary Paper Category List]
{{Bioinformatics}}