Bioinformatic Harvester
{{primary sources |date=April 2018}}
{{Infobox software
| name = Bioinformatic Harvester
|developer = Urban Liebel, Björn Kindler
|latest release version = 4
|latest release date = {{release date and age|2011|05|24}}
|operating_system = Web based
|genre = Bioinformatics tool
|license =
|website = {{URL|http://harvester.kit.edu}} {{dead link|date=July 2017 |bot=InternetArchiveBot |fix-attempted=yes }}
}}
The Bioinformatic Harvester was a bioinformatic meta search engine created by the European Molecular Biology Laboratory{{Cite journal|title=Information retrieval on Internet using meta-search engines: A review|url=http://nopr.niscair.res.in/bitstream/123456789/2243/1/JSIR%2067(10)%20739-746.pdf?trk=right_banner&id=1400972373&ref=88d4042477e42c447b6b62f13e16f07a|author=Manoj, M|author2=Elizabeth, Jacob |date=Oct 2008|pages=739–746|issn=0022-4456|journal=Journal of Scientific & Industrial Research |volume=67 | issue = 10 }} and subsequently hosted and further developed by KIT Karlsruhe Institute of Technology for genes and protein-associated information. Harvester currently works for human, mouse, rat, zebrafish, drosophila and arabidopsis thaliana based information. Harvester cross-links >50 popular bioinformatic resources and allows cross searches. Harvester serves tens of thousands of pages every day to scientists and physicians. Since 2014 the service is down.
How Harvester works
Harvester collects information from protein and gene databases along with information from so called "prediction servers." Prediction server e.g. provide online sequence analysis for a single protein. Harvesters search index is based on the IPI and UniProt protein information collection. The collections consists of:
- ~72.000 human, ~57.000 mouse, ~41.000 rat, ~51.000 zebrafish, ~35.000 arabidopsis protein pages, which cross-link ~50 major bioinformatic resources.
Harvester crosslinks several types of information
=Text based information=
From the following databases:
- UniProt, one of the largest protein databases
- SOURCE, convenient gene information overview
- Simple Modular Architecture Research Tool (SMART)
- SOSUI, predicts transmembrane domains
- PSORT, predicts protein localisation
- HomoloGene, compares proteins from different species
- gfp-cdna, protein localisation with fluorescence microscopy
- International Protein Index (IPI)
= Databases rich in graphical elements =
These databases are not collected, but are crosslinked, being displayed via iframes. An iframe is a window within an HTML page for an embedded view of and interactive access to the linked database. Several such iframes are combined on a single Harvester protein page. This allows simultaneous, convenient comparison of information from several databases.
- NCBI-BLAST, an algorithm for comparing biological sequences from the NCBI
- Ensembl, automatic gene annotation by the EMBL-EBI and Sanger Institute
- FlyBase is a database of model organism Drosophila melanogaster
- GoPubMed is a knowledge-based search engine for biomedical texts
- iHOP, information hyperlinked over proteins via gene/protein synonyms
- Mendelian Inheritance in Man project catalogues all the known diseases
- RZPD, German resources Center for genome research in Berlin/Heidelberg
- STRING, Search Tool for the Retrieval of Interacting Genes/Proteins, developed by EMBL, SIB and UZH
- Zebrafish Information Network
- [https://web.archive.org/web/20171231015119/http://locate.imb.uq.edu.au/ LOCATE] subcellular localisation database (mouse)
= Access from external application =
- Genome browser, working draft assemblies for genomes UCSC
- Google Scholar
- Mitocheck
- PolyMeta, meta search engine for Google, Yahoo, MSN, Ask, Exalead, AllTheWeb, GigaBlast
What one can find
Harvester allows a combination of different search terms and single words.
Search Examples:
- Gene-name: "golga3"
- Gene-alias: "ADAP-S ADAS ADHAPS ADPS" (one gene name is sufficient)
- Gene-Ontologies: "Enzyme linked receptor protein signaling pathway"
- Unigene-Cluster: "Hs.449360"
- Go-annotation: "intra-Golgi transport"
- Molecular function: "protein kinase binding"
- Protein: "Q9NPD3"
- Protein domain: "SH2 sar"
- Protein Localisation: "endoplasmic reticulum"
- Chromosome: "2q31"
- Disease relevant: use the word "diseaselink"
- Combinations: "golgi diseaselink" (finds all golgi proteins associated with a disease)
- mRNA: "AL136897"
- Word: "Cancer"
- Comment: "highly expressed in heart"
- Author: "Merkel, Schmidt"
- Publication or project: "cDNA sequencing project"
See also
Literature
- {{cite journal |vauthors=Liebel U, Kindler B, Pepperkok R |title='Harvester': a fast meta search engine of human protein resources |journal=Bioinformatics |volume=20 |issue=12 |pages=1962–3 |date=August 2004 |pmid=14988114 |doi=10.1093/bioinformatics/bth146 |doi-access=free }}
- {{cite book |vauthors=Liebel U, Kindler B, Pepperkok R |chapter=Bioinformatic "Harvester": A Search Engine for Genome-Wide Human, Mouse, and Rat Protein Resources |title=GTPases Regulating Membrane Dynamics |volume=404 |pages=19–26 |year=2005 |pmid=16413254 |doi=10.1016/S0076-6879(05)04003-6 |series=Methods in Enzymology |isbn=9780121828097 }}
Notes and references
{{reflist|30em}}
External links
- {{Official website|http://harvester.kit.edu}} {{dead link|date=November 2016 |bot=InternetArchiveBot |fix-attempted=yes }} Bioinformatic Harvester V at KIT Karlsruhe Institute of Technology
- {{Cite web |url=http://harvester42.fzk.de/ |title=Harvester42 at KIT - integrating 50 general search engines |access-date=2013-01-06 |archive-url=https://archive.today/20130106060017/http://harvester42.fzk.de/ |archive-date=2013-01-06 |url-status=dead }}
Category:Bioinformatics software
Category:Internet search engines