Apache Taverna

{{Use dmy dates|date=April 2022}}

{{Infobox software

| name = Apache Taverna

| logo = Taverna-wheel-logo.png

| screenshot = Dragon-workflow.png

| caption = Taverna Workbench

| developer = Apache Software Foundation
myGrid for 2.x

| latest release version = 3.1

| latest release date = {{Start date and age|2016|07|01}}

| operating system = Linux
Mac OS X
Microsoft Windows

| programming language = Java

| genre = Scientific workflow system

| license = Apache License 2.0 (LGPL for 2.x)

| website = {{Official URL}}

}}

Apache Taverna was an open source software tool for designing and executing workflows, initially created by the myGrid project under the name Taverna Workbench, then a project under the Apache Software Foundation incubator.{{cite web|url=http://taverna.incubator.apache.org|title=Apache Taverna|website=apache.org}}{{cite web|url=http://www.taverna.org.uk|website=taverna.org.uk|archiveurl=https://web.archive.org/web/20120214024850/http://www.taverna.org.uk/|archivedate=2012-02-14|title=Taverna Workflow Management System Powerful, scalable, open source & domain independent tools for designing and executing workflows. Access to 3500+ resources}} Taverna allowed users to integrate many different software components, including web services, such as those provided by the National Center for Biotechnology Information, the European Bioinformatics Institute, the DNA Data Bank of Japan (DDBJ), SoapLab, BioMOBY and EMBOSS. The set of available services was not finite and users could import new service descriptions into the Taverna Workbench.{{Cite book | vauthors = Belhajjame K, Wolstencroft K, Corcho O, Oinn T, Tanoh F, William A, Goble C | title = 2008 Eighth IEEE International Symposium on Cluster Computing and the Grid (CCGRID) | doi = 10.1109/CCGRID.2008.17 | chapter = Metadata Management in the Taverna Workflow System | pages = 651–656 | year = 2008 | isbn = 9780769531564 | s2cid = 9996653 | chapter-url = http://oa.upm.es/2631/ }}{{cite journal | vauthors = Li P, Castrillo JI, Velarde G, Wassink I, Soiland-Reyes S, Owen S, Withers D, Oinn T, Pocock MR, Goble CA, Oliver SG, Kell DB | display-authors = 6 | title = Performing statistical analyses on quantitative data in Taverna workflows: an example using R and maxdBrowse to identify differentially-expressed genes from microarray data | journal = BMC Bioinformatics | volume = 9 | pages = 334 | date = August 2008 | pmid = 18687127 | pmc = 2528018 | doi = 10.1186/1471-2105-9-334 | doi-access = free }}{{cite journal | vauthors = Oinn T, Addis M, Ferris J, Marvin D, Senger M, Greenwood M, Carver T, Glover K, Pocock MR, Wipat A, Li P | display-authors = 6 | title = Taverna: a tool for the composition and enactment of bioinformatics workflows | journal = Bioinformatics | volume = 20 | issue = 17 | pages = 3045–54 | date = November 2004 | pmid = 15201187 | doi = 10.1093/bioinformatics/bth361 | doi-access = free }}{{Cite journal | vauthors = Oinn T, Greenwood M, Addis M, Alpdemir MN, Ferris J, Glover K, Goble C, Goderis A, Hull D, Marvin D, Li P | display-authors = 6 | title = Taverna: Lessons in creating a workflow environment for the life sciences | journal = Concurrency and Computation: Practice and Experience | volume = 18 | issue = 10 | pages = 1067–1100 | year = 2006 | doi = 10.1002/cpe.993 | s2cid = 10219281 | url = https://eprints.soton.ac.uk/260908/1/taverna-ccpe-reviewed.pdf }}{{cite journal | vauthors = Hull D, Wolstencroft K, Stevens R, Goble C, Pocock MR, Li P, Oinn T | title = Taverna: a tool for building and running workflows of services | journal = Nucleic Acids Research | volume = 34 | issue = Web Server issue | pages = W729-32 | date = July 2006 | pmid = 16845108 | pmc = 1538887 | doi = 10.1093/nar/gkl320 | author-link4 = Carole Goble | author-link3 = Robert David Stevens }} {{open access}}{{cite journal | vauthors = Kawas E, Senger M, Wilkinson MD | title = BioMoby extensions to the Taverna workflow management and enactment software | journal = BMC Bioinformatics | volume = 7 | pages = 523 | date = November 2006 | pmid = 17137515 | pmc = 1693925 | doi = 10.1186/1471-2105-7-523 | doi-access = free }}{{cite journal | vauthors = Sroka J, Kaczor G, Tyszkiewicz J, Kierzek AM | title = XQTav: an XQuery processor for Taverna environment | journal = Bioinformatics | volume = 22 | issue = 10 | pages = 1280–1 | date = May 2006 | pmid = 16551662 | doi = 10.1093/bioinformatics/btl101 | doi-access = free }}{{cite journal | vauthors = Wolstencroft K, Haines R, Fellows D, Williams A, Withers D, Owen S, Soiland-Reyes S, Dunlop I, Nenadic A, Fisher P, Bhagat J, Belhajjame K, Bacall F, Hardisty A, Nieva de la Hidalga A, Balcazar Vargas MP, Sufi S, Goble C | display-authors = 6 | title = The Taverna workflow suite: designing and executing workflows of Web Services on the desktop, web or in the cloud | journal = Nucleic Acids Research | volume = 41 | issue = Web Server issue | pages = W557-61 | date = July 2013 | pmid = 23640334 | pmc = 3692062 | doi = 10.1093/nar/gkt328 }}

Taverna Workbench provided a desktop authoring environment and enactment engine for scientific workflows. The Taverna workflow engine was also available separately, as a Java API, command line tool or as a server.

Taverna was used by users in many domains, such as bioinformatics,{{cite journal | vauthors = Stevens RD, Robinson AJ, Goble CA | title = myGrid: personalised bioinformatics on the information grid | journal = Bioinformatics | volume = 19 | pages = i302-4 | year = 2003 | pmid = 12855473 | doi = 10.1093/bioinformatics/btg1041 | title-link = MyGrid | issue = Suppl 1 | author-link1 = Robert David Stevens | author-link3 = Carole Goble | doi-access = free }}{{cite journal | vauthors = Stevens RD, Tipney HJ, Wroe CJ, Oinn TM, Senger M, Lord PW, Goble CA, Brass A, Tassabehji M | display-authors = 6 | title = Exploring Williams-Beuren syndrome using myGrid | journal = Bioinformatics | volume = 20 | pages = i303-10 | date = August 2004 | issue = Suppl 1 | pmid = 15262813 | doi = 10.1093/bioinformatics/bth944 | author-link8 = Andy Brass | author-link7 = Carole Goble | author-link1 = Robert David Stevens | doi-access = free }} cheminformatics,{{cite journal | vauthors = Truszkowski A, Jayaseelan KV, Neumann S, Willighagen EL, Zielesny A, Steinbeck C | title = New developments on the cheminformatics open workflow environment CDK-Taverna | journal = Journal of Cheminformatics | volume = 3 | pages = 54 | date = December 2011 | pmid = 22166170 | pmc = 3292505 | doi = 10.1186/1758-2946-3-54 | doi-access = free }} medicine, astronomy,{{Cite book | vauthors = Hook RN, Romaniello M, Ullgrén M, Järveläinen P, Maisala S, Oittinen T, Savolainen V, Solin O, Tyynelä J, Peron M, Izzo C | display-authors = 6 | doi = 10.1007/978-3-540-76963-7_23 | chapter = ESO Reflex: A Graphical Workflow Engine for Running Recipes | title = The 2007 ESO Instrument Calibration Workshop | series = ESO Astrophysics Symposia European Southern Observatory | pages = 169–175 | year = 2008 | isbn = 978-3-540-76962-0 }} social science, music, and digital preservation.{{cite web |url= http://www.scape-project.eu/wp-content/uploads/2012/05/SCAPE_D15.1_ONB_v1.0.pdf | vauthors = Raditsch M, Schlarb S, Møldrup-Dalum P, Medjkoune L |title=Web content executable workflows for experimental executio |date=2012}}

Some of the services for the use in Taverna workflows could be discovered through the BioCatalogue - a public, centralised and curated registry of Life Science Web services. Taverna workflows could also be shared with other people through the myExperiment social web site for scientists.{{cite journal | vauthors = Goble CA, Bhagat J, Aleksejevs S, Cruickshank D, Michaelides D, Newman D, Borkum M, Bechhofer S, Roos M, Li P, De Roure D | display-authors = 6 | title = myExperiment: a repository and social network for the sharing of bioinformatics workflows | journal = Nucleic Acids Research | volume = 38 | issue = Web Server issue | pages = W677-82 | date = July 2010 | pmid = 20501605 | pmc = 2896080 | doi = 10.1093/nar/gkq429 | url = }} BioCatalogue and myExperiment are another two product from the myGrid consortium.

Taverna was used in over 350 organizations around the world, both academic and commercial. As of 2011, there have been over 80,000 downloads of Taverna across different versions.

On February 20, 2020, Apache Incubator retired the project and removed the code from its website.

Capabilities

Taverna workflows can invoke general SOAP/WSDL or REST Web services, and more specific SADI, BioMart, BioMoby and SoapLab Web services. It can also invoke R statistical services, local Java code, external tools on local and remote machines (via ssh), do XPath and other text manipulation, import a spreadsheet and include sub-workflows.

Taverna Workbench includes the ability to monitor the running of a workflow and to examine the provenance of the data produced, exposing details of the workflow run as a W3C PROV-O RDF provenance graph,{{Cite conference | doi = 10.1145/2457317.2457376| chapter = A workflow PROV-corpus based on Taverna and Wings| title = Proceedings of the Joint EDBT/ICDT 2013 Workshops on - EDBT '13| pages = 331| year = 2013| vauthors = Belhajjame K, Zhao J, Garijo D, Garrido A, Soiland-Reyes S, Alper P, Corcho O | isbn = 9781450315999}} within a structured Research Object bundle{{cite journal| vauthors = Soiland-Reyes S, Gamble M, Haines R |title=Research Object Bundle 1.0|url=https://w3id.org/bundle/2014-11-05/|website=researchobject.org|access-date=28 January 2015|ref=robundle|format=Specification|doi=10.5281/zenodo.12586|date=2014-11-05}} ZIP file that includes inputs, outputs, intermediate values and the executed workflow definition; together this format is called TavernaProv.{{cite journal |last1=Soiland-Reyes |first1=Stian |last2=Alper |first2=Pinar |last3=Goble |first3=Carole |title=Tracking workflow execution with TavernaProv |url=https://zenodo.org/record/51314 |website=zenodo.org |doi=10.5281/zenodo.51314 |doi-access=free |date=11 May 2016}}

Taverna includes the ability to search for services described in BioCatalogue to invoke from workflows. However, services do not need to be described within BioCatalogue to be included in workflows as they can be added from a WSDL Web Service description or entered as a REST URI pattern.

Taverna also includes the capability to search for workflows on myExperiment. The Taverna Workbench can download, modify and run workflows discovered on myExperiment, and also upload created workflows in order to share them with others using the social aspects of myExperiment.

Taverna workflows do not need to be executed within the Taverna Workbench. Workflows can also be run by:

  • a command line execution tool
  • remote execution server that allow Taverna workflows to be run on other machines, on computational grids, clouds, from Web pages and portals
  • online workflow designer and enactor OnlineHPC

Taverna allows pipelining and streaming of data.{{cite web|title=Implicit iteration|url=http://dev.mygrid.org.uk/wiki/display/tav250/Implicit+iteration|website=Taverna 2.5 User Manual|publisher=myGrid|access-date=28 January 2015|date=2014-09-09|archive-date=22 March 2015|archive-url=https://web.archive.org/web/20150322101049/http://dev.mygrid.org.uk/wiki/display/tav250/Implicit+iteration|url-status=dead}} This means that services downstream in the workflow can start as soon as the first data item is received, without waiting for the whole data list to become available from upstream services and iterations. Taverna services execute in parallel when possible, as Taverna workflows are primarily data-driven rather than control-driven.{{cite web|last1=Soiland-Reyes|first1=Stian | name-list-style = vanc |title=Parallel service invocations|url=http://taverna.knowledgeblog.org/2010/12/13/parallel-service-invocations/|website=The Taverna Knowledge Blog|publisher=knowledgeblog.org|access-date=28 January 2015|date=2010-12-13}}

Image:Taverna 2.1 Splashscreen.png]]

Open source community

Taverna has been an open-source project since 2003,{{cite web| vauthors = Soiland-Reyes S, Sufi S, Seaborne S |title=Taverna Proposal |url= https://wiki.apache.org/incubator/TavernaProposal#Inexperience_with_Open_Source |website=Incubator Wiki |publisher=Apache Software Foundation |access-date=28 January 2015 |date=2014-09-23 }} with contributors from multiple academic and industry institutions. In October 2014 Taverna became an independent Apache incubator project, and changed its name to Apache Taverna (incubating). The project is developing Apache Taverna 3.x,{{cite web|url=http://taverna.incubator.apache.org/download/|title=Download Apache Taverna|publisher=Apache Software Foundation|access-date=28 January 2015}} which license changed from LGPL 2.1 to Apache License 2.0.

References