List of cheminformatics toolkits

{{short description|None}}

Cheminformatics toolkits are notable software development kits that allow cheminformaticians to develop custom computer applications for use in virtual screening, chemical database mining, and structure-activity studies.{{cite book|title=Handbook of Chemoinformatics Algorithms|date=April 2010|publisher=Chapman & Hall|isbn=978-1420082920|author=Jean-Loup Faulon|author2=Andreas Bender}}{{cite book|last=Johann Gasteiger|title=Chemoinformatics|url=https://archive.org/details/isbn_9783527306817_0|url-access=registration|date=November 2003|publisher=Wiley-VCH|isbn=3527306811}} Toolkits are often used for experimentation with new methodologies. Their most important functions deal with the manipulation of chemical structures and comparisons between structures. Programmatic access is provided to properties of individual bonds and atoms.

Functionality

Toolkits provide the following functionality:

  • Read and save structures in various chemistry file formats.
  • Determine if one structure is a substructure of another (substructure matching).
  • Determine if two structures are equal (exact matching).
  • Identification of substructures common to structures in a set (maximal common substructure, MCS).
  • Disassemble molecules, splitting into fragments.
  • Assemble molecules from elements or submolecules.
  • Apply reactions on input reactant structures, resulting in output of reaction product structures.
  • Generate molecular fingerprints. Fingerprints are bit-vectors where individual bits correspond to the presence or absence of structural features. The most important use of fingerprints is in indexing of chemistry databases.

List of notable cheminformatics toolkits

class="wikitable"
NameLicenseAPIsHome PageNotes
CDKOpen sourceJava, R, Pythonhttps://cdk.github.io/{{cite journal|last1=Steinbeck C|author2=Han Y|author3=Kuhn S|author4=Horlacher O|author5=Luttmann E|author6=Willighagen E|title=The Chemistry Development Kit|journal=J. Chem. Inf. Comput. Sci.|year=2003|volume=43|issue=2|pages=493–500|doi=10.1021/ci025584y|first1=C.|pmid=12653513|pmc=4901983}}{{cite journal|last1=Steinbeck C.|author2=Hoppe C.|author3=Kuhn S.|author4=Floris M.|author5=Guha R.|author6=Willighagen E.L.|title=Recent Developments of the Chemistry Development Kit (CDK) - An Open-Source Java Library for Chemo- and Bioinformatics|journal=Curr. Pharm. Des.|year=2006|volume=12|issue=17|pages=2111–2120|doi=10.2174/138161206777585274|first1=Christoph|pmid=16796559|hdl=2066/35445|hdl-access=free}}{{cite journal |last1=Willighagen |first1=Egon L. |last2=Mayfield |first2=John W. |last3=Alvarsson |first3=Jonathan |last4=Berg |first4=Arvid |last5=Carlsson |first5=Lars |last6=Jeliazkova |first6=Nina |last7=Kuhn |first7=Stefan |last8=Pluskal |first8=Tomáš |last9=Rojas-Chertó |first9=Miquel |last10=Spjuth |first10=Ola |last11=Torrance |first11=Gilleain |last12=Evelo |first12=Chris T. |last13=Guha |first13=Rajarshi |last14=Steinbeck |first14=Christoph |title=The Chemistry Development Kit (CDK) v2.0: atom typing, depiction, molecular formulas, and substructure searching |journal=Journal of Cheminformatics |date=December 2017 |volume=9 |issue=1 |pages=33 |doi=10.1186/s13321-017-0220-4|pmid=29086040 |pmc=5461230 |s2cid=2019985 |doi-access=free }}
IndigoOpen sourceJava, .NET, Pythonhttps://github.com/epam/Indigo
Molecular Operating Environment (MOE)ProprietaryScientific Vector Languagehttps://web.archive.org/web/20160909172415/http://www.chemcomp.com/MOE-Cheminformatics_and_QSAR.htm
Open BabelOpen sourceC++, Python, Java, Perl, C#, Rubyhttp://openbabel.org/,reads and writes all chemical file formats.{{cite journal|last=O’Boyle N|author2=Banck M|author3=James C|author4=Morley C|author5=Vandermeersch T|author6=Hutchison G|title=Open babel: an open chemical|journal=Journal of Cheminformatics|year=2011|volume=3|issue=33|pages=33|doi=10.1186/1758-2946-3-33|pmid=21982300|pmc=3198950 |doi-access=free }}
RDKitBSD-3-Clause LicenseC++, Pythonhttps://www.rdkit.org/

References