COCOA (digital humanities)
{{Short description|Text file utility and file format}}
COCOA (an acronym derived from COunt and COncordance Generation on Atlas) was an early text file utility and associated file format for digital humanities, then known as humanities computing. It was approximately 4000 punched cards of FORTRAN and created in the late 1960s and early 1970s at University College London and the Atlas Computer Laboratory in Harwell, Oxfordshire. Functionality included word-counting and concordance building.{{Cite journal |author= Paul E. Corcoran |title= COCOA: A FORTRAN Program for Concordance and Word-count Processing of Natural Language Texts |journal= Behavior Research Methods & Instrumentation |volume= 6 |number= 6 |date= November 1974 |page= 566 |doi= 10.3758/BF03201351 |doi-access= free }}{{Cite journal |author= Colin Day and Ian Marriott |title= Software Reviews: COCOA: A Word Count and Concordance Generator |journal= Computers and the Humanities |volume= 10 |number= 1 |date= February 1976 |page= 56 |doi= 10.1007/BF02399143 |s2cid= 198177017 }}{{Cite web |author= D. B. Russell |title= COCOA - A Word Count and Concordance Generator |publisher= Associates Technology Literature Applications Society |year= 1965 |url= http://www.chilton-computing.org.uk/acl/applications/cocoa/p001.htm |accessdate= 20 October 2013 }}{{Cite web |title= The History of Humanities Computing |author= Susan Hockey |publisher= University of Illinois |url= http://nora.lis.uiuc.edu:3030/companion/view?docId=blackwell/9781405103213/9781405103213.xml&doc.view=print&chunk.id=ss1-2-1&toc.depth=1&toc.id=0 |url-status= dead |archiveurl= https://web.archive.org/web/20130918143131/http://nora.lis.uiuc.edu:3030/companion/view?docId=blackwell%2F9781405103213%2F9781405103213.xml&doc.view=print&chunk.id=ss1-2-1&toc.depth=1&toc.id=0 |archivedate= 18 September 2013 |accessdate= 20 October 2013 }}
Oxford Concordance Program
The Oxford Concordance Program format was a direct descendant of COCOA developed at Oxford University Computing Services. The Oxford Text Archive holds items in this format.{{Cite web |title= Concordia discordantium canonum ac primum de iure naturae et constitutionis |date= 14 January 1987 |publisher= University of Oxford Text Archive |accessdate= 20 October 2013 |url= http://ota.ox.ac.uk/desc/0699 |last1= Gratian |first1= 12th Cent }}
Later developments
The COCOA file format bears at least a passing similarity to the later markup languages such as SGML and XML. A noticeable difference with its successors is that COCOA tags are flat and not tree structured. In that format, every information type and value encoded by a tag should be considered true until the same tag changes its value. Members of the Text Encoding Initiative community maintain legacy support for COCOA,{{Cite web |title= This script is used to convert COCOA to TEI |format= XSLT |year= 2010 |author= James Cummings, Sebastian Rahtz |publisher= Oxford University |url= https://github.com/TEIC/Stylesheets/blob/dev/cocoa/cocoa-to-tei.xsl |accessdate= 3 April 2018 }}{{Cite web|url=https://github.com/TEIC/Stylesheets/tree/master/cocoa|title = Stylesheets/Cocoa at dev · TEIC/Stylesheets|website = GitHub}} although most in-demand texts and corpora have already been migrated to more widely understood formats such as TEI XML.{{Cite web|url=http://www.helsinki.fi/varieng/CoRD/corpora/HelsinkiCorpus/HC_XML.html|title = Corpus Resource Database (CoRD)}}