corpus manager
{{Short description|Software for searching a text collection}}
{{Use dmy dates|date=August 2016}}
A corpus manager (corpus browser or corpus query system) is a tool for multilingual corpus analysis, which allows effective searching in corpora.{{cite web |url=http://wiki.korpus.cz/doku.php/pojmy:korpusovy_manazer |title=Korpusový manažer
|author= |date=8 April 2015 |website=Wiki Český národní korpus
|publisher=Český národní korpus |access-date=18 April 2015}}
A corpus manager usually represents a complex tool that allows one to perform searches for language forms or sequences. It may provide information about the context or allow the user to search by positional attributes, such as lemma, tag, etc. These are called concordances. Other features include the ability to search for collocations, frequency statistics as well as metadata information about the processed text.{{cite journal|last1=Kouklakis|first1=George|last2=Mikros|first2=George|last3=Markopoulos|first3=George|last4=Koutsis|first4=Ilias|title=Corpus Manager A Tool for Multilingual Corpus Analysis|journal=Proceedings from Corpus Linguistics Conference|date=2007|pages=1–12|url=http://www.birmingham.ac.uk/documents/college-artslaw/corpus/conference-archives/2007/244Paper.pdf|publisher=University of Athens}}
The narrower meaning of corpus manager refers only to the server side or the corpus query engine, whereas the client side is simply called the user interface.
A corpus manager can be software installed on a personal computer or it might be provided as a web service.
List of corpus managers
- BNCweb[http://corpora.lancs.ac.uk/BNCweb/ interface to the British National Corpus] more about British National Corpus – a web-based interface for the British National Corpus
- CQPweb[https://cqpweb.lancs.ac.uk/ CQPweb Main Page] - a web-based interface for the study of a large variety of corpora including the Spoken BNC2014
- BYU-BNC[http://corpus.byu.edu/bnc/ BYU-BNC: BRITISH NATIONAL CORPUS] interface – a website that allows searches of the British National Corpora and others created at Brigham Young University
- Coma[https://exmaralda.org/en/corpus-manager-en/ EXMARaLDA Corpus-Manager] Hamburger Zentrum für Sprachkorpora – a tool extension of the system EXMARaLDA for working with oral corpora on a computer
- NoSketch Engine[http://nlp.fi.muni.cz/trac/noske NoSketch Engine (an open-source project combining Manatee, Bonito and Crystal into a powerful and free corpus management system)] – a free open-source corpus management system combining Manatee (back-end) and Bonito (web interface)
- KonText[https://kontext.korpus.cz/first_form A basic query interface for working with corpora] Institute of the Czech National Corpus (ICNC), Faculty of Arts, Charles University in Prague – an extended and modified web interface to NoSketch Engine (a Bonito replacement)
- Sketch Engine[http://sketchengine.co.uk/ The Sketch Engine homepage][http://www.uow.edu.au/~dlee/software.htm Concordancers, Search Engines, Text-analysis Tools] {{Webarchive|url=https://web.archive.org/web/20150315073352/http://www.uow.edu.au/~dlee/software.htm |date=15 March 2015 }} a list on University of Wollongong website – text corpus management and analysis software with more than 500 corpora in 90+ languages
- Spoco {{Citation |author=Ruprecht von Waldenfels|author2= Michał Woźniak |title= SpoCo - a simple and adaptable web interface for dialect corpora|journal= Journal for Language Technology and Computational Linguistics|volume= 31|issue=1 |page= 133-148 |year= 2016|doi= 10.21248/jlcl.31.2016.206 |access-date=|doi-access= free}}
- WordSmith Tools[http://www.lexically.net/wordsmith/index.html WordSmith Tools homepage] – a software package primarily for linguists