ISLRN

{{refimprove|date=February 2017}}

The ISLRN or International Standard Language Resource Number is Persistent Unique Identifier for Language Resources.[http://www.aclweb.org/anthology/W11-3310 Khalid Choukri, Jungyeul Park, Olivier Hamon, Victoria Arranz, Proposal for the International Standard Language Resource Number. Proceedings of Workshop on Language Resources, Technology and Services in the Sharing Paradigm, Chiang Mai, Thailand, November 12, 2011.]

Context

On November 18, 2013,[http://www.elra.info/en/dissemination/elra-events/elra-nlp12-meeting/paris-declaration/ NLP12 Paris Declaration] 12 major organisations (see list below) from the fields Language Resources and Technologies, Computational Linguistics, and Digital Humanities held a cooperation meeting in Paris (France) and agreed to announce the establishment of the International Standard Language Resource Number (ISLRN), to be assigned to each Language Resource.

Among the 12 organisations, 4 institutions constitute the ISLRN Steering Committee (ST)

  • ADHO
  • ACL
  • Asian Federation of Natural Language Processing ST
  • COCOSDA, International Committee for the Coordination & Standardisation of Speech Databases and Assessment Techniques
  • ICCL (COLING)
  • European Data Forum
  • ELRA ST
  • [http://www.eamt.org/iamt.php IAMT, International Association for Machine Translation] {{Webarchive|url=https://web.archive.org/web/20100624162302/http://www.eamt.org/iamt.php |date=2010-06-24 }}
  • ISCA
  • LDC ST
  • [www.cocosda.org Oriental COCOSDA] ST
  • [http://rma.nwu.ac.za/ RMA, Language Resource Management Agency]

Size and Content

The Joint Research Centre(JRC),[https://ec.europa.eu/jrc/en/language-technologies Joint Research Centre] the [European Commission]'s in-house science service, was the first organisation to adopt the ISLRN initiative and requested.

2500 resources and tools have already been allocated an ISLRN. These resources include written data (Annotated corpus, Annotated text, List of misspelled word, Terminological database, Treebank, Wordnet, etc.) and speech corpora (Synthesised Speech, Transcripts and Audiovisual Recordings, Conversational Speech, Folk Sayings, etc.)

Objectives

Providing Language Resources with unique names and identifiers using a standardized nomenclature ensures the identification of each Language Resources and streamlines the citation with proper references in activities within Human Language Technology as well as in documents and scientific publications. Such unique identifier also enhances the reproducibility,[http://www.lrec-conf.org/proceedings/lrec2016/workshops/LREC2016Workshop-4REAL_Proceedings.pdf 4REAL Workshop on Research Results Reproducibility, and Resources Citation in Science and Technology of Language] an essential feature of scientific work.

References

{{Reflist}}