Origin of language

{{Short description|Relationship between language and human evolution}}

{{Use dmy dates|date=November 2024}}

{{Redirects here|Evolution of language|development of languages over time|Evolution of languages}}

{{Linguistics|Topics}}

The origin of language, its relationship with human evolution, and its consequences have been subjects of study for centuries. Scholars wishing to study the origins of language draw inferences from evidence such as the fossil record, archaeological evidence, and the contemporary language diversity. They can also learn the studies of language acquisition, and comparisons between human language and systems of animal communication (particularly other primates).{{Cite news |last=Shah |first=Sonia |date=20 September 2023 |title=The Animals Are Talking. What Does It Mean? |url=https://www.nytimes.com/2023/09/20/magazine/animal-communication.html#permid=127890141 |url-status=live |archive-url=https://ghostarchive.org/archive/20230921140922/https://www.nytimes.com/2023/09/20/magazine/animal-communication.html#permid=127890141 |archive-date=21 September 2023 |access-date=21 September 2023 |work=The New York Times}} Many argue for the close relation between the origins of language and the origins of modern human behavior, but there is little agreement about the facts and implications of this connection.

The shortage of direct, empirical evidence has caused many scholars to regard the entire topic as unsuitable for serious study; in 1866, the Linguistic Society of Paris banned any existing or future debates on the subject, a prohibition which remained influential across much of the Western world until the late twentieth century.{{cite book |last1=Żywiczyński |first1=Przemysław |last2=Wacewicz |first2=Slawomir |title=Statement of the Société de linguistique de Paris banning glottogenetic speculation. |date=August 2019 |doi=10.3726/b15805 |isbn=978-3-631-79394-7 |url=https://www.researchgate.net/figure/Statement-of-the-Societe-de-linguistique-de-Paris-banning-glottogenetic-speculation_fig1_338039384}} Various hypotheses have been developed on the emergence of language.{{Cite book |last1=Tallerman |first1=Maggie |author-link1=Maggie Tallerman |title=The Oxford handbook of language evolution |last2=Gibson |first2=Kathleen Rita |publisher=Oxford University Press |year=2012 |isbn=978-0-19-954111-9}} While Charles Darwin's theory of evolution by natural selection had provoked a surge of speculation on the origin of language over a century and a half ago, the speculations had not resulted in a scientific consensus by 1996.Müller, F. M. 1996 [1861]. The theoretical stage, and the origin of language. Lecture 9 from Lectures on the Science of Language. Reprinted in R. Harris (ed.), The Origin of Language. Bristol: Thoemmes Press, pp. 7–41. Despite this, academic interest has returned to the topic in the early 1990s. Linguists, archaeologists, psychologists, and anthropologists have renewed the investigation into the origin of language with modern methods.{{Cite book |last1=Christiansen |first1=Morten H |author-link1=Morten H. Christiansen |title=Language evolution |last2=Kirby |first2=Simon |publisher=Oxford University Press |year=2003 |isbn=978-0-19-924484-3 |editor-last=Christiansen |editor-first=Morten H. |pages=77–93 |chapter=Language evolution: the hardest problem in science? |editor-last2=Kirby |editor-first2=Simon}}

Approaches

Attempts to explain the origin of language take a variety of forms:{{Cite book |last=Ulbæk |first=Ib |title=Approaches to the evolution of language: social and cognitive base |publisher=Cambridge University Press |year=1998 |isbn=978-0-521-63964-4 |editor-last=Hurford |editor-first=James R. |pages=30–43 |chapter=The origin of language and cognition |editor-last2=Studdert-Kennedy |editor-first2=Michael |editor-last3=Knight |editor-first3=Chris}}

  • "Continuity theories" build on the idea that language exhibits so much complexity that one cannot imagine it simply appearing from nothing in its final form; therefore it must have evolved from earlier pre-linguistic systems among humans' primate ancestors.
  • "Discontinuity theories" take the opposite approach, stating that language, as a unique trait that cannot be compared to anything found among non-humans, must have appeared fairly suddenly during the course of human evolution.
  • Some theories consider language mostly as an innate faculty—largely genetically encoded.
  • Other theories regard language as a mainly cultural system that is learned through social interaction.

Most linguistic scholars {{as of | 2024 | lc=on}} favor continuity-based theories, but they vary in how they hypothesize language development.{{cn|date=November 2024}} Some among those who consider language as mostly innate avoid speculating about specific precursors in nonhuman primates, stressing simply that the language faculty must have evolved gradually.{{Cite book |last=Pinker |first=Steven |title=The Language Instinct |publisher=W. Morrow & Co. |year=1994 |isbn=978-0-688-12141-9 |location=New York}}

Those who consider language as learned socially, such as Michael Tomasello, consider it developing from the cognitively controlled aspects of primate communication, mostly gestural rather than vocal.{{Cite book |last=Tomasello |first=Michael |author-link=Michael Tomasello |title=Communicating meaning: the evolution and development of language |publisher=L. Erlbaum |year=1996 |isbn=978-0-8058-2118-5 |editor-last=Velichkovskiĭ |editor-first=B. M. |location=Mahwah, NJ |chapter=The cultural roots of language |editor-last2=Rumbaugh |editor-first2=Duane M.}}{{Cite journal |last1=Pika |first1=Simone |last2=Mitani |first2=John |year=2006 |title=Referential gestural communication in wild chimpanzees (Pan troglodytes) |journal=Current Biology |volume=16 |issue=6 |pages=R191–R192 |bibcode=2006CBio...16.R191P |doi=10.1016/j.cub.2006.02.037 |issn=0960-9822 |pmid=16546066 |s2cid=2273018 |doi-access=free}} Where vocal precursors are concerned, many continuity theorists envisage language as evolving from early human capacities for song.{{Cite journal |last1=Dunn |first1=M. |last2=Greenhill |first2=S. J. |last3=Levinson |first3=S. C. |last4=Gray |first4=R. D. |date=May 2011 |title=Evolved structure of language shows lineage-specific trends in word-order universals |journal=Nature |volume=473 |issue=7345 |pages=79–82 |bibcode=2011Natur.473...79D |doi=10.1038/nature09923 |pmid=21490599 |s2cid=1588797 |hdl-access=free |hdl=11858/00-001M-0000-0013-3B19-B}}The Economist, "[http://www.economist.com/node/18557572?story_id=18557572 The evolution of language: Babel or babble?]", 16 April 2011, pp. 85–86.{{Cite book |last1=Cross |first1=Ian |url=http://www.music.org/pdf/summit/2014medium.pdf |title=The Prehistory of Language |last2=Woodruff |first2=Ghofur Eliot |date=23 April 2009 |publisher=Oxford University Press |isbn=978-0-19-156287-7 |editor-last=Botha |editor-first=Rudolf P. |pages=77–98 |chapter=Music as a Communicative medium |doi=10.1093/acprof:oso/9780199545872.003.0005 |editor-last2=Knight |editor-first2=Chris |chapter-url=https://books.google.com/books?id=36tLTfV_hLcC&pg=PA77}}{{Cite journal |last=Vaneechoutte |first=Mario |year=2014 |title=The Origin of Articulate Language Revisited: The Potential of a Semi-Aquatic Past of Human Ancestors to Explain the Origin of Human Musicality and Articulate Language |url=http://users.ugent.be/~mvaneech/Vaneechoutte.%202014.%20The%20origin%20of%20articulate%20language.pdf |journal=Human Evolution |volume=29 |pages=1–33}}

Noam Chomsky, a proponent of discontinuity theory, argues that a single change occurred in humans before leaving Africa, coincident with the Great Leap approximately 100,000 years ago, in which a common language faculty developed in a group of humans and their descendants. Chomsky bases his argument on the observation that any human baby of any culture can be raised in a different culture and will completely assimilate the language and behavior of the new culture in which they were raised. This implies that no major change to the human language faculty has occurred since they left Africa.How Could Language Have Evolved, https://doi.org/10.1371/journal.pbio.1001934

Transcending the continuity-versus-discontinuity divide, some scholars view the emergence of language as the consequence of some kind of social transformation{{Cite book |last1=Knight |first1=Chris |url=http://www.chrisknight.co.uk/wp-content/uploads/2007/09/Knight-Power-Social-Conditions1.pdf |title=The Oxford handbook of language evolution |last2=Power |first2=Camilla |publisher=Oxford University Press |year=2012 |isbn=978-0-19-954111-9 |editor-last=Tallerman |editor-first=Maggie |pages=346–349 |chapter=Social conditions for the evolutionary emergence of language |editor-last2=Gibson |editor-first2=Kathleen R.}} that, by generating unprecedented levels of public trust, liberated a genetic potential for linguistic creativity that had previously lain dormant.{{Cite book |last=Rappaport |first=Roy |title=Ritual and religion in the making of humanity |publisher=Cambridge University Press |year=1999 |isbn=978-0-521-29690-8}}{{Cite journal |last=Knight |first=C. |year=2008 |title='Honest fakes' and language origins |url=http://www.chrisknight.co.uk/wp-content/uploads/2007/09/JCS_Knight_CRC.pdf |journal=Journal of Consciousness Studies |volume=15 |issue=10–11 |pages=236–248}}{{Cite book |last=Knight |first=Chris |url=http://www.chrisknight.co.uk/wp-content/uploads/2007/09/The-Origins-of-Symbolic-Culture.pdf |title=Homo Novus: a human without illusion |publisher=Springer |year=2010 |isbn=978-3-642-12141-8 |editor-last=Frey |editor-first=Ulrich J. |location=Berlin |pages=193–211 |chapter=The origins of symbolic culture |editor-last2=Störmer |editor-first2=Charlotte |editor-last3=Willführ |editor-first3=Kai P.}} "Ritual/speech coevolution theory" exemplifies this approach.{{Cite book |last=Knight |first=Chris |url=http://www.chrisknight.co.uk/wp-content/uploads/2007/09/knight_ritual_speech_coevolution.pdf |title=Approaches to the evolution of language: social and cognitive base |publisher=Cambridge University Press |year=1998 |isbn=978-0-521-63964-4 |editor-last=Hurford |editor-first=James R. |pages=68–91 |chapter=Ritual/speech coevolution: a solution to the problem of deception |editor-last2=Studdert-Kennedy |editor-first2=Michael |editor-last3=Knight |editor-first3=Chris}}{{Cite book |last=Knight |first=Chris |url=http://www.chrisknight.co.uk/wp-content/uploads/2008/01/knight-springer-online-fulltext.pdf |title=The evolution of language: proceedings of the 6th international conference (EVOLANG6), Rome, Italy, 12–15 April 200 |publisher=World Scientific |year=2006 |isbn=978-981-256-656-0 |editor-last=Cangelosi |editor-first=Angelo |pages=168–175 |chapter=Language co-evolved with the rule of law |editor-last2=Smith |editor-first2=Andrew D. M. |editor-last3=Kenny Smith}} Scholars in this intellectual camp point to the fact that even chimpanzees and bonobos have latent symbolic capacities that they rarely—if ever—use in the wild.{{Cite book |last1=Savage-Rumbaugh |first1=Sue |title=Machiavellian intelligence: social expertise and the evolution of intellect in monkeys, apes, and human |last2=McDonald |first2=Kelly |publisher=Clarendon |year=1988 |isbn=978-0-19-852175-4 |editor-last=Byrne |editor-first=Richard W. |location=Oxford |pages=224–237 |chapter=Deception and social manipulation in symbol-using apes |editor-last2=Whiten |editor-first2=Andrew}} Objecting to the sudden mutation idea, these authors argue that even if a chance mutation were to install a language organ in an evolving bipedal primate, it would be adaptively useless under all known primate social conditions. A very specific social structure – one capable of upholding unusually high levels of public accountability and trust – must have evolved before or concurrently with language to make reliance on "cheap signals" (e.g. words) an evolutionarily stable strategy.

Since the emergence of language lies so far back in human prehistory, the relevant developments have left no direct historical traces, and comparable processes cannot be observed today. Despite this, the emergence of new sign languages in modern times—Nicaraguan Sign Language, for example—may offer insights into the developmental stages and creative processes necessarily involved.Kegl, J., A. Senghas and M. Coppola (1998). Creation through Contact: Sign language emergence and sign language change in Nicaragua. In M. DeGraff (ed.), Language Creation and Change: Creolization, Diachrony and Development. Cambridge, Massachusetts: MIT Press. Another approach inspects early human fossils, looking for traces of physical adaptation to language use.{{Cite journal |last1=Lieberman |first1=P. |last2=Crelin |first2=E. S. |year=1971 |title=On the speech of Neandertal Man |journal=Linguistic Inquiry |volume=2 |pages=203–222}}{{Cite journal |last1=Arensburg |first1=B. |last2=Tillier |first2=A. M. |last3=Vandermeersch |first3=B. |last4=Duday |first4=H. |last5=Schepartz |first5=L. A. |last6=Rak |first6=Y. |year=1989 |title=A Middle Palaeolithic human hyoid bone |journal=Nature |volume=338 |issue=6218 |pages=758–760 |bibcode=1989Natur.338..758A |doi=10.1038/338758a0 |pmid=2716823 |s2cid=4309147}} In some cases, when the DNA of extinct humans can be recovered, the presence or absence of genes considered to be language-relevant—FOXP2, for example—may prove informative.{{Cite book |last1=Diller |first1=Karl C. |title=The cradle of language |last2=Cann |first2=Rebecca L. |publisher=Oxford University Press |year=2009 |isbn=978-0-19-954586-5 |editor-last=Botha |editor-first=Rudolf P. |pages=135–149 |chapter=Evidence Against a Genetic-Based Revolution in Language 50,000 Years Ago |editor-last2=Knight |editor-first2=Chris}} Another approach, this time archaeological, involves invoking symbolic behavior (such as repeated ritual activity) that may leave an archaeological trace—such as mining and modifying ochre pigments for body-painting—while developing theoretical arguments to justify inferences from symbolism in general to language in particular.{{Cite book |last1=Henshilwood |first1=Christopher Stuart |title=The cradle of language |last2=Dubreuil |first2=Benoît |publisher=Oxford University Press |year=2009 |isbn=978-0-19-954586-5 |editor-last=Botha |editor-first=Rudolf P. |pages=41–61 |chapter=Reading the Artefacts: Gleaning Language Skills From the Middle Stone Age in Southern Africa |editor-last2=Knight |editor-first2=Chris}}{{Cite book |last=Knight |first=Chris |title=The cradle of language |publisher=Oxford University Press |year=2009 |isbn=978-0-19-954586-5 |editor-last=Rudolf P Botha |pages=281–303 |chapter=Language, Ochre, and the Rule of Law |editor-last2=Chris Knight}}{{Cite book |last=Watts |first=Ian |title=The cradle of language |publisher=Oxford University Press |year=2009 |isbn=978-0-19-954586-5 |editor-last=Botha |editor-first=Rudolf P. |pages=62–92 |chapter=Red Ochre, Body Painting, and Language: Interpreting the Blombos Ochre |editor-last2=Knight |editor-first2=Chris}}

The time range for the evolution of language or its anatomical prerequisites extends, at least in principle, from the phylogenetic divergence of Homo (2.3 to 2.4 million years ago) from Pan (5 to 6 million years ago) to the emergence of full behavioral modernity some 50,000–150,000 years ago. Few dispute that Australopithecus probably lacked vocal communication significantly more sophisticated than that of great apes in general,{{Cite journal |last=Arcadi |first=A. C. |date=August 2000 |title=Vocal responsiveness in male wild chimpanzees: implications for the evolution of language |journal=Journal of Human Evolution |volume=39 |issue=2 |pages=205–223 |bibcode=2000JHumE..39..205A |doi=10.1006/jhev.2000.0415 |pmid=10968929 |s2cid=7403772 |doi-access=free}} but scholarly opinions vary as to the developments since the appearance of Homo some 2.5 million years ago. Some scholars assume the development of primitive language-like systems (proto-language) as early as Homo habilis, while others place the development of symbolic communication only with Homo erectus (1.8 million years ago) or with Homo heidelbergensis (0.6 million years ago) and the development of language proper with Homo sapiens, currently estimated at less than 200,000 years ago.

Using statistical methods to estimate the time required to achieve the current spread and diversity in modern languages, Johanna Nichols—a linguist at the University of California, Berkeley—argued in 1998 that vocal languages must have begun diversifying in the human species at least 100,000 years ago.Johanna Nichols, 1998. The origin and dispersal of languages: Linguistic evidence. In Nina Jablonski and Leslie C. Aiello, eds., The Origin and Diversification of Language, pp. 127–70. (Memoirs of the California Academy of Sciences, 24.) San Francisco: California Academy of Sciences. Estimates of this kind are not universally accepted, but jointly considering genetic, archaeological, palaeontological, and much other evidence indicates that language likely emerged somewhere in sub-Saharan Africa during the Middle Stone Age, roughly contemporaneous with the speciation of Homo sapiens.{{Cite book |last1=Botha |first1=Rudolf P. |title=The cradle of language |last2=Knight |first2=Chris |publisher=Oxford University Press |year=2009 |isbn=978-0-19-954586-5}}

Language origin hypotheses

= Early speculations =

{{quotation|I cannot doubt that language owes its origin to the imitation and modification, aided by signs and gestures, of various natural sounds, the voices of other animals, and man's own instinctive cries.|Charles Darwin, 1871. The Descent of Man, and Selection in Relation to SexDarwin, C. (1871). "The Descent of Man, and Selection in Relation to Sex, 2 vols. London: Murray, p. 56.}} In 1861, historical linguist Max Müller published a list of speculative theories concerning the origins of spoken language:Müller, F. M. 1996 [1861]. The theoretical stage, and the origin of language. Lecture 9 from Lectures on the Science of Language. Reprinted in R. Harris (ed.), The Origin of Language. Bristol: Thoemmes Press, pp. 7–41.

  • Bow-wow. The bow-wow, or cuckoo, theory, which Müller attributed to the German philosopher Johann Gottfried Herder, saw early words as imitations of the cries of beasts and birds.
  • Pooh-pooh. The pooh-pooh theory saw the first words as emotional interjections and exclamations triggered by pain, pleasure, surprise, etc.
  • Ding-dong. Müller suggested what he called the ding-dong theory, which states that all things have a vibrating natural resonance, echoed somehow by humans in their earliest words.
  • Yo-he-ho. The yo-he-ho theory claims that language emerged from collective rhythmic labor; that is, the attempt to synchronize muscular efforts resulting in sounds such as heave alternating with sounds such as ho.
  • Ta-ta. The ta-ta theory did not feature in Max Müller's list, having been proposed in 1930 by Sir Richard Paget.Paget, R. 1930. Human speech: some observations, experiments, and conclusions as to the nature, origin, purpose and possible improvement of human speech. London: Routledge & Kegan Paul. According to the ta-ta theory, humans made the earliest words by tongue movements that mimicked manual gestures, rendering them audible.

Most scholars today consider all such theories not so much wrong—they occasionally offer peripheral insights—as naïve and irrelevant.Firth, J. R. 1964. The Tongues of Men and Speech. London: Oxford University Press, pp. 25–26.Stam, J. H. 1976. Inquiries into the origins of language. New York: Harper and Row, pp. 243–244. The problem with these theories is that they rest on the assumption that once early humans had discovered a workable mechanism for linking sounds with meanings, language would automatically have evolved.{{cn|date=December 2023}}

Much earlier, medieval Muslim scholars developed theories on the origin of language.{{Cite journal |last=Shah |first=Mustafa |date=January 2011 |title=Classical Islamic Discourse on the Origins of Language: Cultural Memory and the Defense of Orthodoxy |url=https://core.ac.uk/download/2793514.pdf |journal=Numen |volume=58 |issue=2–3 |pages=314–343 |doi=10.1163/156852711X562335 |s2cid=55165312 |via=CORE}}{{Cite journal |last=Weiss |first=B. |author-link=Bernard G. Weiss |year=1987 |title='Ilm al-wad': An Introductory Account of a Later Muslim Philological Science |journal=Arabica |volume=34 |issue=1 |pages=339–356 |doi=10.1163/157005887X00054 |s2cid=161187751}} Their theories were of five general types:{{Cite journal |last=Weiss |first=B. |author-link=Bernard G. Weiss |year=1974 |title=Medieval Muslim discussions of the origin of language |url=https://www.jstor.org/stable/pdf/43370636.pdf |journal=Zeitschrift der Deutschen Morgenländischen Gesellschaft |volume=124 |issue=1 |pages=33–41 |doi=10.1163/156852711X562335 |jstor=43370636 |s2cid=55165312 }}

  1. Naturalist: There is a natural relationship between expressions and the things they signify. Language thus emerged from a natural human inclination to imitate the sounds of nature.
  2. Conventionalist: Language is a social convention. The names of things are arbitrary inventions of humans.
  3. Revelationist: Language was gifted to humans by God, and it was thus God—and not humans—who named everything.
  4. Revelationist-Conventionalist: God revealed to humans a core base of language—enabling humans to communicate with each other—and then humans invented the rest of language.
  5. Non-Committal: The view that conventionalist and revelationist theories are equally plausible.

= Problems of reliability and deception =

{{further|Signalling theory}}

From the perspective of signalling theory, the main obstacle to the evolution of language-like communication in nature is not a mechanistic one. Rather, it is the fact that symbols—arbitrary associations of sounds or other perceptible forms with corresponding meanings—are unreliable and may as well be false.{{Cite journal |last=Zahavi |first=A. |date=May 1993 |title=The fallacy of conventional signalling |journal=Philosophical Transactions of the Royal Society B: Biological Sciences |volume=340 |issue=1292 |pages=227–230 |bibcode=1993RSPTB.340..227Z |doi=10.1098/rstb.1993.0061 |pmid=8101657}}Zahavi, A. and A. Zahavi 1997. The Handicap Principle: A Missing Piece in Darwin's Puzzle. New York and Oxford: Oxford University Press. {{ISBN|9780190284589}}{{Cite journal |last=Smith |first=J. Maynard |year=1994 |title=Must reliable signals always be costly? |journal=Animal Behaviour |volume=47 |issue=5 |pages=1115–1120 |doi=10.1006/anbe.1994.1149 |issn=0003-3472 |s2cid=54274718}} The problem of reliability was not recognized at all by Darwin, Müller or the other early evolutionary theorists.

Animal vocal signals are, for the most part, intrinsically reliable. When a cat purrs, the signal constitutes direct evidence of the animal's contented state. The signal is trusted, not because the cat is inclined to be honest, but because it just cannot fake that sound. Primate vocal calls may be slightly more manipulable, but they remain reliable for the same reason—because they are hard to fake.{{Cite book |last=Goodall |first=Jane |url=https://archive.org/details/chimpanzeesofgom00good |title=The chimpanzees of Gombe: patterns of behavior |publisher=Belknap |year=1986 |isbn=978-0-674-11649-8 |location=Cambridge, MA}} Primate social intelligence is "Machiavellian"; that is, self-serving and unconstrained by moral scruples. Monkeys, apes and particularly humans often attempt to deceive each other, while at the same time remaining constantly on guard against falling victim to deception themselves.{{Cite book |last1=Byrne |first1=Richard W. |title=Machiavellian intelligence : social expertise and the evolution of intellect in monkeys, apes, and humans |last2=Whiten |first2=Andrew. |publisher=Clarendon |year=1988 |isbn=978-0-19-852175-4 |location=Oxford}}{{Cite journal |last=de Waal |first=Frans B. M. |year=2005 |title=Intentional Deception in Primates |journal=Evolutionary Anthropology |volume=1 |issue=3 |pages=86–92 |doi=10.1002/evan.1360010306 |s2cid=221736130}} Paradoxically, it is theorized that primates' resistance to deception is what blocks the evolution of their signalling systems along language-like lines. Language is ruled out because the best way to guard against being deceived is to ignore all signals except those that are instantly verifiable. Words automatically fail this test.

Words are easy to fake. Should they turn out to be lies, listeners will adapt by ignoring them in favor of hard-to-fake indices or cues. For language to work, listeners must be confident that those with whom they are on speaking terms are generally likely to be honest.{{Cite book |last=Power |first=Camilla |title=Approaches to the evolution of language: social and cognitive base |publisher=Cambridge University Press |year=1998 |isbn=978-0-521-63964-4 |editor-last=Hurford |editor-first=James R. |pages=111–129 |chapter=Old wives' tales: the gossip hypothesis and the reliability of cheap signals |editor-last2=Studdert-Kennedy |editor-first2=Michael |editor-last3=Chris Knight}} A peculiar feature of language is displaced reference, which means reference to topics outside the currently perceptible situation. This property prevents utterances from being corroborated in the immediate "here" and "now". For this reason, language presupposes relatively high levels of mutual trust in order to become established over time as an evolutionarily stable strategy. This stability is born of a longstanding mutual trust and is what grants language its authority. A theory of the origins of language must therefore explain why humans could begin trusting cheap signals in ways that other animals apparently cannot.

== The "mother tongues" hypothesis ==

The "mother tongues" hypothesis was proposed in 2004 as a possible solution to this problem.{{Cite book |last=Fitch |first=W. T. |title=Evolution of communication systems: a comparative approach |publisher=MIT Press |year=2004 |isbn=978-0-262-15111-5 |editor-last=Griebel |editor-first=Ulrike |location=Cambridge, MA |pages=275–296 |chapter=Kin selection and 'mother tongues': a neglected component in language evolution |editor-last2=Oller |editor-first2=D. Kimbrough |chapter-url=https://homepage.univie.ac.at/tecumseh.fitch/media/files/FitchKin2004_large.pdf}} W. Tecumseh Fitch suggested that the Darwinian principle of "kin selection"{{Cite journal |last=Hamilton |first=W. D. |year=1964 |title=The genetical evolution of social behaviour. I, II |journal=Journal of Theoretical Biology |volume=7 |issue=1 |pages=1–52 |bibcode=1964JThBi...7....1H |doi=10.1016/0022-5193(64)90038-4 |pmid=5875341 |s2cid=5310280}}—the convergence of genetic interests between relatives—might be part of the answer. Fitch suggests that languages were originally "mother tongues". If language evolved initially for communication between mothers and their own biological offspring, extending later to include adult relatives as well, the interests of speakers and listeners would have tended to coincide. Fitch argues that shared genetic interests would have led to sufficient trust and cooperation for intrinsically unreliable signals—words—to become accepted as trustworthy and so begin evolving for the first time.{{Cite book |last=Knight |first=Chris |title=The Evolutionary Emergence of Language |publisher=Cambridge University Press |year=2000 |isbn=978-0-521-78157-2 |pages=99–120 |chapter=Play as Precursor of Phonology and Syntax |doi=10.1017/cbo9780511606441.007 |s2cid=56418139}}

Critics of this theory point out that kin selection is not unique to humans.{{Cite book |last=Tallerman |first=Maggie |title=The evolutionary emergence of language: evidence and inference |publisher=Oxford University Press |year=2013 |isbn=978-0-19-965485-7 |editor-last=Botha |editor-first=Rudolf P. |pages=77–96 |chapter=Kin selection, pedagogy and linguistic complexity: whence protolanguage? |editor-last2=Everaert |editor-first2=Martin}} So even if one accepts Fitch's initial premises, the extension of the posited "mother tongue" networks from close relatives to more distant relatives remains unexplained. Fitch argues, however, that the extended period of physical immaturity of human infants and the postnatal growth of the human brain give the human-infant relationship a different and more extended period of intergenerational dependency than that found in any other species.

== The "obligatory reciprocal altruism" hypothesis ==

Ib Ulbæk invokes another standard Darwinian principle—"reciprocal altruism"{{Cite journal |last=Trivers |first=R. L. |year=1971 |title=The evolution of reciprocal altruism |journal=Quarterly Review of Biology |volume=46 |pages=35–57 |doi=10.1086/406755 |s2cid=19027999}}—to explain the unusually high levels of intentional honesty necessary for language to evolve. "Reciprocal altruism" can be expressed as the principle that if you scratch my back, I'll scratch yours. In linguistic terms, it would mean that if you speak truthfully to me, I'll speak truthfully to you. Ordinary Darwinian reciprocal altruism, Ulbæk points out, is a relationship established between frequently interacting individuals. For language to prevail across an entire community, however, the necessary reciprocity would have needed to be enforced universally instead of being left to individual choice. Ulbæk concludes that for language to evolve, society as a whole must have been subject to moral regulation.

Critics point out that this theory fails to explain when, how, why or by whom "obligatory reciprocal altruism" could possibly have been enforced. Various proposals have been offered to remedy this defect. A further criticism is that language does not work on the basis of reciprocal altruism anyway. Humans in conversational groups do not withhold information to all except listeners likely to offer valuable information in return. On the contrary, they seem to want to advertise to the world their access to socially relevant information, broadcasting that information without expectation of reciprocity to anyone who will listen.{{Cite book |last=Dessalles |first=Jean L. |title=Approaches to the evolution of language: social and cognitive base |publisher=Cambridge University Press |year=1998 |isbn=978-0-521-63964-4 |editor-last=James R. Hurford |pages=130–147 |chapter=Altruism, status and the origin of relevance |editor-last2=Michael Studdert-Kennedy |editor-last3=Chris Knight}}

== The gossip and grooming hypothesis ==

Gossip, according to Robin Dunbar in his book Grooming, Gossip and the Evolution of Language, language does for group-living humans what manual grooming does for other primates—it allows individuals to service their relationships and so maintain their alliances on the basis of the principle: if you scratch my back, I'll scratch yours. Dunbar argues that as humans began living in increasingly larger social groups, the task of manually grooming all one's friends and acquaintances became so time-consuming as to be unaffordable.{{Cite book |last=Dunbar |first=R. I. M. |title=Grooming, gossip and the evolution of language |publisher=Faber & Faber |year=1996 |isbn=978-0-571-17396-9 |location=London}} In response to this problem, humans developed "a cheap and ultra-efficient form of grooming"—vocal grooming. To keep allies happy, one now needs only to "groom" them with low-cost vocal sounds, servicing multiple allies simultaneously while keeping both hands free for other tasks. Vocal grooming then evolved gradually into vocal language—initially in the form of "gossip". Dunbar's hypothesis seems to be supported by adaptations, in the structure of language, to the function of narration in general.{{Cite book |last=von Heiseler |first=Till Nikolaus |url=https://www.academia.edu/9648129 |title=Evolution of Language |publisher=World Scientific |year=2014 |editor-last=Cartmill |editor-first=R. L. C. |location=London |pages=114–121 |chapter=Language evolved for storytelling in a super-fast evolution}}

Critics of this theory point out that the efficiency of "vocal grooming"—the fact that words are so cheap—would have undermined its capacity to signal commitment of the kind conveyed by time-consuming and costly manual grooming.{{Cite book |last=Power |first=C. |title=Approaches to the Evolution of Language: Social and Cognitive Bases |publisher=Cambridge University Press |year=1998 |editor-last=Hurford |editor-first=J. R. |pages=111–129 |chapter=Old wives' tales: the gossip hypothesis and the reliability of cheap signals |editor-last2=Studdert-Kennedy |editor-first2=M. |editor-last3=Knight |editor-first3=C.}} A further criticism is that the theory does nothing to explain the crucial transition from vocal grooming—the production of pleasing but meaningless sounds—to the cognitive complexities of syntactical speech.

== Ritual/speech coevolution ==

The ritual/speech coevolution theory was originally proposed by social anthropologist Roy RappaportRappaport, R. A. 1999. "Ritual and Religion in the Making of Humanity." Cambridge University Press. before being elaborated by anthropologists such as Chris Knight,Knight, C. 1998. Ritual/speech coevolution: a solution to the problem of deception. In J. R. Hurford, M. Studdert-Kennedy and C. Knight (eds), Approaches to the Evolution of Language: Social and cognitive bases. Cambridge University Press, pp. 68–91. Jerome Lewis,Lewis, J. 2009. "As well as words: Congo Pygmy hunting, mimicry, and play." In R. Botha and C. Knight (eds), The Cradle of Language. Oxford: Oxford University Press, pp. 236–256. Nick Enfield,{{Cite journal |last=Enfield |first=N. J. |year=2010 |title=Without social context? |url=http://pubman.mpdl.mpg.de/pubman/item/escidoc:527132:11/component/escidoc:527220/Enfield_Science_Language%20Evolution_2010.pdf |journal=Science |volume=329 |issue=5999 |pages=1600–1601 |bibcode=2010Sci...329.1600E |doi=10.1126/science.1194229 |s2cid=143530707 |hdl-access=free |hdl=11858/00-001M-0000-0012-C777-5}} Camilla PowerPower, C. 1998. "Old wives' tales: the gossip hypothesis and the reliability of cheap signals." In J. R. Hurford, M. Studdert Kennedy and C. Knight (eds), Approaches to the Evolution of Language: Social and Cognitive Bases. Cambridge University Press, pp. 111 29. and Ian Watts.Watts, I. 2009. Red ochre, body painting, and language: interpreting the Blombos ochre. In R. Botha and C. Knight (eds), The Cradle of Language. Oxford: Oxford University Press, pp. 62–92. Cognitive scientist and robotics engineer Luc SteelsSteels, Luc. 2009. "Is sociality a crucial prerequisite for the emergence of language?" In Rudolf P. Botha and Chris Knight (eds), The prehistory of language. Oxford: Oxford University Press. {{ISBN|978-0-19-954587-2}} is another prominent supporter of this general approach, as is biological anthropologist and neuroscientist Terrence Deacon.{{Cite book |last=Deacon |first=Terrence William |url=https://archive.org/details/symbolicspeciesc00deac |title=The symbolic species: the co-evolution of language and the brain |publisher=W. W. Norton |year=1997 |isbn=978-0-393-03838-5 |location=New York}} A more recent champion of the approach is the Chomskyan specialist in linguistic syntax, Cedric Boeckx.Boeckx, C. (2023) What made us "hunter-gatherers of words". Front. Neurosci. 17:1080861. {{doi|10.3389/fnins.2023.1080861}}.

These scholars argue that there can be no such thing as a "theory of the origins of language". This is because language is not a separate adaptation, but an internal aspect of something much wider—namely, the entire domain known to anthropologists as human symbolic culture.Knight, C. 2010. The origins of symbolic culture. In Ulrich J. Frey, Charlotte Störmer and Kai P. Willfuhr (eds) 2010. Homo Novus – A Human Without Illusions. Berlin, Heidelberg: Springer-Verlag, pp. 193–211. Attempts to explain language independently of this wider context have failed, say these scientists, because they are addressing a problem with no solution. Language would not work outside its necessary environment of confidence-building social mechanisms and institutions. For example, it would not work for a nonhuman ape communicating with others of its kind in the wild. Not even the cleverest nonhuman ape could make language work under such conditions.

{{quotation|Lie and alternative, inherent in language ... pose problems to any society whose structure is founded on language, which is to say all human societies. I have therefore argued that if there are to be words at all it is necessary to establish The Word, and that The Word is established by the invariance of liturgy.|Roy Rappaport{{Cite book |last=Rappaport |first=Roy A. |title=Ecology, Meaning, and Religion |publisher=North Atlantic |year=1979 |isbn=978-0-913028-54-4 |location=Richmond, CA |pages=201–211}}}}

Advocates of this school of thought point out that words are cheap. Should an especially clever nonhuman ape, or even a group of articulate nonhuman apes, try to use words in the wild, they would carry no conviction. The primate vocalizations that do carry conviction—those they actually use—are unlike words, in that they are emotionally expressive, intrinsically meaningful, and reliable because they are relatively costly and hard to fake.

Oral and gestural languages consist of pattern-making whose cost is essentially zero. As pure social conventions, signals of this kind cannot evolve in a Darwinian social world—they are a theoretical impossibility.Zahavi, A. 1993. "The fallacy of conventional signalling." Philosophical Transactions: Biological Sciences 340: 227–230, published by Royal Society. Being intrinsically unreliable, language works only if one can build up a reputation for trustworthiness within a certain kind of society—namely, one where symbolic cultural facts (sometimes called "institutional facts") can be established and maintained through collective social endorsement.Searle, J. R. 1996. The Construction of Social Reality. London: Penguin. In any hunter-gatherer society, the basic mechanism for establishing trust in symbolic cultural facts is collective ritual.Durkheim, E. 1947 [1915]. "Origins of these beliefs". Chapter VII. In É. Durkheim, The Elementary Forms of the Religious Life: A study in religious sociology. Trans. J. W. Swain. Glencoe, Illinois: The Free Press, pp. 205–239. Therefore, the task facing researchers into the origins of language is more multidisciplinary than is usually supposed. It involves addressing the evolutionary emergence of human ritual, kinship, religion and symbolic culture taken as a whole, with language an important but subsidiary component.

In a 2023 article, Cedric Boeckx endorses the Rappaport/Searle/Knight way of capturing the "special" nature of human words. Words are symbols. This means that, from a standpoint in Darwinian signal evolution theory, they are "patently false signals." Words are facts, but "facts whose existence depends entirely on subjective belief".Knight, C. 2010. The origins of symbolic culture. In Ulrich J. Frey, Charlotte Störmer and Kai P. Willfuhr (eds) 2010. Homo Novus – A Human Without Illusions. Berlin, Heidelberg: Springer-Verlag, pp. 193–211. In philosophical terms, they are "institutional facts": fictions that are granted factual status within human social institutionsSearle, J. R. 1996. "The Construction of Social Reality." London: Penguin. From this standpoint, according to Boeckx, linguistic utterances are symbolic to the extent that they are patent falsehoods serving as guides to communicative intentions. "They are communicatively useful untruths, as it were." The reason why words can survive among humans despite being false is largely down to a matter of trust. The corresponding origins theory is that language can only have begun to evolve from the moment humans started reciprocally faking in communicatively helpful ways, i.e., when they became capable of upholding the levels of trust necessary for linguistic communication to work.

The point here is that an ape or other nonhuman must always carry at least some of the burden of generating the trust necessary for communication to work. That is, in order to be taken seriously, each signal it emits must be a patently reliable one, trusted because it is rooted in some way in the real world. But now imagine what might happen under social conditions where trust could be taken for granted. The signaller could stop worrying about reliability and concentrate instead on perceptual discriminability. Carried to its conclusion, this should permit digital signaling—the cheapest and most efficient kind of communication.

From this philosophical standpoint, animal communication cannot be digital because it does not have the luxury of being patently false. Costly signals of any kind can only be evaluated on an analog scale. Put differently, truly symbolic, digital signals become socially acceptable only under highly unusual conditions—such as those internal to a ritually bonded community whose members are not tempted to lie.{{citation needed|date=November 2024}}

Critics of the speech/ritual co-evolution idea theory include Noam Chomsky, who terms it the "non-existence" hypothesis—a denial of the very existence of language as an object of study for natural science.{{Cite journal |last=Chomsky |first=Noam |year=2011 |title=Language and Other Cognitive Systems. What is Special About Language? |journal=Language Learning and Development |volume=7 |issue=4 |pages=263–278 |doi=10.1080/15475441.2011.584041 |s2cid=122866773}} Chomsky's own theory is that language emerged in an instant and in perfect form,Chomsky, N. 2005. 'Three factors in language design.' Linguistic Inquiry 36(1): 1–22. prompting his critics in turn, to retort that only something that does not exist—a theoretical construct or convenient scientific fiction—could possibly emerge in such a miraculous way. The controversy remains unresolved.

= Tool resiliency, grammar and language production =

Acheulean tool use began during the Lower Paleolithic approximately 1.75 million years ago. Studies focusing on the lateralization of Acheulean tool production and language production have noted similar areas of blood flow when engaging in these activities separately; this theory suggests that the brain functions needed for the production of tools across generations is consistent with the brain systems required for producing language. Researchers used functional transcranial Doppler ultrasonography (fTDC) and had participants perform activities related to the creation of tools using the same methods during the Lower Paleolithic as well as a task designed specifically for word generation.{{Cite journal |last1=Uomini |first1=Natalie Thaïs |last2=Meyer |first2=Georg Friedrich |date=30 August 2013 |editor-last=Petraglia |editor-first=Michael D. |title=Shared Brain Lateralization Patterns in Language and Acheulean Stone Tool Production: A Functional Transcranial Doppler Ultrasound Study |journal=PLOS ONE |volume=8 |issue=8 |pages=e72693 |bibcode=2013PLoSO...872693U |doi=10.1371/journal.pone.0072693 |issn=1932-6203 |pmc=3758346 |pmid=24023634 |doi-access=free}} The purpose of this test was to focus on the planning aspect of Acheulean tool making and cued word generation in language (an example of cued word generation would be trying to list all words beginning with a given letter). Theories of language developing alongside tool use has been theorized by multiple individuals;{{Cite journal |last1=Stout |first1=Dietrich |last2=Chaminade |first2=Thierry |date=12 January 2012 |title=Stone tools, language and the brain in human evolution |journal=Philosophical Transactions of the Royal Society B: Biological Sciences |volume=367 |issue=1585 |pages=75–87 |doi=10.1098/rstb.2011.0099 |pmc=3223784 |pmid=22106428}}{{Cite journal |last1=Putt |first1=Shelby S. J. |last2=Anwarzai |first2=Zara |last3=Holden |first3=Chloe |last4=Ruck |first4=Lana |last5=Schoenemann |first5=P. Thomas |date=4 January 2022 |title=The evolution of combinatoriality and compositionality in hominid tool use: a comparative perspective |journal=International Journal of Primatology |volume=45 |issue=3 |pages=589–634 |doi=10.1007/s10764-021-00267-7 |issn=1573-8604 |s2cid=245654206}}{{Cite journal |last1=Barham |first1=Lawrence |last2=Everett |first2=Daniel |date=June 2021 |title=Semiotics and the Origin of Language in the Lower Palaeolithic |journal=Journal of Archaeological Method and Theory |volume=28 |issue=2 |pages=535–579 |doi=10.1007/s10816-020-09480-9 |issn=1072-5369 |s2cid=225509049 |doi-access=free}} however, until recently, there has been little empirical data to support these hypotheses. Focusing on the results of the study performed by Uomini et al. evidence for the usage of the same brain areas has been found when looking at cued word generation and Acheulean tool use. The relationship between tool use and language production is found in working and planning memory respectively and was found to be similar across a variety of participants, furthering evidence that these areas of the brain are shared. This evidence lends credibility to the theory that language developed alongside tool use in the Lower Paleolithic.

= Humanistic theory =

The humanistic tradition considers language as a human invention. Renaissance philosopher Antoine Arnauld gave a detailed description of his idea of the origin of language in Port-Royal Grammar. According to Arnauld, people are social and rational by nature, and this urged them to create language as a means to communicate their ideas to others. Language construction would have occurred through a slow and gradual process.{{Cite book |last1=Arnauld |first1=Antoine |author-link1=Antoine Arnauld |url=https://archive.org/details/portroyalgrammar0000lanc |title=General and Rational Grammar: The Port-Royal Grammar |last2=Lancelot |first2=Claude |publisher=Mouton |year=1975 |isbn=902793004X |location=The Hague |url-access=registration |orig-year=1660}} In later theory, especially in functional linguistics, the primacy of communication

is emphasised over psychological needs.{{Cite book |last=Daneš |first=František |author-link=František Daneš |title=Functionalism in Linguistics |publisher=John Benjamins |year=1987 |isbn=9789027215246 |editor-last=Dirven |editor-first=R. |pages=3–38 |chapter=On Prague school functionalism in linguistics |editor-last2=Fried |editor-first2=V.}}

The exact way language evolved is however not considered as vital to the study of languages. Structural linguist Ferdinand de Saussure abandoned evolutionary linguistics after having come to the firm conclusion that it would not be able to provide any further revolutionary insight after the completion of the major works in historical linguistics by the end of the 19th century. Saussure was particularly sceptical of the attempts of August Schleicher and other Darwinian linguists to access prehistorical languages through series of reconstructions of proto-languages.{{Cite book |last=Aronoff |first=Mark |url=https://langsci-press.org/catalog/book/151 |title=On Looking into Words (and Beyond): Structures, Relations, Analyses |publisher=SUNY Press |year=2017 |isbn=978-3-946234-92-0 |editor-last=Bowern |pages=443–456 |chapter=Darwinism tested by the science of language |access-date=3 March 2020 |editor-last2=Horn |editor-last3=Zanuttini}}

Saussure's solution to the problem of language evolution involves dividing theoretical linguistics in two. Evolutionary and historical linguistics are renamed as diachronic linguistics. It is the study of language change, but it has only limited explanatory power due to the inadequacy of all of the reliable research material that could ever be made available. Synchronic linguistics, in contrast, aims to widen scientists' understanding of language through a study of a given contemporary or historical language stage as a system in its own right.{{Cite book |last=de Saussure |first=Ferdinand |author-link=Ferdinand de Saussure |url=https://monoskop.org/images/0/0b/Saussure_Ferdinand_de_Course_in_General_Linguistics_1959.pdf |title=Course in general linguistics |publisher=Philosophy Library |year=1959 |isbn=978-0-231-15727-8 |location=New York |access-date=6 May 2020 |archive-url=https://web.archive.org/web/20190808231716/https://monoskop.org/images/0/0b/Saussure_Ferdinand_de_Course_in_General_Linguistics_1959.pdf |archive-date=8 August 2019 |url-status=dead |orig-year=1916}}

Although Saussure put much focus on diachronic linguistics, later structuralists who equated structuralism with the synchronic analysis were sometimes criticised of ahistoricism. According to structural anthropologist Claude Lévi-Strauss, language and meaning—in opposition to "knowledge, which develops slowly and progressively"—must have appeared in an instant.{{Cite book |last=Lévi-Strauss |first=Claude |title=Introduction to the work of Marcel Mauss |publisher=Routledge |year=1987 |isbn=0-7100-9066-8 |pages=59–60}}

Structuralism, as first introduced to sociology by Émile Durkheim, is nonetheless a type of humanistic evolutionary theory which explains diversification as necessitated by growing complexity.{{Cite book |last=Hejl |first=P. M. |title=Biology as Society, Society as Biology: Metaphors |publisher=Springer |year=2013 |isbn=9789401106733 |editor-last=Maasen |editor-first=Sabine |pages=155–191 |chapter=The importance of the concepts of 'organism' and 'evolution' in Emile Durkheim's division of social labor and the influence of Herbert Spencer |editor-last2=Mendelsohn |editor-first2=E. |editor-last3=Weingart |editor-first3=P.}} There was a shift of focus to functional explanation after Saussure's death. Functional structuralists including the Prague Circle linguists and André Martinet explained the growth and maintenance of structures as being necessitated by their functions. For example, novel technologies make it necessary for people to invent new words, but these may lose their function and be forgotten as the technologies are eventually replaced by more modern ones.

= Chomsky's single-step theory =

According to Chomsky's single-mutation theory, the emergence of language resembled the formation of a crystal; with digital infinity as the seed crystal in a super-saturated primate brain, on the verge of blossoming into the human mind, by physical law, once evolution added a single small but crucial keystone.Chomsky, N. (2004). Language and Mind: Current thoughts on ancient problems. Part I & Part II. In Lyle Jenkins (ed.), Variation and Universals in Biolinguistics. Amsterdam: Elsevier, pp. 379–405.{{Cite journal |last=Chomsky |first=N. |year=2005 |title=Three factors in language design |journal=Linguistic Inquiry |volume=36 |issue=1 |pages=1–22 |doi=10.1162/0024389052993655 |s2cid=14954986}} Thus, in this theory, language appeared rather suddenly within the history of human evolution. Chomsky, writing with computational linguist and computer scientist Robert C. Berwick, suggests that this scenario is completely compatible with modern biology. They note that "none of the recent accounts of human language evolution seem to have completely grasped the shift from conventional Darwinism to its fully stochastic modern version—specifically, that there are stochastic effects not only due to sampling like directionless drift, but also due to directed stochastic variation in fitness, migration, and heritability—indeed, all the "forces" that affect individual or gene frequencies{{Nbsp}}... All this can affect evolutionary outcomes—outcomes that as far as we can make out are not brought out in recent books on the evolution of language, yet would arise immediately in the case of any new genetic or individual innovation, precisely the kind of scenario likely to be in play when talking about language's emergence."

Citing evolutionary geneticist Svante Pääbo, they concur that a substantial difference must have occurred to differentiate Homo sapiens from Neanderthals to "prompt the relentless spread of our species, who had never crossed open water, up and out of Africa and then on across the entire planet in just a few tens of thousands of years.{{Nbsp}}... What we do not see is any kind of 'gradualism' in new tool technologies or innovations like fire, shelters, or figurative art." Berwick and Chomsky therefore suggest language emerged approximately between 200,000 years ago and 60,000 years ago (between the appearance of the first anatomically modern humans in southern Africa and the last exodus from Africa respectively). "That leaves us with about 130,000 years, or approximately 5,000–6,000 generations of time for evolutionary change. This is not 'overnight in one generation' as some have (incorrectly) inferred—but neither is it on the scale of geological eons. It's time enough—within the ballpark for what Nilsson and Pelger (1994) estimated as the time required for the full evolution of a vertebrate eye from a single cell, even without the invocation of any 'evo-devo' effects."{{Cite book |last1=Berwick |first1=Robert |title=Why Only Us: Language and Evolution |last2=Chomsky |first2=Noam |publisher=MIT Press |year=2016 |isbn=978-0-262-03424-1 |location=Cambridge, MA}}

The single-mutation theory of language evolution has been directly questioned on different grounds. A formal analysis of the probability of such a mutation taking place and going to fixation in the species has concluded that such a scenario is unlikely, with multiple mutations with more moderate fitness effects being more probable.{{Cite journal |last1=de Boer |first1=Bart |last2=Thompson |first2=Bill |last3=Ravignani |first3=Andrea |last4=Boeckx |first4=Cedric |date=16 January 2020 |title=Evolutionary Dynamics Do Not Motivate a Single-Mutant Theory of Human Language |journal=Scientific Reports |volume=10 |issue=1 |page=451 |bibcode=2020NatSR..10..451D |doi=10.1038/s41598-019-57235-8 |issn=2045-2322 |pmc=6965110 |pmid=31949223}} Another criticism has questioned the logic of the argument for single mutation and puts forward that from the formal simplicity of Merge, the capacity Berwick and Chomsky deem the core property of human language that emerged suddenly, one cannot derive the (number of) evolutionary steps that led to it.{{Cite journal |last1=Martins |first1=Pedro Tiago |last2=Boeckx |first2=Cedric |date=27 November 2019 |title=Language evolution and complexity considerations: The no half-Merge fallacy |journal=PLOS Biology |volume=17 |issue=11 |pages=e3000389 |doi=10.1371/journal.pbio.3000389 |issn=1545-7885 |pmc=6880980 |pmid=31774810 |doi-access=free}}

= The Romulus and Remus hypothesis =

{{See also|Recursion#In language|Prefrontal synthesis}}

The Romulus and Remus hypothesis, proposed by neuroscientist Andrey Vyshedskiy, seeks to address the question as to why the modern speech apparatus originated over 500,000 years before the earliest signs of modern human imagination. This hypothesis proposes that there were two phases that led to modern recursive language. The phenomenon of recursion occurs across multiple linguistic domains, arguably most prominently in syntax and morphology. Thus, by nesting a structure such as a sentence or a word within themselves, it enables the generation of potentially (countably) infinite new variations of that structure. For example, the base sentence [Peter likes apples.] can be nested in irrealis clauses to produce [Mary said [Peter likes apples.]], [Paul believed [Mary said [Peter likes apples.]]] and so forth.{{Cite book |last=Carnie |first=Andrew |author-link=Andrew Carnie |url=https://books.google.com/books?id=MFZ1UV3YGtgC |title=Syntax: A Generative Introduction |publisher=Wiley-Blackwell |year=2012 |isbn=978-0-470-65531-3 |edition=3rd |location=West Sussex |pages=20–21}}

The first phase includes the slow development of non-recursive language with a large vocabulary along with the modern speech apparatus, which includes changes to the hyoid bone, increased voluntary control of the muscles of the diaphragm, and the evolution of the FOXP2 gene, as well as other changes by 600,000 years ago.{{Cite journal |last1=Dediu |first1=Dan |last2=Levinson |first2=Stephen C. |year=2013 |title=On the antiquity of language: the reinterpretation of Neandertal linguistic capacities and its consequences |journal=Frontiers in Psychology |volume=4 |page=397 |doi=10.3389/fpsyg.2013.00397 |issn=1664-1078 |pmc=3701805 |pmid=23847571 |doi-access=free}} Then, the second phase was a rapid Chomskian single step, consisting of three distinct events that happened in quick succession around 70,000 years ago and allowed the shift from non-recursive to recursive language in early hominins.

  1. A genetic mutation that slowed down the prefrontal synthesis (PFS) critical period of at least two children that lived together.
  2. This allowed these children to create recursive elements of language such as spatial prepositions.
  3. Then this merged with their parents' non-recursive language to create recursive language.{{Cite journal |last=Vyshedskiy |first=Andrey |date=29 July 2019 |title=Language evolution to revolution: the leap from rich-vocabulary non-recursive communication system to recursive language 70,000 years ago was associated with acquisition of a novel component of imagination, called Prefrontal Synthesis, enabled by a mutation that slowed down the prefrontal cortex maturation simultaneously in two or more children – the Romulus and Remus hypothesis |journal=Research Ideas and Outcomes |volume=5 |doi=10.3897/rio.5.e38546 |issn=2367-7163 |doi-access=free}}

It is not enough for children to have a modern prefrontal cortex (PFC) to allow the development of PFS; the children must also be mentally stimulated and have recursive elements already in their language to acquire PFS. Since their parents would not have invented these elements yet, the children would have had to do it themselves, which is a common occurrence among young children that live together, in a process called cryptophasia.{{Cite journal |last=Bakker |first=Peter |date=July 1987 |title=Autonomous Languages of Twins |journal=Acta Geneticae Medicae et Gemellologiae: Twin Research |volume=36 |issue=2 |pages=233–238 |doi=10.1017/S0001566000004463 |issn=0001-5660 |pmid=3434134 |doi-access=free}} This means that delayed PFC development would have allowed more time to acquire PFS and develop recursive elements.

Delayed PFC development also comes with negative consequences, such as a longer period of reliance on one's parents to survive and lower survival rates. For modern language to have occurred, PFC delay had to have an immense survival benefit in later life, such as PFS ability. This suggests that the mutation that caused PFC delay and the development of recursive language and PFS occurred simultaneously, which lines up with evidence of a genetic bottleneck around 70,000 years ago.{{Cite journal |last1=Amos W. |last2=Hoffman J. I. |date=7 January 2010 |title=Evidence that two main bottleneck events shaped modern human genetic diversity |journal=Proceedings of the Royal Society B: Biological Sciences |volume=277 |issue=1678 |pages=131–137 |doi=10.1098/rspb.2009.1473 |pmc=2842629 |pmid=19812086}} This could have been the result of a few individuals who developed PFS and recursive language which gave them significant competitive advantage over all other humans at the time.

= Gestural theory =

The gestural theory states that human language developed from gestures that were used for simple communication.

Two types of evidence support this theory.

  1. Gestural language and vocal language depend on similar neural systems. The regions on the cortex that are responsible for mouth and hand movements border each other.
  2. Nonhuman primates can use gestures or symbols for at least primitive communication, and some of their gestures resemble those of humans, such as the "begging posture", with the hands stretched out, which humans share with chimpanzees.Premack, David & Premack, Ann James. The Mind of an Ape, {{ISBN|0-393-01581-5}}.{{Cite journal |last1=Pollick |first1=A. S. |last2=de Waal |first2=F. B. |date=May 2007 |title=Ape Gestures and Language Evolution |journal=Proceedings of the National Academy of Sciences |volume=104 |issue=19 |pages=8184–8189 |bibcode=2007PNAS..104.8184P |doi=10.1073/pnas.0702624104 |pmc=1876592 |pmid=17470779 |doi-access=free}}

Research has found strong support for the idea that oral communication and sign language depend on similar neural structures. Patients who used sign language, and who suffered from a left-hemisphere lesion, showed the same disorders with their sign language as vocal patients did with their oral language.{{Cite book |last=Kimura |first=Doreen |title=Neuromotor mechanisms in human communication |publisher=Oxford University Press |year=1993 |isbn=978-0-19-505492-7 |location=New York}} Other researchers found that the same left-hemisphere brain regions were active during sign language as during the use of vocal or written language.{{Cite journal |last=Newman |first=A. J. |display-authors=etal |year=2002 |title=A Critical Period for Right Hemisphere Recruitment in American Sign Language Processing |journal=Nature Neuroscience |volume=5 |issue=1 |pages=76–80 |doi=10.1038/nn775 |pmid=11753419 |s2cid=2745545}}

Primate gesture is at least partially genetic: different nonhuman apes will perform gestures characteristic of their species, even if they have never seen another ape perform that gesture. For example, gorillas beat their breasts. This shows that gestures are an intrinsic and important part of primate communication, which supports the idea that language evolved from gesture.{{Cite journal |last1=Arbib |first1=M. A. |last2=Liebal |first2=K |last3=Pika |first3=S. |date=December 2008 |title=Primate vocalization, gesture, and the evolution of human language |journal=Current Anthropology |volume=49 |issue=6 |pages=1053–1076 |doi=10.1086/593015 |pmid=19391445 |s2cid=18832100}}

Further evidence suggests that gesture and language are linked. In humans, manually gesturing has an effect on concurrent vocalizations, thus creating certain natural vocal associations of manual efforts. Chimpanzees move their mouths when performing fine motor tasks. These mechanisms may have played an evolutionary role in enabling the development of intentional vocal communication as a supplement to gestural communication. Voice modulation could have been prompted by preexisting manual actions.

From infancy, gestures both supplement and predict speech.{{Cite journal |last1=Capone |first1=Nina C. |last2=McGregor |first2=Karla K. |year=2004 |title=Gesture Development |journal=Journal of Speech, Language, and Hearing Research |volume=47 |issue=1 |pages=173–186 |doi=10.1044/1092-4388(2004/015) |pmid=15072537 |s2cid=7244799}}{{Cite journal |last1=Ozçalişkan |first1=S. |last2=Goldin-Meadow |first2=S. |date=July 2005 |title=Gesture is at the cutting edge of early language development |journal=Cognition |volume=96 |issue=3 |pages=B101–B113 |doi=10.1016/j.cognition.2005.01.001 |pmid=15996556 |s2cid=206863317}} This addresses the idea that gestures quickly change in humans from a sole means of communication (from a very young age) to a supplemental and predictive behavior that is used despite the ability to communicate verbally. This too serves as a parallel to the idea that gestures developed first and language subsequently built upon it.

Two possible scenarios have been proposed for the development of language,Rizzolatti, G. (2008). Giacomo Rizzolatti on the Evolution of Language. Retrieved from http://gocognitive.net/interviews/evolution-language-gestures{{full citation needed|date=January 2015}} one of which supports the gestural theory:

  1. Language developed from the calls of human ancestors.
  2. Language was derived from gesture.

The first perspective that language evolved from the calls of human ancestors seems logical because both humans and animals make sounds or cries. One evolutionary reason to refute this is that, anatomically, the centre that controls calls in monkeys and other animals is located in a completely different part of the brain than in humans. In monkeys, this centre is located in the depths of the brain related to emotions. In the human system, it is located in an area unrelated to emotion. Humans can communicate simply to communicate—without emotions. So, anatomically, this scenario does not work. This suggests that language was derived from gesture{{Cite journal |last=Kendon |first=Adam |date=February 2017 |title=Reflections on the "gesture-first" hypothesis of language origins |journal=Psychonomic Bulletin & Review |volume=24 |issue=1 |pages=163–170 |doi=10.3758/s13423-016-1117-3 |pmc=5325861 |pmid=27439503}}(humans communicated by gesture first and sound was attached later).

The important question for gestural theories is why there was a shift to vocalization. Various explanations have been proposed:

  1. Human ancestors started to use more and more tools, meaning that their hands were occupied and could no longer be used for gesturing.{{Cite book |last=Corballis |first=Michael C. |title=The transition to language |publisher=Oxford University Press |year=2002 |isbn=978-0-19-925066-0 |editor-last=Wray |editor-first=Alison |pages=161–179}}
  2. Manual gesturing requires that speakers and listeners be visible to one another. In many situations, they might need to communicate, even without visual contact—for example after nightfall or when foliage obstructs visibility.
  3. A composite hypothesis holds that early language took the form of part gestural and part vocal mimesis (imitative 'song-and-dance'), combining modalities because all signals (like those of nonhuman apes and monkeys) still needed to be costly in order to be intrinsically convincing. In that event, each multi-media display would have needed not just to disambiguate an intended meaning but also to inspire confidence in the signal's reliability. The suggestion is that only once community-wide contractual understandings had come into force{{Cite book |last=Knight |first=Chris |url=http://www.chrisknight.co.uk/wp-content/uploads/2008/01/knight-springer-online-fulltext.pdf |title=The evolution of language: proceedings of the 6th international conference (EVOLANG6), Rome, Italy, 12–15 April 200 |publisher=World Scientific |year=2006 |isbn=9789812566560 |editor-last=Cangelosi |editor-first=Angelo |volume=7 |location=New Jersey |pages=109–128 |chapter=Language co-evolved with the rule of law |journal=Mind & Society |doi=10.1007/s11299-007-0039-1 |editor-last2=Smith |editor-first2=Andrew D. M. |editor-last3=Smith |editor-first3=Kenny |s2cid=143877486}} could trust in communicative intentions be automatically assumed, at last allowing Homo sapiens to shift to a more efficient default format. Since vocal distinctive features (sound contrasts) are ideal for this purpose, it was only at this point—when intrinsically persuasive body-language was no longer required to convey each message—that the decisive shift from manual gesture to the current primary reliance on spoken language occurred.{{Cite book |last=Knight |first=Chris |title=The Evolutionary emergence of language: social function and the origins of linguistic for |publisher=Cambridge University Press |year=2000 |isbn=978-0-521-78157-2 |editor-last=Chris Knight |pages=99–1119 |chapter=Play as precursor of phonology and syntax |editor-last2=Michael Studdert-Kennedy |editor-last3=James R. Hurford}}

A comparable hypothesis states that in 'articulate' language, gesture and vocalisation are intrinsically linked, as language evolved from equally intrinsically linked dance and song.

Humans still use manual and facial gestures when they speak, especially when people meet who have no language in common.{{Cite book |last1=Kolb, Bryan |title=Fundamentals of Human Neuropsychology |last2=Ian Q. Whishaw |publisher=Worth Publishers |year=2003 |isbn=978-0-7167-5300-1 |edition=5th |name-list-style=amp}} There are also a great number of sign languages still in existence, commonly associated with Deaf communities. These sign languages are equal in complexity, sophistication, and expressive power, to any oral language.Sandler, Wendy; & Lillo-Martin, Diane. (2006). Sign Language and Linguistic Universals. Cambridge University Press. The cognitive functions are similar and the parts of the brain used are similar. The main difference is that the "phonemes" are produced on the outside of the body, articulated with hands, body, and facial expression, rather than inside the body articulated with tongue, teeth, lips, and breathing.{{Cite book |last=Meena |first=Ram Lakhan |title=Current Trends of Applied Linguistics |publisher=K. K. Publications |year=2021 |page=48}} (Compare the motor theory of speech perception.)

Critics of gestural theory note that it is difficult to name serious reasons why the initial pitch-based vocal communication (which is present in primates) would be abandoned in favor of the much less effective non-vocal, gestural communication.{{Cite journal |last1=Hewes |first1=Gordon W. |last2=Andrew |first2=R. J. |last3=Carini |first3=Louis |last4=Choe |first4=Hackeny |last5=Gardner |first5=R. Allen |last6=Kortlandt |first6=A. |last7=Krantz |first7=Grover S. |last8=McBride |first8=Glen |last9=Nottebohm |first9=Fernando |last10=Pfeiffer |first10=John |last11=Rumbaugh |first11=Duane G. |last12=Steklis |first12=Horst D. |last13=Raliegh |first13=Michael J. |last14=Stopa |first14=Roman |last15=Suzuki |first15=Akira |year=1973 |title=Primate Communication and the Gestural Origin of Language [and Comments and Reply] |journal=Current Anthropology |volume=14 |issue=1/2 |pages=5–24 |doi=10.1086/201401 |jstor=2741093 |s2cid=146288708 |last16=Washburn |first16=S. L. |last17=Wescott |first17=Roger W.}} However, Michael Corballis has pointed out that it is supposed that primate vocal communication (such as alarm calls) cannot be controlled consciously, unlike hand movement, and thus it is not credible as precursor to human language; primate vocalization is rather homologous to and continued in involuntary reflexes (connected with basic human emotions) such as screams or laughter (the fact that these can be faked does not disprove the fact that genuine involuntary responses to fear or surprise exist). Also, gesture is not generally less effective, and depending on the situation can even be advantageous, for example in a loud environment or where it is important to be silent, such as on a hunt. Other challenges to the "gesture-first" theory have been presented by researchers in psycholinguistics, including David McNeill.{{Cite journal |last1=McNeill |first1=David |last2=Bertenthal |first2=Bennett |last3=Cole |first3=Jonathan |last4=Gallagher |first4=Shaun |date=April 2005 |title=Gesture-first, but no gestures? |journal=Behavioral and Brain Sciences |volume=28 |issue=2 |pages=138–139 |doi=10.1017/S0140525X05360031 |s2cid=51753637}}

= Tool-use associated sound in the evolution of language =

Proponents of the motor theory of language evolution have primarily focused on the visual domain and communication through observation of movements. The Tool-use sound hypothesis suggests that the production and perception of sound also contributed substantially, particularly incidental sound of locomotion (ISOL) and tool-use sound (TUS).{{Cite journal |last=Larsson |first=M |year=2015 |title=Tool-use-associated sound in the evolution of language |journal=Animal Cognition |volume=18 |issue=5 |pages=993–1005 |doi=10.1007/s10071-015-0885-x |pmid=26118672 |s2cid=18714154}} Human bipedalism resulted in rhythmic and more predictable ISOL. That may have stimulated the evolution of musical abilities, auditory working memory, and abilities to produce complex vocalizations, and to mimic natural sounds.{{Cite journal |last=Larsson |first=M |year=2014 |title=Self-generated sounds of locomotion and ventilation and the evolution of human rhythmic abilities |journal=Animal Cognition |volume=17 |issue=1 |pages=1–14 |doi=10.1007/s10071-013-0678-z |pmc=3889703 |pmid=23990063}} Since the human brain proficiently extracts information about objects and events from the sounds they produce, TUS, and mimicry of TUS, might have achieved an iconic function. The prevalence of sound symbolism in many extant languages supports this idea. Self-produced TUS activates multimodal brain processing (motor neurons, hearing, proprioception, touch, vision), and TUS stimulates primate audiovisual mirror neurons, which is likely to stimulate the development of association chains. Tool use and auditory gestures involve motor-processing of the forelimbs, which is associated with the evolution of vertebrate vocal communication. The production, perception, and mimicry of TUS may have resulted in a limited number of vocalizations or protowords that were associated with tool use. A new way to communicate about tools, especially when out of sight, would have had selective advantage. A gradual change in acoustic properties, meaning, or both could have resulted in arbitrariness and an expanded repertoire of words. Humans have been increasingly exposed to TUS over millions of years, coinciding with the period during which spoken language evolved.

= Mirror neurons and language origins =

In humans, functional MRI studies have reported finding areas homologous to the monkey mirror neuron system in the inferior frontal cortex, close to Broca's area, one of the language regions of the brain. This has led to suggestions that human language evolved from a gesture performance/understanding system implemented in mirror neurons. Mirror neurons have been said to have the potential to provide a mechanism for action-understanding, imitation-learning, and the simulation of other people's behavior.Skoyles, John R., Gesture, Language Origins, and Right Handedness, Psychology: 11,#24, 2000 This hypothesis is supported by some cytoarchitectonic homologies between monkey premotor area F5 and human Broca's area.{{Cite journal |last1=Petrides |first1=M. |last2=Cadoret |first2=G. |last3=Mackey |first3=S. |date=June 2005 |title=Orofacial somatomotor responses in the macaque monkey homologue of Broca's area |journal=Nature |volume=435 |issue=7046 |pages=1235–1238 |bibcode=2005Natur.435.1235P |doi=10.1038/nature03628 |pmid=15988526 |s2cid=4397762}}

Rates of vocabulary expansion link to the ability of children to vocally mirror non-words and so to acquire the new word pronunciations. Such speech repetition occurs automatically, quickly{{Cite journal |last1=Porter |first1=R. J. |last2=Lubker |first2=J. F. |date=September 1980 |title=Rapid reproduction of vowel-vowel sequences: evidence for a fast and direct acoustic-motoric linkage in speech |journal=Journal of Speech and Hearing Research |volume=23 |issue=3 |pages=593–602 |doi=10.1044/jshr.2303.593 |pmid=7421161}} and separately in the brain to speech perception.{{Cite journal |last1=McCarthy |first1=R. |last2=Warrington |first2=E. K. |date=June 1984 |title=A two-route model of speech production. Evidence from aphasia. |journal=Brain |volume=107 |issue=2 |pages=463–485 |doi=10.1093/brain/107.2.463 |pmid=6722512 |doi-access=free}}{{Cite journal |last1=McCarthy |first1=R. A. |last2=Warrington |first2=E. K. |year=2001 |title=Repeating without semantics: surface dysphasia? |journal=Neurocase |volume=7 |issue=1 |pages=77–87 |doi=10.1093/neucas/7.1.77 |pmid=11239078 |s2cid=12988855}} Moreover, such vocal imitation can occur without comprehension such as in speech shadowing{{Cite journal |last=Marslen-Wilson |first=W. |year=1973 |title=Linguistic structure and speech shadowing at very short latencies |journal=Nature |volume=244 |issue=5417 |pages=522–523 |bibcode=1973Natur.244..522M |doi=10.1038/244522a0 |pmid=4621131 |s2cid=4220775}} and echolalia.{{Cite journal |last1=Fay |first1=W. H. |last2=Coleman |first2=R. O. |date=July 1977 |title=A human sound transducer/reproducer: temporal capabilities of a profoundly echolalic child |journal=Brain and Language |volume=4 |issue=3 |pages=396–402 |doi=10.1016/0093-934x(77)90034-7 |pmid=907878 |s2cid=29492873}} Further evidence for this link comes from a recent study in which the brain activity of two participants was measured using fMRI while they were gesturing words to each other using hand gestures with a game of charades—a modality that some have suggested might represent the evolutionary precursor of human language. Analysis of the data using Granger Causality revealed that the mirror-neuron system of the observer indeed reflects the pattern of activity of in the motor system of the sender, supporting the idea that the motor concept associated with the words is indeed transmitted from one brain to another using the mirror system.{{Cite journal |last1=Schippers |first1=M. B. |last2=Roebroeck |first2=A |last3=Renken |first3=R. |last4=Nanetti |first4=L. |last5=Keysers |first5=C. |year=2010 |title=Mapping the Information flow from one brain to another during gestural communication |journal=Proceedings of the National Academy of Sciences of the United States of America |volume=107 |issue=20 |pages=9388–9393 |bibcode=2010PNAS..107.9388S |doi=10.1073/pnas.1001791107 |pmc=2889063 |pmid=20439736 |doi-access=free}}

Not all linguists agree with the above arguments, however. In particular, supporters of Noam Chomsky argue against the possibility that the mirror neuron system can play any role in the hierarchical recursive structures essential to syntax.{{Cite book |last=Moro |first=Andrea |title=The boundaries of Babel: the brain and the enigma of impossible language |publisher=MIT Press |year=2008 |isbn=978-0-262-13498-9 |location=Cambridge, MA}}{{page needed|date=March 2017}}

= Putting-down-the-baby theory =

According to Dean Falk's "putting-down-the-baby" theory, vocal interactions between early hominid mothers and infants began a sequence of events that led, eventually, to human ancestors' earliest words.{{Cite journal |last=Falk |first=D. |date=August 2004 |title=Prelinguistic evolution in early hominins: whence motherese? |url=http://www.cogsci.ucsd.edu/~johnson/COGS260/Falk2004.pdf |url-status=dead |journal=Behavioral and Brain Sciences |volume=27 |issue=4 |pages=491–583 |doi=10.1017/s0140525x04000111 |pmid=15773427 |s2cid=39547572 |archive-url=https://web.archive.org/web/20140104205636/http://www.cogsci.ucsd.edu/~johnson/COGS260/Falk2004.pdf |archive-date=4 January 2014 |access-date=4 January 2014}} The basic idea is that evolving human mothers, unlike their counterparts in other primates, could not move around and forage with their infants clinging onto their backs. Loss of fur in the human case left infants with no means of clinging on. Frequently, therefore, mothers had to put their babies down. As a result, these babies needed to be reassured that they were not being abandoned. Mothers responded by developing 'motherese'—an infant-directed communicative system embracing facial expressions, body language, touching, patting, caressing, laughter, tickling, and emotionally expressive contact calls. The argument is that language developed out of this interaction.

In The Mental and Social Life of Babies, psychologist Kenneth Kaye noted that no usable adult language could have evolved without interactive communication between very young children and adults. "No symbolic system could have survived from one generation to the next if it could not have been easily acquired by young children under their normal conditions of social life."{{Cite book |last=Kaye |first=K. |url=https://archive.org/details/mentalsociallife0000kaye_a5t8/page/186 |title=The Mental and Social Life of Babies |publisher=University of Chicago Press |year=1982 |isbn=0-226-42848-6 |pages=[https://archive.org/details/mentalsociallife0000kaye_a5t8/page/186 186]}}

= From-where-to-what theory =

File:From where to what.png

The "from where to what" model is a language evolution model that is derived primarily from the organization of language processing in the brain into two structures: the auditory dorsal stream and the auditory ventral stream.{{Cite journal |last=Poliva |first=Oren |date=20 September 2017 |title=From where to what: a neuroanatomically based evolutionary model of the emergence of speech in humans |journal=F1000Research |volume=4 |page=67 |doi=10.12688/f1000research.6175.3 |issn=2046-1402 |pmc=5600004 |pmid=28928931 |doi-access=free}}{{Cite journal |last=Poliva |first=Oren |date=30 June 2016 |title=From Mimicry to Language: A Neuroanatomically Based Evolutionary Model of the Emergence of Vocal Language |journal=Frontiers in Neuroscience |volume=10 |page=307 |doi=10.3389/fnins.2016.00307 |issn=1662-453X |pmc=4928493 |pmid=27445676 |doi-access=free}} It hypothesizes seven stages of language evolution (see illustration). Speech originated for the purpose of exchanging contact calls between mothers and their offspring to find one another in the event they became separated (illustration part 1). The contact calls could be modified with intonations in order to express either a higher or lower level of distress (illustration part 2). The use of two types of contact calls enabled the first question-answer conversation. In this scenario, the child would emit a low-level distress call to express a desire to interact with an object, and the mother would respond with either another low-level distress call (to express approval of the interaction) or a high-level distress call (to express disapproval) (illustration part 3). Over time, the improved use of intonations and vocal control led to the invention of unique calls (phonemes) associated with distinct objects (illustration part 4). At first, children learned the calls (phonemes) from their parents by imitating their lip-movements (illustration part 5). Eventually, infants were able to encode into long-term memory all the calls (phonemes). Consequentially, mimicry via lip-reading was limited to infancy and older children learned new calls through mimicry without lip-reading (illustration part 6). Once individuals became capable of producing a sequence of calls, this allowed multi-syllabic words, which increased the size of their vocabulary (illustration part 7). The use of words, composed of sequences of syllables, provided the infrastructure for communicating with sequences of words (i.e. sentences).

The theory's name is derived from the two auditory streams, which are both found in the brains of humans and other primates. The auditory ventral stream is responsible for sound recognition, and so it is referred to as the auditory what stream.{{Cite journal |last=Scott |first=S. K. |date=1 December 2000 |title=Identification of a pathway for intelligible speech in the left temporal lobe |journal=Brain |volume=123 |issue=12 |pages=2400–2406 |doi=10.1093/brain/123.12.2400 |issn=1460-2156 |pmc=5630088 |pmid=11099443}}{{Cite journal |last1=Davis |first1=Matthew H. |last2=Johnsrude |first2=Ingrid S. |date=15 April 2003 |title=Hierarchical Processing in Spoken Language Comprehension |journal=The Journal of Neuroscience |volume=23 |issue=8 |pages=3423–3431 |doi=10.1523/jneurosci.23-08-03423.2003 |issn=0270-6474 |pmc=6742313 |pmid=12716950 |doi-access=free}}{{Cite journal |last1=Petkov |first1=Christopher I. |last2=Kayser |first2=Christoph |last3=Steudel |first3=Thomas |last4=Whittingstall |first4=Kevin |last5=Augath |first5=Mark |last6=Logothetis |first6=Nikos K. |date=10 February 2008 |title=A voice region in the monkey brain |journal=Nature Neuroscience |volume=11 |issue=3 |pages=367–374 |doi=10.1038/nn2043 |issn=1097-6256 |pmid=18264095 |s2cid=5505773}} In primates, the auditory dorsal stream is responsible for sound localization, and thus it is called the auditory where stream. Only in humans (in the left hemisphere) is it also responsible for other processes associated with language use and acquisition, such as speech repetition and production, integration of phonemes with their lip movements, perception and production of intonations, phonological long-term memory (long-term memory storage of the sounds of words), and phonological working memory (the temporary storage of the sounds of words).{{Cite journal |last1=Buchsbaum |first1=Bradley R. |last2=Baldo |first2=Juliana |last3=Okada |first3=Kayoko |last4=Berman |first4=Karen F. |last5=Dronkers |first5=Nina |last6=D'Esposito |first6=Mark |last7=Hickok |first7=Gregory |date=December 2011 |title=Conduction aphasia, sensory-motor integration, and phonological short-term memory – An aggregate analysis of lesion and fMRI data |journal=Brain and Language |volume=119 |issue=3 |pages=119–128 |doi=10.1016/j.bandl.2010.12.001 |issn=0093-934X |pmc=3090694 |pmid=21256582}}{{Cite journal |last1=Warren |first1=Jane E. |last2=Wise |first2=Richard J.S. |last3=Warren |first3=Jason D. |date=December 2005 |title=Sounds do-able: auditory–motor transformations and the posterior temporal plane |journal=Trends in Neurosciences |volume=28 |issue=12 |pages=636–643 |doi=10.1016/j.tins.2005.09.010 |issn=0166-2236 |pmid=16216346 |s2cid=36678139}}{{Cite journal |last=Campbell |first=Ruth |date=12 March 2008 |title=The processing of audio-visual speech: empirical and neural bases |journal=Philosophical Transactions of the Royal Society of London B: Biological Sciences |volume=363 |issue=1493 |pages=1001–1010 |doi=10.1098/rstb.2007.2155 |issn=0962-8436 |pmc=2606792 |pmid=17827105}}{{Cite journal |last1=Kayser |first1=Christoph |last2=Petkov |first2=Christopher I. |last3=Logothetis |first3=Nikos K. |date=December 2009 |title=Multisensory interactions in primate auditory cortex: fMRI and electrophysiology |journal=Hearing Research |volume=258 |issue=1–2 |pages=80–88 |doi=10.1016/j.heares.2009.02.011 |issn=0378-5955 |pmid=19269312 |s2cid=31412246}}{{Cite journal |last1=Hickok |first1=Gregory |last2=Buchsbaum |first2=Bradley |last3=Humphries |first3=Colin |last4=Muftuler |first4=Tugan |date=1 July 2003 |title=Auditory–Motor Interaction Revealed by fMRI: Speech, Music, and Working Memory in Area Spt |journal=Journal of Cognitive Neuroscience |volume=15 |issue=5 |pages=673–682 |doi=10.1162/089892903322307393 |issn=1530-8898 |pmid=12965041}}{{Cite journal |last1=Schwartz |first1=M. F. |last2=Faseyitan |first2=O. |last3=Kim |first3=J. |last4=Coslett |first4=H. B. |date=20 November 2012 |title=The dorsal stream contribution to phonological retrieval in object naming |journal=Brain |volume=135 |issue=12 |pages=3799–3814 |doi=10.1093/brain/aws300 |issn=0006-8950 |pmc=3525060 |pmid=23171662}}{{Cite journal |last=Gow |first=David W. |date=June 2012 |title=The cortical organization of lexical knowledge: A dual lexicon model of spoken language processing |journal=Brain and Language |volume=121 |issue=3 |pages=273–288 |doi=10.1016/j.bandl.2012.03.005 |issn=0093-934X |pmc=3348354 |pmid=22498237}}{{Cite journal |last1=Buchsbaum |first1=Bradley R. |last2=D'Esposito |first2=Mark |date=May 2008 |title=The Search for the Phonological Store: From Loop to Convolution |journal=Journal of Cognitive Neuroscience |volume=20 |issue=5 |pages=762–778 |doi=10.1162/jocn.2008.20501 |issn=0898-929X |pmid=18201133 |s2cid=17878480}} Some evidence also indicates a role in recognizing others by their voices.{{Cite journal |last1=Lachaux |first1=Jean-Philippe |last2=Jerbi |first2=Karim |last3=Bertrand |first3=Olivier |last4=Minotti |first4=Lorella |last5=Hoffmann |first5=Dominique |last6=Schoendorff |first6=Benjamin |last7=Kahane |first7=Philippe |date=31 October 2007 |title=A Blueprint for Real-Time Functional Mapping via Human Intracranial Recordings |journal=PLOS ONE |volume=2 |issue=10 |pages=e1094 |bibcode=2007PLoSO...2.1094L |doi=10.1371/journal.pone.0001094 |issn=1932-6203 |pmc=2040217 |pmid=17971857 |doi-access=free}}{{Cite journal |last1=Jardri |first1=Renaud |last2=Houfflin-Debarge |first2=Véronique |last3=Delion |first3=Pierre |last4=Pruvo |first4=Jean-Pierre |last5=Thomas |first5=Pierre |last6=Pins |first6=Delphine |date=April 2012 |title=Assessing fetal response to maternal speech using a noninvasive functional brain imaging technique |journal=International Journal of Developmental Neuroscience |volume=30 |issue=2 |pages=159–161 |doi=10.1016/j.ijdevneu.2011.11.002 |issn=0736-5748 |pmid=22123457 |s2cid=2603226}} The emergence of each of these functions in the auditory dorsal stream represents an intermediate stage in the evolution of language.

A contact call origin for human language is consistent with animal studies, as like human language, contact call discrimination in monkeys is lateralised to the left hemisphere.{{Cite journal |last1=Petersen |first1=M. |last2=Beecher |first2=M. |last3=Zoloth |last4=Moody |first4=D. |last5=Stebbins |first5=W. |date=20 October 1978 |title=Neural lateralization of species-specific vocalizations by Japanese macaques (Macaca fuscata) |journal=Science |volume=202 |issue=4365 |pages=324–327 |bibcode=1978Sci...202..324P |doi=10.1126/science.99817 |issn=0036-8075 |pmid=99817}}{{Cite journal |last1=Heffner |first1=H. |last2=Heffner |first2=R. |date=5 October 1984 |title=Temporal lobe lesions and perception of species-specific vocalizations by macaques |journal=Science |volume=226 |issue=4670 |pages=75–76 |bibcode=1984Sci...226...75H |doi=10.1126/science.6474192 |issn=0036-8075 |pmid=6474192}} Mice with knock-out to language related genes (such as FOXP2 and SRPX2) also resulted in the pups no longer emitting contact calls when separated from their mothers.{{Cite journal |last1=Shu |first1=W. |last2=Cho |first2=J. Y. |last3=Jiang |first3=Y. |last4=Zhang |first4=M. |last5=Weisz |first5=D. |last6=Elder |first6=G. A. |last7=Schmeidler |first7=J. |last8=De Gasperi |first8=R. |last9=Sosa |first9=M. A. G. |date=27 June 2005 |title=Altered ultrasonic vocalization in mice with a disruption in the Foxp2 gene |journal=Proceedings of the National Academy of Sciences |volume=102 |issue=27 |pages=9643–9648 |bibcode=2005PNAS..102.9643S |doi=10.1073/pnas.0503739102 |issn=0027-8424 |pmc=1160518 |pmid=15983371 |doi-access=free}}{{Cite journal |last1=Sia |first1=G. M. |last2=Clem |first2=R. L. |last3=Huganir |first3=R. L. |date=31 October 2013 |title=The Human Language-Associated Gene SRPX2 Regulates Synapse Formation and Vocalization in Mice |journal=Science |volume=342 |issue=6161 |pages=987–991 |bibcode=2013Sci...342..987S |doi=10.1126/science.1245079 |issn=0036-8075 |pmc=3903157 |pmid=24179158}} Supporting this model is also its ability to explain unique human phenomena, such as the use of intonations when converting words into commands and questions, the tendency of infants to mimic vocalizations during the first year of life (and its disappearance later on) and the protruding and visible human lips, which are not found in other apes. This theory could be considered an elaboration of the putting-down-the-baby theory of language evolution.

= Grammaticalisation theory =

"Grammaticalization" is a continuous historical process in which free-standing words develop into grammatical appendages, while these in turn become ever more specialized and grammatical. An initially "incorrect" usage, in becoming accepted, leads to unforeseen consequences, triggering knock-on effects and extended sequences of change. Paradoxically, grammar evolves because, in the final analysis, humans care less about grammatical niceties than about making themselves understood.Sperber, D. and D. Wilson 1986. Relevance. Communication and cognition. Oxford: Blackwell. If this is how grammar evolves today, according to this school of thought, similar principles at work can be legitimately inferred among distant human ancestors, when grammar itself was first being established.{{Cite book |last=Deutscher |first=Guy |url=https://archive.org/details/unfoldingoflangu00deut |title=The unfolding of language: an evolutionary tour of mankind's greatest invention |publisher=Metropolitan |year=2005 |isbn=978-0-8050-7907-4 |location=New York}}Hopper, P. J. 1998. Emergent grammar. In M. Tomasello (ed.), The New Psychology of Language. Mahwah, NJ: Lawrence Erlbaum, 155–175.{{Cite book |last1=Heine |first1=Bernd |title=The genesis of grammar : a reconstructio |last2=Kuteva |first2=Tania |publisher=Oxford University Press |year=2007 |isbn=978-0-19-922777-8}}

In order to reconstruct the evolutionary transition from early language to languages with complex grammars, it is necessary to know which hypothetical sequences are plausible and which are not. In order to convey abstract ideas, the first recourse of speakers is to fall back on immediately recognizable concrete imagery, very often deploying metaphors rooted in shared bodily experience.Lakoff, G. and M. Johnson 1980. Metaphors We Live By. Chicago: University of Chicago Press. A familiar example is the use of concrete terms such as "belly" or "back" to convey abstract meanings such as "inside" or "behind". Equally metaphorical is the strategy of representing temporal patterns on the model of spatial ones. For example, English speakers might say "It is going to rain", modelled on "I am going to London." This can be abbreviated colloquially to "It's gonna rain." Even when in a hurry, English speakers do not say "I'm gonna London"—the contraction is restricted to the job of specifying tense. From such examples it can be seen why grammaticalisation is consistently unidirectional—from concrete to abstract meaning, not the other way around.

Grammaticalization theorists picture early language as simple, perhaps consisting only of nouns.p. 111 Even under that extreme theoretical assumption, however, it is difficult to imagine what would realistically have prevented people from using, say, "spear" as if it were a verb ("Spear that pig!"). People might have used their nouns as verbs or their verbs as nouns as occasion demanded. In short, while a noun-only language might seem theoretically possible, grammaticalization theory indicates that it cannot have remained fixed in that state for any length of time.{{Cite book |last1=Heine |first1=Bernd |title=The Oxford handbook of language evolution |last2=Kuteva |first2=Tania |publisher=Oxford University Press |year=2012 |isbn=978-0-19-954111-9 |editor-last=Maggie Tallerman |pages=512–527 |chapter=Grammaticalization theory as a tool for reconstructing language evolution |editor-last2=Kathleen R. Gibson}}

Creativity drives grammatical change. This presupposes a certain attitude on the part of listeners. Instead of punishing deviations from accepted usage, listeners must prioritise imaginative mind-reading. Imaginative creativity—emitting a leopard alarm when no leopard was present, for example—is not the kind of behaviour which, say, vervet monkeys would appreciate or reward.{{Cite journal |last1=Cheney |first1=Dorothy L. |last2=Seyfarth |first2=Robert M. |year=2005 |title=Constraints and preadaptations in the earliest stages of language evolution |url=http://www.psych.upenn.edu/~seyfarth/Publications/LinguisticReview.pdf |journal=The Linguistic Review |volume=22 |issue=2–4 |pages=135–159 |doi=10.1515/tlir.2005.22.2-4.135 |s2cid=18939193}} Creativity and reliability are incompatible demands; for "Machiavellian" primates as for animals generally, the overriding pressure is to demonstrate reliability.{{Cite book |last1=Maynard Smith |first1=John |title=Animal signals |last2=Harper |first2=David |publisher=Oxford University Press |year=2003 |isbn=978-0-19-852684-1 |location=New York}} If humans escape these constraints, it is because in their case, listeners are primarily interested in mental states.

To focus on mental states is to accept fictions—inhabitants of the imagination—as potentially informative and interesting. An example is metaphor: a metaphor is, literally, a false statement.Davidson, R. D. 1979. What metaphors mean. In S. Sacks (ed.), On Metaphor. Chicago: University of Chicago Press, pp. 29–45. In Romeo and Juliet, Romeo declares "Juliet is the sun!". Juliet is a woman, not a ball of plasma in the sky, but human listeners are not (or not usually) pedants insistent on point-by-point factual accuracy. They want to know what the speaker has in mind. Grammaticalisation is essentially based on metaphor. To outlaw its use would be to stop grammar from evolving and, by the same token, to exclude all possibility of expressing abstract thought.Lakoff, G. and R. Núñez 2000. Where mathematics comes from. New York: Basic Books.

A criticism of all this is that while grammaticalization theory might explain language change today, it does not satisfactorily address the really difficult challenge—explaining the initial transition from primate-style communication to language as it is known today. Rather, the theory assumes that language already exists. As Bernd Heine and Tania Kuteva acknowledge: "Grammaticalisation requires a linguistic system that is used regularly and frequently within a community of speakers and is passed on from one group of speakers to another". Outside modern humans, such conditions do not prevail.

= Evolution-progression model =

Human language is used for self-expression; however, expression displays different stages. The consciousness of self and feelings represents the stage immediately prior to the external, phonetic expression of feelings in the form of sound (i.e. language). Intelligent animals such as dolphins, Eurasian magpies, and chimpanzees live in communities, wherein they assign themselves roles for group survival and show emotions such as sympathy.{{Cite journal |last=Gallup |first=G. G. Jr. |year=1970 |title=Chimpanzees: Self recognition |journal=Science |volume=167 |issue=3914 |pages=86–87 |bibcode=1970Sci...167...86G |doi=10.1126/science.167.3914.86 |pmid=4982211 |s2cid=145295899}} When such animals view their reflection (mirror test), they recognize themselves and exhibit self-consciousness.{{Cite journal |last=Mitchell |first=R. W. |year=1995 |title=Evidence of dolphin self-recognition and the difficulties of interpretation |journal=Consciousness and Cognition |volume=4 |issue=2 |pages=229–234 |doi=10.1006/ccog.1995.1029 |pmid=8521261 |s2cid=45507064}} Notably, humans evolved in a quite different environment than that of these animals. Human survival became easier with the development of tools, shelter, and fire, thus facilitating further advancement of social interaction, self-expression, and tool-making, as for hunting and gathering.{{Cite journal |last=Ko |first=Kwang Hyun |year=2016 |title=Origins of human intelligence: The chain of tool-making and brain evolution |url=http://www.drustvo-antropologov.si/AN/PDF/2016_1/Anthropological_Notebooks_XXII_1_Ko.pdf |journal=Anthropological Notebooks |volume=22 |issue=1 |pages=5–22}} The increasing brain size allowed advanced provisioning and tools and the technological advances during the Palaeolithic era that built upon the previous evolutionary innovations of bipedalism and hand versatility allowed the development of human language.{{Citation needed|date=May 2018}}

= Self-domesticated ape theory =

According to a study investigating the song differences between white-rumped munias and its domesticated counterpart (Bengalese finch), the wild munias use a highly stereotyped song sequence, whereas the domesticated ones sing a highly unconstrained song. In wild finches, song syntax is subject to female preference—sexual selection—and remains relatively fixed. However, in the Bengalese finch, natural selection is replaced by breeding, in this case for colorful plumage, and thus, decoupled from selective pressures, stereotyped song syntax is allowed to drift. It is replaced, supposedly within 1000 generations, by a variable and learned sequence. Wild finches, moreover, are thought incapable of learning song sequences from other finches.{{Cite journal |last1=Soma |first1=M. |last2=Hiraiwa-Hasegawa |first2=M. |last3=Okanoya |first3=K. |year=2009 |title=Early ontogenetic effects on song quality in the Bengalese finch (Lonchura striata var. domestica): laying order, sibling competition and song syntax |url=https://ir.soken.ac.jp/?action=repository_uri&item_id=3818 |journal=Behavioral Ecology and Sociobiology |volume=63 |issue=3 |pages=363–370 |doi=10.1007/s00265-008-0670-9 |bibcode=2009BEcoS..63..363S |s2cid=23137306}} In the field of bird vocalization, brains capable of producing only an innate song have very simple neural pathways: the primary forebrain motor centre, called the robust nucleus of arcopallium, connects to midbrain vocal outputs, which in turn project to brainstem motor nuclei. By contrast, in brains capable of learning songs, the arcopallium receives input from numerous additional forebrain regions, including those involved in learning and social experience. Control over song generation has become less constrained, more distributed, and more flexible.

One way to think about human evolution is that humans are self-domesticated apes. Just as domestication relaxed selection for stereotypic songs in the finches—mate choice was supplanted by choices made by the aesthetic sensibilities of bird breeders and their customers—so might human cultural domestication have relaxed selection on many of their primate behavioural traits, allowing old pathways to degenerate and reconfigure. Given the highly indeterminate way that mammalian brains develop—they basically construct themselves "bottom up", with one set of neuronal interactions preparing for the next round of interactions—degraded pathways would tend to seek out and find new opportunities for synaptic hookups. Such inherited de-differentiations of brain pathways might have contributed to the functional complexity that characterises human language. And, as exemplified by the finches, such de-differentiations can occur in very rapid time-frames.{{Cite journal |last1=Ritchie |first1=Graham |last2=Kirby |first2=Simon |year=2005 |title=Selection, domestication, and the emergence of learned communication systems |url=http://homepages.inf.ed.ac.uk/s0237680/pubs/ritchie_05_selection.pdf |url-status=dead |journal=Second International Symposium on the Emergence and Evolution of Linguistic Communication |archive-url=https://web.archive.org/web/20120121153322/http://homepages.inf.ed.ac.uk/s0237680/pubs/ritchie_05_selection.pdf |archive-date=21 January 2012}}

Speech and language for communication

{{See also|Animal communication|Animal language|Origin of speech}}

{{Multiple issues|Much of the language in this section is vague and does not match the encyclopedic tone.|section=y}}

A distinction can be drawn between speech and language. Language is not necessarily spoken: it might alternatively be written or signed. Speech is among a number of different methods of encoding and transmitting linguistic information, albeit arguably{{By whom|date=November 2024}} the most natural one.MacNeilage, P. 1998. Evolution of the mechanism of language output: comparative neurobiology of vocal and manual communication. In J. R. Hurford, M. Studdert Kennedy and C. Knight (eds), Approaches to the Evolution of Language. Cambridge University Press, pp. {{clarify span|222 41|date=August 2022}}.

Some scholars, such as Noam Chomsky, view language as an initially cognitive development, its "externalisation" to serve communicative purposes occurring later in human evolution. According to one such school of thought, the key feature distinguishing human language is recursion,{{Cite journal |last1=Hauser |first1=M. D. |last2=Chomsky |first2=N. |last3=Fitch |first3=W. T. |date=November 2002 |title=The faculty of language: what is it, who has it, and how did it evolve? |url=http://www.chomsky.info/articles/20021122.pdf |url-status=dead |journal=Science |volume=298 |issue=5598 |pages=1569–1579 |doi=10.1126/science.298.5598.1569 |pmid=12446899 |archive-url=https://web.archive.org/web/20131228122250/http://www.chomsky.info/articles/20021122.pdf |archive-date=28 December 2013}} (in this context, the iterative embedding of phrases within phrases). Other scholars—notably Daniel Everett—deny that recursion is universal, citing certain languages (e.g. Pirahã) which allegedly{{By whom|date=November 2024}} lack this feature.{{Cite journal |last=Everett |first=Daniel L. |year=2005 |title=Cultural Constraints on Grammar and Cognition in Piraha Another Look at the Design Features of Human Language |url=http://www1.icsi.berkeley.edu/~kay/Everett.CA.Piraha.pdf |journal=Current Anthropology |volume=46 |issue=4 |pages=621–646 |doi=10.1086/431525 |s2cid=2223235 |hdl-access=free |hdl=2066/41103}}

The ability to ask questions is considered by some{{Like whom?|date=May 2021}} to distinguish language from non-human systems of communication.{{Cite book |last=Zhordania |first=I. M. |title=Who asked the first question : the origins of human choral singing, intelligence, language and speech |publisher=Logos Tbilisi Ivane Javakhishvili State University |year=2006 |isbn=9789994031818 |location=Tbilisi, Georgia}} Some captive primates (notably bonobos and chimpanzees), having learned to use rudimentary signing to communicate with their human trainers, proved able to respond correctly to complex questions and requests. Yet they failed to ask even the simplest questions themselves.{{Cite journal |last1=Savage-Rumbaugh |first1=E. Sue |last2=Murphy |first2=Jeannine |last3=Sevcik |first3=Rose A. |last4=Brakke |first4=Karen E. |last5=Williams |first5=Shelly L. |last6=Rumbaugh |first6=Duane M. |last7=Bates |first7=Elizabeth |year=1993 |title=Language Comprehension in Ape and Child |journal=Monographs of the Society for Research in Child Development |volume=58 |issue=3/4 |pages=i–252 |doi=10.2307/1166068 |jstor=1166068 |pmid=8366872}} Conversely, human children are able to ask their first questions (using only question intonation) at the babbling period of their development, long before they start using syntactic structures. Although babies from different cultures acquire native languages from their social environment, all languages of the world without exception—tonal, non-tonal, intonational and accented—use similar rising "question intonation" for yes–no questions.Bolinger, Dwight L. (Editor) 1972. Intonation. Selected Readings. Harmondsworth: Penguin, p. 314.{{Cite book |last=Cruttenden |first=Alan |title=Intonation |publisher=Cambridge University Press |year=1986 |isbn=978-0-521-26028-2 |pages=169–174}} Except, of course, the ones that don't.Lee, Hye-Sook 2008. [https://www.isca-archive.org/speechprosody_2008/lee08_speechprosody.pdf Non-rising questions in North Keyonsang Korean.] in Proc. Speech Prosody 2008. p. 241. Retrieved 26.

August 2024. {{Clarify|reason=Questionable intent of this sentence makes the sentence read more like a comment made in spite rather than a part of the paragraph. One statement should not be made for it to immediately be rebutted in the next sentence.|date=November 2024}} This fact is a strong evidence of the universality of question intonation. In general, according to some authors{{Like whom?|date=November 2024}}, sentence intonation/pitch is pivotal in spoken grammar and is the basic information used by children to learn the grammar of whatever language.

Cognitive development and language

Language users have high-level reference (or deixis)—the ability to refer to things or states of being that are not in the immediate realm of the speaker. This ability is often related to theory of mind, or an awareness of the other as a being like the self with individual wants and intentions. According to Chomsky, Hauser and Fitch (2002), there are six main aspects of this high-level reference system:

  • Theory of mind
  • Capacity to acquire non-linguistic conceptual representations, such as the object/kind distinction
  • Referential vocal signals
  • Imitation as a rational, intentional system
  • Voluntary control over signal production as evidence of intentional communication
  • Number representation

= Theory of mind =

{{Main|Theory of mind}}

Simon Baron-Cohen (1999) argues that theory of mind must have preceded language use, based on evidence of use of the following characteristics as much as 40,000 years ago: intentional communication, repairing failed communication, teaching, intentional persuasion, intentional deception, building shared plans and goals, intentional sharing of focus or topic, and pretending. Moreover, Baron-Cohen argues that many primates show some, but not all, of these abilities.{{Citation needed|date=January 2014}} Call and Tomasello's research on chimpanzees supports this, in that individual chimps seem to understand that other chimps have awareness, knowledge, and intention, but do not seem to understand false beliefs. Many primates show some tendencies toward a theory of mind, but not a full one as humans have.{{Cite journal |last1=Tomasello |first1=Michael |last2=Call |first2=Josep |last3=Hare |first3=Brian |date=April 2003 |title=Chimpanzees understand psychological states – the question is which ones and to what extent |journal=Trends in Cognitive Sciences |volume=7 |issue=4 |pages=153–156 |doi=10.1016/S1364-6613(03)00035-4 |pmid=12691762 |s2cid=3390980}}

Ultimately, there is some consensus within the field that a theory of mind is necessary for language use. Thus, the development of a full theory of mind in humans was a necessary precursor to full language use.{{Cite journal |last1=Hale |first1=Courtney Melinda |last2=Tager-Flusberg |first2=Helen |date=June 2003 |title=The influence of language on theory of mind: a training study |journal=Developmental Science |volume=6 |issue=3 |pages=346–359 |doi=10.1111/1467-7687.00289 |pmc=1350918 |pmid=16467908}}

= Number representation =

In one particular study, rats and pigeons were required to press a button a certain number of times to get food. The animals showed very accurate distinction for numbers less than four, but as the numbers increased, the error rate increased. In another, the primatologist Tetsuro Matsuzawa attempted to teach chimpanzees Arabic numerals.{{Cite journal |last=Matsuzawa |first=Tetsuro |date=1985 |title=Use of numbers by a chimpanzee |url=https://www.nature.com/articles/315057a0 |journal=Nature |volume=315 |issue=6014 |pages=57–59 |bibcode=1985Natur.315...57M |doi=10.1038/315057a0 |pmid=3990808 |s2cid=4361089}} The difference between primates and humans in this regard was very large, as it took the chimps thousands of trials to learn 1–9, with each number requiring a similar amount of training time; yet, after learning the meaning of 1, 2 and 3 (and sometimes 4), children (after the age of 5.5 to 6) easily comprehend the value of greater integers by using a successor function (i.e. 2 is 1 greater than 1, 3 is 1 greater than 2, 4 is 1 greater than 3; once 4 is reached it seems most children suddenly understand that the value of any integer n is 1 greater than the previous integer).{{Cite journal |last1=Cheung |first1=Pierina |last2=Rubenson |first2=Miriam |last3=Barner |first3=David |date=February 2017 |title=To infinity and beyond: Children generalize the successor function to all possible numbers years after learning to count |url=https://www.sciencedirect.com/science/article/abs/pii/S0010028516302006 |journal=Cognitive Psychology |volume=92 |pages=22–36 |doi=10.1016/j.cogpsych.2016.11.002 |pmid=27889550 |s2cid=206867905 |via=Science Direct}} Put simply, other primates learn the meaning of numbers one by one, similar to their approach to other referential symbols, while children first learn an arbitrary list of symbols (1, 2, 3, 4...) and then later learn their precise meanings.{{Cite journal |last=Carey |first=Susan |year=2001 |title=Cognitive Foundations of Arithmetic: Evolution and Ontogenisis |url=http://www.wjh.harvard.edu/~lds/pdfs/carey2001c.pdf |url-status=dead |journal=Mind and Language |volume=16 |issue=1 |pages=37–55 |doi=10.1111/1468-0017.00155 |archive-url=https://web.archive.org/web/20130725071406/http://www.wjh.harvard.edu/%7Elds/pdfs/carey2001c.pdf |archive-date=25 July 2013 |access-date=13 January 2014}} These results can be seen as evidence for the application of the "open-ended generative property" of language in human numeral cognition.

Linguistic structures

= Lexical-phonological principle =

Hockett (1966) details a list of features regarded as essential to describing human language.{{Cite journal |last=Hockett |first=Charles F. |year=1960 |title=The Origin of Speech |url=http://www.gifted.ucalgary.ca/dflynn/files/dflynn/Hockett60.pdf |url-status=dead |journal=Scientific American |volume=203 |issue=3 |pages=88–96 |bibcode=1960SciAm.203c..88H |doi=10.1038/scientificamerican0960-88 |pmid=14402211 |archive-url=https://web.archive.org/web/20140106173517/http://www.gifted.ucalgary.ca/dflynn/files/dflynn/Hockett60.pdf |archive-date=6 January 2014 |access-date=6 January 2014}} In the domain of the lexical-phonological principle, two features of this list are most important:

  • Productivity: users can create and understand completely novel messages.
  • New messages are freely coined by blending, analogizing from, or transforming old ones.
  • Either new or old elements are freely assigned new semantic loads by circumstances and context. This says that in every language, new idioms constantly come into existence.
  • Duality (of Patterning): a large number of meaningful elements are made up of a conveniently small number of independently meaningless yet message-differentiating elements.

The sound system of a language is composed of a finite set of simple phonological items. Under the specific phonotactic rules of a given language, these items can be recombined and concatenated, giving rise to morphology and the open-ended lexicon. A key feature of language is that a simple, finite set of phonological items gives rise to an infinite lexical system wherein rules determine the form of each item, and meaning is inextricably linked with form. Phonological syntax, then, is a simple combination of pre-existing phonological units. Related to this is another essential feature of human language: lexical syntax, wherein pre-existing units are combined, giving rise to semantically novel or distinct lexical items.{{Citation needed paragraph|date=January 2014}}

Certain elements of the lexical-phonological principle are known to exist outside of humans. While all (or nearly all) have been documented in some form in the natural world, very few coexist within the same species. Bird-song, singing nonhuman apes, and the songs of whales all display phonological syntax, combining units of sound into larger structures apparently devoid of enhanced or novel meaning. Certain other primate species do have simple phonological systems with units referring to entities in the world. However, in contrast to human systems, the units in these primates' systems normally occur in isolation, betraying a lack of lexical syntax. There is new{{When|date=May 2021}} evidence to suggest that Campbell's monkeys also display lexical syntax, combining two calls (a predator alarm call with a "boom", the combination of which denotes a lessened threat of danger), however it is still unclear whether this is a lexical or a morphological phenomenon.{{Cite journal |last1=Schlenker |first1=Philippe |last2=Chemla |first2=Emmanuel |last3=Arnold |first3=Kate |last4=Lemasson |first4=Alban |last5=Ouattara |first5=Karim |last6=Keenan |first6=Sumir |last7=Stephan |first7=Claudia |last8=Ryder |first8=Robin |last9=Zuberbühler |first9=Klaus |date=December 2014 |title=Monkey semantics: two 'dialects' of Campbell's monkey alarm calls |journal=Linguistics and Philosophy |volume=37 |issue=6 |pages=439–501 |doi=10.1007/s10988-014-9155-7 |s2cid=3428900}}

= Pidgins and creoles =

{{Main|Creole language|pidgin}}

Pidgins are significantly simplified languages with only rudimentary grammar and a restricted vocabulary. In their early stage, pidgins mainly consist of nouns, verbs, and adjectives with few or no articles, prepositions, conjunctions or auxiliary verbs. Often the grammar has no fixed word order and the words have no inflection.{{Cite book |last=Diamond |first=Jared M. |title=The third chimpanzee : the evolution and future of the human animal |publisher=HarperCollins |year=1992 |isbn=978-0-06-018307-3 |location=New York |pages=[https://archive.org/details/thirdchimpanzee00jare_0/page/141 141–167] |chapter=Bridges to human language |chapter-url=https://archive.org/details/thirdchimpanzee00jare_0/page/141}}

If contact is maintained between the groups speaking the pidgin for long periods of time, the pidgins may become more complex over many generations. If the children of one generation adopt the pidgin as their native language it develops into a creole language, which becomes fixed and acquires a more complex grammar, with fixed phonology, syntax, morphology, and syntactic embedding. The syntax and morphology of such languages may often have local innovations not obviously derived from any of the parent languages.

Studies of creole languages around the world have suggested that they display remarkable similarities in grammar{{Citation needed|date=December 2018}} and are developed uniformly from pidgins in a single generation. These similarities are apparent even when creoles do not have any common language origins. In addition, creoles are similar, despite being developed in isolation from each other. Syntactic similarities include subject–verb–object word order. Even when creoles are derived from languages with a different word order they often develop the SVO word order. Creoles tend to have similar usage patterns for definite and indefinite articles, and similar movement rules for phrase structures even when the parent languages do not.

Evolutionary timeline

{{Human timeline}}

= Primate communication =

Field primatologists can give useful insights into great ape communication in the wild. One notable finding is that nonhuman primates, including the other great apes, produce calls that are graded, as opposed to categorically differentiated, with listeners striving to evaluate subtle gradations in signallers' emotional and bodily states. Nonhuman apes seemingly find it extremely difficult to produce vocalisations in the absence of the corresponding emotional states. In captivity, nonhuman apes have been taught rudimentary forms of sign language or have been persuaded to use lexigrams—symbols that do not graphically resemble the corresponding words—on computer keyboards. Some nonhuman apes, such as Kanzi, have been able to learn and use hundreds of lexigrams.{{Cite book |last1=Savage-Rumbaugh |first1=E. Sue |title=Kanzi: the ape at the brink of the human mind |last2=Lewin |first2=Roger. |publisher=Wiley |year=1994 |isbn=978-0-471-58591-6 |location=New York}}{{Cite book |last1=Savage-Rumbaugh |first1=E. Sue |url=https://archive.org/details/apeslanguagehuma00sava |title=Apes, language, and the human mind |last2=Shanker |first2=Stuart. |last3=Taylor |first3=Talbot J. |publisher=Oxford University Press |year=1998 |isbn=978-0-19-510986-3 |location=New York}}

The Broca's and Wernicke's areas in the primate brain are responsible for controlling the muscles of the face, tongue, mouth, and larynx, as well as recognizing sounds. Primates are known to make "vocal calls", and these calls are generated by circuits in the brainstem and limbic system.Freeman, Scott; Jon C. Herron., Evolutionary Analysis (4th ed.), Pearson Education, Inc. (2007), {{ISBN|0-13-227584-8}} pages 789–90

In the wild, the communication of vervet monkeys has been the most extensively studied. They are known to make up to ten different vocalizations. Many of these are used to warn other members of the group about approaching predators. They include a "leopard call", a "snake call", and an "eagle call".{{Cite journal |last1=Seyfarth |first1=Robert M. |last2=Cheney |first2=Dorothy L. |last3=Marler |first3=Peter |year=1980 |title=Vervet monkey alarm calls: Semantic communication in a free-ranging primate |journal=Animal Behaviour |volume=28 |issue=4 |pages=1070–1094 |doi=10.1016/S0003-3472(80)80097-2 |s2cid=53165940}} Each call triggers a different defensive strategy in the monkeys who hear the call and scientists were able to elicit predictable responses from the monkeys using loudspeakers and prerecorded sounds. Other vocalisations may be used for identification. If an infant monkey calls, its mother turns toward it, but other vervet mothers turn instead toward that infant's mother to see what she will do.{{Cite journal |last1=Arnold |first1=Kate |last2=Zuberbühler |first2=Klaus |year=2006 |title=Language evolution: Semantic combinations in primate calls |journal=Nature |volume=441 |issue=7091 |page=303 |bibcode=2006Natur.441..303A |doi=10.1038/441303a |pmid=16710411 |s2cid=4413635 |doi-access=free}}{{Cite news |last=Wade, Nicholas |date=23 May 2006 |title=Nigerian Monkeys Drop Hints on Language Origin |url=https://www.nytimes.com/2006/05/23/science/23lang.html |access-date=9 September 2007 |work=The New York Times}}

Similarly, researchers have demonstrated that chimpanzees (in captivity) use different "words" in reference to different foods. They recorded vocalisations that chimps made in reference, for example, to grapes, and then other chimps pointed at pictures of grapes when they heard the recorded sound.{{Cite thesis |last=Gibbons |first=Christopher M. |title=The referentiality of chimpanzee vocal signaling: behavioral and acoustic analysis of food barks |publisher=Ohio State University |url=http://rave.ohiolink.edu/etdc/view?acc_num=osu1173219994 |year=2007}}{{Cite journal |last1=Slocombe |first1=Katie E. |last2=Zuberbühler |first2=Klaus |year=2005 |title=Functionally Referential Communication in a Chimpanzee |url=http://doc.rero.ch/record/278602/files/Slocombe_K.-Functionally_refererential_20170201172432-HV.pdf |journal=Current Biology |volume=15 |issue=19 |pages=1779–1784 |bibcode=2005CBio...15.1779S |doi=10.1016/j.cub.2005.08.068 |pmid=16213827 |s2cid=6774592}}

= ''Ardipithecus ramidus'' =

A study published in HOMO: Journal of Comparative Human Biology in 2017 claims that Ardipithecus ramidus, a hominin dated at approximately 4.5 Ma, shows the first evidence of an anatomical shift in the hominin lineage suggestive of increased vocal capability.{{Cite journal |last1=Clark |first1=Gary |last2=Henneberg |first2=Maciej |year=2017 |title=Ardipithecus ramidus and the evolution of language and singing: An early origin for hominin vocal capability |journal=HOMO |volume=68 |issue=2 |pages=101–121 |doi=10.1016/j.jchb.2017.03.001 |pmid=28363458}} This study compared the skull of A. ramidus with 29 chimpanzee skulls of different ages and found that in numerous features A. ramidus clustered with the infant and juvenile measures as opposed to the adult measures. Such affinity with the shape dimensions of infant and juvenile chimpanzee skull architecture, it was argued, may have resulted in greater vocal capability. This assertion was based on the notion that the chimpanzee vocal tract ratios that prevent speech are a result of growth factors associated with puberty—growth factors absent in A. ramidus ontogeny. A. ramidus was also found to have a degree of cervical lordosis more conducive to vocal modulation when compared with chimpanzees as well as cranial base architecture suggestive of increased vocal capability.

What was significant in this study, according to the authors, was the observation that the changes in skull architecture that correlate with reduced aggression are the same changes necessary for the evolution of early hominin vocal ability. In integrating data on anatomical correlates of primate mating and social systems with studies of skull and vocal tract architecture that facilitate speech production, the authors argue that paleoanthropologists prior to their study have failed to understand the important relationship between early hominin social evolution and the evolution of our species' capacities for language.

While the skull of A. ramidus, according to the authors, lacks the anatomical impediments to speech evident in chimpanzees, it is unclear what the vocal capabilities of this early hominin were. While they suggest A. ramidus—based on similar vocal tract ratios—may have had vocal capabilities equivalent to a modern human infant or very young child, they concede this is a debatable and speculative hypothesis. However, they do claim that changes in skull architecture through processes of social selection were a necessary prerequisite for language evolution. As they write:

{{blockquote|We propose that as a result of paedomorphic morphogenesis of the cranial base and craniofacial morphology Ar. ramidus would have not been limited in terms of the mechanical components of speech production as chimpanzees and bonobos are. It is possible that Ar. ramidus had vocal capability approximating that of chimpanzees and bonobos, with its idiosyncratic skull morphology not resulting in any significant advances in speech capability. In this sense the anatomical features analysed in this essay would have been exapted in later more voluble species of hominin. However, given the selective advantages of pro-social vocal synchrony, we suggest the species would have developed significantly more complex vocal abilities than chimpanzees and bonobos.}}

= Early ''Homo'' =

Anatomically, some scholars believe that features of bipedalism developed in the australopithecines around 3.5 million years ago. Around this time, these structural developments within the skull led to a more prominently L-shaped vocal tract.{{Cite book |last1=Aronoff |first1=Mark |title=The handbook of linguistics |last2=Rees-Miller |first2=Janie. |publisher=Blackwell |year=2001 |isbn=0-631-20497-0 |location=Malden, MA }}{{page needed|date=May 2020}} In order to generate the sounds modern Homo sapiens are capable of making, such as vowels, it is vital that Early Homo populations must have a specifically shaped voice track and a lower sitting larynx.{{Cite journal |last=Fitch |first=W. Tecumseh |year=2000 |title=The evolution of speech: a comparative review |journal=Trends in Cognitive Sciences |volume=4 |issue=7 |pages=258–267 |doi=10.1016/S1364-6613(00)01494-7 |pmid=10859570 |s2cid=14706592}} Opposing research previously suggested that Neanderthals were physically incapable of creating the full range of vocals seen in modern humans due to the differences in larynx placement. Establishing distinct larynx positions through fossil remains of Homo sapiens and Neanderthals would support this theory; however, modern research has revealed that the hyoid bone was indistinguishable in the two populations. Though research has shown a lower sitting larynx is important to producing speech, another theory states it may not be as important as once thought.{{Cite journal |last=Ohala |first=John J. |date=10 September 1987 |title=Experimental Phonology |journal=Annual Meeting of the Berkeley Linguistics Society |volume=13 |page=207 |doi=10.3765/bls.v13i0.1803 |issn=2377-1666 |doi-access=free}} Cataldo, Migliano, and Vinicius report speech alone appears inadequate for transmitting stone tool-making knowledge, and suggest that speech may have emerged due to an increase in complex social interactions.{{Cite journal |last1=Cataldo |first1=D. M. |last2=Migliano |first2=A. B. |last3=Vinicius |first3=L. |date=19 January 2018 |title=Speech, stone tool-making and the evolution of language |journal=PLOS ONE |volume=13 |issue=1 |pages=e0191071 |bibcode=2018PLoSO..1391071C |doi=10.1371/journal.pone.0191071 |pmc=5774752 |pmid=29351319 |doi-access=free}}

= Archaic ''Homo sapiens'' =

{{redirect|Hmmmmm|Humming|Humming (disambiguation)}}

{{Further|Archaic humans}}

Steven Mithen proposed the term Hmmmmm for the pre-linguistic system of communication posited to have been used by archaic Homo, beginning with Homo ergaster and reaching the highest sophistication in the Middle Pleistocene with Homo heidelbergensis and Homo neanderthalensis. Hmmmmm is an acronym for holistic (non-compositional), manipulative (utterances are commands or suggestions, not descriptive statements), multi-modal (acoustic as well as gestural and facial), musical, and mimetic.{{Cite book |last=Mithen |first=Steven J. |title=The singing neanderthals: the origins of music, language, mind, and body |publisher=Harvard University Press |year=2006 |isbn=978-0-674-02192-1 |location=Cambridge, MA}}

== ''Homo erectus'' ==

Evidence for Homo erectus potentially using language comes in the form of Acheulean tool usage. The use of abstract thought in the formation of Acheulean hand axes coincides with the symbol creation necessary for simple language.{{Cite journal |last1=Barham |first1=Lawrence |last2=Everett |first2=Daniel |date=1 June 2021 |title=Semiotics and the Origin of Language in the Lower Palaeolithic |journal=Journal of Archaeological Method and Theory |volume=28 |issue=2 |pages=535–579 |doi=10.1007/s10816-020-09480-9 |issn=1573-7764 |s2cid=225509049 |doi-access=free}} Recent language theories present recursion as the unique facet of human language and theory of mind.{{Cite journal |last1=Vicari |first1=Giuseppe |last2=Adenzato |first2=Mauro |date=May 2014 |title=Is recursion language-specific? Evidence of recursive mechanisms in the structure of intentional action |url=https://linkinghub.elsevier.com/retrieve/pii/S1053810014000555 |journal=Consciousness and Cognition |volume=26 |pages=169–188 |doi=10.1016/j.concog.2014.03.010 |pmid=24762973 |s2cid=206955548 |hdl-access=free |hdl=2318/154505}}{{Cite journal |last=Corballis |first=Michael |date=2007 |title=The Uniqueness of Human Recursive Thinking |url=https://www.americanscientist.org/article/the-uniqueness-of-human-recursive-thinking |journal=American Scientist |volume=95 |issue=3 |page=240 |doi=10.1511/2007.65.240 |issn=0003-0996}} However, breaking down language into its symbolic parts: separating meaning from the requirements of grammar, it becomes possible to see that language does not depend on either recursion or grammar. This can be evidenced by the Pirahã language users in Brazil that have no myth or creation stories, no numbers and no colors within their language.{{Cite journal |last=Everett |first=Daniel L. |date=August 2005 |title=Cultural Constraints on Grammar and Cognition in Pirahã: Another Look at the Design Features of Human Language |journal=Current Anthropology |volume=46 |issue=4 |pages=621–646 |doi=10.1086/431525 |issn=0011-3204 |s2cid=2223235 |hdl-access=free |hdl=2066/41103}} This is to highlight that even though grammar may have been unavailable, use of foresight, planning and symbolic thought can be evidence of language as early as one million years ago with Homo erectus.

== ''Homo heidelbergensis'' ==

{{See also|Homo_heidelbergensis#Language|l1=Homo heidelbergensis: Language}}

Homo heidelbergensis was a close relative (most probably a migratory descendant) of Homo ergaster. Some researchers believe this species to be the first hominin to make controlled vocalisations, possibly mimicking animal vocalisations, and that as Homo heidelbergensis developed more sophisticated culture, proceeded from this point and possibly developed an early form of symbolic language.

== ''Homo neanderthalensis'' ==

{{See also|Neanderthal_behavior#Language|l1=Neanderthal behavior: Language}}

The discovery in 1989 of the (Neanderthal) Kebara 2 hyoid bone suggests that Neanderthals may have been anatomically capable of producing sounds similar to modern humans.{{Cite journal |last1=Arensburg |first1=B. |last2=Schepartz |first2=L. A. |last3=Tillier |first3=A. M. |last4=Vandermeersch |first4=B. |last5=Rak |first5=Y. |date=October 1990 |title=A reappraisal of the anatomical basis for speech in Middle Palaeolithic hominids |journal=American Journal of Physical Anthropology |volume=83 |issue=2 |pages=137–146 |doi=10.1002/ajpa.1330830202 |pmid=2248373}}{{Cite journal |last1=D'Anastasio |first1=R. |last2=Wroe |first2=S. |last3=Tuniz |first3=C. |last4=Mancini |first4=L. |last5=Cesana |first5=D. T. |last6=Dreossi |first6=D. |last7=Ravichandiran |first7=M. |last8=Attard |first8=M. |last9=Parr |first9=W. C. |last10=Agur |first10=Anne |last11=Capasso |first11=Luigi |display-authors=8 |year=2013 |title=Micro-biomechanics of the kebara 2 hyoid and its implications for speech in neanderthals |journal=PLOS ONE |volume=8 |issue=12 |pages=e82261 |bibcode=2013PLoSO...882261D |doi=10.1371/journal.pone.0082261 |pmc=3867335 |pmid=24367509 |doi-access=free}} The hypoglossal nerve, which passes through the hypoglossal canal, controls the movements of the tongue, which may have enabled voicing for size exaggeration (see size exaggeration hypothesis below) or may reflect speech abilities.{{Cite journal |last1=Jungers |first1=W. L. |last2=Pokempner |first2=A. A. |last3=Kay |first3=R. F. |last4=Cartmill |first4=M. |date=August 2003 |title=Hypoglossal canal size in living hominoids and the evolution of human speech. |url=http://www.baa.duke.edu/kay/site/riogallegos/PDFs/j74.pdf |url-status=dead |journal=Human Biology |volume=75 |issue=4 |pages=473–484 |doi=10.1353/hub.2003.0057 |pmid=14655872 |s2cid=30777048 |archive-url=https://web.archive.org/web/20070612035730/http://www.baa.duke.edu/kay/site/riogallegos/PDFs/j74.pdf |archive-date=12 June 2007}}{{Cite journal |last1=DeGusta |first1=D. |last2=Gilbert |first2=W. H. |last3=Turner |first3=S. P. |date=February 1999 |title=Hypoglossal canal size and hominid speech |journal=Proceedings of the National Academy of Sciences of the United States of America |volume=96 |issue=4 |pages=1800–1804 |bibcode=1999PNAS...96.1800D |doi=10.1073/pnas.96.4.1800 |pmc=15600 |pmid=9990105 |doi-access=free}}{{Cite book |last=Johansson |first=Sverker |title=Evolution of Language: Sixth International Conference, Rome |date=April 2006 |isbn=9789812566560 |pages=152–159 |chapter=Constraining the Time when Language Evolved |doi=10.1142/9789812774262_0020 |access-date=10 September 2007 |chapter-url=http://urn.kb.se/resolve?urn=urn:nbn:se:du-13687 |archive-url=https://web.archive.org/web/20061015133922/http://www.tech.plymouth.ac.uk/socce/evolang6/johansson_constraining.pdf |archive-date=15 October 2006 |url-status=dead}}{{Cite journal |last=Houghton |first=P. |date=February 1993 |title=Neandertal supralaryngeal vocal tract |journal=American Journal of Physical Anthropology |volume=90 |issue=2 |pages=139–146 |doi=10.1002/ajpa.1330900202 |pmid=8430750}}{{Cite journal |last1=Boë |first1=Louis-Jean |last2=Maeda |first2=Shinji |last3=Heim |first3=Jean-Louis |year=1999 |title=Neandertal man was not morphologically handicapped for speech |journal=Evolution of Communication |volume=3 |issue=1 |pages=49–77 |doi=10.1075/eoc.3.1.05boe}}

However, although Neanderthals may have been anatomically able to speak, Richard G. Klein in 2004 doubted that they possessed a fully modern language. He largely bases his doubts on the fossil record of archaic humans and their stone tool kit. Bart de Boer in 2017 acknowledges this ambiguity of a universally accepted Neanderthal vocal tract; however, he notes the similarities in the thoracic vertebral canal, potential air sacs, and hyoid bones between modern humans and Neanderthals to suggest the presence of complex speech.de Boer, Bart (2017). "Evolution of speech and evolution of language". Psychonomic Bulletin & Review. 24 (1): 158–162. doi:10.3758/s13423-016-1130-6. ISSN 1069-9384. For two million years following the emergence of Homo habilis, the stone tool technology of hominins changed very little. Klein, who has worked extensively on ancient stone tools, describes the crude stone tool kit of archaic humans as impossible to break down into categories based on their function, and reports that Neanderthals seem to have had little concern for the final aesthetic form of their tools. Klein argues that the Neanderthal brain may have not reached the level of complexity required for modern speech, even if the physical apparatus for speech production was well-developed.{{Cite journal |last=Klarreich |first=E. |year=2004 |title=Biography of Richard G. Klein |journal=Proceedings of the National Academy of Sciences |volume=101 |issue=16 |pages=5705–5707 |bibcode=2004PNAS..101.5705K |doi=10.1073/pnas.0402190101 |pmc=395972 |pmid=15079069 |doi-access=free}}{{Cite web |last=Klein, Richard G. |title=Three Distinct Human Populations |url=http://www.accessexcellence.org/BF/bf02/klein/bf02e3.html |access-date=10 September 2007 |website=Biological and Behavioral Origins of Modern Humans |publisher=Access Excellence @ The National Health Museum}} The issue of the Neanderthal's level of cultural and technological sophistication remains a controversial one.{{Citation needed|date=May 2021}}

Based on computer simulations used to evaluate that evolution of language that resulted in showing three stages in the evolution of syntax, Neanderthals are thought to have been in stage 2, showing they had something more evolved than proto-language but not quite as complex as the language of modern humans.{{Cite journal |last=Marwick |first=Ben |year=2003 |title=Pleistocene Exchange Networks as Evidence for the Evolution of Language |journal=Cambridge Archaeological Journal |volume=13 |pages=67–81 |doi=10.1017/S0959774303000040 |s2cid=15514627 |hdl-access=free |hdl=1885/42089}}

Some researchers, applying auditory bioengineering models to computerised tomography scans of Neanderthal skulls, have asserted that Neanderthals had auditory capacity very similar to that of anatomically modern humans.{{Cite journal |last1=Conde-Valverde |first1=Mercedes |last2=Martínez |first2=Ignacio |last3=Quam |first3=Rolf M. |last4=Rosa |first4=Manuel |last5=Velez |first5=Alex D. |last6=Lorenzo |first6=Carlos |last7=Jarabo |first7=Pilar |last8=Bermúdez de Castro |first8=José María |last9=Carbonell |first9=Eudald |last10=Arsuaga |first10=Juan Luis |date=1 March 2021 |title=Neanderthals and Homo sapiens had similar auditory and speech capacities |url=https://www.nature.com/articles/s41559-021-01391-6 |journal=Nature Ecology & Evolution |volume=5 |issue=5 |pages=609–615 |bibcode=2021NatEE...5..609C |doi=10.1038/s41559-021-01391-6 |issn=2397-334X |pmid=33649543 |s2cid=232090739}} These researchers claim that this finding implies that "Neanderthals evolved the auditory capacities to support a vocal communication system as efficient as modern human speech."

= ''Homo sapiens'' =

{{See also|Anatomically modern humans|Behavioral modernity}}

Anatomically modern humans begin to appear in the fossil record in Ethiopia some 200,000 years ago.{{Cite journal |last1=Fleagle |first1=John G. |last2=Assefa |first2=Zelalem |last3=Brown |first3=Francis H. |last4=Shea |first4=John J. |year=2008 |title=Paleoanthropology of the Kibish Formation, southern Ethiopia: Introduction |journal=Journal of Human Evolution |volume=55 |issue=3 |pages=360–365 |bibcode=2008JHumE..55..360F |doi=10.1016/j.jhevol.2008.05.007 |pmid=18617219}} Although there is still much debate as to whether behavioural modernity emerged in Africa at around the same time, a growing number of archaeologists nowadays{{When|date=May 2021}} invoke the southern African Middle Stone Age use of red ochre pigments—for example at Blombos Cave—as evidence that modern anatomy and behaviour co-evolved.{{Cite journal |last1=Henshilwood |first1=C. S. |last2=d'Errico |first2=F. |last3=Yates |first3=R. |last4=Jacobs |first4=Z. |last5=Tribolo |first5=C. |last6=Duller |first6=G. A. T. |last7=Mercier |first7=N. |last8=Sealy |first8=J. C. |last9=Valladas |first9=H. |last10=Watts |first10=I. |last11=Wintle |first11=A. G. |year=2002 |title=Emergence of modern human behavior: Middle Stone Age engravings from South Africa |journal=Science |volume=295 |issue=5558 |pages=1278–1280 |bibcode=2002Sci...295.1278H |doi=10.1126/science.1067575 |pmid=11786608 |s2cid=31169551}} These archaeologists argue strongly that if modern humans at this early stage were using red ochre pigments for ritual and symbolic purposes, they probably had symbolic language as well.

According to the recent African origins hypothesis, from around 60,000 – 50,000 years ago{{Cite web |last=Minkel |first=J. R. |date=18 July 2007 |title=Skulls Add to "Out of Africa" Theory of Human Origins: Pattern of skull variation bolsters the case that humans took over from earlier species |url=http://www.sciam.com/article.cfm?articleID=DA5114C2-E7F2-99DF-30BBDDD4415DED90 |access-date=9 September 2007 |publisher=Scientific American.com}} a group of humans left Africa and began migrating to occupy the rest of the world, carrying language and symbolic culture with them.Chris Stringer, 2011. The Origin of Our Species. London: Penguin.

= Descended larynx =

{{More citations needed section|date=May 2021}}

File:Illu larynx.jpg

The larynx (or voice box) is an organ in the neck housing the vocal folds, which are responsible for phonation. In humans, the larynx is descended. The human species is not unique in this respect: goats, dogs, pigs and tamarins lower the larynx temporarily, to emit loud calls.{{Cite journal |last=Fitch |first=W. T. |year=2000 |title=The phonetic potential of nonhuman vocal tracts: comparative cineradiographic observations of vocalizing animals |journal=Phonetica |volume=57 |issue=2–4 |pages=205–218 |doi=10.1159/000028474 |pmid=10992141 |s2cid=202652500}} Several deer species have a permanently lowered larynx, which may be lowered still further by males during their roaring displays.{{Cite journal |last1=Fitch |first1=W. T. |last2=Reby |first2=D. |date=August 2001 |title=The descended larynx is not uniquely human |journal=Proceedings of the Royal Society B |volume=268 |issue=1477 |pages=1669–1675 |doi=10.1098/rspb.2001.1704 |pmc=1088793 |pmid=11506679}} Lions, jaguars, cheetahs and domestic cats also do this.{{Cite journal |last1=Weissengruber |first1=G. E. |last2=Forstenpointner |first2=G. |last3=Peters |first3=G. |last4=Kübber-Heiss |first4=A. |last5=Fitch |first5=W. T. |date=September 2002 |title=Hyoid apparatus and pharynx in the lion (Panthera leo), jaguar (Panthera onca), tiger (Panthera tigris), cheetah (Acinonyxjubatus) and domestic cat (Felis silvestris f. catus) |journal=Journal of Anatomy |volume=201 |issue=3 |pages=195–209 |doi=10.1046/j.1469-7580.2002.00088.x |pmc=1570911 |pmid=12363272}} However, laryngeal descent in nonhumans (according to Philip Lieberman) is not accompanied by descent of the hyoid; hence the tongue remains horizontal in the oral cavity, preventing it from acting as a pharyngeal articulator.{{Cite journal |last=Lieberman |first=Philip |year=2007 |title=The Evolution of Human Speech: Its Anatomical and Neural Bases |url=http://www.cog.brown.edu/people/lieberman/pdfFiles/Lieberman%20P.%202007.%20The%20evolution%20of%20human%20speech,%20Its%20anatom.pdf |url-status=dead |journal=Current Anthropology |volume=48 |issue=1 |pages=39–66 |doi=10.1086/509092 |s2cid=28651524 |archive-url=https://web.archive.org/web/20140611203314/http://www.cog.brown.edu/people/lieberman/pdfFiles/Lieberman%20P.%202007.%20The%20evolution%20of%20human%20speech,%20Its%20anatom.pdf |archive-date=11 June 2014 |access-date=3 May 2009}}

{{Infobox anatomy

| Name = Larynx

| Latin =

| Image = Larynx external en.svg

| Caption = Anatomy of the larynx, anterolateral view

| Width =

| Image2 =

| Caption2 =

| Precursor =

| System =

| Artery =

| Vein =

| Nerve =

| Lymph =

}}

Despite all this, scholars remain divided as to how "special" the human vocal tract really is. It has been shown that the larynx does descend to some extent during development in chimpanzees, followed by hyoidal descent.{{Cite journal |last1=Nishimura |first1=T. |last2=Mikami |first2=A. |last3=Suzuki |first3=J. |last4=Matsuzawa |first4=T. |date=September 2006 |title=Descent of the hyoid in chimpanzees: evolution of face flattening and speech |journal=Journal of Human Evolution |volume=51 |issue=3 |pages=244–254 |bibcode=2006JHumE..51..244N |doi=10.1016/j.jhevol.2006.03.005 |pmid=16730049}} As against this, Philip Lieberman points out that only humans have evolved permanent and substantial laryngeal descent in association with hyoidal descent, resulting in a curved tongue and two-tube vocal tract with 1:1 proportions. He argues that Neanderthals and early anatomically modern humans could not have possessed supralaryngeal vocal tracts capable of producing "fully human speech".{{Cite journal |last1=Lieberman |first1=Philip |last2=McCarthy |first2=Robert C. |last3=Strait |first3=David |year=2006 |title=The Recent Origin of Human Speech |journal=The Journal of the Acoustical Society of America |volume=119 |issue=5 |page=3441 |bibcode=2006ASAJ..119.3441L |doi=10.1121/1.4786937}} Uniquely in the human case, simple contact between the epiglottis and velum is no longer possible, disrupting the normal mammalian separation of the respiratory and digestive tracts during swallowing. Since this entails substantial costs—increasing the risk of choking while swallowing food—we are forced to ask what benefits might have outweighed those costs. The obvious benefit—so it is claimed—must have been speech. But this idea has been vigorously contested. One objection is that humans are in fact not seriously at risk of choking on food: medical statistics indicate that accidents of this kind are extremely rare.M. Clegg 2001. The Comparative Anatomy and Evolution of the Human Vocal Tract Unpublished thesis, University of London. Another objection is that in the view of most scholars, speech as it is known emerged relatively late in human evolution, roughly contemporaneously with the emergence of Homo sapiens.{{Cite journal |last1=Perreault |first1=C. |last2=Mathew |first2=S. |year=2012 |title=Dating the origin of language using phonemic diversity |journal=PLOS ONE |volume=7 |issue=4 |pages=e35289 |bibcode=2012PLoSO...735289P |doi=10.1371/journal.pone.0035289 |pmc=3338724 |pmid=22558135 |doi-access=free}} A development as complex as the reconfiguration of the human vocal tract would have required much more time, implying an early date of origin. This discrepancy in timescales undermines the idea that human vocal flexibility was initially driven by selection pressures for speech, thus not excluding that it was selected for e.g. improved singing ability.

== Size exaggeration hypothesis ==

To lower the larynx is to increase the length of the vocal tract, in turn lowering formant frequencies so that the voice sounds "deeper"—giving an impression of greater size. John Ohala argues that the function of the lowered larynx in humans, especially males, is probably to enhance threat displays rather than speech itself.John J. Ohala, 2000. [http://linguistics.berkeley.edu/~ohala/papers/lowered_larynx.pdf The irrelevance of the lowered larynx in modern Man for the development of speech.] Paris, ENST: The Evolution of Language, pp. 171–172. Ohala points out that if the lowered larynx were an adaptation for speech, adult human males would be expected to be better adapted in this respect than adult females, whose larynx is considerably less low. However, females outperform males in verbal tests,{{Cite journal |last1=Barel |first1=Efrat |last2=Tzischinsky |first2=Orna |date=June 2018 |title=Age and Sex Differences in Verbal and Visuospatial Abilities |journal=Advances in Cognitive Psychology |volume=2 |issue=14 |pages=51–61 |doi=10.5709/acp-0238-x |pmc=7186802 |pmid=32362962}} falsifying this whole line of reasoning.

W. Tecumseh Fitch likewise argues that this was the original selective advantage of laryngeal lowering in the human species. Although (according to Fitch) the initial lowering of the larynx in humans had nothing to do with speech, the increased range of possible formant patterns was subsequently co-opted for speech. Size exaggeration remains the sole function of the extreme laryngeal descent observed in male deer. Consistent with the size exaggeration hypothesis, a second descent of the larynx occurs at puberty in humans, although only in males. In response to the objection that the larynx is descended in human females, Fitch suggests that mothers vocalizing to protect their infants would also have benefited from this ability.Fitch, W. T. (2002). Comparative vocal production and the evolution of speech: Reinterpreting the descent of the larynx. In A. Wray (ed.), The Transition to Language. Oxford: Oxford University Press, pp. 21–45.

= Phonemic diversity =

In 2011, Quentin Atkinson published a survey of phonemes from 500 different languages as well as language families and compared their phonemic diversity by region, number of speakers and distance from Africa. The survey revealed that African languages had the largest number of phonemes, and Oceania and South America had the smallest number. After allowing for the number of speakers, the phonemic diversity was compared to over 2000 possible origin locations. Atkinson's "best fit" model is that language originated in western, central, or southern Africa between 80,000 and 160,000 years ago. This predates the hypothesized southern coastal peopling of Arabia, India, southeast Asia, and Australia. It would also mean that the origin of language occurred at the same time as the emergence of symbolic culture.{{Cite journal |last=Atkinson |first=Quentin |year=2011 |title=Phonemic Diversity Supports a Serial Founder Effect Model of Language Expansion from Africa |url=http://www.stat.uchicago.edu/~pmcc/prelims/2011/Atkinson-346-9.pdf |journal=Science Magazine |volume=332 |issue=6027 |pages=346–349 |bibcode=2011Sci...332..346A |doi=10.1126/science.1199295 |pmid=21493858 |s2cid=42021647 |access-date=9 July 2017}}

Numerous linguists{{Cite journal |last1=Cysouw |first1=Michael |last2=Dediu |first2=Dan |last3=Moran |first3=Steven |date=10 February 2012 |title=Comment on "Phonemic Diversity Supports a Serial Founder Effect Model of Language Expansion from Africa" |journal=Science |volume=335 |issue=6069 |page=657 |bibcode=2012Sci...335..657C |doi=10.1126/science.1208841 |pmid=22323802 |ref=Cysouw 2012 |doi-access=free |hdl-access=free |hdl=11858/00-001M-0000-0012-1937-4}}{{Cite journal |last1=Wang |first1=Chuan-Chao |last2=Ding |first2=Qi-Liang |last3=Tao |first3=Huan |last4=Li |first4=Hui |date=10 February 2012 |title=Comment on "Phonemic Diversity Supports a Serial Founder Effect Model of Language Expansion from Africa" |url=https://www.science.org/doi/full/10.1126/science.1207846 |journal=Science |volume=335 |issue=6069 |page=657 |bibcode=2012Sci...335..657W |doi=10.1126/science.1207846 |pmid=22323803 |s2cid=31360222 |access-date=22 October 2023 |ref=Wang 2012}}{{Cite journal |last1=Pereltsvaig |first1=Asya |last2=Van Tuyl |first2=Rory |date=10 February 2012 |title=Comment on "Phonemic Diversity Supports a Serial Founder Effect Model of Language Expansion from Africa" |url=https://www.science.org/doi/full/10.1126/science.1209176 |journal=Science |volume=335 |issue=6069 |page=657 |bibcode=2012Sci...335..657V |doi=10.1126/science.1209176 |pmid=22323804 |access-date=22 October 2023 |ref=Pereltsvaig 2012}} have criticized Atkinson's paper as misrepresenting both the phonemic data and processes of linguistic change, as language complexity does not necessarily correspond to age, and of failing to take into account the borrowing of phonemes from neighbouring languages, as some Bantu languages have done with click consonants. Recreations of his method gave possible origins of language in the Caucasus and Turkmenistan, in addition to southern and eastern Africa.

History

= In religion and mythology =

{{Main|Mythical origins of language}}

{{See also|Divine language|Adamic language}}

File:Pieter Bruegel the Elder - The Tower of Babel (Vienna) - Google Art Project.jpg by Pieter Bruegel the Elder (1563)]]

The search for the origin of language has a long history in mythology. Most mythologies do not credit humans with the invention of language but speak of a divine language predating human language. Mystical languages used to communicate with animals or spirits, such as the language of the birds, are also common, and were of particular interest during the Renaissance.

Vāc is the Hindu goddess of speech, or "speech personified". As Brahman's "sacred utterance", she has a cosmological role as the "Mother of the Vedas". The Aztecs' story maintains that only a man, Coxcox, and a woman, Xochiquetzal, survived a flood, having floated on a piece of bark. They found themselves on land and had many children who were at first born unable to speak, but subsequently, upon the arrival of a dove, were endowed with language, although each one was given a different speech such that they could not understand one another.Turner, P. and Russell-Coulter, C. (2001) Dictionary of Ancient Deities (Oxford: OUP)

In the Old Testament, the Book of Genesis (chapter 11) says that God prevented the Tower of Babel from being completed through a miracle that made its construction workers start speaking different languages. After this, they migrated to other regions, grouped together according to which of the newly created languages they spoke, explaining the origins of languages and nations outside of the Fertile Crescent.{{Cite book |last=Pennock |first=Robert T. |url=https://books.google.com/books?id=aC1OccYnX0sC&q=Tower+of+Babel:+The+Evidence+Against+the+New+Creationism |title=Tower of Babel: The Evidence against the New Creationism |publisher=Bradford |year=2000 |isbn=978-0-262-66165-2}}

= Historical experiments =

{{Main|Language deprivation experiments}}

History contains a number of anecdotes about people who attempted to discover the origin of language by experiment. The first such tale was told by Herodotus (Histories 2.2). He relates that Pharaoh Psammetichus (probably Psammetichus I, 7th century BC) had two children raised by a shepherd, with the instructions that no one should speak to them, but that the shepherd should feed and care for them while listening to determine their first words. When one of the children cried "bekos" with outstretched arms the shepherd concluded that the word was Phrygian, because that was the sound of the Phrygian word for 'bread'. From this, Psammetichus concluded that the first language was Phrygian. King James IV of Scotland is said to have tried a similar experiment; his children were supposed to have spoken Hebrew.{{Cite book |last=Lindsay |first=Robert |url=https://archive.org/details/bub_gb_AKUvAAAAMAAJ |title=The history of Scotland: from 21 February 1436. to March, 1565. In which are contained accounts of many remarkable passages altogether differing from our other historians; and many facts are related, either concealed by some, or omitted by others |publisher=Baskett & Co. |year=1728 |page=[https://archive.org/details/bub_gb_AKUvAAAAMAAJ/page/n125 104]}}

Both the medieval monarch Frederick II and Akbar are said to have tried similar experiments; the children involved in these experiments did not speak. The current situation of deaf people also points into this direction.{{Clarify|date=May 2021}}

= History of research =

{{Main|Evolutionary linguistics}}

Modern linguistics did not begin until the late 18th century, and the Romantic or animist theses of Johann Gottfried Herder and Johann Christoph Adelung remained influential well into the 19th century. The question of language origin seemed inaccessible to methodical approaches, and in 1866 the Linguistic Society of Paris famously banned all discussion of the origin of language, deeming it to be an unanswerable problem. An increasingly systematic approach to historical linguistics developed in the course of the 19th century, reaching its culmination in the Neogrammarian school of Karl Brugmann and others.{{Citation needed|date=January 2014}}

However, scholarly interest in the question of the origin of language has only gradually been rekindled{{Colloquialism|date=May 2021}} from the 1950s on (and then controversially) with ideas such as universal grammar, mass comparison and glottochronology.{{Citation needed|date=January 2014}}

The "origin of language" as a subject in its own right emerged from studies in neurolinguistics, psycholinguistics and human evolution. The Linguistic Bibliography introduced "Origin of language" as a separate heading in 1988, as a sub-topic of psycholinguistics. Dedicated research institutes of evolutionary linguistics are a recent phenomenon, emerging only in the 1990s.{{Cite book |last=Meena |first=Ram Lakhan |url=https://books.google.com/books?id=y1Y7EAAAQBAJ&q=Dedicated+research+institutes+of+evolutionary+linguistics+are+a+recent+phenomenon,+emerging+only+in+the+1990s |title=Current Trends of Applied Linguistics |date=3 August 2021 |publisher=K.K. Publications |access-date=9 January 2022}}

See also

References

{{Duplicated citations|date=November 2024}}

{{Reflist|30em}}

Further reading

{{Duplicated citations|date=November 2024}}

{{columns-list|colwidth=30em|

  • {{Cite book |last=Allott |first=Robin. |title=The Motor Theory of Language Origin |publisher=Book Guild |year=1989 |isbn=978-0-86332-359-1 |location=Sussex, England}}
  • {{Cite book |last1=Armstrong |first1=David F. |title=Gesture and the Nature of Language |last2=Stokoe |first2=William C. |last3=Wilcox |first3=Sherman E. |publisher=Cambridge University Press |year=1995 |isbn=978-0-521-46772-8}}
  • {{Cite book |title=The Evolutionary Emergence of Language: Evidence and Inference |publisher=Oxford University Press |year=2013 |isbn=978-0-19-965484-0 |editor-last=Botha |editor-first=Rudolf P. |editor-last2=Everaert |editor-first2=Martin}}
  • {{Cite book |last1=Botha |first1=Rudolf P. |title=The Prehistory of Language |last2=Knight |first2=Chris |publisher=Oxford University Press |year=2009 |isbn=978-0-19-954587-2}}
  • {{Cite book |last=Burling |first=Robbins |title=The Talking Ape: How Language Evolved |publisher=Oxford University Press |year=2005 |isbn=978-0-19-927940-1}}
  • {{Cite book |last1=Cangelosi |first1=Angelo |title=Simulating the Evolution of Language |last2=Greco |first2=Alberto |last3=Harnad |first3=Stevan |author-link3=Stevan Harnad |publisher=Springer |year=2002 |isbn=978-1-85233-428-4 |editor-last=Cangelosi |editor-first=Angelo |location=London; New York |chapter=Symbol Grounding and the Symbolic Theft Hypothesis |editor-first2=Domenico |editor-last2=Parisi}}
  • {{Cite book |last=Corballis |first=Michael C. |url=https://archive.org/details/fromhandtomoutho0000corb |title=From Hand to Mouth: The Origins of Language |publisher=Princeton University Press |year=2002 |isbn=978-0-691-08803-7 |location=Princeton |url-access=registration}}
  • {{Cite book |last=Crystal |first=David |title=The Cambridge Encyclopedia of Language |publisher=Cambridge University Press |year=1997 |isbn=978-0-521-55967-6}}
  • de Grolier, E. (ed.), 1983. The Origin and Evolution of Language. Paris: Harwood Academic Publishers.
  • Dessalles, J-L., 2007. Why We Talk: The Evolutionary Origins of Language. Oxford University Press. {{ISBN|978-0199563463}}
  • {{Cite book |last1=Dor |first1=Dan |title=The Social Origins of Language |last2=Knight |first2=Chris |last3=Lewis |first3=Jerome |publisher=Oxford University Press |year=2015 |isbn=978-0-19-966533-4}}
  • {{Cite book |last1=Dunbar |first1=Robin Ian MacDonald |title=The Evolution of Culture: An Interdisciplinary View |last2=Knight |first2=Chris |last3=Power |first3=Camilla |publisher=Edinburgh University Press |year=1999 |isbn=978-0-7486-1076-1}}
  • {{Cite book |last=Everett |first=Daniel L. |author-link=Daniel Everett |title=How Language Began: The Story of Humanity's Greatest Invention |publisher=Liveright |year=2017 |isbn=978-0-87140-795-5 |location=New York}}
  • {{Cite book |last=Fitch |first=W. Tecumseh |title=The Evolution of Language |publisher=Cambridge University Press |year=2010 |isbn=978-0-521-67736-3}}
  • {{Cite book |last1=Givón |first1=Talmy |author-link1=Talmy Givón |title=The Evolution of Language out of Pre-Language |last2=Malle |first2=Bertram F |publisher=John Benjamins |year=2002 |isbn=978-1-58811-237-8}}
  • {{Cite book |last=Harnad |first=Stevan R. |author-link=Stevan Harnad |title=Origins and Evolution of Language and Speech |publisher=New York Academy of Sciences |year=1976 |isbn=0-89072-026-6 |editor-last=Steklis, Horst D. |series=Annals of the New York Academy of Sciences, v. 280 |location=New York |editor-last2=Lancaster, Jane}}
  • {{Cite book |last=Hillert |first=Dieter |author-link=Dieter Hillert |title=The Nature of Language: Evolution, Paradigms and Circuits |publisher=Springer Nature |year=2014 |isbn=978-1-4939-0609-3 |location=New York}}
  • {{Cite book |last=Hurford |first=James R. |url=http://www.lel.ed.ac.uk/~jim/rocapaper.pdf |title=Logical issues in language acquisition |publisher=Foris |year=1990 |isbn=9789067655064 |editor-last=Roca |editor-first=I. M. |location=Dordrecht, Holland Providence, RI |chapter=Nativist and Functional Explanations in Language Acquisition}}
  • {{Cite book |last=Hurford |first=James R. |title=The Origins of Meaning: Language in the Light of Evolution |publisher=Oxford University Press |year=2007 |isbn=978-0-19-920785-5}}
  • {{Cite book |last1=Hurford |first1=James R. |title=Approaches to the Evolution of Language: Social and Cognitive Bases |last2=Studdert-Kennedy |first2=Michael. |last3=Knight |first3=Chris |publisher=Cambridge University Press |year=1998 |isbn=978-0-521-63964-4}}
  • {{Cite book |last=Kenneally |first=Christine. |url=https://archive.org/details/firstwordsearchf00kenn |title=The First Word: The Search for the Origins of Language |publisher=Viking |year=2007 |isbn=978-0-670-03490-1 |location=New York}}
  • {{Cite journal |last=Knight |first=Chris |year=2016 |title=Puzzles and Mysteries in the Origin of Language |url=http://www.chrisknight.co.uk/wp-content/uploads/2016/09/Mysteries-and-puzzles-2016.pdf |journal=Language and Communication |volume=50 |pages=12–21 |doi=10.1016/j.langcom.2016.09.002}}
  • {{Cite book |last1=Knight |first1=Chris |title=The Evolutionary Emergence of Language: Social Function and the Origins of Linguistic Form |last2=Studdert-Kennedy |first2=Michael. |last3=Hurford |first3=James R. |publisher=Cambridge University Press |year=2000 |isbn=978-0-521-78157-2}}
  • {{Cite book |last=Komarova |first=Natalia L. |author-link=Natalia Komarova |url=https://archive.org/details/isbn_9785484010011/page/164 |title=History and mathematics. Analyzing and modeling global development |publisher=URSS |year=2006 |isbn=978-5-484-01001-1 |editor-last=Grinin |editor-first=L. E. |location=Moscow |pages=[https://archive.org/details/isbn_9785484010011/page/164 164–179] |chapter=Language and Mathematics: An evolutionary model of grammatical communication |editor-last2=De Munck |editor-first2=Victor C. |editor-last3=Korotaev |editor-first3=A. V.}}
  • Lenneberg, E. H. 1967. Biological Foundations of Language. New York: Wiley. {{ISBN|9780471526261}}
  • Leroi-Gourhan, A. 1993. Gesture and Speech. Trans. A. Bostock Berger. Cambridge, MA: MIT Press. {{ISBN|9780262121736}}
  • {{Cite book |last=Lieberman |first=Philip |url=https://archive.org/details/uniquelyhumanevo00lieb |title=Uniquely Human: The Evolution of Speech, Thought, and Selfless Behavior |publisher=Harvard University Press |year=1991 |isbn=978-0-674-92182-5 |location=Cambridge, MA}}
  • {{Cite journal |last=Lieberman |first=P. |year=2007 |title=The Evolution of Human Speech: Its Anatomical and Neural Bases |url=http://www.cog.brown.edu/people/lieberman/pdfFiles/Lieberman%20P.%202007.%20The%20evolution%20of%20human%20speech,%20Its%20anatom.pdf |url-status=dead |journal=Current Anthropology |volume=48 |issue=1 |pages=39–66 |doi=10.1086/509092 |s2cid=28651524 |archive-url=https://web.archive.org/web/20140611203314/http://www.cog.brown.edu/people/lieberman/pdfFiles/Lieberman%20P.%202007.%20The%20evolution%20of%20human%20speech,%20Its%20anatom.pdf |archive-date=11 June 2014 |access-date=3 May 2009}}
  • {{Cite book |last=Lieberman |first=Philip. |title=Toward an Evolutionary Biology of Language |publisher=Belknap Press of Harvard University Press |year=2006 |isbn=978-0-674-02184-6 |location=Cambridge, MA}}
  • Logan, Robert K. 2007. The Extended Mind: The Emergence of Language, the Human Mind and Culture. Toronto: University of Toronto Press. {{ISBN| 9781442691803}}
  • MacNeilage, P. 2008. The Origin of Speech. Oxford University Press. {{ISBN|9780199581580}}
  • Mazlumyan, Victoria 2008. Origins of Language and Thought. {{ISBN|0977391515}}.
  • Mithen, Stephen 2006. The Singing Neanderthals: The Origins of Music, Language, Mind and Body. {{ISBN|9780753820513}}
  • {{Cite book |last=Pinker |first=Steven |title=The Language Instinct: How the Mind Creates Language |publisher=HarperPerennial ModernClassics |year=2007 |isbn=978-0-06-133646-1 |location=New York}}
  • Tomasello, M. 2008. Origins of Human Communication. Cambridge, MA: MIT Press. {{ISBN|9780262261203}}

}}