Unbounded Merge

Lively discussions and controversies have arisen from the contrasting gradual and emergent explanations for language’s evolution. In this letter, I would like to add further support for the view that the language faculty could not have evolved gradually.

In their essay, Robert Berwick and Noam Chomsky reply to Cedric Boeckx’s criticism of their book Why Only Us and defend the view that the language faculty comprises a distinctive human phenotype that resulted from a small rewiring of the brain.¹ They illustrate how a minimal change in a computational system may have broad consequences on the overall generative capacity of that system, much like a small rewriting of the brain may have led to the rapid emergence of the language faculty. Berwick and Chomsky point to a common confusion in conventional evolutionary theory between the cognitive ability for language and its externalization. They also note Eric Lenneberg had foreseen the inherent difficulty in any evolutionary explanation of human language:

We can no longer reconstruct what the selection pressures were or in what order they came, because we know too little that is securely established by hard evidence about the ecological and social conditions of fossil man. Moreover, we do not even know what the targets of actual selection were. This is particularly troublesome because every genetic alteration brings about several changes at once, some of which must be quite incidental to the selective process.²

Language is a generative system that yields unlimited production from finite means. The discrete infinity of language is problematic for any evolutionary explanation. How is it possible for this Basic Property of language to have evolved gradually? Riny Huijbregts proves that this is a logical impossibility.³ He summarizes the arguments for two conflicting views on the discrete infinity of language: language emerging as infinite, and language emerging as finite proto-language and then evolving to infinite language.⁴ He shows that the computational system underlying human language could not have been reached gradually because the infinite productivity generated by such system cannot be reached stepwise.

In evolutionist explanations of language, proto-syntax is an intermediate state of development: the pre-syntactic one-word stage leads to the proto-syntax two-word stage, which leads to modern syntax. Unsurprisingly, there is little consensus on what the properties of proto-language could be. For Derek Bickerton, proto-language had a large vocabulary, but no internal syntax.⁵ For James Hurford, proto-thought had something like predicate calculus, but no quantifier or logical name.⁶

Ray Jackendoff takes proto-language to be derived by proto-Merge, the precursor of full-fledged Merge.⁷ Proto-Merge would be an n-ary operation that generates flat concatenation or adjunction structures (Figure 1). It could not generate hierarchical structures (Fig. 2), which instead are derived by Merge (Fig. 3). Merge is a binary operation. It takes two syntactic objects, represented by variables α and β (Fig. 2), and derives a set.⁸ This operation is recursively unbounded. That is, it may reapply indefinitely to its own output. Unbounded Merge recursively combines two syntactic objects (Fig. 2a). It may also displace to a higher position in the hierarchical structure an object that has been previously merged, leaving the lower copy of the displaced constituent unpronounced (Fig. 2b). The syntactic derivations may also include silent constituents that do not result from their own displacement: for example, categories occupying in the position of α and β (Fig. 2). Silent constituents are derived by unbounded merge, interpreted at the semantic interface, but not externalized at the sensory-motor interface. This asymmetry is a feature of language design not taken into consideration in conventional explanations of language evolution, which are mainly focused on externalization.

As there are no recordings of proto-language, believers in the gradual evolution of language have attempted to identify empirical evidence for a previous evolutionary stage in modern languages. Two-word expressions with irregular grammatical properties, such as English exocentric VN compounds dare-devil and pick-pocket, have been identified as fossils of proto-language.⁹ According to Ljiljana Progovac and John Locke,

While these compounds violate several rules and principles of modern syntax, their structure, as well as their persistence, do provide some continuity with modern syntax. If so, then the syntax that supports their formation (proto-syntax) may have facilitated a transition from a pre-syntactic (one-word) stage to modern syntax.¹⁰

Such analyses are irrelevant to the problem of language evolution, given that the language faculty is an unbounded recursive system and that the gradual evolution of a finite language into infinite language is not a logical possibility. The notion that constructions like VN exocentric compounds are remnants from proto-language is nothing more than speculation. The alleged absence of a principled, Merge-based analysis of these constructions is not proof that they are fossils of proto-language. And in any case, such an analysis is already available.

VN exocentric compounds are not derived by proto-Merge and thus are not remnants of a finite proto-language.¹¹ Their underlying hierarchical structure is derived by full-fledged Merge and includes unpronounced constituents. This does not come as a surprise. The phenotypic property of the human language faculty is an unbounded hierarchical assembly of syntactic objects with two interfaces. The hierarchical structures are derived by the computational procedure of the language faculty and are interpreted by the semantic system and by the sensory system. Given this asymmetry, the externalization of a linguistic expression is not isomorphic to its semantic representation. Interface asymmetries are pervasive in language and are part of language design.¹² They are expected in the derivations of any linguistic expression, including one-word expressions.

One-word expressions, such as here and there, in expressions such as I stayed here and I went there, include an unpronounced locative or directional preposition at or to.¹³ In English, this preposition is omitted and is interpreted only at the semantic interface. This is also the case in Italian with qui (“here”) and li (“there”). The directional preposition a (“at” or “to”) is sometimes pronounced in varieties of Italian spoken in southern Italy, where aecche (literally, “at here”) and alocche (“at there”) closely relate to their Latin counterparts, ad hic and ad locum.¹⁴ This illustrates that the hierarchical assembly of linguistic expressions leading to the semantic interface may not coincide with the externalization of these expressions at the sensory interface.

The hierarchical structure derived by full-fledged Merge underlies any linguistic expression. A conventional evolutionary theory of the language faculty fails to distinguish the unbounded recursive assembly of syntactic objects from their externalization. The latter is subject to variation, contrary to the former, which is the human-specific phenotypic trait under investigation.

Progress has been made in characterizing the human specific phenotypic trait.¹⁵ This progress logically excludes any evolutionary explanation of the human language faculty, because of the impossibility of attaining the infinite generative system of the language faculty stepwise. The alleged proto-Merge analysis of apparently simple and irregular expressions is a stipulation and is irrelevant to the evolvability of recursive languages. An analysis based on full-fledged Merge of apparently simple and irregular expressions in conjunction with the asymmetry of the interfaces offers a deeper explanation for the human capacity for language than evolutionary proto-Merge analyses.

Further work on the biological underpinning of infinite languages, unbounded Merge, and design features such as interface asymmetries may lead to a deeper understanding of the language faculty and its emergence in Homo sapiens.

Anna Maria Di Sciullo

Robert Berwick and Noam Chomsky reply:

Anna Maria Di Sciullo’s general point is well-taken. Apparently simple and irregular expressions can be analyzed using full-fledged Merge and empty elements, without any new stipulations. This scientific parsimony is exactly why, at least for such cases, there is no need to posit some other operation alongside Merge, no matter what its origin. Even using the term proto-Merge lends it too much credibility. Proto-Merge is not even definable, and it does not simplify the evolutionary picture. For instance, there is no way to move from Jackendoff’s n-ary operations to Merge, even putting aside the question of whether an n-ary operation is simpler than Merge.

The examples of linguistic fossils that have been cited by Jackendoff and others are linguistic forms, such as daredevil. The latter term arose only in the early 1700s; nobody seriously suggests it is really 100,000, or more, years old. What about claiming that the n-ary operation itself is a fossil? As far as we know, Jackendoff does not suggest this, but if he did, it would be meaningless—like saying that Merge is a fossil, or the human heart is a fossil. We are not supposing that new operations were devised 20,000 years ago. As far as we know, the language faculty has remained intact since before the separation of modern humans when they exited Africa.

Are there any signs of an n-ary operation today? The only close candidate is unbounded, unstructured coordination: “I met someone young, tall, eager to go to college, ....” But n-ary operations are not the right system to describe examples like these. We have to account for the fact that the elements in such examples share a category: “young” and “tall” are both adjectives. This again begins to sound something suspiciously like Merge. We think this hunch is on the right track, but we do not have space to go into detail about it here. There is no version of proto-Merge apart from full-fledged Merge.

DOI: 10.37282/991819.19.47

Cedric Boeckx, “Not Only Us,” Inference: International Review of Science 3, no. 1 (2017). Robert Berwick and Noam Chomsky, Why Only Us: Language and Evolution (Cambridge, MA: MIT Press, 2016). ↩
Eric Lenneberg, “On Explaining Language,” Science, New Series 164, no. 3,880 (1969): 642. ↩
M. A. C. (Riny) Huybregts, “Infinite Generation of Language Unreachable from a Stepwise Approach,” Frontiers in Psychology (2019), doi:10.3389/fpsyg.2019.00425. ↩
E.g., Noam Chomsky, “The Language Capacity: Architecture and Evolution,” Psychonomic Bulletin and Review 24, no. 1 (2017): 200–03, doi:10.3758/s13423-016-1078-6, and related works; Evelina Fedorenko and Steven Piantadosi, “Infinitely Productive Language Can Arise from Chance under Communicative Pressure,” Journal of Language Evolution 2, no. 2 (2017): 141–47. ↩
Derek Bickerton, Language and Species (Chicago: University of Chicago Press, 1990). ↩
James Hurford, “Protothought Had No Logical Names,” New Essays on the Origin of Language, ed. Jürgen Trabant (Berlin: Mouton de Gruyter, 2001), 119–32. See also James Hurford, The Origins of Grammar: Language in the Light of Evolution II (Oxford: Oxford University Press, 2012). ↩
Ray Jackendoff, Foundations of Language: Brain, Meaning, Grammar, Evolution (Oxford: Oxford University Press, 2002). See also Ray Jackendoff, “Possible Stages in the Evolution of the Language Capacity,” Trends in Cognitive Sciences 3 (1999): 272–79; Ray Jackendoff, “What Is the Human Language Faculty? Two Views,” Language 87, no. 3 (2011): 586–624; and a reply to Jackendoff by Anna Maria Di Sciullo and Lyle Jenkins, “Biolinguistics and the Human Language Faculty,” Language 92, no. 3 (2017): e1–e32. ↩
Noam Chomsky, Angel Gallego, and Dennis Ott, “Generative Grammar and the Faculty of Language: Insights, Questions, and Challenges,” Catalan Journal of Linguistics 2 (2017). ↩
See Anna Maria Di Sciullo and Edwin William, On the Definition of Word (Cambridge, MA: MIT Press, 1987), for the internal structure of compounds, as well as Anna Maria Di Sciullo, Asymmetry in Morphology (Cambridge, MA: MIT Press, 2005). On the hierarchical structure of exocentric compounds, see Anna Maria Di Sciullo, “Decomposing Compounds,” SKASE Journal of Theoretical Linguistics 2 (2005): 14–33, and Anna Maria Di Sciullo, “Why Are Compounds Part of Human Language? A View from Asymmetry Theory,” in The Oxford Handbook of Compounding, ed. Rochelle Lieber and Pavol Štekauer (Oxford: Oxford University Press, 2011): 145–77. John Locke and Ljiljana Progovac, “The Urge to Merge: Ritual Insult and the Evolution of Syntax,” Biolinguistics 3, nos. 2–3 (2009): 337–54. ↩
Ljiljana Progovac and John Locke, “The Urge to Merge: Ritual Insult and the Evolution of Syntax,” Biolinguistics 3, nos. 2–3 (2009): 341. ↩
Anna Maria Di Sciullo, “Exocentric Compounds, Language and Proto-Language,” Language and Information Society 20 (2013): 1–26. See also Shigeru Miyagawa and Vitor Nóbrega, “The Precedence of Syntax in the Rapid Emergence of Human Language in Evolution as Defined by the Integration Hypothesis,” Frontiers in Psychology 6 (2015): 271, doi:10.3389/fpsyg.2015.00271. ↩
Anna Maria Di Sciullo, “Asymmetry and the Language Faculty,” Revista Linguí∫tica 13, no. 2 (2017): 88–107, doi:10.31513/linguistica.2017.v13n2a14030. ↩
See Henk Van Riemsdijk, A Case Study in Syntactic Markedness: The Binding Nature of Prepositional Phrases (Dordrecht: Foris Publications, 1978); Jerrold Katz and Paul Postal, An Integrated Theory of Linguistic Descriptions (Cambridge, MA: MIT Press, 1964); Richard Kayne, Movement and Silence (Oxford: Oxford University Press, 2005); Hans Broekhuis, “On Parameters and on Principles of Pronunciation,” in Organizing Grammar: Linguistic Studies in Honor of Henk van Riemsdijk, ed. Hans Broekhuis et al. (Berlin: Mouton de Gruyter, 2006): 289–99; Chris Collins, “Home Sweet Home,” NYU Working Papers in Linguistics 1 (2007): 1–34. ↩
Anna Maria Di Sciullo, “Variation in the Pronunciation/Silence of the Prepositions in Locative Determiners,” Proceedings of the Linguistic Society of America 2 (2017). ↩
See the sixty-eight papers assembled in Anna Maria Di Sciullo, ed., Biolinguistics Critical Concepts in Linguistics, 4 vols. (London: Routledge, 2017). ↩

Anna Maria Di Sciullo is Professor of Linguistics at the University of Quebec at Montreal.

Robert Berwick is a Professor in the Laboratory for Information and Decision Systems at MIT.

Noam Chomsky is Institute Professor and Professor of Linguistics (Emeritus) at MIT.

Letters to the Editors

More Letters for this Article