Syntax 265
11 Syntax
MARK C. BAKER
1 The Domain of Syntax
At its most basic level, syntax can be deﬁned as the branch of linguistics that
studies how the words of a language can be combined to make larger units,
such as phrases, clauses, and sentences.
As such, syntax is deeply concerned with the relationship between the ﬁnite
and the inﬁnite. Most languages have a ﬁnite number of basic words, but those
words can be put together to make an inﬁnite number of sentences. Thus, one
can buy a more or less complete dictionary of English, but it is unthinkable to
buy a similar reference book that would list all the sentences of English. This
is because there are rules and patterns that can be used in a dynamic way to
create and understand new English sentences. Syntax is the study of these
rules and patterns. For example, readers of this chapter will be aware of having
encountered the English words this, loves, elephant, and eggplant, and they
will ﬁnd them listed in their dictionaries. However, they will not be aware of
having encountered the English sentence in (1) before, and they will be unable
to “look it up”:
(1) This elephant loves eggplant.
Nevertheless, they will have no hesitancies about recognizing it as English, or
about being able to understand it and use it appropriately. Moreover, they will
feel very differently about (2), even though it is made up of the same familiar
elements:
(2) *Loves elephant eggplant this.
In this arrangement, the words are not well-formed, but rather are ungrammatical
(indicated by * symbol).
Syntax is primarily concerned with whether a sentence is “properly put
together” rather than whether it is meaningful, or silly, or bizarre. Thus, it is
266 Mark C. Baker
concerned with the difference between (1) and (2), more than in the difference
between (1) and the examples in (3) (Chomsky 1957: ch. 2).
(3) a. Those hippopotamuses hate asparagus.
b. #This eggplant loves elephants.
These sentences are all instances of the same basic pattern from a syntactic
point of view, even though they mean different things ((3b) in particular
being nonsensical, and unacceptable for that reason). What the new sentences
formed by the patterns of syntax actually mean is the primary concern of
semantics. However, aspects of how words are put together can inﬂuence how
the sentence is interpreted; hence these issues can come into the domain of
syntax as well. Indeed, there is a complex interaction between the disciplines
of syntax and semantics, and it is controversial whether the topics can truly be
distinguished.
The other branches of linguistics that are closely related to syntax are
morphology and discourse analysis: morphology because it builds the words
that are the starting point of syntax, and discourse analysis because it involves
the construction of sentences into even larger entities, such as texts. Again, it
is somewhat controversial to what extent syntax is a distinct topic from these
others. It is conceivable that sentences are built in accordance with the same
patterns and procedures as words or texts are. In that case, syntax would not
be a separate topic from morphology or discourse analysis. Be that as it may,
there is no doubt that some kind of syntax exists, and that it involves taking
the ﬁnite building blocks made available by a given language and putting
them together into an inﬁnite number of representations, which in turn can
express an inﬁnite number of thoughts.
2 The Chomskyan Perspective
The study of syntax has ﬂowered since 1950, having been given new impetus
by the work of Noam Chomsky. One small but essential part of Chomsky’s
contribution has been to bring certain insights and perspectives from the
study of formal languages in mathematics and computer science to bear on the
fundamental problems of syntax. For example, the simple mathematical concept
of a recursive function sheds crucial new light on how a ﬁnite number of words
could be pieced together to make an inﬁnite number of sentences (Chomsky
1975: chs 7, 8). In essence, a recursive function is one that is deﬁned in terms of
itself – a “circular” deﬁnition that succeeds in saying something substantive.
For example, suppose that English contains the following rules:
(4) a. A sentence (S) consists of a Noun Phrase (NP) followed by a Verb
Phrase (VP).
(S → NP + VP)
Syntax 267
b. A VP consists of a verb (V), possibly followed by an NP and / or a
clause (CP).
(VP → V, or V + NP, or V + CP, or V + NP + CP)
c. A CP consists of a S, possibly preceded by a complementizer (C)
(CP → S or C + S)
English also includes the following words, which are members of the indicated
lexical categories:
(5) Noun phrases: John, Mary, Bill
Verbs: believes, likes
Complementizer: that
This is a very small vocabulary, and a relatively short list of rules / patterns.
However, the set of rules is recursive in the sense that a sentence contains a VP,
and one of the things a VP can contain is a clause (CP), which always contains
a new S. This S in turn contains its own VP, which can contain yet another
S. And so on. Thus, with only these rules one can already make an inﬁnite
number of grammatical English sentences, depending on how many times one
puts a new S inside the verb phrase:
(6) Mary likes Bill.
John thinks that Mary likes Bill.
Bill thinks that John thinks that Mary likes Bill.
Mary thinks that Bill thinks that John thinks that Mary likes Bill.
John thinks that Mary thinks that Bill thinks that John thinks that Mary
likes Bill.
Note that each of these sentences expresses a distinct idea – one that is potentially
useful in the complex world of high school courtship. Clearly there is
much more to English than this, but it shows how the basic idea of recursion
can elegantly address the apparent paradox of how people who have a ﬁnite
amount of knowledge of English stored in their minds can produce and understand
an inﬁnite number of English sentences.
Another foundational contribution from Chomsky and the cognitive science
revolution of the 1950s and 1960s has more to do with the spirit of linguistic
inquiry than its technical machinery. This is the ethos that when one is trying
to discover the syntactic rules and patterns of a particular language, those
rules should be stated explicitly and precisely, with (ideally) nothing being
taken for granted or left to the imagination (Chomsky 1957: Preface). To see
why this has proved important, consider the difference between telling a friend
to do something and programming a computer to do something. Interacting
with your friends is generally easier, because you automatically expect the
friend to bring a great deal of common sense to bear on the request, ﬁlling in
the details and interpreting the command in the light of circumstances. In
268 Mark C. Baker
contrast, computers have no inherent common sense; they do all and only
what they are told in the most literal fashion. But while this can make working
with computers exasperating, it can also make it educational, because trying to
program a computer to do a task forces one to take stock of exactly what goes
into that task, with no question-begging or step-skipping. This experience
turns out to be highly relevant to the study of language. The syntax of English
often seems deceptively simple, particularly in a monolingual situation. Since
we all speak English (or another natural language), it is very easy to unconsciously
ﬁll in details without realizing it. If, however, one takes up the challenge
of ﬁnding the patterns and principles of syntax to the point where one
could (in principle) program a computer to judge sentences, manipulate them,
and interpret them the way we do, one is forced to seriously investigate many
issues that are otherwise easily overlooked. Sometimes this may seem obsessive
to the outside observer: syntacticians sometimes seem to be telling needlessly
complex stories when it is obvious that we say (1) and not (2) because
“(1) sounds better” or “(1) makes more sense.” But in fact linguists are at least
as interested in exactly what goes into this “linguistic common sense” as they
are about the salient and arbitrary rules of grammar that get most of people’s
attention (like “don’t end a sentence with a preposition”). In fact, many important
discoveries have been made by taking this perspective, some of which will
be mentioned below.
The broadly Chomskyan approach to syntax is often called formal or generative:
“formal” because it uses mathematical style formalisms and deﬁnitions
in presenting its analysis, such as the recursive rules above; “generative” because
it seeks to explicate how ﬂuent speakers can generate new well-formed
expressions of a language. Many current syntacticians still accept these labels
for their work, while others have reacted against them and what they see as
some deﬁciencies and excesses of this approach. However, it seems fair to say
that even those that react most loudly against the Chomskyan approach have
been positively inﬂuenced by its most basic ethos and by some of what has
been learned from that perspective. For purposes of this chapter I will attempt
to background these areas of disagreement, and foreground areas of relative
agreement, simply noting in passing some areas where interpretations differ
signiﬁcantly.
3 Basic Lessons of Syntactic Research
What then are some of the substantive discoveries that have been made by
adopting this perspective to syntax? Beyond the details of a vast number of
particular discoveries, one can identify at least three broad themes: (1) that
syntax is a vast topic; (2) that constraints are central to syntax; and (3) that
there is a large component of syntax that is common to all human languages.
I will discuss each of these themes in turn, using them as a context for presenting
some more speciﬁc material by way of illustration.
Syntax 269
3.1 The vastness of syntax
The ﬁrst important lesson that comes from several decades of research on syntax
is that there is much more to syntax than anyone ever imagined. Natural
languages turn out to be enormously complicated, once one takes up the “take
nothing for granted” attitude of the computer programmer. Prior to reﬂection,
it is natural to think of an activity like playing chess as the epitome of a complex
activity requiring great intelligence; in contrast, forming grammatical
English sentences seems like a very ordinary, routine behavior. After all, playing
chess requires conscious mental exertion and some people can do it much
better than others, whereas English sentences are formed automatically, almost
unconsciously by practically everyone. Indeed, the ﬁrst blush of success over
phrase structure rules could make one think that that is pretty much all there
is to it. However, the actual experience of computer programming has shown
vividly that the real complexity of the tasks is the reverse. Steady progress has
been made in programming computers to play chess, to the point that they now
beat even the best human players. In contrast, there is still no computer system
that can match the ability to judge, generate, and interpret English sentences
of an average ten-year-old child, in spite of the fact that enormous resources
have been devoted to the problem. Indeed, the longest grammars of the beststudied
languages are not close to complete descriptions, and new discoveries
about languages like English and French are still made on a regular basis.
3.1.1 Phrase Structure
To see where some of the complexities come from, let us return to the notion
of phrase structure. Rules like S → NP + VP and VP → V or V + NP express in
a succinct way several facts (Chomsky 1965: ch. 2). First, they express the fact
that all normal sentences must have at least a subject NP and a verb (except
for imperatives, and certain elliptical expressions). If the subject is omitted, the
sentence strikes native speakers of English as deviant.
(7) a. Mary likes the dress.
b. *Likes the dress.
Second, these rules express the fact that the subject appears before the verb,
and the object appears immediately after it. Thus, The dress likes Mary is a silly
sentence with a completely different meaning from (7a); this is because it
cannot be understood as having the object come before the verb and the subject
after, but only as the dress being the subject and Mary being the object. The
other logically possible rearrangements of the words are simply ungrammatical:
(8) a. *Mary the dress likes.
b. *Likes Mary the dress.
c. *Likes the dress Mary.
d. (*)The dress Mary likes. (This is OK in context as a topicalization)
270 Mark C. Baker
Third, the phrase structure rules as given also express the more subtle fact that
the object and the verb together form a tighter unit in English, with the subject
attached more loosely. Thus, in some special contexts one can put the verb and
the object together at the front of the sentence, leaving the subject behind (see
(9a)). In contrast, one can never put the subject and the verb at the front,
leaving the object behind ((9b)).
(9) I told you that John would like that dress, and indeed . . .
a. . . . Like that dress, John does.
b. *. . . John like, does that dress.
This unit that contains both the verb and the object but not the subject – the
phrase that appears before the comma in (9a) – is called the Verb Phrase.
Putting these pieces together, we can diagram the structure of a simple English
sentence as follows:
(10)
NP
likes
V
VP
Mary
NP
S
the dress
Similarly, the second sentence in (6) would have the phrase structure diagram
in (11); note the recursion, where one S is embedded inside another.
(11)
Billlikes
NPV
VP
Mary
NP
S
that
C
CP
thinks
V
VP
John
NP
S
Syntax 271
While these rules can be elaborated, reﬁned, and added to, this gives some
sense of how they are intended to work.
Already some complications arise. The phrase structure rules as given account
for the ungrammaticality of (7a) compared to (7b), where the subject has
been omitted. Now it is also ungrammatical to leave out the object of (7a):
(12) *John likes.
Indeed, (12) feels just as bad as (7b). However, unlike (7b), (12) can be generated
by the phrase structure rules we have given. The difference is intentional.
The reason is that (12) becomes grammatical when another verb is substituted
for likes; for example, one can say John smiles. However, sentences like (7b) are
impossible whatever verb is chosen (*Arrived the dress, *Tore the dress, etc.).
Thus, it is a property of the particular English verb like that it needs to appear
with an object, whereas it is a property of the English language as a whole that
sentences need to have subjects. This difference between the status of subjects
and objects is sometimes called the (extended) projection principle (Chomsky 1981).
In contrast, the fact that objects must be present if and only if the particular
verb of a clause speciﬁcally calls for one is sometimes called the theta criterion
(Chomsky 1981).
3.1.2 Transformational and interpretive rules
Rich as they are, there seems to be more still to syntax than phrase structure
rules. There are also certain systematic relationships between sentences that
need to be captured, for which Chomsky originally proposed the formal device
of the transformational rule (Chomsky 1957: ch. 7). These rules change one
phrase structure into another. For example, consider the following English
sentences:
(13) a. Mary will like the dress.
b. *Mary will like what?
c. What will Mary like?
d. *What will Mary like the dress?
e. *The dress will Mary like.
Like other languages, English has a special set of words that can take the place
of a missing piece of information in order to form a question; these are often
called wh-words after the letters that most of them begin with (who, what,
where, which . . . ). However, (13b) shows that it is not enough in English to
simply put one of these words in place of the desired piece of information.
Rather, the wh-word must appear at the beginning of the sentence, as in (13c).
Moreover, (13c) is a striking exception to the generalization made above that
the verb like must be immediately followed by an object. On the contrary, in
this environment like must not have an object after it, as shown by (13d). One
272 Mark C. Baker
natural way to describe this network of facts is to say that the initial structure
of (13c) is like that of (13a), with what substituted for the dress. However, the
structure is “transformed” by a rule that can be stated something like this:
(14) Move wh-words to before the ﬁrst word of an S that contains them.
This rule applies to interrogative NPs, but not to other kinds of NPs; hence (13e)
is not possible, even though it is perfectly parallel to (13c).
Transformational relationships like these turn out to be relatively common.
Indeed, (13) illustrates another one. Notice that the placement of the future
auxiliary element will is different in declarative sentences like (13a) and interrogative
ones like (13b). In declarative sentences, it always comes after the
subject and before the main verb, whereas in interrogative sentences it comes
between the wh-phrase and the subject. This change in word order is required
(see (15a)) and one cannot have a second auxiliary of the same type in the
usual position between the subject and the verb (15b).
(15) a. *What Mary will like?
b. *What will Mary will / may like?
Again, a movement transformation is a simple way to account for this fact:
(16) Shift the auxiliary verb to the front of the subject NP in sentences that
are interpreted as questions.
There may also be processes that apply to phrase structures that do not fall
under the general category of movement. Suppose that we expand our miniature
English dictionary to include NPs that are pronouns, such as he, she, and
it. Then the phrase structure rules generate examples like (17):
(17) John thinks that he likes Mary.
This sentence is ambiguous. Depending on context and expectations, it can
easily be interpreted as meaning that John thinks that John himself likes Mary,
or that John thinks that some other male we are discussing (say Bill) likes Mary.
This is an essential feature of how competent English speakers understand and
use sentences with pronouns. Now suppose that each distinct person referred
to is associated with a unique number, written as a subscript to the Noun
Phrase(s) that refer to it. Then the ambiguity in (17) can be captured by adding
a rule like the following (Chomsky 1973; this rule is given for illustrative
purposes; it is not now widely accepted, although the effect is genuine (Lasnik
1989: ch. 4)):
(18) Copy the referential index of an NP onto a pronoun as its referential
index.
Syntax 273
This is called an interpretive rule, rather than a transformational rule. It is
optional: one meaning of (17) comes from applying the rule, and the other
meaning comes from choosing not to apply it.
(19) a. John3 thinks that he3 likes Mary4, or (by (18))
b. John3 thinks that he2 likes Mary4.
Just as the transformational rules apply only to a speciﬁc set of items (whwords,
auxiliary verbs), so (18) applies only to pronouns. Thus, if the man
named John also happens to be a sentimental fool, one can in principle use
either the NP John or the sentimental fool to refer to him. However, if one
replaces the pronoun he in (17) with the NP the sentimental fool, the sentence
can only have a meaning like (19b), where three distinct people are being
discussed. This is because the sentimental fool is not a pronoun, and therefore
rule (18) does not apply.
The existence of transformational rules like (14) and (16) and interpretive
rules like (18) is more controversial than the existence of phrase structure.
These elements of a linguistic analysis can be eliminated; one simply has to
make up new phrase structure rules that will generate the desired sentences
directly (Gazdar et al. 1985). Some syntacticians prefer this alternative approach,
often for reasons of computational simplicity or formal elegance. However,
there is no doubt that there needs to be some kind of general mechanism for
expressing the relationships among the different parts of a syntactic structure
– such as the relationship between the sentence initial question word in (13c)
and the fact that the sentence has no NP in the object position of the verb
phrase.
3.2 The centrality of constraints
This is far from exhausting the domain of syntax, however. Indeed, it merely
sets the stage for what is perhaps the most interesting and signiﬁcant discovery
of all: the existence of constraints. When the various rules of syntax are
stated in their simplest and most general form, they typically “over-generate,”
producing a number of ungrammatical sentences along with the grammatical
ones. Therefore, syntacticians have proposed a system of constraints that prevent
these rules from running wild.
As an example, let us return to the rule that moves wh-words to the front of
sentences. This is a very general phenomenon, but it turns out to be far from
exceptionless. Consider, for example, the following more complex examples:
(20) a. John will think that Mary likes the dress
b. John will think Mary likes the dress.
c. What will John think Mary likes – ?
d. What will John think that Mary likes – ?
274 Mark C. Baker
e. Who will John think – likes the dress?
f. *Who will John think that – likes the dress?
(20a) and (20b) are both simple sentences, formed by the phrase structure rules
in (4). They differ only in whether the complementizer that is present or not: in
(20a) the CP inside the VP containing think is expanded as C + S (where that is
C); in (20b) the CP is expanded simply as S. Examples (20c) and (20d) are the
result of generating what as the object of the embedded sentence and moving
it to the front; both are grammatical, as expected. Examples (20e) and (20f)
are the result of substituting who for the subject NP of the embedded sentence
and moving it to the front. This time, we ﬁnd something unanticipated: (20f)
is ungrammatical for most speakers, even though our rules can construct this
sentence just as easily as the others. There seems to be a special, additional
constraint at work here, which can be phrased as follows (Chomsky and Lasnik
1977):
(21) Complementizer constraint
A sentence is ungrammatical if a complementizer comes immediately
before a verb (in languages where subject phrases are generally
obligatory).
Since only (20f) has the forbidden sequence of words . . . that – likes . . . it
alone is ruled out by (21), as desired.
There are other constraints on wh-movement as well. We have seen that
either a subject or an object can be replaced by a wh-word, which then moves
to the front of the clause. However, a difference between subjects and objects
arises when one tries to question only a part of the noun phrase. When a
question word replaces part of an object noun phrase and moves to the front,
the result is often perfectly acceptable, as in (22).
(22) a. You saw [a picture of John].
b. Who did you see [a picture of – ]?
However, when a question word replaces a similar part of a subject noun
phrase, the result is unacceptable:
(23) a. [A picture of John] disturbed you.
b. *Who did [a picture of – ] disturb you?
There is no obvious reason why (23b) should not be as useful or meaningful a
sentence as (22b); it just sounds awkward. To account for this, linguists have
stated the following condition (Huang 1982):
(24) The condition on extraction domains (CED):
A phrase X can move out of a phrase Y only if Y is immediately contained
in a Verb Phrase.
Syntax 275
Since the object is inside the VP, as shown by the phrase structure diagram in
(25), the movement in (22b) is possible:
(25)
However, the subject NP is not inside the Verb Phrase; therefore question
words cannot move out of the subject:
(26)
a picture of – NP
bother
V
VPNP
S
who
NP
S′
you
*
Thus, the condition on extraction domains correctly differentiates between
sentences like (22) and (23).
There are also interesting constraints on the pronoun interpretation rule in
(18). Suppose that the object of a sentence is a pronoun. That pronoun can
easily be understood as referring to something contained in the subject noun
phrase, as expected:
(27) John’s2 mother3 loves him2.
The situation is not symmetrical however. If we put a pronoun in the subject
position, it cannot normally be understood as referring to something contained
in the object. Thus, he cannot be John (or the mother) in sentences like (28), but
must be some third person.
a picture of –
OK
NP
see
V
VP
you
NP
S
who
NP
S′
276 Mark C. Baker
(28) *He2 loves John’s2 mother3
One might think that this is simply a matter of linear order – that a pronoun
can never refer to a noun phrase that comes after it. However, this is not true.
For example, in a sentence like (29), the pronoun his comes before John; nevertheless,
his can be understood as referring to John, at least for most English
speakers. (This interpretation is easier to get if one says the sentence with the
main stress on the verb loves, rather than on John.)
(29) (?)His2 mother3 loves John2.
Thus, we need to add a condition to (18), which by itself implies that pronouns
can always refer to any other NP in the same sentence (Lasnik 1989, Reinhart
1983). First, it helps to deﬁne the term c-command:
(30) C-command:
An element X c-commands another element Y if the ﬁrst phrase which
properly contains X also properly contains Y.
Given this, we can state the condition on pronouns as follows:
(31) Disjoint reference condition (DRC):
A pronoun X may not refer to the same thing as (have the same index
as) a nonpronominal NP Y if X c-commands Y.
This explains the pattern of facts as follows. Consider (32), which is the structure
of (27).
(32)
NP
loves
V
VP
S
him
N
John’s
NP
NP
mother
First we must check if the pronoun him c-commands the NP John. The ﬁrst
category that properly contains him is the VP, and John is outside this VP. Thus,
the pronoun does not c-command John in this structure. Therefore, the DRC
does not apply, and (18) can; thus, the pronoun may refer to the same thing as
“John.” However, (28) comes out differently:
Syntax 277
(33)
Here the ﬁrst category that contains the pronoun he is S, the whole sentence,
and S obviously contains the NP John as well. Therefore, the pronoun does
c-command the noun phrase in this structure, so the pronoun cannot refer to
the same thing as that noun phrase, by the DRC. Finally, (34) is the structure
of (29):
(34)
Here the pronoun his is not the whole subject, but only a part of the subject.
Thus, the ﬁrst phrase which properly contains his is the subject NP. This
subject NP does not include the object John. Therefore, the pronoun does not
c-command the NP in this example. Therefore, coreference is possible again.
Thus, we see that a condition deﬁned over the basic phrase structure of the
sentence accounts for the usage of pronouns in English; indeed, it does so better
than a simplistic rule that says that a pronoun must come after the Noun
Phrase that it refers to.
Much of the beauty of these constraints comes from the fact that in many cases
they regulate the operation of more than one transformational relationship
(Ross 1967, Chomsky 1973, Chomsky 1977). For example, in addition to the
transformations outlined above, English has a rule of relative clause formation
that makes noun phrases out of sentence-like structures. A simple example is:
(35) Mary likes the dress → The dress that Mary likes –
This is not the same process as question formation, but it is somewhat similar
in that a noun phrase (here the dress) comes to appear at the beginning of the
NP
loves
V
VP
S
John
N
his
NP
NP
mother
John’s mother
NNP
NP
loves
V
VP
he
NP
S
278 Mark C. Baker
clause, and there is a gap inside the clause where a noun phrase normally has
to be. Thus, this too can be thought of as a movement process. Now given the
way the constraints are set up, we automatically expect relative clause formation
to have the same limitations as question formation. This is correct. Like
question movement, relative clause formation cannot create a conﬁguration
where the complementizer that comes immediately before the verb. The examples
in (36) are exactly parallel to those in (20).
(36) a. The dress that John thinks Mary likes –
b. The dress that John thinks that Mary likes –
c. The woman that John thinks – likes the dress
d. *The woman that John thinks that – likes the dress
Similarly, relative clause formation obeys the CED, such that part of the object
can be moved, but not part of the subject. (37) is parallel to (22b) and (23b).
(37) a. The man that you saw a picture of
b. *The man that a picture of disturbed you
This is an important ﬁnding, because it shows that the constraints should not
be built into the individual transformations themselves; rather they are more
general, and have a semi-separate existence. A priori, one could write the
Complementizer Filter and the CED into the basic formulation of the question
movement transformation. The result would be a rule that is rather complex
and inelegant. Furthermore, the same complexities would have to be written
into the relative clause transformation – and into a number of other transformations
as well. Clearly, it is better to factor these out as separate conditions. This
has led to the view that the constraints are really more fundamental than the
transformations themselves, a shift from seeing language as fundamentally a
list of construction rules to seeing it as a system of constraints that must be
satisﬁed (Chomsky 1981).
Ongoing research has proceeded in the direction of uncovering constraints
that are more and more general. A good illustration is the so-called “shortest
move” constraint. Auxiliary inversion seems to be quite a different kind of
process from both question movement and relative clause formation: it moves
a very different kind of linguistic element (a verb, rather than a noun phrase)
to a somewhat different position under partially different circumstances. As a
result, constraints like the complementizer constraint and the CED do not seem
relevant to this particular transformation. Nevertheless, there are similarities.
Suppose that one tries to generate a sentence with two distinct wh-words, each
substituting for a different piece of desired information, as in (38a). The whmovement
rule must apply exactly once to such a structure (in English), moving
one of the wh-words to the beginning of the clause and leaving the other in
place, as shown by the grammatical example in (38b). Strikingly, one does not
have a choice as to which of the wh-words to move, as shown by (38c).
Syntax 279
(38) a. John gave what to who.
b. What did John give to who?
c. *Who did John give what to?
Something very similar is found with auxiliary inversion. (39a) shows a structure
that contains two distinct auxiliary verbs. Auxiliary inversion can apply once
and only once to this structure to create the grammatical Yes / No question in
(39b). Again, one does not have a choice of which auxiliary to move, as shown
by the ungrammaticality of (39c).
(39) a. John will have eaten the sandwich by noon.
b. Will John have eaten the sandwich by noon?
c. *Have John will eaten the sandwich by noon?
There is a simple generalization here: in both cases it is the element that starts
off closest to the front of the clause that must move. This has led linguists to
propose the “shortest move” condition, which favors shorter movements to
longer movements when there is a choice (Chomsky 1995). All the various
kinds of movements arguably obey this very general condition. This condition
can be looked at as a kind of economy condition, in the sense that it favors the
smallest possible adjustment to the structure. One strain of current research
explores the idea that all conditions can be reduced to a very small number of
these “super-conditions,” all of which have an economy ﬂavor (Chomsky 1995).
3.3 The similarity of all human language
A third major lesson that emerges from contemporary syntactic research
is that all natural languages are strikingly similar. This is certainly not one’s
ﬁrst impression. There are some six thousand languages currently spoken in
the world, many of which had relatively little contact with each other until
recently. When speakers of an Indo-European language ﬁrst start to learn a
Native American language, or an Australian Aboriginal language, or an East
Asian language they are usually more struck by difference than by similarity.
However, the differences seem to be largely on the surface. We have already
seen how once one starts to take nothing for granted one begins to discover
many unsuspected intricacies of English grammar. It is not surprising that
when one begins to study other languages from the same perspective, one
ﬁnds unsuspected intricacies in those languages too. What is surprising is that
the intricacies turn out to be largely the same, even across languages that are
geographically separated and historically unrelated. Moreover, this seems to
be more true the further one goes into the linguistic analysis. Thus, the phrase
structures of languages are not necessarily the same; for example, verbs and
prepositions come before their object noun phrases in English, but they come
after their objects in Japanese, Hindi, and many other languages. Similarly, the
280 Mark C. Baker
transformational processes of a language may or may not be the same. For
example, English has a rule that moves question words to the front of the
clause, but languages like Chinese, Japanese, and Hindi do not. But at the level
of syntactic constraints it is striking that constraints originally discovered for
languages like English and French often show up in other languages – or at
least it is possible to rephrase the constraint slightly so that it applies to all of
the languages. Thus, the syntax of natural languages is not only vaster than
we thought, but it is also more similar than we thought.
To get a ﬂavor of how this kind of reasoning goes, consider the Edo language,
spoken in Nigeria. This language had no contact with European languages
until recent times, and has had no impact on the development of syntactic
theory before now. Thus, it provides an interesting test case for the generality
of that theory. In fact, it can be shown that Edo is strikingly like English
in most of the syntactic features presented above. (The data in this section is
from O. T. Stewart, personal communication.)
First of all, the same basic phrase structure rules that work in English work
also in Edo. For example, the subject comes before the verb, and the object
comes immediately after it:
(40) Òzó guòghó àkhé.
Ozo broke pot.
“Ozo broke the pot.”
As in English, the subject noun phrase is obligatory in all clauses, but whether
an object is needed or not depends on which verb is chosen:
(41) a. Òzó só.
Ozo shouted
“Ozo shouted.”
b. *Guòghó àkhé.
broke pot
(“Pot” must be the subject, not the object: Àkhé guòghó “the pot
broke.”)
While Edo does not have any process that fronts the VP that is directly comparable
to the English one in (9a), it is possible to show that the object and verb
form a relatively tight unit to the exclusion of the subject, as in English. Thus,
the basic phrase structure patterns S → NP + VP and VP → V or V + NP (or
V + CP) are equally valid for both languages.
There are also similarities between the two languages at the level of transformations.
Edo, like English, has a rule that obligatorily moves question words
to the front of the clause:
(42) a. *Òzó ghá guòghó dèmwìn?
Ozo will break what
Syntax 281
b. Dèmwìn nè òzó ghá ghó!
ghó?
what that Ozo will break
“What will Ozo break?”
Edo also has the optional process of assigning a pronoun the same reference
as a noun that appears elsewhere in the sentence. Thus, (43) is ambiguous in
Edo, just as (17) is in English.
(43) Òzó hòó nè ò kpàá.
Ozo wants that he leave. (he = Ozo, or he = the other guy under
discussion)
However, Edo does not have the Auxiliary Inversion transformation. Thus,
in (42b) the future tense auxiliary ghá does not shift to before the subject in
Edo questions the way it does in English (*dèmwìn ghá òzó guòghó). Instead a
special functional word nè is inserted in this position. This kind of patterning
gives syntacticians the impression that, while languages are certainly not
identical, they seem to choose their syntactic structures from a limited set of
options that are universally available.
Finally, consider the level of constraints. Recall that English puts limitations
on when a wh-word can move to the front of the clause, such as the CED. The
same constraint holds in Edo. (44a) shows a sentence in which a clause is in
the object position with respect to the main verb ta “say.” (44b) shows that one
can generate a question word as the object of this embedded clause and then
move it to the front of the sentence as a whole in the usual way.
(44) a. Úyì tá wé!é òzó dé ímótò.
Uyi said that Ozo bought car.
“Uyi said that Ozo bought a car.”
b. Dèmwìn nè Úyì tá wé!é òzó dé – ?
what that Uyi said that Ozo bought.
“What did Uyi say that Ozo bought?”
In contrast, in (45a) there is a clause that functions as the subject of the main
verb yee “please.” All things being equal, it should be possible to replace the
object of this clause with a question word and then move that question word
to the front, forming the question in (45b). But this is impossible.
(45) a. Wèé òzó dé ímótò yèé Úyì.
that Ozo bought car please Uyi.
“That Ozo bought a car pleased Uyi.”
b. *Dèmwìn nè wèé Òzó dé – yèé Úyì?
what that that Ozo bought please Uyi.
“What did that Ozo bought please Uyi?”
(i.e. “What did it please Uyi that Ozo bought?”)
282 Mark C. Baker
Note that exactly the same judgments hold true of the English translations.
This conﬁrms that the same constraint on question movement holds in both
languages.
The DRC also applies in Edo, constraining how pronouns are interpreted.
Consider the following range of examples:
(46) a. Wé!
é né!
né ékítà lèlé Ìsòkèn yèé órè.
That the dog followed Isoken pleased her (OK her = Isoken)
b. Wé!
é né!
né ékítà lèlé érè yèé Ìsòkèn.
That the dog followed her pleased Isoken. (OK her = Isoken)
(47) a. Ìsòkèn hòó nè né!
né ékítà lèlé érè.
Isoken wants that the dog follow her. (OK her = Isoken)
b. Ò hòó né né!
né ékítà lèlé Ìsòkèn.
She wants that the dog follow Isoken. (Only OK if she ≠ Isoken)
The examples in (46) have an embedded clause that functions as the subject of
the main verb yèé “please”, and the speaker desires to express that the object
of the embedded verb (the person who is followed) is the same person as the
object of the main verb (the person who is pleased). This can be done by either
putting a pronoun in the embedded clause and a name in the main clause, or
vice versa. The examples in (47) are similar, except that this time the embedded
clause is the object of the main verb hòó “want.” Now if the speaker desires to
express that the subject of wanting is the same person as the intended object
of following, the options are restricted: the name must be used as the subject
of the sentence as a whole, and a pronoun must be used as the object of the
embedded clause, as in (47a). It is grammatical to use a pronoun as the subject
and a name as the embedded object, as in (47b), but then the pronoun can only
be interpreted as some third person, not Ìsòkèn. This is exactly what the DRC
predicts. In (47b) the smallest phrase containing the pronoun is the sentence
as a whole, which also contains Ìsòkèn. Therefore, the pronoun c-commands
the name, and coreference is prohibited. However, the pronoun does not
c-command the name in any of the other sentences in (46) or (47) (it is contained
in the embedded VP in (46b) and (47a), and in the main VP in (46a);
these phrases do not contain Ìsòkèn). Therefore, coreference is allowed. Again,
the same judgments are valid for the English translations, conﬁrming that the
same constraint holds in both languages.
Sometimes even when languages look quite different, they are actually responding
to a similar constraint in a different way. An interesting example
comes from the complementizer constraint, stated in (21). The conﬁguration
of complementizer wèé followed by verb is also avoided in Edo:
(48) a. Dèmwìn nè Úyì tá wèé òzó dé –
what that Uyi said that Ozo bought.
“What did Uyi say that Ozo bought?”
Syntax 283
b. *Dòmwàn nè Úyì tá wèé – dé ímótò.
who that Uyi said that buy car.
“Who did Uyi say that bought a car?”
However, English avoids the problematic conﬁguration by omitting the complementizer;
in contrast Edo avoids it by ﬁlling in the space left behind by the
question word with a pronoun, as shown in (49) (see (Koopman 1982) for
discussion of this pattern in Vata):
(49) a. *Dòmwàn nè Úyì tá dé ímótò.
who that Uyi said buy car.
“Who did Uyi say bought a car?”
b. Dòmwàn nè Úyì tá wèé ó dé ímótò.
who that Uyi said that he buy car.
“Who did Uyi say bought a car?”
Moreover, this difference in the two languages is not an arbitrary one.
Edo is different from English in that it normally requires embedded clauses
to have a complementizer; in symbols, CP → C + S is the only option in Edo.
Therefore, (49a) is ungrammatical in this language. On the other hand, there
are other situations in which pronouns show up where question words once
were in Edo. This then is a simple example of how even in areas where languages
look somewhat different, important similarities can be discerned, particularly
in the domain of constraints.
Over all, when one compares the syntax of different languages, they seem to
be making different choices from a similar range of options: they have the same
kinds of phrase structures, with possible differences in the order of the words;
they have the same kinds of transformations; they have the same general constraints,
although with different speciﬁc ways of satisfying them. This leads to
the idea that much of human language is somewhat like a kit that you buy at a
store: the basic pieces are prefabricated, although there is some variation in how
you assemble them (and much freedom in how you decorate the ﬁnal product).
Up to this point, there is reasonably broad agreement across a wide range
of syntacticians, although signiﬁcant differences in emphasis. Disagreement
becomes more vigorous on the question of how to interpret this interplay of
universal patterns and language-particular facts. Roughly three approaches
are current. The ﬁrst is the principles and parameters approach of Chomsky himself
and his nearer followers. This takes the view that the principles of language
are essentially universal, and they take the form of a certain number of
stipulated abstract constraints. The set of these principles is often referred to as
Universal Grammar. Language variation comes from ﬁxing certain parameters
– especially those that involve the idiosyncratic properties of the words of
a language (Chomsky 1981). This has been called a “constitutional view.”
The second approach is more “architectural” (Bach 1988). It emphasizes the
modes of composition of the words of languages (phrase structure rules or the
284 Mark C. Baker
equivalent) and tries to state those principles of combination in just such a
way that constraints like the CED emerge as theorems of the system. This is
roughly the view of more mathematically oriented theories like categorial grammar
and generalized phrase structure grammar. Third, there are a variety of
functionalist approaches. These see the universal constraints on language as
functional in nature: they are design features that facilitate language’s basic
purpose of communication, a goal which is taken to be essentially the same
across the species (see, for example, Croft 1995). On these views, languagespeciﬁc
idiosyncrasies are generally taken to have historical explanations, in
which some communicative strategy that was originally a dynamic option gets
entrenched as a habit, and thus becomes a “rule” of a particular language.
The CED facts surveyed above can provide an illustrative comparison of
these perspectives. I stated the CED as an extrinsic constraint, in the style of
the formal, “constitutional” approach: all movements are possible, unless they
violate this “law.” A functionalist approach, on the other hand, would investigate
the communicative purpose of asking a question and the communicative
purpose of expressing something as a direct object (rather than a subject
or a modiﬁer), looking for (in)compatibilities at that level. At ﬁrst, it seems
like it should be easy to determine which approach is correct by simple crosslinguistic
comparison. The formal approach predicts that CED-type restrictions
should appear when and only when there is a movement relation; the
exact communicative purpose of the movement is not directly relevant. For
example, (37) showed that the CED restricts the formation of relative clauses
as well as questions, even though the communicative effects of the two are
quite different. Conversely, I mentioned in passing that some languages form
questions simply by replacing the unknown material with a suitable interrogative
word, with no movement. On the formal view, it should be possible
to ask a wider range of questions in these languages than in English, because
the CED does not restrict this kind of question formation. For a number of
languages, this seems correct. Thus, (50) is a grammatical question in Chinese,
where the wh-phrase shei replaces an NP inside a clause that functions as the
subject of the sentence as a whole (Huang 1982):
(50) [Lisi da-le shei] shi ni hen bugaoxing? (Huang 1982: 496)
Lisi hit-ASP who make you very unhappy.
“Who did it make you very unhappy that Lisi hit?”
Strikingly, relative clause formation does involve movement of an NP in
Chinese – although to the end of the clause, rather than to the beginning of
the clause. Thus, the formal approach expects CED-style restrictions, and that
is what one ﬁnds:
(51) ??Lisi da-le – shi wo hen bugaoxing de neige ren
Lisi hit-ASP – make I very unhappy DE that man
“The man that it made me very unhappy that Lisi hit”
Syntax 285
However, there is also evidence that points the other way. For example, Foley
and Van Valin (1984: 22–4) mention that question words are not moved in the
Native American language Lakhota; yet the kinds of sentences that violate the
CED in English are also ungrammatical in Lakhota:
(52) #WiChaSa wã taku ophethZ ki he wãlaka he.
man a what buy the that you-see Q
“What did you see the man who bought?”
They point out that the badness of (52) makes more sense from the functionalist
perspective, assuming that there is some kind of basic incompatibility
between the goal of asking a question and the goal of expressing an object
modiﬁer. This functional style of explanation can rule out sentences like (52)
in the same way in Lakhota and English, irrespective of differences in how
each particular language chooses to express questions. Much work remains to
be done to sort out whether the formalist expectations or the functionalist
expectations ﬁt the facts better, or whether some kind of synthesis is called for.
This is an area in which further research and debate can be expected.
4 Universal Grammar and Parameterization:
A Concrete Example
So far, I have presented syntacticians’ claims about Universal Grammar only
by comparing English and Edo. While this is a fair test in as much as the languages
developed quite independently of each other, it is clear that they are
relatively similar typologically. In the rest of this chapter, I will illustrate the
claims of syntax by way of a more radical comparison, between English and
the Mohawk language. The analysis in this section, based on Baker (1991,
1996), is less a part of the standard canon than the material presented so far,
and some features of it are controversial. However, its leading idea has wide
acceptance, and it provides a good illustration of some characteristic features
of contemporary syntactic research.
If one offered a prize for the language that looks the least like English,
Mohawk would be a strong candidate. At ﬁrst glance, one ﬁnds differences
wherever one looks, starting with the most simple sentences. We saw that in
English and Edo a transitive verb must appear with two NPs, a subject and an
object. In Mohawk this is not so: either the subject or the object – or both – can
be left out, and the result is still a complete and grammatical sentence:
(53) a. Sak ranuhwe’s atya’tawi.
Sak likes dress
“Sak likes the dress.”
b. Ranuhwe’s atya’tawi.
likes dress
“He likes the dress.”
286 Mark C. Baker
c. Sak ranuhwe’s.
Sak likes
“Sak likes it.”
d. Ranuhwe’s.
likes
“He likes it.”
In English and Edo, the subject must come before the verb and the object
after; in Mohawk this is not so. Rather, the subject, the object and the verb can
appear in any of the six orders that are logically possible for a three word
sentence (see also (53a)):
(54) a. Ranuhwe’s Sak atya’tawi.
likes Sak dress
“Sak likes the dress.”
b. Sak atya’tawi ranuhwe’s.
Sak dress likes
c. Ranuhwe’s atya’tawi ne Sak.
like dress NE Sak
d. Atya’tawi ranuhwe’s ne Sak.
dress like NE Sak
e. Atya’tawi Sak ranuhwe’s.
dress Sak like
In English, the object and the verb sometimes move together as a unit; in
Mohawk this is never possible, as far as researchers have been able to determine.
Thus the phrase structure rules of English do not seem applicable to Mohawk
at all. Such languages are sometimes called nonconﬁgurational (Hale 1983).
Moving on to the domains of transformations and constraints, recall that in
English, a question word can move out of the direct object noun phrase to the
beginning of the clause, but question words cannot move out of subject noun
phrases (see (22) and (45)). One similarity between Mohawk and English is
that question words must move to the front of the clause in Mohawk too:
(55) Uhka washakokv’ ne Sak?
who see NE Sak
“Who did Sak see?”
However, in Mohawk when one tries to move a question word out of some
larger phrase that serves as the object, the result is just as ungrammatical as if
the question word moves out of a subject phrase. Thus, there is no difference
here between subjects and objects the way there is in English and Edo:
(56) a. *?Uhka wesatsituni’ ne akokara’?
who made-cry-you NE story
“Who did the story of make you cry?”
Syntax 287
b. *?Uhka senuhwe’s ne akokara’?
who you-like NE story
“Who do you like the story of?”
In English, the DRC says that a pronoun subject cannot refer to an NP contained
in the direct object, but a pronoun object can refer to an NP contained in
the subject ((27) and (28)). In contrast, both kinds of coreference are possible in
Mohawk.
(57) a. (Akauha) wa’akoya’takehnha’ ne Uwari akona’tsu.
her help NE Mary pot
“Mary’s pot helped her.” (OK her = Mary)
b. (Rauha) wahanohare’ ne Sak rao’share.
he wash NE Sak knife
“He washed Sak’s knife.” (OK he = Sak)
Thus, our ﬁrst impression is that the syntax of Mohawk is radically different
from the syntax of English, and (except for question movement) virtually none
of the rules and principles seem to carry over. Rather, it seems like, in order to
analyze Mohawk insightfully, one must start from scratch, building a theory
that accounts for Mohawk in its own terms.
There is another possibility, however. It could be that Mohawk is actually
rather like English, but we are looking at it wrong. In fact, when one looks a
little further, certain similarities begin to turn up. Consider once again the
DRC. Before, I only used this to explicate differences between subjects and
objects, but the condition is more general. For example, most English speakers
ﬁnd a difference between (58a) and (58b):
(58) a. I hired him2 [because John2 was a good worker].
b. *I told him2 [that John2 was a good worker].
Superﬁcially, these sentences look very similar, the main difference being the
ﬁrst verb. However, the pronoun him can be understood as referring to John
in the ﬁrst sentence, but not in the second. Why should this be? Part of the
answer is that the embedded clauses in these two sentences are performing
very different functions. In (58b), the clause is basically a special kind of object.
As such, it can be replaced by a normal NP object:
(59) I told him [the news].
However, a similar replacement is not possible in (58a):
(60) *I hired him [the reason].
The embedded clause is not an object of the verb at all in (58a); rather, it is a
modiﬁer that helps put the main clause into context. As a result, the embedded
clause is optional in (58a); if it is omitted, the sentence is still grammatical and
288 Mark C. Baker
complete. In contrast, (58b) sounds incomplete without the embedded clause.
These differences translate into a difference in the phrase structure of the
sentences. Since the clause of tell functions as a kind of direct object, it makes
sense that this would be generated inside the VP, in the same kind of position
as other direct objects:
(61)
On the other hand, since the because clause is a modiﬁer of the sentence as a
whole, and only loosely attached to it, its phrase structure is as in (62):
(62)
John . . .that
SC
CP
him
NP
VP
V
told
S
NP
I
NP
hired
V
VP
I
NP
S
him
S
John . . .
SC
CP
because
Now we can apply the DRC to these structures and we get the results we
want. In (61) the object pronoun c-commands the NP John inside the embedded
clause; therefore they cannot refer to the same thing. In (62), however, the
object pronoun does not c-command the NP “John” inside the because-clause:
the ﬁrst phrase that properly contains the pronoun is VP, and this VP does not
contain the modiﬁer. Therefore, the pronoun can refer to the same thing as the
NP, just as we observed.
Signiﬁcantly, comparable sentences in Mohawk elicit the same pattern of
judgments, as shown in (63).
(63) a. (Rauha) wahi’nha’ne’ ne tsi Sak rayo’tvhseriyo.
him I-hired because Sak is-a-good-worker
“I hired him because Sak is a good worker.” (OK him = Sak)
Syntax 289
b. (Rauha) wahihrori’ tsi Sak rayo’tvhseriyo.
him I-told that Sak is-a-good-worker
“I told him that Sak is a good worker.” (Not OK: him = Sak)
Again, the pronoun object of the ﬁrst verb can refer to an NP in the embedded
clause when the verb means “tell,” but not when it means “hire.” Perhaps then
the DRC applies in Mohawk after all.
Consider next the CED. Given the structures in (61) and (62), the CED predicts
that wh-words should be able to move out of that-clauses but not becauseclauses,
because only that-clauses are immediately contained in VP. This is true
in English:
(64) You think [that Mary kicked John.]
Who do you think [that Mary kicked – ]?
(65) You cried [because Mary kicked John.]
*Who did you cry [because Mary kicked – ]?
(In fact, this kind of contrast was part of the original motivation for the CED in
Huang (1982).) Strikingly, the same contrast is found in Mohawk. Sentences
like (66a) are perfectly grammatical, while sentences like (66b) are not:
(66) Uhka ihsehre’ Uwari wahuwarasvtho’?
Who you-think Mary kicked
“Who do you think Mary kicked?”
(67) *Uhka wa’tesahvrehte’ ne tsi Uwari wahuwarasvtho’?
who you-shouted because Mary kicked
“Who did you shout because Mary kicked?”
Thus, there is evidence that the CED applies in Mohawk too.
So far, we have something of a paradox. When we look at noun phrase
objects and subjects, Mohawk and English seem completely different. However,
when we look at embedded clauses, they seem very much the same.
How can this conﬂicting evidence be reconciled?
In fact, there is a rather simple way of resolving the paradox. (Formal) syntactic
analysis characteristically comes in two steps: ﬁrst, one determines the
basic structures of sentences in the language, and, second, one deﬁnes conditions
over those structures. Given this, the fact that one gets different grammaticality
patterns in Mohawk from English is not necessarily due to a difference
in the conditions, but could be due to a difference in the basic structure that
the conditions apply to. Let us then hypothesize that the conditions in Mohawk
are indeed the same as those in English, based on the evidence from embedded
clauses. Now, reconsider sentence (57b), repeated as (68).
290 Mark C. Baker
(68) (Rauha) wahanohare’ ne Sak rao’share.
he wash NE Sak knife
“He washed Sak’s knife.” (OK he = Sak)
Here the pronoun subject can be coreferent with the NP Sak inside the direct
object, unlike in English. If the DRC really holds in Mohawk, this can only mean
that the object NP is in some other position where it is not c-commanded by
the pronoun subject. In other words, it must be outside of the smallest phrase
that properly contains the subject. Is there such a position? The answer is yes:
we saw above that because-clauses can be attached loosely to the sentence as a
whole. This is only legitimate because the sentence is complete without the
because-clause. However, the same is true of object NPs in Mohawk; sentences
are complete without them (see (53c)). Thus, the structure of (68) is (69).
(69)
More generally, we can conjecture subject and object NPs are always loosely
attached to the clause in this way in Mohawk.
This hypothesis about Mohawk sentence structure immediately solves some
other problems as well. Another difference between Mohawk and English was
that question words cannot move out of object NPs in Mohawk, as shown in
(56b). This now makes perfect sense. The NP that we are trying to move out of
is not a true object after all, but rather a loosely attached NP, in the same kind
of position as because-clauses:
(70)
?
washed
V
VP
he
NP
S
S
knife
NNP
NP
Sak’s
like
?V
VP
you
NP
NPS
S
who
NP
S′
– story
Syntax 291
Since the loosely attached NP is not in the VP, it follows from the CED that a
question word cannot move out of this NP. These sentences fall into place with
no further adjustments.
The important thing to realize here is that the DRC and the CED both seem
to go wrong in Mohawk – but crucially they go wrong in the same way. They both
assume that object NPs are inside the VP. Once this assumption about basic
structure is corrected, the conditions consistently give the correct results after
all. Our ﬁrst impression was that Mohawk was different from English in almost
every imaginable way. We have now corrected that impression; in fact, Mohawk
differs from English in only one way, as stated in (71) (see Jelinek 1984; the
idea apparently has roots as far back as Humboldt’s remarks on Nahuatl in the
ﬁrst half of the nineteenth century).
(71) NPs are true subjects and objects in English, but are loosely attached to
the sentence as a whole in Mohawk.
Since this affects the basic structure of the sentence, and many conditions are
stated in terms of that structure, it causes ripples throughout the grammar of
Mohawk. Nevertheless, most of the syntactic principles are the same.
In fact, there is reason to think that even (71) overstates the differences
between Mohawk and English. Consider again the fact that the subject and
object in Mohawk do not need to appear:
(72) Ranuhwe’s. (= (53d))
likes
“He likes it.”
Why is such a sentence possible? The verb nuhwe’ “like” is similar to its English
translation in being a transitive verb; logically speaking, it is a two-place
predicate. Thus, it should need to have two NPs, almost as a point of logical
necessity. The reader may have noticed that when NPs are missing, the gaps
are interpreted as pronouns – “he” and “it” in this case. Moreover, what pronoun
to use is determined by agreement preﬁxes on the verb. Many languages
such as Spanish use different verb forms for different subjects, depending on
whether it is ﬁrst person, or second person, or whatever:
(73) a. (el) habla español. “He speaks Spanish.”
b. (tu) hablas español. “You speak Spanish.”
c. (yo) hablo español. “I speak Spanish.”
Moreover, languages that have enough of this kind of agreement typically
allow one to omit the pronoun, because the information is already there on the
verb; thus, it is normal to say simply Hablo español. Linguists often assume that
there is still a pronoun present in the syntactic structure of such a sentence, but
it is not pronounced. Now Mohawk is like Spanish in these respects, only
more so: Mohawk verbs change form not only depending on the subject, but
depending on the object as well. Thus, it is not surprising that Mohawks can
292 Mark C. Baker
leave out both subject and object pronouns unless they want to give them
special emphasis.
With this in mind, we can reconsider Mohawk structures in (69) and (70).
These structures are peculiar from the English perspective because there is no
direct object (or the direct object is in the wrong place, depending on your
terminology). However, given Mohawk’s rich agreement system, it is reasonable
to say that there is an object present, but it is simply one of the invisible
pronouns just discussed. Then a more complete structure for (68) would be:
(74)
NP
washed
V
VP
he
NP
S
S
knife
NNP
NP
Sak’s
it
This structure does not look so unfamiliar. English too can have structures in
which the true object is a pronoun, and some NP is attached to the sentence to
clarify what that pronoun refers to, as shown in (75).
(75) a. That dress, he really likes it.
b. He really likes it, that dress.
c. That dress, he really likes it, John.
This phenomenon is called dislocation. It is colloquial and not particularly common
in English, but it does exist. Thus, it is not really accurate to say that
English and Mohawk have different structures; it is more accurate to say that
Mohawk uses dislocation all the time, while English does not.
Finally, we can ask why NPs must be dislocated in Mohawk but not in
English. Moreover, recall that embedded clauses do not need to be dislocated;
rather they can appear inside the VP in Mohawk just as in English, as shown
in (63)–(67). In fact, a similar contrast between NPs and clauses is found in
English, although in a different part of the grammatical system. Some verbs
like believe can have either an NP or a clause as their object:
(76) a. People generally believe [John]NP.
b. People generally believe [that John is telling the truth]CP.
However, when these verbs are put in the passive voice, the situation changes.
The clause can still remain in its usual position after the verb, but the object
NP cannot. Rather, it must be “dislocated” – in this case to the subject position:
Syntax 293
(77) a. It is generally believed [that John is telling the truth]CP.
b. *It is generally believed [John]NP.
(Compare: [John]NP is generally believed –.)
Apparently putting the verb in the passive form disrupts its ability to take an
NP object, but has no effect on clauses. This has led linguists to propose a
principle which I state very informally as follows (a part of the case ﬁlter;
Chomsky 1981):
(78) If an afﬁx (of a certain type) is attached to a verb, it can no longer take an
NP as a direct object.
Now we know that every transitive verb in Mohawk has afﬁxes attached to it.
In particular, it has an afﬁx that agrees with the object in person, number, and
gender. Suppose this agreement afﬁx is in the same class as the passive voice
afﬁx. Then it follows that NPs will never be in the normal object position in
Mohawk; they will always be dislocated. There will be no similar effect on
clauses, however. If this is correct, then all of the many differences between
Mohawk and English actually boil down to this:
(79) All Mohawk verbs have afﬁxes that agree with their subject and object
(but English verbs do not).
There is nothing mysterious or complex about (79); on the contrary, it is one of
the ﬁrst things one must face when learning Mohawk.
In closing, let us reconsider the basic question of whether Mohawk is like
English. Crucially, the answer depends on what one means by the question. In
one sense, Mohawk is obviously very different. In particular, the basic structure
of most sentences in Mohawk is very different from that of comparable
sentences in English. As a result, words can appear in any order, pronouns
have different possibilities of interpretation, question words face different
limitations, and so on. However, in another sense, the syntax of Mohawk and
English is virtually identical. This comes out most clearly if one makes a list of
the syntactic principles that hold in both languages and compares it to a list of
the differences:
(80) a. Common elements (“principles”)
The disjoint reference condition
The condition on extraction domains
Question words become clause-initial
Clauses must be “logically complete” (have the right number of NPs)
NPs can be “dislocated” to the edge of a sentence
The case ﬁlter, etc.
b. Differences (“parameters”)
Verbs agree with both subject and object in Mohawk, not English
294 Mark C. Baker
Almost all the principles are the same, and the one difference is a rather superﬁcial
one. However, that one difference just happens to be like a pebble that is
in exactly the right position to start an avalanche: it has repercussions throughout
the grammar.
Thus, this extended example should help to explicate what syntacticians
mean when they speak of “Universal Grammar,” and when they say that “all
human languages are essentially the same.” This claim makes sense, but only
when one shifts from looking at languages as some kind of list of sentence
structures to looking at them as a set of constraints in the minds of speakers
that the sentences they form must satisfy. In other words, one must look not at
particular Mohawk examples like (53) to (57), but rather at the abstract characterization
of Mohawk in (80) that induces those examples. Chomsky (1986)
refers to this as the shift from looking at “Externalized language” to looking at
“Internalized language,” a part of his emphasis on the importance of a psychological
interpretation of syntactic theory. Hopefully, this example also illustrates
something of the intellectual interest that surrounds the task of trying to
tease out what is universal in human language and what is idiosyncratic and
particular, which is a central focus of current syntactic research.
Generative Grammar 295
12 Generative Grammar
THOMAS WASOW
1 Introduction
1.1 “Grammar”
To most people, the word “grammar” suggests a normative enterprise, dictating
what constitutes correct language use. For example, many educated English
speakers would identify at least ﬁve supposed grammatical “errors” in the
following sentence:
(1) Hopefully, we will be able to easily ﬁgure out who to talk to.
Yet native speakers of American English also certainly recognize that (1) would
be an entirely acceptable and natural sounding sentence in ordinary discourse.
Indeed, the supposedly “correct” alternative (2) would be an awkward and
affected way of expressing the thought.
(2) I hope that we shall be able easily to ﬁgure out to whom to talk.
Modern linguistics has little use for this prescriptive conception of grammar.
Linguists are more interested in the knowledge of English that allows native
speakers to judge (1) as fully acceptable and (2) as somewhat less natural. The
prescriptions of traditional grammar are attempts to impose the speech patterns
of one region, class, ethnicity, or generation on speakers belonging to other
groups. They may be of interest to sociologists, historians, and political scientists,
but they tell us very little about the nature of language.
Language is a natural phenomenon, constituting an essential component of
every human society. Linguistics is concerned with studying languages and
language in general, much as biology studies living species and life in general.
From this scientiﬁc perspective, the norms of prescriptive grammar are to