Semantics: - Yilmaz Kilicaslan

advertisement
Semantics: Handout 5
Analysis of Propositions into Predicates and Constituents
Yılmaz Kılıçaslan
October 15, 2009
1 Need for a more expressive language
Now that we have a system of propositional logic at our disposal, we can translate (indicative)
sentences of English into that system. In this way, we can formally capture the validity of
argument (1) and the validity argument (2) (cf. Handout 3) (along with those of (7) and (8)).
Below are the representations of these arguments in L0, respectively:
(1)
p q
¬p
------q
(2)
p q
¬r  ¬ p
¬r
------q
(1) is an instance of the valid argument schema (11) in Handout 3, which is repeated here as
(3):
(3)
A or B
Not A
-------B
and (2) comprises an instance of the well-known modus ponens:
(4)
If A then B
A
----B
where the premises are the second and third premises of (2) and the conclusion is ¬ p, and an
instance of (3), where the premises are the first premise of (2) and the sub-conclusion
obtained by (4) and the conclusion is q.
However, translating third, fourth and fifth arguments in Handout 3 into L0 would result in the
loss of some parts of their meanings upon which a valid argumentation would be based. Such
valid arguments will be left to be treated later on (cf. Handout 6). Let us now look at another
case of a valid argument whose translation into L0 will bring about the same sort of intolerable
loss in meaning:
1
(5)
Muhammad Ali is taller than Richard Nixon.
Richard Nixon is taller than Noam Chomsky.
------------------------------------------------------Muhammad Ali is taller than Noam Chomsky.
This is a valid argument. However, that cannot be shown if we translate the premises and
conclusion into propositional letters:
(6)
p
q
--r
Given that propositional letters are logical variables which could, in principle, stand for any
statement, (6) cannot be considered a valid argument schema (or, more precisely, an instance
of a valid argument schema). The problem with L0 when dealing with arguments like (5) is
that propositions are treated as the most detailed level of analysis. What we need is an
analysis of the propositions constituting (5) into a structure where the individuals and the
relation between them are represented. In what follows, we will see how statements can be
given such a deeper analysis.
2 Predicate-Constituent Analysis
Let us start with simple statements with a clear subject-predicate structure:
(7)
Jacques Chirac is bald.
(8)
Noam Chomsky snores.
Each of the sentences has one part which refers to a property (being bald and snoring) and
another part which refers to some entity (Jacques Chirac and Noam Chomsky). Accordingly,
we need to have (individual) constants / names which are always interpreted in such a way
that they refer to an entity (that is, an individual or an object) and predicate constants which
are always interpreted such that they refer to all kinds of properties which entities (of some
particular sort) may or may not have. Note that individual constants and predicate constants
are logical variables.1 Let us use lowercase letters for individual constants and capital letters
for predicate constants. A well-formed formula corresponding to a sentence can be made by
prefixing a predicate letter to a constant occurring in brackets. If we have some particular
interpretation of the sentences in mind, then we may choose suggestive letters. So, (7) and (8)
might, for example, be represented as (9) and (10), respectively:
(9)
B(j)
(10)
S(n)
1
Recall that the expressions which the validity of argument schemata of a logical system depends on are called
the logical constants of that logical system, as their meaning is completely fixed in the system. Linguistic items
whose meaning does not take part in determining the validity of arguments are treated as logical variables. The
denotation of items of the latter type varies from one model to another.
2
Until the end of the nineteenth century, statements with subject-predicate structure were the
only individual statements which were taken seriously. This was the only structure which
Aristotle ascribed to propositions. Besides these, however, there are other kinds of individual
statements which from a logical point of view cannot profitably be analyzed in terms of
subjects and predicates. Sentences which say that two entities bear some particular
relationship to each other are a case in point. The premises and conclusion of argument (5) are
examples of such sentences. These sentences too could be translated in the subject-predicate
schema. The first premise would be translated as R(m), with m translating Muhammad Ali,
and R standing for is taller than Richard Nixon, whereas the second would come out as N(r),
with r for Richard Nixon and N for is taller than Noam Chomsky, and the conclusion would
read N(m). But the argument schema:
(11)
R(m)
N(r)
------N(m)
cannot be shown to be valid.
What we need is an approach to the structure of propositions where not only subject-predicate
forms like:
(12)
a is P
but also relational forms like:
(13)
a R b (a bears the relation R to b)
can find an equally important place. It was Gottlob Frege who first appreciated the importance
of relational propositions. He removed the grammatical notion of subject from the central
place it had previously occupied in logic. This gives way to the concept of a constituent, a
term which refers to an entity. Any number of different constituents may appear in relational
propositions, and none enjoys a privileged position above the others. There is no need to
single out a unique subject. Frege’s own example motivating this departure from traditional
practice is still quite instructive. He notes that the sentence
(14)
The Greeks defeated the Persians at Plataea.
which would appear to be about the Greeks (as subject), is in fact synonymous with the
passive construction:
(15)
The Persians were defeated by the Greeks at Plataea.
If subjects are to be singled out, the Persians would seem to be the subject of (15). The lesson
to be drawn is that neither of the constituents the Greeks and the Persians is logically more
important than the other. There may be differences between (14) and (15), but they do not
belong to logic.
Having relaxed the structure of propositions in the way indicated above, (5) can now be
turned into the schema:
3
(16)
T(m, r)
T(r, n)
--------T(m, n)
where T stands for is bigger than and m, r, and n for Muhammad Ali, Richard Nixon and
Noam Chomsky, respectively. Once we add an extra premise stating the transitivity of the
relation R for all constituent entities, we will be able to show that the argument is valid. The
construction of this latter premise requires us to employ quantifiers and variables which will
be introduced later on (cf. Handout 6). We will now introduce the formal specifications of two
simple languages, L1 and L1E, which are essentially the propositional calculus with
propositions analyzed into predicates and constituents.
3 The Language L1
In this new language, the propositional letters will disappear and individual and predicate
symbols will be our new basic expressions. The rest of the syntax will be formulated
accordingly.
Syntax of L1
A. Basic Expressions:
Category
Basic Expressions
Names
d, n, j, and m
One-place predicates M, B
Two-place predicates K, L
B. Formation Rules:
1. If δ is a one-place predicate and α is a name, then δ(α) is a sentence.
2. If γ is a two-place predicate and α and β are names, then γ(α, β) is a sentence.
3. If φ is a sentence, then ¬φ is a sentence.
4. If φ and ψ are sentences, then [φ  ψ] is a sentence.
5. If φ and ψ are sentences, then [φ  ψ] is a sentence.
6. If φ and ψ are sentences, then [φ  ψ] is a sentence.
7. If φ and ψ are sentences, then [φ  ψ] is a sentence.
L1 has two kinds of formation rules. There are rules used to produce atomic sentences by
combining predicates with an appropriate number of names. These are rules 1. and 2. above.
There are also rules which form a sentence out of one or more other sentences (3.-7. above).
4
Semantics of L1
We assume that names take individuals as their semantic values. This valuation is certainly
congruent with our common sense. The way we determine the valuations of predicates is
probably a less intuitive one. One-place and two-place predicates are assumed to take as their
semantic values sets of individuals and sets of pairs of individuals, respectively. For the sake
of simplicity, our universe will consist of just four individuals, namely, Richard Nixon, Noam
Chomsky, John Mitchell and Muhammad Ali. Each semantic valuation will be a function. A
particular model set up in this way is given below:
A. Basic Expressions:
[d] = Richard Nixon
[n] = Noam Chomsky
[j] = Jacque Chirac
[m] = Muhammad Ali
[M] = the set of all living people with moustaches
[B] = the set of all living people who are bald
[K] = the set of all pairs of living people such that the first knows the second
[L] = the set of all pairs of living people such that the first loves the second
Our strategy for determining the semantic values of the sentences will strictly follow the
Principle of Compositionality. Having assigned a semantic value to each basic expression
(e.g., as above), the semantic rules will determine the semantic value of each larger
constituent in terms of the semantic values of its components by taking into consideration
how these constituents are syntactically combined:
B. Semantic Rules:
1. If δ is a one-place predicate and α is a name, then δ(α) is true iff [α]  [δ].
2. If γ is a two-place predicate and α and β are names, then γ(α, β) is true iff <[α], [β]>  [δ].
3. If φ is a sentence, then ¬φ is true iff φ is not true.
4. If φ and ψ are sentences, then [φ  ψ] is true iff φ and ψ are true.
5. If φ and ψ are sentences, then [φ  ψ] is true iff either φ or ψ is true.
6. If φ and ψ are sentences, then [φ  ψ] is true iff either φ is false or ψ is true.
7. If φ and ψ are sentences, then [φ  ψ] is true iff either φ and ψ are both true or else φ and
ψ are both false.
Note that we have introduced the connectives ¬,, , , and  syncategorematically this
time. We could, however, have treated them as basic expressions in, for example, a category
of sentence conjunctions in a way similar to what we did when specifying L0 (cf. Handout 5).
Similarly, several other aspects of our syntax or semantics could have received alternative
formulations. A good way of illustrating which features of L1 are crucial and which are
5
matters of conventions is to design a different language. This will be a language that
resembles English to a much greater degree than L1. The syntactic design of the new
language, which we will call L1E, will be somehow different but it will be a semantically
similar language:
4 The Language L1E
Syntax of L1E
Instead of recursive definitions, let us use a (context-free) phrase-structure grammar of the
sort linguists are accustomed to in order to specify the syntax of L1E.
(17)
N  Sadie
N  Liz
N  Hank
Vi  snores
Vi  sleeps
Vi  is-boring
Vt  loves
Vt  hates
Vt  is-boring
Conj  and
Conj  or
Neg  it-is-not-the-case-that
S  S Conj S
S  Neg S
S  N VP
VP  Vi
VP  Vt N
Semantics of L1E
Even though we have used a phrase-structure grammar to formulate the syntax of L1E we will
still adopt the Principle of Compositionality in our interpretation of L1E in much the same way
as with L1. We will assign a semantic value to each lexical item as we did to each basic
expression in L1, and for each syntactic constituent there will be a rule for determining its
semantic value from the semantic value(s) of its sub-constituents. Though we may think of the
phrase-structure rules as defining a tree “from top to bottom,” our semantic rules will be
formulated as proceeding from the bottom of the tree (its terminal nodes) to the top.
Nevertheless, there will be a semantic rule corresponding to each phrase structure rule in the
semantics of L1E, just as there was a one-to-one correspondence between syntactic and
6
semantic rules in L1. More formally, for each phrase structure rule α  β1 β2 … βn, there will
be a semantic rule for determining the semantic value of the constituent labeled α in a tree in
terms of the semantic values of the constituents β1 β2 … βn which the node α dominates.
We will first assign semantic values to the names. As with L1, we will want names to denote
individuals, so we may assign the names Sadie, Liz and Hank values as follows:
(18)
[Sadie] = Anwar Sadat
[Liz] = Queen Elizabeth II
[Hank] = Henry Kissinger
We wish to give the semantic values of intransitive verbs, Vi , the effect of singling out a set of
individuals, just as we did with the one place-predicates of L1. We will achieve this effect in a
different way, however.
Sets of individuals are in a one-to-one correspondence with functions that map individuals to
0 or 1. If A is the set of individuals and S is any subset of A, we define a function on the set A
by letting
(19)
fs(a) =
1 if a  S
0 if a  S
The semantic values of Vi’s in L1E will all be characteristic functions of sets of individuals.
Assuming for the sake of simplicity that the three individuals mentioned in (18) are the only
individuals in the world, we might for example stipulate that the semantic values of our Vi’s
are the following functions:
(20)
Anwar Sadat
[snores] =
[sleeps] =
1
Queen Elizabeth II
Henry Kissinger
0
Anwar Sadat
1
Queen Elizabeth II
Henry Kissinger
0
Anwar Sadat
[is boring] =
1
Queen Elizabeth II
Henry Kissinger
0
7
Let us now ask what sort of values the Vt’s should have. Semantically they seem to express
relations between individuals. According to the grammar, a Vt followed by an N forms a VP.
Therefore, a VP can have as its semantic value a function from individuals to truth values. We
need the semantic value of a Vt to be something that maps the semantic value of an N (i.e. an
individual) into the semantic value of a VP (i.e. a function from individuals to truth values).
Thus, we take the value of a Vt to be a function which yields other functions as its “outputs”.
Its domain will be the set of individuals, and its co-domain (i.e. set within which its “outputs”
must lie) will be the set of all functions from individuals to truth values. The following are the
semantic values of the Vt’s:
8
(21) [loves] =
Anwar Sadat
1
Queen Elizabeth II
Henry Kissinger
Anwar Sadat
Anwar Sadat
0
1
Queen Elizabeth II
Henry Kissinger
Anwar Sadat
0
1
Queen Elizabeth II
Queen Elizabeth II
Henry Kissinger
0
Anwar Sadat
1
Queen Elizabeth II
Henry Kissinger
0
Anwar Sadat
1
Queen Elizabeth II
Henry Kissinger
0
Anwar Sadat
1
Queen Elizabeth II
Henry Kissinger
Henry Kissinger
0
Anwar Sadat
1
Queen Elizabeth II
Henry Kissinger
0
Anwar Sadat
1
Queen Elizabeth II
Henry Kissinger
9
0
Anwar Sadat
1
Queen Elizabeth II
Henry Kissinger
Anwar Sadat
Anwar Sadat
0
1
Queen Elizabeth II
Henry Kissinger
0
Anwar Sadat
1
Queen Elizabeth II
[hates] =
Henry Kissinger
0
Anwar Sadat
1
Queen Elizabeth II
Queen Elizabeth II
Henry Kissinger
0
Anwar Sadat
1
Queen Elizabeth II
Henry Kissinger
0
Anwar Sadat
1
Queen Elizabeth II
Henry Kissinger
Henry Kissinger
0
Anwar Sadat
1
Queen Elizabeth II
Henry Kissinger
0
Anwar Sadat
1
Queen Elizabeth II
Henry Kissinger
10
0
[is-taller-than] =
Anwar Sadat
1
Queen Elizabeth II
Henry Kissinger
Anwar Sadat
Anwar Sadat
0
1
Queen Elizabeth II
Henry Kissinger
0
Anwar Sadat
1
Queen Elizabeth II
Henry Kissinger
0
Anwar Sadat
1
Queen Elizabeth II
Queen Elizabeth II
Henry Kissinger
0
Anwar Sadat
1
Queen Elizabeth II
Henry Kissinger
0
Anwar Sadat
1
Queen Elizabeth II
Henry Kissinger
Henry Kissinger
0
Anwar Sadat
1
Queen Elizabeth II
Henry Kissinger
0
Anwar Sadat
1
Queen Elizabeth II
Henry Kissinger
11
0
The remaining lexical items are the negation operator it-is-not-the-case-that and the twocoordinating conjunctions and and or. We will assume that these have semantic values as
defined in the semantic specification of Lo. Below are the functions assigned as semantic
values to these items:
(22)
[it-is-not-the-case-that] =
1
0
0
1
<1, 1>
[and] =
1
<0, 1>
<1, 0>
[or] =
<0, 0>
0
<1, 1>
1
<0, 1>
<1, 0>
<0, 0>
0
As for the semantic rules, they should be something like the following:
(23)
If α is γ, where γ is any lexical category and β is any lexical item, then [α] = [β].
β
(24)
If α is VP and β is Vi, then [α] = [β].
β
(25)
If α is N and β is VP, and if γ is S, then [γ] = [β]([α]).
α
(26)
β
If α is Vt and β is N, and if γ is VP, then [γ] = [α]([β]).
α
β
12
(27)
If α is Neg and φ is S , and if ψ is S, then [ψ] = [α]([φ]).
α
(28)
β
If α is Conj and φ is S , and if ψ is S, and  is S , then [] = [α](<[φ], [ψ]>).
φ
α ψ
5 A Comparison of L1 and L1E
The syntax and semantics of the two languages we have just described are similar in the
overall effect of their semantic interpretations but differ in a number of details. The
differences between the two languages may be summarized as follows:
1. While the basic expressions of L1 look like those of the formal languages found in
logic textbooks, as does the ordering of these in formulas, the lexical items of L1E are
deliberately designed to resemble English, and the word order in L1E is much like that
of English sentences.
2. We used recursive definitions to specify the syntax of L1, but we used a context-free
phrase structure grammar to specify that of L1E.
3. Our semantic rule for L1 classified sentences as “true” or “false” in the meta-language,
while those of L1E simply assigned 1 or 0 as the semantic value of a sentence.
4. The semantic values of one-place predicates were sets in L1, but the semantic values of
Vi’s were characteristic functions of sets in L1E.
5. The semantic values of two-place predicates were sets of pairs in L1E, but the semantic
values of Vt’s in L1E were functions from individuals to the kind of semantic values
assigned to Vi’s.
6. The logical symbols ¬,, , , and  were treated syncategorematically in L1E but the
corresponding lexical items it-is-not-the-case-that, and, etc., in L1E were introduced as
members of the category Neg or Conj, and their semantic values were therefore
defined as functions on truth values (or pairs of them), independently of the semantic
rules corresponding to S  Neg S and S  S Conj S.
However, these six characteristics are not in any essential way tied to the differences between
natural languages and the formal languages of logicians, but rather are independent and
somewhat arbitrary choices which we made for convenience. We could in fact alter the syntax
or semantics of L1 or L1E with respect to almost any one of these characteristics while keeping
the other five the same.
6 Some Features Essential to Truth-Conditional Semantics
Let us now summarize the essential points that are common to the applications of truthconditional semantics we have presented so far:
1. Any truth-conditional semantics is always tightly interconnected with the syntax of the
language in question.
13
2. The syntax is required to have at least a set of syntactic categories, one of which is the
category “sentence” or something of the sort – the category associated with truth or
falsity.
3. There must be some initial assignment of expressions of the language to the syntactic
categories and, since in every interesting case we will be dealing with an infinite
language, there must e rules which effect the assignment of the remaining well-formed
expressions to their respective categories.
4. There must be a set of things which can be assigned as semantic values. In the system
assumed thus far these are (i) a set of individuals (ii) a set of truth values, and (iii)
various functions constructed out of these by means of set theory.
5. A specification is needed for each syntactic category of the type of semantic value that
is to be assigned to expressions of that category (e.g., names are to have individuals
assigned, etc.) Sentences are to be assigned truth values.
6. There must be a set of semantic rules specifying how the semantic value of any
complex expression is determined in terms of the semantic values of its components.
7. Each of the basic expressions must be assigned a specific semantic value of the
appropriate type.
7 The Notion of Truth Relative to a Model
If we look at the list of essential components of a truth conditional semantics given above, we
see that the items in the list fall into two broad classes corresponding to the kinds of factors
which go into determining the semantic values of a sentence. We could say, roughly, that a
particular sentence gets the semantic value that it does – is either true or false – because of
certain formal structural properties that it has on the one hand, and on the other hand because
of certain facts about the world. To take an example from L1E, the sentence Liz snores gets the
semantic value 1 (true) because all the following considerations interact to yield this result:
1. Liz has as its semantic value Queen Elizabeth II.
2. snores has as its semantic value a function from individuals to truth values which
maps Queen Elizabeth II into 1.
3. Liz is a name and snores is an intransitive verb, and the truth value of any sentence
composed of a name plus an intransitive verb is determined by applying the function
which is the semantic value of the intransitive verb to the argument which is the
semantic value of the name.
Of these, the last is a theory-internal condition which is specified as a part of our semantic
theory for L1E in particular. The first two, however, are conditions which have to do with
assumed facts about the connections between the language and the world and facts about the
way the world is.
All this discussion is by way of introducing the notion of a model. Formally, a model is an
ordered pair < A, F >, where A is a set, the set of individuals, and F is a function which
assigns semantic values of the appropriate sort to basic expressions. All the rest is taken as the
fixed part of the semantics for a particular language, and we may then examine the effect on
the semantic values of expressions in the language as we allow the model to vary.
14
PS: The content and examples of this handout are adopted from Dowty et al (1981) and
Gamut (1991).
References
Dowty D.R., Wall R. E. and Peters S. (1981) Introduction to Montague Semantics. Dordrecht,
The Netherlands: Kluwer Academic Publishers.
Gamut, L.T.F. (1991) Introduction to Logic: Logic, Language, and Meaning (Volume 1).
Chicago: The university of Chicago Press.
15
Download