How Cognition Could Be Computing: Semiotic Systems, Computers

advertisement
How Cognition Could Be Computing
Semiotic Systems, Computers, & the Mind
William J. Rapaport
Department of Computer Science & Engineering,
Department of Philosophy, Department of Linguistics,
and Center for Cognitive Science
rapaport@buffalo.edu
http://www.cse.buffalo.edu/~rapaport
Summary
• Computationalism = cognition is computable.
– Mental processes can be the result of algorithmic procedures…
– …that can be affected by emotions/attitudes/individual histories.
– Computers that implement these (cognitive) procedures
really exhibit those mental processes.
• They are “semiotic” (= sign-using) systems.
• They really think.
– Computers can possess minds.
• “Syntactic semantics” explains how all this is possible.
I. What Is “Computationalism”?
• What is AI?
– Not “artificial”: It’s a computational theory
– Not about “intelligence”: It’s about cognition
• Better name:
– Computational cognition
• cf. “computational linguistics”
“computational statistics”
“computational geometry”, etc.
What Is “Computationalism”?
• “Computationalism” =? cognition is computation
– Hobbes, McCulloch/Pitts, Putnam, Fodor, Pylyshyn, …
– interesting, worth exploring, possibly true
– BUT:
• Not what “computational” usually means!
• What should “computationalism” be?
• Must preserve crucial insight:
– cognition is explainable via mathematical theory of computation
• First, some definitions…
Preliminary Definitions
A. “Cognition”
≈ whatever cognitive scientists study, including:
•
•
•
•
•
•
•
•
•
•
•
believing
consciousness
emotion
language
learning
memory
perception
planning
problem solving
reasoning
representation
–
•
•
including categorization, concepts, mental imagery, etc.
sensation
thought, etc.
Preliminary Definitions
B. “Algorithm” (informal notion)
•
A is an algorithm for executor E to accomplish goal G ≈informally
1. A is a procedure (finite set/seq of statements/rules/instructions)
such that:
•
Each statement/rule/instruction S is such that:
a) S is composed of finite # of symbols/marks from finite alphabet
b) S is unambiguous for E—i.e.:
i. E knows how to execute S
ii. E can execute S
iii. S can be executed in finite time
iv. after executing S, E knows what to do next
2. A halts (≈ takes finite time)
3. A halts with G accomplished
Preliminary Definitions
•
“Effective”
– Church: left undefined
– Rosser:
•
•
•
each step is precisely determined
“method” produces an answer…
…in finite # of steps
– Markov:
•
“process” produces an answer
– Kleene:
•
effective procedure = algorithm
– Knuth:
•
all operations can be done exactly, in finite time
Preliminary Definitions
C. “Algorithm” (formal notion)
– A is an algorithm =formally
A is (logically equivalent to) a Turing machine
• Church-Turing Thesis:
• algorithminformal = algorithmformal
Preliminary Definitions
D. “Computable”
–
task/goal/field of study G is computable
iff
 algorithm(s)formal for G
The Proper Treatment of Computationalism
•
Computationalism ≠ Cognition is computation
The Proper Treatment of Computationalism
•
Computationalism = Cognition is computable
– i.e.,  algorithm(s) that compute cognitive functions
a) Basic research question of computational cognitive science:
• How much of cognition is computable?
b) Working assumption of computational cognitive science:
• All cognition is computable
c) …
Proper Treatment of Computationalism
c) Implementational implication
(multiple realization):
•
If cognition is computable, then:
–
anything that implements cognitive computations
would be cognitive (would really think)

even if humans don’t do it that way!
 Piccinini:
neural spike trains are not representable as digit strings;
 not computational
 BUT:
~  functions whose O/P they produce are not computable

◊: human cognition is computable but not computed
Proper Treatment of Computationalism
•
2 Views of Cognitive Science:
1. How do humans cognize?
•
•
(part of CogSci)
Might not be computationally.
BUT: Can abstract away from the specifically human
to a more general issue:
2. How is cognition possible? (also part of CogSci)
•
Might be computable.
The Proper Treatment of Computationalism
•
2 Views of “Computationalism”
1. Cognition is computation
– “strong” / “narrow” / “nearsighted” view
– the mind or brain is a computer
•
how the M/B does what it does is by computing
vs. the proper treatment:
2. Cognition is computable
– “weak” / “wide” / “farsighted” view
– what the mind or brain does
can be described in computational terms
•
how it does it
is a matter for neuroscience to determine
Proper Treatment of Computationalism
• Turing on his test:
– “the use of words and general educated opinion
will alter so much
that one will be able to speak of machines thinking
without expecting to be contradicted.”
• “general educated opinion”
– changes when we abstract & generalize
• “the use of words”
– changes when reference shifts
from word’s initial / narrow application
to more abstract / general phenomenon
• cf. “fly”, “compute”, “algorithm”
• ditto for “cognition” / “think”
II. Syntactic Semantics
as a theory underlying computationalism
1. Cognition is internal
•
Cognitive agents have direct access
only to internal representatives of external objects
2. Semantics is syntactic
•
 Words, meanings, & semantic relations between them
are all syntactic items
3. Understanding is recursive
•
Recursive Case:
–
•
We understand one thing in terms of
another that must already be understood;
Base Case:
–
We understand something in terms of itself
(syntactic understanding)
Syntactic Semantics
1. Internalism:
Cognitive agents have direct access only
to internal representatives of external objects
–
–
A cognitive agent understands the world by
“pushing the world into the mind” (Jackendoff 2002)
 Both words & their meanings
(including external objects)
are represented internally in a single LOT
•
Humans: biological neural network
– Adrian 1926,1928: nervous systems transduce different physical
stimuli into a common internal medium (cf. Piccinini)
•
Computers:
– artificial neural network
– symbolic knowledge-representation & reasoning system
Syntactic Semantics: Internalism
• Hume: argument from double vision
• Kant: “noumena” vs. “phenomena”
• Ayer: argument from illusion
• Fodor anti-Putnam: “methodological solipsism”
& G.Segal anti-Burge: “individualism”
• Pylyshyn:
– “output of sensory transducers
is the only contact the cognitive system ever has
with the environment”
• Changizi: argument from time delay in perception
Syntactic Semantics
2. (Internalism ) Syntacticism
 Words, meanings, & semantic relations
between them are all syntactic
•
syntax =
–
•
set of signs / marks / neurons / …
semantics = study of relations between members of two sets
–
–
–
study of relations among members of a single set
set of signs / marks / neurons / …
& set of (external) meanings / …
(with its own syntax!)
“Pushing” meanings into same set as symbols for them
allows semantics to be done syntactically
•
•
•
turns semantic relations between 2 sets (internal signs, external meanings)
into syntactic relations among the marks of a single (internal) LOT
e.g.:
truth tables & formal semantics are both syntactic
e.g.:
neurons representing both signs & external meanings
 Symbol-manipulating computers
can do semantics by doing syntax
Syntactic Semantics: Syntacticism
• “Syntactic semantics” underlies the Semantic Web:
– “syntactic” info on web pages
gains meaning from…
– “syntactic” info (metadata encoded in RDF) in HTML source files
– metadata annotates webpage data
– metadata provides “semantic interpretation” of webpage data
Syntactic Semantics
3. Understanding is recursive:
– Recursive cases:
•
We understand a syntactic domain (SYN-1) indirectly
by interpreting it in terms of a semantic domain (SEM-1)
–
•
but SEM-1 must be antecedently understood
SEM-1 can be understood by considering it as a syntactic domain SYN-2
interpreted in terms of yet another semantic domain
– which also must be antecedently understood, etc.
– Base case:
• A domain that is understood directly (i.e., not “antecedently”)
– in terms of itself
– i.e., syntactically
» & perhaps holistically
Syntactic Semantics: Recursiveness
• Syntactic understanding:
– “the meaning of an internal state
(which may or may not be linked to an external state of affairs)
for the system itself
is most naturally defined in terms of
that state’s relations to its other states.”
• Edelman, Shimon (2008), “On the Nature of Minds, or: Truth and Consequences”,
JETAI 20: 181-196; quote on pp.188-189.
• I.e., syntactically
III. Rapaport’s Thesis
 Syntax suffices for semantic cognition
 cognition is computable
&  computers are capable of thinking
James H. Fetzer’s Thesis
•
It doesn’t,
– it isn’t,
 & they aren’t
Fetzer’s Thesis
• Computers differ from cognitive agents in 3 ways:
1. statically (symbolically)
2. dynamically (algorithmically)
3. affectively (emotionally)
• Simulation is not the real thing
Fetzer’s Static Difference
ARGUMENT 1: Computers are mark-manipulating systems,
minds are not.
Premise 1: Computers manipulate marks on the basis of their size, shapes,
and relative locations.
Premise 2: (a) These shapes, sizes, and relative locations exert causal influence
upon computers
but (b) do
not stand for anything for those systems.
Premise 3: Minds operate by utilizing signs that stand for other things in some
respect or other for them as sign-using (or “semiotic”) systems.
__________________________________________________________________
Conclusion 1: Computers are not semiotic (or sign-using) systems.
___________________________________________________________________
Conclusion 2:
Computers are not the possessors of minds.
Figure 9. The Static Difference
The Static Difference
•
Static Premise 1:
– Is computer manipulation of symbols independent of meaning?
•
depends on what ‘meaning’ means:
1. Computational symbol-manipulation is independent of
external, 3rd-person meaning imposed on the symbols
2. But not independent of internal, 1st-person meaning
– arises from syntactic relations among internal symbols
– “intrinsic” meaning
The Static Difference
• Static Premise 2b:
– The symbols that computers manipulate
“do not” stand for anything for those computers.
• But:
– Fetzer’s locution allows for the possibility that
symbols could stand for something for the computer
– Insofar as they could,
such machines might be capable of thinking
• He should have said “could not stand for anything”
– But then he’d be wrong :-)
Fetzer |- Computers Are Not Semiotic Systems
1. In a “semiotic system” (e.g., a mind):
–
a)
b)
c)
something (S) is a sign of something (x) for somebody (z)
x “grounds” sign S
x “is an interpretant w.r.t. a context” to sign-user z
S is in a “causation” relation with z
Fetzer |- Computers Are Not Semiotic Systems
2. In a computer (I/O) system:
a) input i (playing role of sign S)
is in a “causation” relation with
computer c (playing role of sign-user z)
b) output o (playing role of thing x)
is in an “interpretant” relation with computer c
c) BUT: No “grounding” relation between i & o
Fetzer |- Computers Are Not Semiotic Systems
3.  Computers only have causal relationships,
no mediation between I/P & O/P (?!)
4. But semiotic systems require such mediation
•
Peirce:
interpretant is “mediately determined by” the sign
– [ “interpretant” is really the sign-user’s mental concept
of the thing x (!!) ]
5.  Computers are not semiotic systems
6. But minds are.
7.  Minds are not computers
& computers can’t be minds.
Incardona |- Computers Are Semiotic Systems!
1. Something is a semiotic system iff
it carries out a process that mediates
between a sign & its interpretant
•
Semiotic systems interpret signs
2. Algorithms describe processes
that mediate between I/Ps & O/Ps
•
•
An algorithm’s O/P is an interpretation of its I/P
Algorithms ground the I/O relation
3. Computers are algorithm machines.
4.  Computers are semiotic systems
The Static Difference
• Argument that computers are semiotic systems
from embedding in the world:
– Fetzer’s (counter?)example:
• “A red light at an intersection stands for
applying the brakes and coming to a complete halt, only
proceeding when the light turns green,
for those who know ‘the rules of the road’.”
– Can such a red light stand for
applying the brakes, etc., for a computer?
• It could, if the computer “knows the rules of the road”
• But a computer can “know” those rules…
– if it has those rules stored in a knowledge base
– and if it uses those rules to drive a vehicle
» cf. Stanley the VW (2005 DARPA Grand Challenge)
* Parisien & Thagard 2008, “Robosemantics:
How Stanley Represents the World”, Minds & Machines
The Static Difference
• Does a calculator that computes GCDs understand
what it’s doing?
– Fetzer & Rapaport: No
• Could a computer that computes GCDs understand
what it’s doing?
– Fetzer: No
– Rapaport & Goldfain: Yes, it could…
• as long as it had enough background / contextual / supporting information
• a computer with a full-blown theory of math
at the level of an algebra student learning GCDs
could understand GCDs as well as the student
The Static Difference
• Goldfain |- Computers could be semiotic systems:
– G1: The natural #s that a cognitive agent refers to
are denoted by a sequence of unique marks
exemplifying a finite initial segment of the natural-# structure.
– G2: Such a finite initial segment can be generated by a computational
cognitive agent (a computer) via perception & action in the world
during an act of counting (e.g., using Lisp’s “gensym”)
• they have a history of how they became marks
that signify something for the agent (the computer).
– G3: These marks (e.g., b4532, b182, b9000…) have no meaning
for a human user who lacks access to their ordering.
– G4: Such private marks (“numerons”) are associable
with publicly meaningful marks (“numerlogs”)
• e.g., b4532 denotes the same number as “1”,
b182 denotes the same number as “2”, etc.
– G5: A computational cognitive agent (a computer) can do math
solely on the basis of its numerons.
– C1:  These marks stand for something for the computer (the agent).
– C2:
& we can check the math because of G4.
Summary: No “Static Differences”
• Both computers & minds manipulate marks
• The marks can “stand for something”
for both computers & minds
• Computers (and minds) are “semiotic systems”
• Computers can possess minds
Fetzer’s Dynamic Difference
ARGUMENT 2: Computers are governed by algorithms, but
minds are not.
Premise 1: Computers are governed by programs,
which are causal models of algorithms.
Premise 2: Algorithms are effective decision procedures for arriving at definite
solutions to problems in a finite number of steps.
Most human thought processes, including dreams, daydreams,
and ordinary thinking, are not procedures for arriving at solutions
to problems in a finite number of steps.
Premise 3:
______________________________________________________________________
Conclusion 1:
Most human thought processes are not governed by
programs as causal models of algorithms.
_______________________________________________________________________
Conclusion 2:
Minds are not computers.
Figure 10. The Dynamic Difference
The Dynamic Difference
• Premises 1 & 2:
– Def of ‘algorithm’ is OK
– But algorithms may be the wrong entity
• may need a more general notion of “procedure”
(Shapiro)
• like an algorithm, but:
– need not halt
– need not yield “correct” output
The Dynamic Difference
•
Premise 3: Most human thinking is not algorithmic
– Dreams are not algorithms
– Ordinary stream-of-consciousness thinking is not “algorithmic”
• BUT:
– Some human thought processes may indeed not be algorithms
– But that’s not the real issue, which is:
• Could there be algorithms/procedures that produce these
(or other mental states or processes)?
– If dreams are our interpretations of random neuron firings during sleep,
as if they were due to external causes…
• …then: if non-dream neuron-firings are computable
(& there’s every reason to think they are)
then so are dreams
– Stream of consciousness might be computable
• e.g., via spreading activation in a semantic network
The Dynamic Difference
• Whether a mental state/process is computable
is at least an empirical question
– Must avoid the Hubert Dreyfus fallacy:
• one philosopher’s idea of a non-computable process
is another computer scientist’s research project
• what no one has yet written a program for
is not thereby necessarily non-computable
– In fact:
Mueller, Erik T. (1990), Daydreaming in Humans & Machines:
A Computer Model of the Stream of Thought (Ablex)
• Cf. Edelman, Shimon (2008), Computing the Mind (Oxford)
–  burden of proof is on Fetzer!
The Dynamic Difference
• Dynamic Conclusion 2:
– Are minds computers?
• Maybe, maybe not
• I prefer to say (with Shimon Edelman, et al.):
– The (human) mind is a virtual machine,
computationally implemented
(in the nervous system)
Summary: No “Dynamic Difference”
• All (human) thought processes are/might be
describable by algorithms/procedures
= computationalism properly treated
Fetzer’s Affective Difference
ARGUMENT 3: Mental thought transitions are affected by emotions, attitudes,
and histories, but computers are not.
Premise 1: Computers are governed by programs,
which are causal models of algorithms.
Premise 2: Algorithms are effective decisions,
which are not affected by emotions, attitudes, or histories.
Premise 3: Mental thought transitions are affected by values of variables
that do not affect computers.
_____________________________________________________________________
Conclusion 1:
The processes controlling mental thought transitions are
fundamentally different than those that control
computer procedures.
_____________________________________________________________________
Conclusion 2:
Minds are not computers.
Figure 11. The Affective Difference
Contra Affective Premises 2 & 3:
• Programs can be based on (idiosyncratic)
emotions, attitudes, & histories
– Rapaport-Ehrlich contextual vocabulary acquisition program
• Learns a meaning for an unfamiliar word from:
– the word’s textual context
– integrated with the reader’s idiosyncratic …
» “denotations”, “connotations”,
» emotions, attitudes, histories,
» & prior beliefs
– Sloman, Picard, Thagard
• Developing computational theories of affect, emotion, etc.
• Emotions, attitudes, & histories can affect computers that model them.
Summary: No “Affective Differences”
• Processes controlling mental thought transitions
are not fundamentally different from
those controlling algorithms/procedures.
• Algorithms can take emotions/attitudes/histories
into account.
• Both computers & minds can be affected by
emotions/attitudes/histories
The Matter of Simulation
ARGUMENT 4: Digital machines can nevertheless simulate thought processes
and other forms of human behavior.
Premise 1: Computer programmers and those who design the systems that they
control can increase their performance capabilities,
making them better and better simulations.
Premise 2: Their performance capabilities may be closer and closer
approximations to the performance capabilities of human beings
without turning them into thinking things.
Premise 3:
Indeed, the static, dynamic, and affective differences that
distinguish computer performance from human
performance preclude them from being thinking things.
______________________________________________________________________________
Conclusion: Although the performance capabilities of digital machines can
become better and better approximations of human behavior,
they are still not thinking things.
Argument from Simulation
• Agreed:
A computer that “simulates” some process P
is not necessarily “really” doing P
– But what is “really doing P” vs. “simulating P”?
– What is the “scope” of a simulation?
• Computer simulations of hurricanes
don’t get real people really wet
– Real people are outside the scope of the simulation
– BUT: a computer simulation of a hurricane could get
simulated people simulatedly wet
• Computer simulation of the daily operations of a bank
is not thereby the daily operations of a (real) bank
– BUT: I can do my banking online
– Simulations can be used as if they were real
Argument from Simulation
• Some simulations of Xs are real Xs:
– scale model of a scale model of X is a scale model of X
– Xeroxed/faxed/PDF copies of documents are those
documents
– A computer that simulates an “informational process”
is thereby actually doing that informational process
• Because
a computer simulation of information is information…
Argument from Simulation
• Computer simulation of a picture is a picture
– digital photography
• Computer simulation of language is language
– computers really do parse sentences (Woods)
– IBM’s Watson really answers questions
• Computer simulation of math is math
– “A simulation of a computation and the computation itself are
equivalent: try to simulate the addition of 2 and 3, and the
result will be just as good as if you ‘actually’ carried out the
addition—that is the nature of numbers” (Edelman)
• Computer simulation of reasoning is reasoning
– automated theorem proving, computational logic,…
Argument from Simulation
• Computer simulation of cognition is cognition
– “if the mind is a computational entity,
a simulation of the relevant computations
would constitute its fully functional replica”
(Edelman)
– cf. “implementational implication”
Summary:
Simulation Can Be(come) the Real Thing
• Close approximation to human thought processes
can turn computers into thinking things
– only asymptotically?
– actually?
• cf. Turing on “general educated opinion”
& “the use of words”
Summary
• Computers are “semiotic (sign-using) systems”.
• Computationalismproperly treated = cognition is computable…
• …not necessarily computational.
• Any non-computable residue will be negligible
– Mental processes are describable (governable) by algorithmic procedures…
– …that can be affected by emotions/attitudes/individual histories.
– Computers that implement these cognitive procedures
really exhibit those cognitive behaviors.
• They really think.
– Computers can possess minds.
• “Syntactic semantics” explains how all this is possible.
Download