FROM HERE TO HUMAN-LEVEL AI

Page created by Gilbert Paul

Uncategorized

English

Like
Share
Embed
Fullscreen
Slides
Download HTML
Download PDF
Abuse

←

→

Page content transcription

If your browser does not render page correctly, please read the page content below

FROM HERE TO HUMAN-LEVEL AI

John McCarthy
Computer Science Department
Stanford University
Stanford, CA 94305
jmc@cs.stanford.edu
http://www-formal.stanford.edu/jmc/

Abstract clude nonmonotonic reasoning, ap-
proximate concepts, formalized con-
texts and introspection.
It is not surprising that reaching
human-level AI has proved to be dif-
ficult and progress has been slow—
though there has been important 1 What is Human-Level AI?
progress. The slowness and the de-
mand to exploit what has been dis- The first scientific discussion of human level
covered has led many to mistakenly machine intelligence was apparently by Alan
redefine AI, sometimes in ways that Turing in the lecture [Turing, 1947]. The no-
preclude human-level AI—by rele- tion was amplified as a goal in [Turing, 1950],
gating to humans parts of the task but at least the latter paper did not say what
that human-level computer programs would have to be done to achieve the goal.
would have to do. In the terminology Allen Newell and Herbert Simon in 1954 were
of this paper, it amounts to settling the first people to make a start on program-
for a bounded informatic situation in- ming computers for general intelligence. They
stead of the more general common were over-optimistic, because their idea of
sense informatic situation. what has to be done to achieve human-level in-
Overcoming the “brittleness” of telligence was inadequate. The General Prob-
present AI systems and reaching lem Solver (GPS) took general problem solv-
human-level AI requires programs ing to be the task of transforming one expres-
that deal with the common sense sion into another using an allowed set of trans-
informatic situation—in which the formations.
phenomena to be taken into account Many tasks that humans can do, humans can-
in achieving a goal are not fixed in not yet make computers do. There are two ap-
advance. proaches to human-level AI, but each presents
We discuss reaching human-level AI, difficulties. It isn’t a question of deciding be-
emphasizing logical AI and especially tween them, because each should eventually
emphasizing representation problems succeed; it is more a race.
of information and of reasoning.
Ideas for reasoning in the com- 1. If we understood enough about how the
mon sense informatic situation in- human intellect works, we could simulate

it. However, we don’t have have suffi- A formal theory in the physical sciences deals
cient ability to observe ourselves or others with a bounded informatic situation. Scientists
to understand directly how our intellects decide informally in advance what phenomena
work. Understanding the human brain to take into account. For example, much ce-
well enough to imitate its function there- lestial mechanics is done within the Newtonian
fore requires theoretical and experimental gravitational theory and does not take into ac-
success in psychology and neurophysiol- count possible additional effects such as out-
ogy. 1 See [Newell and Simon, 1972] for gassing from a comet or electromagnetic forces
the beginning of the information process- exerted by the solar wind. If more phenom-
ing approach to psychology. ena are to be considered, a person must make
a new theory. Probabilistic and fuzzy uncer-
2. To the extent that we understand the tainties can still fit into a bounded informatic
problems achieving goals in the world system; it is only necessary that the set of pos-
presents to intelligence we can write intel- sibilities (sample space) be bounded.
ligent programs. That’s what this article
is about. Most AI formalisms also work only in a
bounded informatic situation. What phenom-
ena to take into account is decided by a person
What problems does the world present to in-
before the formal theory is constructed. With
telligence? More narrowly, we consider the
such restrictions, much of the reasoning can be
problems it would present to a human scale
monotonic, but such systems cannot reach hu-
robot faced with the problems humans might
man level ability. For that, the machine will
be inclined to relegate to sufficiently intelli-
have to decide for itself what information is
gent robots. The physical world of a robot
relevant. When a bounded informatic system
contains middle sized objects about which its
is appropriate, the system must construct or
sensory apparatus can obtain only partial in-
choose a limited context containing a suitable
formation quite inadequate to fully determne
theory whose predicates and functions connect
the effects of its future actions. Its mental
to the machine’s inputs and outputs in an ap-
world includes its interactions with people and
propriate way. The logical tool for this is non-
also meta-information about the information
monotonic reasoning.
it has or can obtain.
Our approach is based on what we call the 2 The Common Sense Informatic
common sense informatic situation. In order Situation
to explain the common sense informatic situ-
ation, we contrast it with the bounded infor- Contention: The key to reaching
matic situation that characterizes both formal human-level AI is making systems that
scientific theories and almost all (maybe all) operate successfully in the common
experimental work in AI done so far.2 sense informatic situation.
1
Recent work with positron emission tomography has In general a thinking human is in what we call
identified areas of the brain that consume more glucose the common sense informatic situation first
when a person is doing mental arithmetic. This knowledge
will help build AI systems only when it becomes possible discussed in3 [McCarthy, 1989]. It is more
to observe what is going on in these areas during mental general than any bounded informatic situation.
arithmetic.
2
The textbook [David Poole and Goebel, 1998] puts it The known facts are incomplete, and there is
this way. “To get human-level computational intelligence no a priori limitation on what facts are rel-
it must be the agent itself that decides how to divide up
3
the world, and which relationships to reason about. http://www-formal.stanford.edu/jmc/ailogic.html

evant. It may not even be decided in ad- sure with altitude. However, in every case,
vance what phenomena are to be taken into the physics knowledge is embedded in com-
account. The consequences of actions cannot mon sense knowledge. Thus before one can
be fully determined. The common sense in- use Galileo’s law of falling bodies s = 21 gt2 , one
formatic situation necessitates the use of ap- needs common sense information about build-
proximate concepts that cannot be fully de- ings, their shapes and their roofs.
fined and the use of approximate theories in- Bounded informatic situations are obtained by
volving them. It also requires nonmonotonic nonmonotonically inferring that only the phe-
reasoning in reaching conclusions. nomena that somehow appear to be relevant
The common sense informatic situation also are relevant. In the barometer example, the
includes some knowledge about the system’s student was expected to infer that the barom-
mental state. eter was only to be used in the conventional
way for measuring air pressure. For example,
A nice example of the common sense infor-
matic situation is illustrated by an article in a reasoning system might do this by apply-
the American Journal of Physics some years ing circumscription to a predicate relevant in a
ago. It discussed grading answers to a physics formalism containing also metalinguistic infor-
problem. The exam problem is to find the mation, e.g. that this was a problem assigned
height of a building using a barometer. The in a physics course. Formalizing relevance in
intended solution is to measure the air pres- a useful way promises to be difficult.
sure at the top and bottom of the building Common sense facts and common sense rea-
and multiply the difference by the ratio of the soning are necessarily imprecise. The impreci-
density of mercury to the density of air. sion necessitated by the common sense infor-
However, other answers may be offered. (1) matic situation applies to computer programs
drop the barometer from the top of the build- as well as to people.
ing and measure the time before it hits the Some kinds of imprecision can be represented
ground. (2) Measure the height and length of numerically and have been explored with the
the shadow of the barometer and measure the aid of Bayesian networks, fuzzy logic and simi-
length of the shadow of the building. (3) Rap- lar formalisms. This is in addition to the study
pel down the building using the barometer as of approximation in numerical analysis and the
a measuring rod. (4) Lower the barometer on physical sciences.
a string till it reaches the ground and measure
the string. (5) Offer the barometer to the jani- 3 The Use of Mathematical Logic
tor of the building in exchange for information
about the height. (6) Ignore the barometer, What about mathematical logical languages?
count the stories of the building and multiply
by ten feet. Mathematical logic was devised to formal-
ize precise facts and correct reasoning. Its
Clearly it is not possible to bound in advance founders, Leibniz, Boole and Frege, hoped to
the common sense knowledge of the world use it for common sense facts and reasoning,
that may be relevant to grading the prob- not realizing that the imprecision of concepts
lem. Grading some of the solutions requires used in common sense language was often a
knowledge of the formalisms of physics and the necessary feature and not always a bug. The
physical facts about the earth, e.g. the law biggest success of mathematical logic was in
of falling bodies or the variation of air pres- formalizing mathematical theories. Since the

common sense informatic situation requires AI formalism make programs reason logically.
using imprecise facts and imprecise reason- However, we have to extend logic and extend
ing, the use of mathematical logic for common the programs that use it in various ways.
sense has had limited success. This has caused
One important extension was the development
many people to give up. Gradually, extended of modal logic starting in the 1920s and using
logical languages and even extended forms of it to treat modalities like knowledge, belief and
mathematical logic are being invented and de- obligation. Modalities can be treated either
veloped. by using modal logic or by reifying concepts
It is necessary to distinguish between mathe- and sentences within the standard logic. My
matical logic and particular mathematical log- opinion is that reification in standard logic is
ical languages. Particular logical languages more powerful and will work better.
are determined by a particular choice of con-
A second extension was the formalization of
cepts and the predicate and function symbols
nonmonotonic reasoning beginning in the late
to represent them. Failure to make the dis- 1970s—with circumscription and default logic
tinction has often led to error. When a par- and their variants as the major proposals.
ticular logical language has been shown inad- Nonmonotonic logic has been studied both as
equate for some purpose, some people have pure mathematics and in application to AI
concluded that logic is inadequate. Different problems, most prominently to the formaliza-
concepts and different predicate and function tion of action and causality. Several variants
symbols might still succeed. In the words of of the major formalisms have been devised.
the drive-in movie critic of Grapevine, Texas,
“I’m surprised I have to explain this stuff.” Success so far has been moderate, and it isn’t
clear whether greater success can be obtained
The pessimists about logic or some particular by changing the the concepts and their rep-
set of predicates might try to prove a theorem resentation by predicate and function symbols
about its inadequacies for expressing common or by varying the nonmonotonic formalism. 5
sense.4
We need to distinguish the actual use of logic
Since it seems clear that humans don’t use from what Allen Newell, [Newell, 1981] and
logic as a basic internal representation formal- [Newell, 1993], calls the logic level and which
ism, maybe something else will work better was also proposed in [McCarthy, 1979]6 .
for AI. Researchers have been trying to find
this something else since the 1950s but still
4 Approximate Concepts and
haven’t succeeded in getting anything that is
Approximate Theories
ready to be applied to the common sense in-
formatic situation. Maybe they will eventually Other kinds of imprecision are more funda-
succeed. However, I think the problems listed mental for intelligence than numerical impre-
in the later sections of this article will apply cision. Many phenomena in the world are ap-
to any approach to human-level AI. propriately described in terms of approximate
Mathematical logic has been concerned with concepts. Although the concepts are impre-
how people ought to think rather than how cise, many statements using them have precise
people do think. We who use logic as a basic truth values. We offer two examples: the con-
5
One referee for KR96 foolishly and arrogantly pro-
4
Gödel’s theorem is not relevant to this, because the posed rejecting a paper on the grounds that the inadequacy
question is not one of decideability or of characterizing of circumscription for representing action was known.
6
truth. http://www-formal.stanford.edu/jmc/ascribing.html

cept of Mount Everest and the concept of the posals for handling nonmonotonic reasoning.
welfare of a chicken. The exact pieces of rock In particular, getting from the common sense
and ice that constitute Mount Everest are un-
informatic situation to a bounded informatic
clear. For many rocks, there is no truth of the
situation needs nonmonotonic reasoning.
matter as to whether it is part of Mount Ever-
est. Nevertheless, it is true without qualifica-
6 Elaboration Tolerance
tion that Edmund Hillary and Tenzing Norgay
climbed Mount Everest in 1953 and that John
Human abilities in the common sense infor-
McCarthy never set foot on it.
matic situation also include what may be
The point of this example is that it is possi- called elaboration tolerance—the ability to
ble and even common to have a solid knowl- elaborate a statement of some facts without
edge structure from which solid conclusions having to start all over. Thus when we begin
can be inferred based on a foundation built on to think about a problem, e.g. determining
the quicksand of approximate concepts with- the height of a building, we form a bounded
out definite extensions. context and try to solve the problem within it.
As for the chicken, it is clear that feeding it However, at any time more facts can be added,
helps it and wringing its neck harms it, but e.g. about the precision with which the time
it is unclear what its welfare consists of over for the barometer to fall can be estimated us-
the course of the decade from the time of its ing a stop watch and also the possibilities of
hatching. Is it better off leading a life of poul- acquiring a stop watch.
try luxury and eventually being slaughtered Elaboration Tolerance7 discusses about 25
or would it be better off escaping the chicken elaborations of the Missionaries and Cannibals
yard and taking its chances on starvation and problem.
foxes? There is no truth of the matter to be
What I have so far said so far about ap-
determined by careful investigation of chick-
proximate concepts, nonmonotonic reasoning
ens. When a concept is inherently ap-
and elaboration tolerance is independent of
proximate, it is a waste of time to try to
whether mathematical logic, human language
give it a precise definition. Indeed differ-
or some other formalism is used.
ent efforts to define such a concept precisely
will lead to different results—if any. In my opinion, the best AI results so far have
been obtained using and extending mathemat-
Most human common sense knowledge in-
ical logic.
volves approximate concepts, and reaching
human-level AI requires a satisfactory way
of representing information involving approxi- 7 Formalization of Context
mate concepts.
A third extension of mathematical logic in-
volves formalizing the notion of context8
5 Nonmonotonic Reasoning [McCarthy, 1993]. Notice that when logical
theories are used in human communication
Common sense reasoning is also imprecise in and study, the theory is used in a context
that it draws conclusions that might not be which people can discuss from the outside. If
made if there were more information. Thus computers are to have this facility and are to
common sense reasoning is nonmonotonic. I 7
http://www-formal.stanford.edu/jmc/elaboration.html
will not go into the details of any of the pro- 8
http://www-formal.stanford.edu/jmc/context.html

work within logic, then the “outer” logical lan- travel, but the travel agent will not tell
guage needs names for contexts and sentences his customer to be sure and wear clothes.
giving their relations and a way of entering a
context. Clearly human-level AI requires rea- • The ramification problem concerns how to
soning about context. treat side-effects of events other than the
principal effect mentioned in the event de-
Human-level AI also requires the ability to
scription.
transcend the outermost context the system
has used so far. Besides in [McCarthy, 1993],
this is also discussed in Making Robots Each of these involves elaboration tolerance,
Conscious of their Mental States9 e.g. adding descriptions of the effects of
[McCarthy, 1996]. additional events without having to change
the descriptions of the events already de-
Further work includes [Buvač, 1996] and scribed. When I wrote about applications of
[Buvač et al., 1995]. circumscription to formalizing common
sense10 [McCarthy, 1986], I hoped that a sim-
8 Reasoning about ple abnormality theory would suffice for all of
Events—Especially Actions them. That didn’t work out when I tried it,
but I still think a common nonmonotonic rea-
Reasoning about actions has been a major AI soning mechanism will work. Tom Costello’s
activity, but this paper will not discuss my or draft “The Expressive Power of Circumscript-
other people’s current approaches, concentrat- tion” 11 argues that simple abnormality theo-
ing instead on the long range problem of reach- ries have the same expressive power as more
ing human level capability. We regard actions elaborate nonmonotonic formalisms that have
as particular kinds of events and therefore pro- been proposed.
pose subsuming reasoning about actions under
the heading of reasoning about events. Human level intelligence requires reasoning
about strategies of action, i.e. action pro-
Most reasoning about events has concerned grams. It also requires considering multiple
determining the effects of an explicitly given actors and also concurrent events and contin-
sequence of actions by a single actor. Within uous events. Clearly we have a long way to
this framework various problems have been go.
studied.
Some of these points are discussed in a draft
• The frame problem concerns not having on narrative12 [McCarthy, 1995].
to state what does not change when an
event occurs. 9 Introspection

• The qualification problem concerns not People have a limited ability to observe their
having to state all the preconditions of an own mental processes. For many intellectual
action or other event. The point is both tasks introspection is irrelevant. However, it
to limit the set of preconditions and also is at least relevant for evaluating how one is
to jump to the conclusion that unstated using one’s own thinking time. Human-level
others will be fulfilled unless there is evi- AI will require introspective ability.
dence to the contrary. For example, wear-
10
ing clothes is a precondition for airline http://www-formal.stanford.edu/jmc/applications.html
11
http://www-formal.stanford.edu/tjc/expressive.html
9 12
http://www-formal.stanford.edu/jmc/consciousness.html http://www-formal.stanford.edu/jmc/narrative.html

That robots also need introspection13 summarized as that of succeeding in the com-
is argued and how to do it is discussed in mon sense informatic situation.
[McCarthy, 1996].
The problems include:

10 Heuristics
common sense knowledge of the world
The largest qualitative gap between human Many important aspects of what this
performance and computer performance is in knowledge is in and how it can be
the area of heuristics, even though the gap is represented are still unsolved questions.
disguised in many applications by the millions- This is particularly true of knowledge of
fold speed advantage of computers. The gen- the effects of actions and other events.
eral purpose theorem proving programs run
very slowly, and the special purpose programs epistemologically adequate languages
are very specialized in their heuristics. These are languages for expressing
what a person or robot can ac-
I think the problem lies in our present in- tually learn about the world15
ability to give programs domain and prob- [McCarthy and Hayes, 1969].
lem dependent heuristic advice. In my Ad-
vice Taker paper14 [McCarthy, 1959] I adver- elaboration tolerance What a person
tised that the Advice Taker would express its knows can be elaborated without starting
heuristics declaratively. Maybe that will work, all over.
but neither I nor anyone else has been able to
get a start on the problem in the ensuing al- nonmonotonic reasoning Perhaps new sys-
most 40 years. Josefina Sierra-Santibanez re- tems are needed.
ports on some progress in a forthcoming arti-
cle. contexts as objects This subject is just be-
Another possibility is to express the advice in ginning. See the references of section 7.
a procedure modification language, i.e. to ex-
tend elaboration tolerance to programs. Of introspection AI systems will need to exam-
course, every kind of modularity, e.g. object ine their own internal states.
orientation, gives some elaboration tolerance,
but these devices haven’t been good enough. action The present puzzles of formalizing ac-
tion should admit a uniform solution.
Ideally, a general purpose reasoning system
would be able to accept advice permitting it
to run at a fixed ratio speed of speeds to a I doubt that a human-level intelligent program
special purpose program, e.g. at 1/20 th the will have structures corresponding to all these
speed. entities and to the others that might have been
listed. A generally intelligent logical program
11 Summary probably needs only its monotonic and non-
monotonic reasoning mechanisms plus mecha-
Conclusion: Between us and human-level in- nisms for entering and leaving contexts. The
telligence lie many problems. They can be rest are handled by particular functions and
predicates.
13
http://www-formal.stanford.edu/jmc/consciousness.html
14 15
http://www-formal.stanford.edu/jmc/mcc59.html http://www-formal.stanford.edu/jmc/mcchay69.html

12 Remarks and Acknowledgements It will be much more scientifically satisfying to
understand human level artificial intelligence
1. To what extent will all these problems logically than just achieve it by a computer-
have to be faced explicitly by people ized evolutionary process that produced an in-
working with neural nets and connection- telligent but incomprehensible result. In fact,
ist systems? The systems I know about the logical approach would be worth pursuing
are too primitive for the problems even to even if the intellectually lazy evolutionary ap-
arise. However, more ambitious systems proach won the race.
will inhabit the common sense informatic
situation. They will have to be elabora- References
tion tolerant and will require some kind
of mental model of the consequences of [Buvač, 1996] Buvač, S. (1996). Quantifica-
actions. tional logic of context. In Proceedings of
the Thirteenth National Conference on Ar-
2.
tificial Intelligence.
3. I got useful suggestions from Eyal Amir,
Saša Buvač and Tom Costello. [Buvač et al., 1995] Buvač, S., Buvač, V., and
Mason, I. A. (1995). Metamathematics of
4. Some additional relevant papers are in my contexts. Fundamenta Informaticae, 23(3).
book [McCarthy, 1990] and on my Web
site16 . [David Poole and Goebel, 1998] David Poole,
A. M. and Goebel, R. (1998). Computa-
5. My understanding that I should prepare a tional Intelligence. Oxford.
printable version of this invited talk came
rather late. I expect that both the spoken [McCarthy, 1959] McCarthy, J. (1959). Pro-
version and the 1996 November Web ver- grams with Common Sense17 . In Mecha-
sion will have better explanations of the nisation of Thought Processes, Proceedings
important concepts. of the Symposium of the National Physics
Laboratory, pages 77–84, London, U.K. Her
6. This work was partly supported by ARPA Majesty’s Stationery Office. Reprinted in
(ONR) grant N00014-94-1-0775. McC90.

13 Conclusion [McCarthy, 1979] McCarthy, J. (1979). As-
cribing mental qualities to machines18 . In
Many will find dismayingly large the list of Ringle, M., editor, Philosophical Perspec-
tasks that must be accomplished in order to tives in Artificial Intelligence. Harvester
to reach human-level logical intelligence. Per- Press. Reprinted in [McCarthy, 1990].
haps fewer but more powerful ideas would sim-
plify the list. Others will claim that a sys- [McCarthy, 1986] McCarthy, J. (1986). Ap-
tem that evolves intelligence as life does will plications of Circumscription to Formaliz-
be more straightforward to build. Maybe, but ing Common Sense Knowledge19 . Artifi-
the advocates of that approach have been at it cial Intelligence, 28:89–116. Reprinted in
as long as we have and still aren’t even close. [McCarthy, 1990].
17
http://www-formal.stanford.edu/jmc/mcc59.html
So it’s a race. 18
http://www-formal.stanford.edu/jmc/ascribing.html
16 19
http://www-formal.stanford.edu/jmc/ http://www-formal.stanford.edu/jmc/applications.html

[McCarthy, 1989] McCarthy, J. (1989). Ar-                   [Newell and Simon, 1972] Newell, A. and Si-
  tificial Intelligence, Logic and Formalizing                mon, H. A. (1972). Human Problem Solving.
  Common Sense20 . In Thomason, R., edi-                      Prentice–Hall, Englewood Cliffs, NJ.
  tor, Philosophical Logic and Artificial Intel-
  ligence. Klüver Academic.                                [Turing, 1950] Turing, A. (1950). Computing
                                                              machinery and intelligence. Mind.
[McCarthy, 1990] McCarthy, J. (1990). For-
                                                            [Turing, 1947] Turing, A. M. (1947). Lec-
  malization of common sense, papers by John
                                                              ture to the london mathematical society.
  McCarthy edited by V. Lifschitz. Ablex.
                                                              In The Collected Works of A. M. Tur-
                                                              ing, volume Mechanical Intelligence. North-
[McCarthy, 1993] McCarthy,    J.    (1993).
                                 21                           Holland. This was apparently the first pub-
  Notes on Formalizing Context .         In
                                                              lic introduction of AI, typescript in the
  IJCAI-93.      Available on http://www-
                                                              King’s College archive, the book is 1992.
  formal.stanford.edu/jmc/.

[McCarthy, 1995] McCarthy, J. (1995). Situa-
  tion Calculus with Concurrent Events and
  Narrative22 . Contents subject to change.
  Reference will remain.

[McCarthy, 1996] McCarthy, J. (1996). Mak-
  ing Robots Conscious of their Mental
  States23 . In Muggleton, S., editor, Machine
  Intelligence 15. Oxford University Press.

[McCarthy and Hayes, 1969] McCarthy,       J.
  and Hayes, P. J. (1969). Some Philosophical
  Problems from the Standpoint of Artificial
  Intelligence24 . In Meltzer, B. and Michie,
  D., editors, Machine Intelligence 4, pages
  463–502. Edinburgh University Press.

[Newell, 1981] Newell, A. (1981). The knowl-
  edge level. AI Magazine, 2(2):1–20. Origi-
  nally delivered as the Presidential Address,
  American Association for Artificial Intelli-
  gence, AAAI80, Stanford, CA, August 1980.

[Newell, 1993] Newell, A. (1993). Reflections
  on the knowledge level. Artificial Intelli-
  gence, 59(1-2):31–38.
 20
    http://www-formal.stanford.edu/jmc/ailogic.html
 21
    http://www-formal.stanford.edu/jmc/context.html
 22
    http://www-formal.stanford.edu/jmc/narrative.html
 23
    http://www-formal.stanford.edu/jmc/consciousness.html
 24
    http://www-formal.stanford.edu/jmc/mcchay69.html

You can also read