Dr. Gerald Tesauro, the IBM Research scientist who taught
Watson how to make
wagers when its Jeopardy!, has been named an Association for the
Advancement of Artificial Intelligence (AAAI) Fellow. His development of TD-Gammon,
“a self-teaching neural network that learned to play backgammon at human world
championship level,” and work applying machine learning across disciplines from
computer virus recognition to computer chess, and other fields made him an
ideal candidate for the association’s title.
You’ve worked on machines that play Jeopardy!, chess and
backgammon. What is the significance of
machines that can play games?
![]() |
Dr. Gerald Tesauro |
By working in these domains, researchers made enormous progress in search, learning, and simulation techniques, to the point where the best computers now surpass the best humans in virtually all classic board games. As a result, AI is now moving on to tackle real-world ambiguity head-on.
In the Jeopardy! Grand Challenge, we still had a game
environment with precise rules of play, but now had to deal with highly
ambiguous natural-language questions, having no explicitly defined meaning.
Looking forward, the next "Drosophila of AI" may be in life-like
virtual reality games, such as World of Warcraft. In such environments, AI
software would need to move simulated bodies via simulated physics, and would
need to engage in deep dialogues (including bargaining, persuasion, etc.) with
other human or computerized players.
How does a machine learning to play a game translate to
things like e-commerce and virus recognition?
One aspect of learning in games is learning how to detect
generalizable structure in a game state (i.e., "pattern recognition")
that is useful for categorizing or evaluating the state. This type of learning
directly carries over to virus recognition, where we look for patterns in the
raw binaries of .EXE files that may indicate likelihood of infection. The other
main aspect is learning how to make the best decision (i.e., select the best
move) to achieve the player's long-range objectives.
By developing general-purpose Reinforcement Learning algorithms in game environments, we were able to then directly apply those algorithms in both e-commerce (submitting the optimal bid in a double-auction marketplace) as well as in autonomic computing (dynamically assigning server capacity to transactional workloads in data centers).
By developing general-purpose Reinforcement Learning algorithms in game environments, we were able to then directly apply those algorithms in both e-commerce (submitting the optimal bid in a double-auction marketplace) as well as in autonomic computing (dynamically assigning server capacity to transactional workloads in data centers).
Now that Watson is working in medicine and customer
service, what new things are you teaching it?
Personally I'm not teaching it anything. My motto for
Machine Learning is "Human out of the loop." Actually, I'm part of a
big team that is articulating IBM's vision and roadmap for "Cognitive
Computing." Besides Watson, IBM has many other technology components that
contribute to Cognitive Computing, such as SyNAPSE, a computational platform
that leverages brain architecture principles, and IMARS, which provides
semantically meaningful labeling of raw multimedia (speech, image, video, etc.)
content.
My colleagues and I are working out how to combine our various technology offerings to create an enhanced version of Watson, with sufficient capabilities at natural language dialogue, massive-scale multi-modal inference, etc., to participate as a genuine partner in a collaborative problem-solving team.
My colleagues and I are working out how to combine our various technology offerings to create an enhanced version of Watson, with sufficient capabilities at natural language dialogue, massive-scale multi-modal inference, etc., to participate as a genuine partner in a collaborative problem-solving team.
What are you working on now? Where else can theoretical and
applied machine learning be used?
Guess what -- it's all about Analytics on Big Data. One
current topic is choosing what data to train on in a high-volume streaming
environment. Imagine there is so much data coming in so rapidly that you could not
keep up if you looked at all of it. So,
the question is, how do you choose the best subset to examine, given that you
can never see the full data for any example?
I'm also using massive amounts of weather data from geosynchronous satellites to learn predictive models of available solar energy, over a wide range of spatial and temporal scales. Accurate predictions could result in billions of dollars of spending reductions in the US on unnecessary backup capacity by the utility companies.
I'm also using massive amounts of weather data from geosynchronous satellites to learn predictive models of available solar energy, over a wide range of spatial and temporal scales. Accurate predictions could result in billions of dollars of spending reductions in the US on unnecessary backup capacity by the utility companies.
What does it mean to you to be named an AAAI Fellow?
I've already been honored by the many colleagues who have
built upon my work, and many students who have been inspired to seek careers
related to AI. But it's a special honor and privilege to be officially
recognized by the leading professional society devoted to AI, and to be counted
in the company of so many esteemed earlier Fellows, including all of the
founders of the field.
Few years ago (2009), I discovered the first Mario AI competition, an IA contest.
ReplyDeleteSince then, I see games as "testbeds" and a bright idea to let IA programs learn from other (more & more realistic) programs.
I wrote in my 2009 post A bright idea to let IA programs learn from other programs:
"I see that bright idea may bring more:
- video games could provide a testbed for optical recognition, move detection recognition, etc.
- these games could be a platform for decision-making algorithms.
- online game platforms could provide a way of testing collaboration.
- etc.
While online games, like WoW, are mainly targeting human users, it may be interesting for IA developers to be able to plug their IA to a drive a character avatar. The success of such avatar, among other real users' avatars, could be somewhat seen as a the result of a kind of Turing test. Plugging IA for driving avatars, and developing an open community for exchanging such IA details and code, may help IA development.
Well, there is a coevolution involving video games and IA. This coevolution has already started because IA drives non-human characters of a game. The various environments games provide, and the development of the game industry, may be an (interesting) opportunity to help IA development."
Another post in 2010 was the follow-up of this IA/game relationship trend: Starcraft enables to improve IA while prodiving (business) game API.
So, I am glad to have read here others in the IA field are thinking about WoW (for example) for improving IA.
Gaming is always a part of learning as we seen in kids. Of course kids can be told as neural biological machines playing games to learn. From your article it seems that machines are no more kids, entering into their teens. Development of general-purpose Reinforcement Learning algorithms in game environments could be competing in the real world may not be far away
ReplyDelete