
|
PhD Student Department of Computing Science University of Alberta |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
amw8@ualberta.ca (780) 906-4587 |
How to learn and represent grounded knowledge of a physically embodied agent is a key open problem in artificial intelligence (AI). This problem is challenging because grounding knowledge often necessitates learning at a fine-time scale from low-level sensorimotor data like IR distance sensors and motor velocities. There is also a significant computational challenge. The agent has limited resources with which to learn a large amount of knowledge about a high-dimensional, non-stationary environment. Many of the languages proposed for representing agent knowledge trade-off expressiveness with learnability. Expressive languages, like first order logic, Bayes networks and fuzzy logic can represent a wide range of knowledge, about people, objects and relationships but are computationally expensive to update and often require supervision. Differential equation models (from operations research) and transition matrices (from reinforcement learning), on the other hand, can be learned efficiently from data but are much less expressive. These low-level models cannot directly represent abstract, temporally-extended predictions and require significant prior knowledge. There remains room for exploring new approaches to learning expressive knowledge from experience generated by an autonomous robot.
The goal of my doctoral research is to build a mobile robot that can learn a significant amount of knowledge from its interaction with the world: a demonstration of a complete working system on a robot. To achieve this goal I will design and build an architecture that integrates: (1) representing and updating knowledge about the world, (2) generating behaviour and (3) maintaining internal state of the robot. I will use a new approach to knowledge representation based on value functions and on other ideas and algorithms from reinforcement learning. The scope of my research will include how to scale the system, given this representation of knowledge, by adapting the behaviour generation and state representation update procedures based on experience generated by the robot. My architecture will be the first real-time architecture that can learn flexible, general purpose knowledge about the world directly from sensorimotor data generate by the Critterbot.

The Critterbot
Mail:
|
Department of Computing Science University of Alberta Edmonton, Alberta T6G 2E8 Canada |