Presentation

advertisement
Resource-Bounded Machines are
Motivated to be Effective, Efficient,
and Curious
Bas R. Steunebrink, Jan Koutník, Kristinn R.
Thórisson, Eric Nivel, Jürgen Schmidhuber
The Swiss AI Lab IDSIA, USI & SUPSI,
Reykjavik University
The Main Argument
1. Explicitly acknowledge resource constraints
2. Identify the constrained resources
3. Design AI system to be driven by better and
better resource utilization
4. Order activities around resource utilization
5. Emergent result: effectiveness, efficiency &
curiosity
Fundamental Resources
Resource
Energy
Input
Time
Compression
Efficiency
Learning
Effectiveness
Drive
(unnamed)
Curiosity
(unnamed)
To improve
Work
Play
Dream
Resource Compression: Why?
• Less time & energy spent  more reward
(resources shared with other agents)
• Less time & energy spent  more left for
future tasks
• Compression of input = learning  better
prepared for unknown future
Driven by Resource Compression Progress
Minimize resource consumption through:
1. Knowledge
– Learn new more effective & efficient routines
– Curious exploration to “fill knowledge gaps”
2. Architecture
– Re-encode known routines for more effective &
efficient execution
– Example: self-compilation
Work—Play—Dream Framework
• Utilize the kinds of activities afforded by the
patterns of interaction with human teachers /
supervisors / users
• Work: fulfill main purpose, no exploration,
store interesting / unexpected events
• Play: curiosity-driven exploration, perform
experiments, may still requires supervision
• Dream: analyze unprocessed events, selfcompilation, task invention for Play
WPD Framework, cont.
• Work—Play—Dream are processes, not states
• Can run in parallel, but sometimes not possible
due to resource/situational constraints
• Combination of Work & Play leads to creativity
• Dreaming may be a necessary side-effect for any
system constrained in computational power and
memory
– “Tired” = buffers reach capacity
– “Dream” = process input history backlog
AERA: An Explicitly Resource-bounded Architecture
•
•
•
•
•
Knowledge is operationally constructive
Model-based & model-driven, hierarchically
Simulation through forward chaining
Planning through backward chaining
Compilation of useful & reliable chains
– Originally for scalability
– Now realized to satisfy the architectural way of
achieving resource compression: re-encoding
How AERA does resource compression
• Consider 3 resources: Time, RAM, HDD
• Time more precious than memory
• Self-compilation leads to better resource
utilization
• Thus AERA must be motivated to self-compile
• Simple analysis of control values yields goals that
give rise to empirical testing of unstable models
• Crux: scheduling needed  Work, Play, Dream
Conclusion
• AERA is being developed as a cognitive
architecture towards AGI
–
–
–
–
Based on many firm principles, but not curiosity
But all ingredients for curiosity are present
Learning & re-encoding both possible!
Thanks to self-compilation ability
• Resource usage compression = principled middle
ground
– No twisting of AERA or Theory of Curiosity
– AERA still based on solid principles
– Curiosity generalized to resources-bounded view
Download