EMBODIED AGENTSIN AUGMENTED & VIRTUAL REALITIESCourse E6998-004, Dept. of Computer Science, Columbia University, Fall 2002Prof. Kris R. Thórisson, Ph.D. |
|||||||||
|
|||||||||
1 |
Concepts |
|
| |
Why do we need
architectures? |
|
Ymir: quick summary + Bird's-eye view |
||
| |
Blackboards |
|
Trace examples |
||
2 |
Why Do We Need An Architecture? |
|
| |
A.I. (and multimodal communicative systems design) is about complexity management
|
|
Most psycological and A.I. research has worked on isolated parts of the psyche
|
||
| |
We need a coherent architectural plan to make use of these isolated pieces of information |
|
3 |
Architectural Goals |
|
Holistic architectures can provide communicative interfaces that exploit ...
|
||
System Characteristics Wanted:
|
||
| |
4 |
Ymir Approach |
|
General
-- not "vertical", special-function |
||
Integration -- not divide-and-conquer |
||
| |
Use Existing
Solutions -- create new where needed |
|
"Expert System/Engineering Approach" -- create system by hand |
||
Use blackboards extensively |
||
5 |
Origin of Blackboards |
|
Oliver Selfridge - 1959: "Daemons" paper |
||
HEARSAY-II speech
recognition, CMU - 1980 |
||
Widely used in
AI since |
||
6 |
Ymir |
|
Three Blackboards:
|
||
|
||
![]() |
||
7 |
Functional Sketchboard |
|
"Functional Sketches" of user actions:
E.g.: The user makes a gesture. If the gesture is deictic, we need to look in the direction that she’s pointing. The function of the gesture is to direct our attention in a certain spatial way; however, we may not know what’s there to be seen, it may not even be a deictic gesture - hence “sketches” |
||
The FS is based on the hypothesis that a functional analysis is a precursor to (higher-priority than) multimodal interpretation and real-time multimodal action generation |
||
8 |
Content Blackboard |
|
Communication between the Process Control Layer (PCL) and the Content Layer (CL)
|
||
Processes in the PCL post messages about their own activities and read messages about activities in the CL |
||
| Processes in the CL post messages about their own activities and read messages about what’s happening in the PCL | ||
9 |
Motor Feedback Blackboard |
|
Communication between Action Scheduler and
|
||
Data: status
of ongoing motor actions executing in the AS
|
||
10 |
Blackboard Benefits |
|
Design
|
||
Efficient method to deal with ...
|
||
Isolation of communicative data and data processing |
||
Supports incremental, opportunistic processing |
||
Better forward migration support |
||
11 |
Gandalf |
|
26 Perceptual Modules |
||
35 Decision
Modules |
||
83 Behavior Modules |
||
1 1/2 Knowledge Base |
||
Design Basis- Speech & Dialogue
- Face-to-Face Interaction
- Gaze
- Gesture
|
||
12 |
Ymir BB Trace Example 1 |
|
This example shows the series of system events that happen during and after the user's multimodal communicative act "Delete that box [deictic gesture]" |
13 |
Ymir BB Trace Example 2 |
|
This graph shows internal states of Gandalf during interaction over a 16 second interval (each vertical line marks a second). The highly rythmic gaze pattern observed at the top of the graph indicates a bad eye calibration. Notice that modules 1 through 34 all relate to the user’s behavior |
14 |
Blackboard Implementations |
|
Most are based on event-triggers |
||
Ymir blackboards:
'passive' - modules (agents) determined when to access, what to retrieve,
etc. |
||
More specialized implementations include BBs for planning, with three-part storage: Plans, planning data, planning heuristics |
||
15 |
Other Architectures |
|
Subsumption
architecture [Brooks]
|
||
POSH (parallel-rooted, ordered, slip-stack, hierarchical) reactive plans [Bryson]
|
||
2002©K.R.Thórisson