Current Publications
-
LEAP: LLM-Generation of Egocentric Action Programs
We introduce LEAP (illustrated in Figure 1), a novel method for generating video-grounded action programs through use of a Large Language Model (LLM). These action programs represent the motoric, perceptual,...
-
Mid-Vision Feedback for Convolutional Neural Networks
Feedback plays a prominent role in biological vision, where perception is modulated based on agents’ continuous interactions with the world, and evolving expectations and world model. We introduce a novel...
-
Therbligs in Action: Video Understanding through Motion Primitives
Therbligs in Action: Video Understanding through Motion PrimitivesEadom Dessalene, Michael Maynord, Cornelia Ferm¨uller, Yiannis AloimonosUniversity of Maryland, College ParkCollege Park, MD 20742, USA{edessale,maynord,fermulcm,[email protected]} In this paper we introduce a rule-based,...
-
Egocentric Object Manipulation Graphs
We introduce Egocentric Object Manipulation Graphs (Ego-OMG) – a novel repre-sentation for activity modeling and anticipation of near future actions integratingthree components: 1) semantic temporal structure of activities, 2) short-term...
-
Data-Driven Goal Generation for Integrated Cognitive Systems
We describe our Meta-cognitive, Integrated, Dual-Cycle Architecture (MIDCA), whose purpose is to provide agents with a greater capacity for acting in an open world and dealing with unexpected events. We...