Month: April 2025
-
LEAP: LLM-Generation of Egocentric Action Programs
We introduce LEAP (illustrated in Figure 1), a novel method for generating video-grounded action programs through use of a Large Language Model (LLM). These action programs represent the motoric, perceptual,...
-
Mid-Vision Feedback for Convolutional Neural Networks
Feedback plays a prominent role in biological vision, where perception is modulated based on agents’ continuous interactions with the world, and evolving expectations and world model. We introduce a novel...
-
Therbligs in Action: Video Understanding through Motion Primitives
Therbligs in Action: Video Understanding through Motion PrimitivesEadom Dessalene, Michael Maynord, Cornelia Ferm¨uller, Yiannis AloimonosUniversity of Maryland, College ParkCollege Park, MD 20742, USA{edessale,maynord,fermulcm,[email protected]} In this paper we introduce a rule-based,...
-
Egocentric Object Manipulation Graphs
We introduce Egocentric Object Manipulation Graphs (Ego-OMG) – a novel repre-sentation for activity modeling and anticipation of near future actions integratingthree components: 1) semantic temporal structure of activities, 2) short-term...