Latest events

Search events

Enter a keyword to search events within the date range above.

AIML Connect Fridays: Towards the 3D Human Foundation Agent

Professor Michael Black

This talk will describe current progress on building a 3D Human Foundation Agent (HFA) that can perceive the world and the humans in it. The HFA is a digitally embodied agent that understands human behavior and responds to it using its 鈥渕otor system鈥 to translate its goals into 3D actions. The Human Foundation Agent must (1) perceive human movement in 3D, (2) understand the goals, implications, and emotions inherent in that movement, and (3) plan and generate natural motor activity to (4) drive a digital or physical embodiment that interacts with real or virtual humans in real or virtual 3D worlds. This talk will focus on current progress and the path to building HFAs through 3D human motion capture from video, synthetic training data, generative behaviour modelling, AI-driven graphics, and large vision-language models that are fine-tuned to understand 3D humans. HFAs will radically change how people interact with machines. So much so that a child born today will have trouble imagining a world in which technology doesn鈥檛 understand their motions and behaviours.

[Read more about AIML Connect Fridays: Towards the 3D Human Foundation Agent]

AIML Research Seminar: What Makes AI Safe for Human Decision-Making?聽

Lana Tikhomirov

Abstract: Lana is a cognitive psychology student at AIML investigating medical decision-making with AI. She incorporates knowledge of human factors, expertise, and cognitive science with areas of bioethics and AI safety. She will cover the work of her PhD, findings, and future directions.

[Read more about AIML Research Seminar: What Makes AI Safe for Human Decision-Making?聽]

AIML Special Presentation: From Instructional Diagrams to Real-World Assembly

Jiahao Zhang

Abstract: From instructional diagrams to real-world assembly, understanding and bridging the gap between diagrammatic instructions and practical actions is a significant challenge. To address this, we introduce the Ikea Assembly in the Wild (IAW) Dataset, which comprises 183 hours of diverse furniture assembly videos and nearly 8,300 corresponding illustrations from assembly manuals, annotated with their ground truth alignments. Leveraging this dataset, we tackle three key tasks: First, we align segments of in-the-wild assembly videos with corresponding instructional diagrams using a supervised contrastive learning approach that captures subtle diagrammatic details. Second, we predict the precise start and end times of the steps outlined in the manuals within the video sequences, enabling accurate temporal grounding. Finally, we demonstrate a furniture assembly pipeline where furniture parts are selected and their 6D poses are predicted based on cues from the instructional diagrams. Our methods significantly advance multimodal alignment and learning for practical assembly, achieving state-of-the-art performance on tasks such as retrieval, temporal grounding, and 3D part assembly.

[Read more about AIML Special Presentation: From Instructional Diagrams to Real-World Assembly]

AIML Special Presentation: Several Recent Results in 4D Reconstruction and Generative Modelling

Dr Vladislav Golyanik

Dr Golyanik鈥檚 talk focuses on several recent approaches for generative modelling and 3D reconstruction of non-rigid scenes from various conditioning signals and inputs, such as textual prompts, RGB images, or event streams. He also outlines promising directions for how the field could benefit from modern and upcoming quantum hardware. 

[Read more about AIML Special Presentation: Several Recent Results in 4D Reconstruction and Generative Modelling]

RSS News Feed