Consistent Diffusion Models and Learning from Corrupted Data with Ambient Diffusion

WE GOT ACCESS TO GPT-3! [Epic Special Edition]

РАЗБИЛ ТАЧКУ ДИРЕКТОРА 🤯 СПАС ДЕВЧОНКУ ВЕРНУЛИСЬ В КАМПУС 🤩 НОВАЯ УЧИЛКА ТРЕШ 😱

МАЛОЙ ГАИШНИК

Рома Попов - вопросы, которые надо задать детям #50вопросов #shorts #семья #дети #родители #любовь

Steerable Visual Intelligence

UWMadison MLOPT Idea Seminar

Подписаться 99

Просмотров 66

50% 1

Видео Поделиться Скачать Добавить в

Speaker: Haotian Liu (UW-Madison)
Title: Steerable Visual Intelligence
Time: Mar 8, 2024, 12:30 PM - 1:30 PM CT
Abstract: Understanding and reasoning about the visual world based on human instructions has long been a challenging problem. The previous paradigm, which involved training supervised models on many sub-tasks and unifying them into a large system, was not streamlined and offered limited steerability. In this talk, I will introduce two of my recent works, REACT and the LLaVA-series, that approach this problem by enhancing customizability using retrieval, and bringing improved steerability with natural language instructions. We demonstrate that REACT and the LLaVA-series offer a promising path for building customizable, large multimodal models that follow human intent at an affordable cost. Finally, I will present several future directions I am eager to explore in building next-generation steerable visual intelligence systems.
Bio: Haotian Liu is a final-year PhD student at University of Wisconsin-Madison, advised by Prof. Yong Jae Lee. His research primarily focuses on computer vision and vision-language multimodal learning. His recent work has centered on building customizable and steerable large models that follow humans’ intent, including instruction-following multimodal models, controllable image generation, and customizable foundation models. He co-organized the 1st and 2nd Workshop on Computer Vision in the Wild in ECCV 2022 and CVPR 2023.
Location: Engineering Research Building (1500 Engineering Drive) Room 514

Опубликовано:

1 окт 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии

Далее

Consistent Diffusion Models and Learning from Corrupted Data with Ambient Diffusion

57:48

Consistent Diffusion Models and Learning from Corrupted Data with Ambient Diffusion

Просмотров 228

WE GOT ACCESS TO GPT-3! [Epic Special Edition]

3:57:17

WE GOT ACCESS TO GPT-3! [Epic Special Edition]

Просмотров 310 тыс.

00:16

Просмотров 308 тыс.

РАЗБИЛ ТАЧКУ ДИРЕКТОРА 🤯 СПАС ДЕВЧОНКУ ВЕРНУЛИСЬ В КАМПУС 🤩 НОВАЯ УЧИЛКА ТРЕШ 😱

13:19

РАЗБИЛ ТАЧКУ ДИРЕКТОРА 🤯 СПАС ДЕВЧОНКУ ВЕРНУЛИСЬ В КАМПУС 🤩 НОВАЯ УЧИЛКА ТРЕШ 😱

Просмотров 904 тыс.

МАЛОЙ ГАИШНИК

00:35

МАЛОЙ ГАИШНИК

Просмотров 383 тыс.

Рома Попов - вопросы, которые надо задать детям #50вопросов #shorts #семья #дети #родители #любовь

00:43

Рома Попов - вопросы, которые надо задать детям #50вопросов #shorts #семья #дети #родители #любовь

Просмотров 355 тыс.

A Brief History of Time(keeping): Optical Atomic Clocks and Their Applications

53:07

A Brief History of Time(keeping): Optical Atomic Clocks and Their Applications

Просмотров 1,1 тыс.

Ship 30 destacked...For Now | SpaceX Boca Chica

11:39

Ship 30 destacked...For Now | SpaceX Boca Chica

Просмотров 18 тыс.

CIIR Talk Series - 9/20/2024: Julian McAuley - Knowledge-grounded Conversational Recommender Systems

1:00:51

CIIR Talk Series - 9/20/2024: Julian McAuley - Knowledge-grounded Conversational Recommender Systems

Просмотров 62

What's the future for generative AI? - The Turing Lectures with Mike Wooldridge

1:00:59

What's the future for generative AI? - The Turing Lectures with Mike Wooldridge

Просмотров 505 тыс.

What is generative AI and how does it work? - The Turing Lectures with Mirella Lapata

46:02

What is generative AI and how does it work? - The Turing Lectures with Mirella Lapata

Просмотров 1 млн

Google Cloud Platform Full Course | Google Cloud Platform Tutorial | Cloud Computing | Simplilearn

3:42:31

Google Cloud Platform Full Course | Google Cloud Platform Tutorial | Cloud Computing | Simplilearn

Просмотров 476 тыс.

IN FULL: Julian Assange makes first public statement since prison release

22:05

IN FULL: Julian Assange makes first public statement since prison release

Просмотров 73 тыс.

MIT Introduction to Deep Learning | 6.S191

1:09:58

MIT Introduction to Deep Learning | 6.S191

Просмотров 596 тыс.

Google Cloud Platform Tutorial 2024 | Google Cloud In Depth Tutorial | Cloud Computing | Simplilearn

3:51:57

Google Cloud Platform Tutorial 2024 | Google Cloud In Depth Tutorial | Cloud Computing | Simplilearn

Просмотров 249 тыс.

00:16

Просмотров 308 тыс.