Unscripted Grounded Visual Learning - Rescheduled

Stella Yu

Computer vision has made remarkable advances through data-driven learning of image-text associations. Large-scale vision and language models like CLIP, SAM, and ChatGPT can generate compelling descriptions of images. However, these models, trained with scripted data and limited grounding, often struggle to provide detailed visual evidence and to generalize across a diverse range of infrequent visual concepts during testing. In contrast, human infants develop robust visual understanding from limited experiences, even before acquiring language. This contrast raises crucial questions: What are we missing? Do we not see without naming our visual experiences? Can vision be developed entirely from visual data without predefined labels and semantic knowledge? I will present our research progress on how we can computationally learn to abstract and generalize visual concepts directly from images and videos.

Speaker: Stella Yu, University of Michigan

Editor's Note: This talk has been rescheduled for May 27, 2025.

Thursday, 03/20/25

04:00 PM - 04:50 PM

Contact:

Website: Click to Visit

Cost:

Free

Save this Event:

iCalendar
Google Calendar
Yahoo! Calendar
Windows Live Calendar

Sonoma State Dept. of Engineering Science

1801 East Cotati Ave
Cerent Engineering Science Complex, Salazar Hall Room #2009A
Rohnert Park, CA 94928

Phone: (707) 664-2030
Website: Click to Visit

<						>
S	M	T	W	T	F	S
			01	02	03	04
05	06	07	08	09	10	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28	29	30	31

Thursday, 03/20/25

Contact:

Cost:

Save this Event:

Sonoma State Dept. of Engineering Science

Categories: