Multimodal Learning from Videos: Self-Supervised Pre-Training, Post-Training Alignment, and Benchmarks

Shranjeno v:
Bibliografske podrobnosti
izdano v:ProQuest Dissertations and Theses (2025)
Glavni avtor: Sarkar, Pritam
Drugi avtorji: Posen, Aaron, Beirami, Ahmad, Ebrahimi, Sayna, Arık, Sercan, Pfister, Tomas
Izdano:
ProQuest Dissertations & Theses
Teme:
Online dostop:Citation/Abstract
Full Text - PDF
Oznake: Označite
Brez oznak, prvi označite!