Alessio Tonioni
6
Papers
57
Total Citations
Papers (6)
Text-Conditioned Resampler For Long Form Video Understanding
ECCV 2024arXiv
24
citations
Omnia de EgoTempo: Benchmarking Temporal Understanding of Multi-Modal LLMs in Egocentric Videos
CVPR 2025
14
citations
Active Data Curation Effectively Distills Large-Scale Multimodal Models
CVPR 2025
14
citations
Test-Time Visual In-Context Tuning
CVPR 2025
4
citations
UIP2P: Unsupervised Instruction-based Image Editing via Edit Reversibility Constraint
ICCV 2025arXiv
1
citations
Zero-Shot Styled Text Image Generation, but Make It Autoregressive
CVPR 2025
0
citations