Yann LeCun
13
Papers
991
Total Citations
Papers (13)
Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs
CVPR 2024
570
citations
Navigation World Models
CVPR 2025arXiv
136
citations
Layer by Layer: Uncovering Hidden Representations in Language Models
ICML 2025
118
citations
Transformers without Normalization
CVPR 2025arXiv
96
citations
Scaling Language-Free Visual Representation Learning
ICCV 2025arXiv
39
citations
Hierarchical World Models as Visual Whole-Body Humanoid Controllers
ICLR 2025arXiv
20
citations
RoboPEPP: Vision-Based Robot Pose and Joint Angle Estimation through Embedding Predictive Pre-Training
CVPR 2025
8
citations
Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning
ICLR 2025
4
citations
Rate-In: Information-Driven Adaptive Dropout Rates for Improved Inference-Time Uncertainty Estimation
CVPR 2025
0
citations
MetaMorph: Multimodal Understanding and Generation via Instruction Tuning
ICCV 2025
0
citations
How Learning by Reconstruction Produces Uninformative Features For Perception
ICML 2024
0
citations
The Entropy Enigma: Success and Failure of Entropy Minimization
ICML 2024
0
citations
Stochastic positional embeddings improve masked image modeling
ICML 2024
0
citations