Koustuv Sinha
4
Papers
46
Total Citations
Papers (4)
Scaling Language-Free Visual Representation Learning
ICCV 2025arXiv
39
citations
VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning
ICLR 2025
7
citations
Controlling Multimodal LLMs via Reward-guided Decoding
ICCV 2025arXiv
0
citations
MetaMorph: Multimodal Understanding and Generation via Instruction Tuning
ICCV 2025
0
citations