Rama Chellappa
32
Papers
63
Total Citations
Papers (32)
Jack of All Tasks Master of Many: Designing General-Purpose Coarse-to-Fine Vision-Language Model
CVPR 2024arXiv
50
citations
Video-ColBERT: Contextualized Late Interaction for Text-to-Video Retrieval
CVPR 2025
9
citations
DuoLoRA : Cycle-consistent and Rank-disentangled Content-Style Personalization
ICCV 2025arXiv
3
citations
MS-GS: Multi-Appearance Sparse-View 3D Gaussian Splatting in the Wild
NeurIPS 2025arXiv
1
citations
Medical World Model
ICCV 2025
0
citations
FaceXFormer: A Unified Transformer for Facial Analysis
ICCV 2025arXiv
0
citations
ViT-Linearizer: Distilling Quadratic Knowledge into Linear-Time Vision Models
ICCV 2025arXiv
0
citations
Enrich and Detect: Video Temporal Grounding with Multimodal LLMs
ICCV 2025arXiv
0
citations
TOGA: Temporally Grounded Open-Ended Video QA with Weak Supervision
ICCV 2025arXiv
0
citations
Unsupervised Video Domain Adaptation with Masked Pre-Training and Collaborative Self-Training
CVPR 2024arXiv
0
citations
SAINT: Spatially Aware Interpolation NeTwork for Medical Slice Synthesis
CVPR 2020arXiv
0
citations
3DRegNet: A Deep Neural Network for 3D Point Registration
CVPR 2020arXiv
0
citations
Hierarchical Video Prediction Using Relational Layouts for Human-Object Interactions
CVPR 2021
0
citations
HyperSegNAS: Bridging One-Shot Neural Architecture Search With 3D Medical Image Segmentation Using HyperNet
CVPR 2022arXiv
0
citations
EyePAD++: A Distillation-Based Approach for Joint Eye Authentication and Presentation Attack Detection Using Periocular Images
CVPR 2022
0
citations
Segment and Complete: Defending Object Detectors Against Adversarial Patch Attacks With Robust Patch Detection
CVPR 2022arXiv
0
citations
HaLP: Hallucinating Latent Positives for Skeleton-Based Self-Supervised Learning of Actions
CVPR 2023arXiv
0
citations
PASS: Protected Attribute Suppression System for Mitigating Bias in Face Recognition
ICCV 2021arXiv
0
citations
The Pursuit of Knowledge: Discovering and Localizing Novel Categories Using Dual Memory
ICCV 2021arXiv
0
citations
EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone
ICCV 2023arXiv
0
citations
MOST: Multiple Object Localization with Self-Supervised Transformers for Object Discovery
ICCV 2023arXiv
0
citations
SparseDet: Improving Sparsely Annotated Object Detection with Pseudo-positive Mining
ICCV 2023arXiv
0
citations
STEPs: Self-Supervised Key Step Extraction and Localization from Unlabeled Procedural Videos
ICCV 2023arXiv
0
citations
The Devil is in the Details: Self-Supervised Attention for Vehicle Re-Identification
ECCV 2020
0
citations
Visual Question Answering on Image Sets
ECCV 2020
0
citations
Where in the World Is This Image? Transformer-Based Geo-Localization in the Wild
ECCV 2022
0
citations
Dual Manifold Adversarial Robustness: Defense against Lp and non-Lp Adversarial Attacks
NeurIPS 2020arXiv
0
citations
Robust Optimal Transport with Applications in Generative Modeling and Domain Adaptation
NeurIPS 2020arXiv
0
citations
Sleeper Agent: Scalable Hidden Trigger Backdoors for Neural Networks Trained from Scratch
NeurIPS 2022arXiv
0
citations
FeLMi : Few shot Learning with hard Mixup
NeurIPS 2022
0
citations
Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision Tasks
NeurIPS 2023arXiv
0
citations
Certified Robustness via Dynamic Margin Maximization and Improved Lipschitz Regularization
NeurIPS 2023arXiv
0
citations