Rama Chellappa

32

Papers

63

Total Citations

Papers (32)

Jack of All Tasks Master of Many: Designing General-Purpose Coarse-to-Fine Vision-Language Model

Video-ColBERT: Contextualized Late Interaction for Text-to-Video Retrieval

DuoLoRA : Cycle-consistent and Rank-disentangled Content-Style Personalization

MS-GS: Multi-Appearance Sparse-View 3D Gaussian Splatting in the Wild

NeurIPS 2025arXiv

Medical World Model

FaceXFormer: A Unified Transformer for Facial Analysis

ViT-Linearizer: Distilling Quadratic Knowledge into Linear-Time Vision Models

Enrich and Detect: Video Temporal Grounding with Multimodal LLMs

TOGA: Temporally Grounded Open-Ended Video QA with Weak Supervision

Unsupervised Video Domain Adaptation with Masked Pre-Training and Collaborative Self-Training

SAINT: Spatially Aware Interpolation NeTwork for Medical Slice Synthesis

3DRegNet: A Deep Neural Network for 3D Point Registration

Hierarchical Video Prediction Using Relational Layouts for Human-Object Interactions

HyperSegNAS: Bridging One-Shot Neural Architecture Search With 3D Medical Image Segmentation Using HyperNet

EyePAD++: A Distillation-Based Approach for Joint Eye Authentication and Presentation Attack Detection Using Periocular Images

Segment and Complete: Defending Object Detectors Against Adversarial Patch Attacks With Robust Patch Detection

HaLP: Hallucinating Latent Positives for Skeleton-Based Self-Supervised Learning of Actions

PASS: Protected Attribute Suppression System for Mitigating Bias in Face Recognition

The Pursuit of Knowledge: Discovering and Localizing Novel Categories Using Dual Memory

EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone

MOST: Multiple Object Localization with Self-Supervised Transformers for Object Discovery

SparseDet: Improving Sparsely Annotated Object Detection with Pseudo-positive Mining

STEPs: Self-Supervised Key Step Extraction and Localization from Unlabeled Procedural Videos

The Devil is in the Details: Self-Supervised Attention for Vehicle Re-Identification

Visual Question Answering on Image Sets

Where in the World Is This Image? Transformer-Based Geo-Localization in the Wild

Dual Manifold Adversarial Robustness: Defense against Lp and non-Lp Adversarial Attacks

NeurIPS 2020arXiv

Robust Optimal Transport with Applications in Generative Modeling and Domain Adaptation

NeurIPS 2020arXiv

Sleeper Agent: Scalable Hidden Trigger Backdoors for Neural Networks Trained from Scratch

NeurIPS 2022arXiv

FeLMi : Few shot Learning with hard Mixup

Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision Tasks

NeurIPS 2023arXiv

Certified Robustness via Dynamic Margin Maximization and Improved Lipschitz Regularization

NeurIPS 2023arXiv