Vishal M. Patel

23
Papers
237
Total Citations

Papers (23)

JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation

CVPR 2024
47
citations

View-decoupled Transformer for Person Re-identification under Aerial-ground Camera Network

CVPR 2024
32
citations

Distilling Multi-modal Large Language Models for Autonomous Driving

CVPR 2025
27
citations

MonoDiff: Monocular 3D Object Detection and Pose Estimation with Diffusion Models

CVPR 2024
24
citations

Lux Post Facto: Learning Portrait Performance Relighting with Conditional Video Diffusion and a Hybrid Dataset

CVPR 2025
14
citations

Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning

NeurIPS 2025
14
citations

LQMFormer: Language-aware Query Mask Transformer for Referring Image Segmentation

CVPR 2024
13
citations

AWRaCLe: All-Weather Image Restoration Using Visual In-Context Learning

AAAI 2025
12
citations

The Power of Context: How Multimodality Improves Image Super-Resolution

CVPR 2025
12
citations

STEREO: A Two-Stage Framework for Adversarially Robust Concept Erasing from Text-to-Image Diffusion Models

CVPR 2025
12
citations

Leveraging Thermal Modality to Enhance Reconstruction in Low-Light Conditions

ECCV 2024arXiv
10
citations

GenDeg: Diffusion-based Degradation Synthesis for Generalizable All-In-One Image Restoration

CVPR 2025
8
citations

Perception in Reflection

ICML 2025
7
citations

Equivariant Spatio-Temporal Self-Supervision for LiDAR Object Detection

ECCV 2024
3
citations

SyncNoise: Geometrically Consistent Noise Prediction for Instruction-based 3D Editing

AAAI 2025
2
citations

CrowdDiff: Multi-hypothesis Crowd Density Estimation using Diffusion Models

CVPR 2024
0
citations

Holo-Relighting: Controllable Volumetric Portrait Relighting from a Single Image

CVPR 2024
0
citations

SINR: Sparsity Driven Compressed Implicit Neural Representations

CVPR 2025
0
citations

CoDi: Conditional Diffusion Distillation for Higher-Fidelity and Faster Image Generation

CVPR 2024
0
citations

Filter Images First, Generate Instructions Later: Pre-Instruction Data Selection for Visual Instruction Tuning

CVPR 2025
0
citations

SegFace: Face Segmentation of Long-Tail Classes

AAAI 2025
0
citations

Towards Zero-Shot Anomaly Detection and Reasoning with Multimodal Large Language Models

CVPR 2025
0
citations

MIRE: Matched Implicit Neural Representations

CVPR 2025
0
citations