Vishal M. Patel
23
Papers
237
Total Citations
Papers (23)
JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation
CVPR 2024
47
citations
View-decoupled Transformer for Person Re-identification under Aerial-ground Camera Network
CVPR 2024
32
citations
Distilling Multi-modal Large Language Models for Autonomous Driving
CVPR 2025
27
citations
MonoDiff: Monocular 3D Object Detection and Pose Estimation with Diffusion Models
CVPR 2024
24
citations
Lux Post Facto: Learning Portrait Performance Relighting with Conditional Video Diffusion and a Hybrid Dataset
CVPR 2025
14
citations
Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning
NeurIPS 2025
14
citations
LQMFormer: Language-aware Query Mask Transformer for Referring Image Segmentation
CVPR 2024
13
citations
AWRaCLe: All-Weather Image Restoration Using Visual In-Context Learning
AAAI 2025
12
citations
The Power of Context: How Multimodality Improves Image Super-Resolution
CVPR 2025
12
citations
STEREO: A Two-Stage Framework for Adversarially Robust Concept Erasing from Text-to-Image Diffusion Models
CVPR 2025
12
citations
Leveraging Thermal Modality to Enhance Reconstruction in Low-Light Conditions
ECCV 2024arXiv
10
citations
GenDeg: Diffusion-based Degradation Synthesis for Generalizable All-In-One Image Restoration
CVPR 2025
8
citations
Perception in Reflection
ICML 2025
7
citations
Equivariant Spatio-Temporal Self-Supervision for LiDAR Object Detection
ECCV 2024
3
citations
SyncNoise: Geometrically Consistent Noise Prediction for Instruction-based 3D Editing
AAAI 2025
2
citations
CrowdDiff: Multi-hypothesis Crowd Density Estimation using Diffusion Models
CVPR 2024
0
citations
Holo-Relighting: Controllable Volumetric Portrait Relighting from a Single Image
CVPR 2024
0
citations
SINR: Sparsity Driven Compressed Implicit Neural Representations
CVPR 2025
0
citations
CoDi: Conditional Diffusion Distillation for Higher-Fidelity and Faster Image Generation
CVPR 2024
0
citations
Filter Images First, Generate Instructions Later: Pre-Instruction Data Selection for Visual Instruction Tuning
CVPR 2025
0
citations
SegFace: Face Segmentation of Long-Tail Classes
AAAI 2025
0
citations
Towards Zero-Shot Anomaly Detection and Reasoning with Multimodal Large Language Models
CVPR 2025
0
citations
MIRE: Matched Implicit Neural Representations
CVPR 2025
0
citations