Yu-Chiang Frank Wang
38
Papers
88
Total Citations
Papers (38)
SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation
ECCV 2024
57
citations
Select and Distill: Selective Dual-Teacher Knowledge Transfer for Continual Learning on Vision-Language Models
ECCV 2024arXiv
16
citations
Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks
CVPR 2025arXiv
9
citations
Seg2Reg: Differentiable 2D Segmentation to 1D Regression Rendering for 360 Room Layout Reconstruction
CVPR 2024
5
citations
RAPPER: Reinforced Rationale-Prompted Paradigm for Natural Language Explanation in Visual Question Answering
ICLR 2024
1
citations
Segment Anything, Even Occluded
CVPR 2025
0
citations
UWAV: Uncertainty-weighted Weakly-supervised Audio-Visual Video Parsing
CVPR 2025
0
citations
Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation
CVPR 2025
0
citations
Bias in Gender Bias Benchmarks: How Spurious Features Distort Evaluation
ICCV 2025
0
citations
Continual Personalization for Diffusion Models
ICCV 2025
0
citations
EMLoC: Emulator-based Memory-efficient Fine-tuning with LoRA Correction
NeurIPS 2025arXiv
0
citations
Language-Guided Transformer for Federated Multi-Label Classification
AAAI 2024
0
citations
GSNeRF: Generalizable Semantic Neural Radiance Fields with Enhanced 3D Scene Understanding
CVPR 2024
0
citations
Propagated Image Filtering
CVPR 2015
0
citations
Learning Cross-Domain Landmarks for Heterogeneous Domain Adaptation
CVPR 2016
0
citations
Multi-Label Zero-Shot Learning With Structured Knowledge Graphs
CVPR 2018arXiv
0
citations
Detach and Adapt: Learning Cross-Domain Disentangled Deep Representation
CVPR 2018arXiv
0
citations
Towards Scene Understanding: Unsupervised Monocular Depth Estimation With Semantic-Aware Representation
CVPR 2019
0
citations
Spot and Learn: A Maximum-Entropy Patch Sampler for Few-Shot Image Classification
CVPR 2019
0
citations
Convolution in the Cloud: Learning Deformable Kernels in 3D Graph Convolution Networks for Point Cloud Analysis
CVPR 2020
0
citations
Learning Identity-Invariant Motion Representations for Cross-ID Face Reenactment
CVPR 2020
0
citations
LayoutTransformer: Scene Layout Generation With Conceptual and Spatial Diversity
CVPR 2021
0
citations
NeurMiPs: Neural Mixture of Planar Experts for View Synthesis
CVPR 2022arXiv
0
citations
Scene Graph Expansion for Semantics-Guided Image Outpainting
CVPR 2022arXiv
0
citations
Bias-Eliminating Augmentation Learning for Debiased Federated Learning
CVPR 2023
0
citations
Unsupervised Domain Adaptation With Imbalanced Cross-Domain Data
ICCV 2015
0
citations
No More Discrimination: Cross City Adaptation of Road Scene Segmenters
ICCV 2017arXiv
0
citations
Recover and Identify: A Generative Dual Model for Cross-Resolution Person Re-Identification
ICCV 2019
0
citations
Cross-Dataset Person Re-Identification via Unsupervised Pose Disentanglement and Adaptation
ICCV 2019
0
citations
Efficient Model Personalization in Federated Learning via Client-Specific Prompt Generation
ICCV 2023arXiv
0
citations
Learning to Learn in a Semi-Supervised Fashion
ECCV 2020
0
citations
VideoMage: Multi-Subject and Motion Customization of Text-to-Video Diffusion Models
CVPR 2025
0
citations
VLsI: Verbalized Layers-to-Interactions from Large to Small Vision Language Models
CVPR 2025
0
citations
3D Gaussian Inpainting with Depth-Guided Cross-View Consistency
CVPR 2025
0
citations
Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration
CVPR 2025
0
citations
Sparse Voxels Rasterization: Real-time High-fidelity Radiance Field Rendering
CVPR 2025
0
citations
A Unified Feature Disentangler for Multi-Domain Image Translation and Manipulation
NeurIPS 2018
0
citations
Adversarial Teacher-Student Representation Learning for Domain Generalization
NeurIPS 2021
0
citations