Yu-Chiang Frank Wang

38
Papers
88
Total Citations

Papers (38)

SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation

ECCV 2024
57
citations

Select and Distill: Selective Dual-Teacher Knowledge Transfer for Continual Learning on Vision-Language Models

ECCV 2024
16
citations

Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks

CVPR 2025arXiv
9
citations

Seg2Reg: Differentiable 2D Segmentation to 1D Regression Rendering for 360 Room Layout Reconstruction

CVPR 2024
5
citations

RAPPER: Reinforced Rationale-Prompted Paradigm for Natural Language Explanation in Visual Question Answering

ICLR 2024
1
citations

Segment Anything, Even Occluded

CVPR 2025
0
citations

UWAV: Uncertainty-weighted Weakly-supervised Audio-Visual Video Parsing

CVPR 2025
0
citations

Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation

CVPR 2025
0
citations

Bias in Gender Bias Benchmarks: How Spurious Features Distort Evaluation

ICCV 2025
0
citations

Continual Personalization for Diffusion Models

ICCV 2025
0
citations

EMLoC: Emulator-based Memory-efficient Fine-tuning with LoRA Correction

NeurIPS 2025arXiv
0
citations

Language-Guided Transformer for Federated Multi-Label Classification

AAAI 2024
0
citations

GSNeRF: Generalizable Semantic Neural Radiance Fields with Enhanced 3D Scene Understanding

CVPR 2024
0
citations

Propagated Image Filtering

CVPR 2015
0
citations

Learning Cross-Domain Landmarks for Heterogeneous Domain Adaptation

CVPR 2016
0
citations

Multi-Label Zero-Shot Learning With Structured Knowledge Graphs

CVPR 2018arXiv
0
citations

Detach and Adapt: Learning Cross-Domain Disentangled Deep Representation

CVPR 2018arXiv
0
citations

Towards Scene Understanding: Unsupervised Monocular Depth Estimation With Semantic-Aware Representation

CVPR 2019
0
citations

Spot and Learn: A Maximum-Entropy Patch Sampler for Few-Shot Image Classification

CVPR 2019
0
citations

Convolution in the Cloud: Learning Deformable Kernels in 3D Graph Convolution Networks for Point Cloud Analysis

CVPR 2020
0
citations

Learning Identity-Invariant Motion Representations for Cross-ID Face Reenactment

CVPR 2020
0
citations

LayoutTransformer: Scene Layout Generation With Conceptual and Spatial Diversity

CVPR 2021
0
citations

NeurMiPs: Neural Mixture of Planar Experts for View Synthesis

CVPR 2022arXiv
0
citations

Scene Graph Expansion for Semantics-Guided Image Outpainting

CVPR 2022arXiv
0
citations

Bias-Eliminating Augmentation Learning for Debiased Federated Learning

CVPR 2023
0
citations

Unsupervised Domain Adaptation With Imbalanced Cross-Domain Data

ICCV 2015
0
citations

No More Discrimination: Cross City Adaptation of Road Scene Segmenters

ICCV 2017arXiv
0
citations

Recover and Identify: A Generative Dual Model for Cross-Resolution Person Re-Identification

ICCV 2019
0
citations

Cross-Dataset Person Re-Identification via Unsupervised Pose Disentanglement and Adaptation

ICCV 2019
0
citations

Efficient Model Personalization in Federated Learning via Client-Specific Prompt Generation

ICCV 2023arXiv
0
citations

Learning to Learn in a Semi-Supervised Fashion

ECCV 2020
0
citations

VideoMage: Multi-Subject and Motion Customization of Text-to-Video Diffusion Models

CVPR 2025
0
citations

VLsI: Verbalized Layers-to-Interactions from Large to Small Vision Language Models

CVPR 2025
0
citations

3D Gaussian Inpainting with Depth-Guided Cross-View Consistency

CVPR 2025
0
citations

Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration

CVPR 2025
0
citations

Sparse Voxels Rasterization: Real-time High-fidelity Radiance Field Rendering

CVPR 2025
0
citations

A Unified Feature Disentangler for Multi-Domain Image Translation and Manipulation

NeurIPS 2018
0
citations

Adversarial Teacher-Student Representation Learning for Domain Generalization

NeurIPS 2021
0
citations