Most Cited 2025 "4d radar" Papers
22,274 papers found • Page 99 of 112
Conference
Effective Diffusion Transformer Architecture for Image Super-Resolution
Kun Cheng, Lei Yu, Zhijun Tu et al.
Aligning Instance Brownian Bridge with Texts for Open-Vocabulary Video Instance Segmentation
Zesen Cheng, Kehan Li, Li Hao et al.
Bridge 2D-3D: Uncertainty-aware Hierarchical Registration Network with Domain Alignment
Zhixin Cheng, Jiacheng Deng, Xinjun Li et al.
Zero-Shot Scene Change Detection
Kyusik Cho, Dong Yeop Kim, Euntai Kim
Distribution-Level Feature Distancing for Machine Unlearning: Towards a Better Trade-off Between Model Utility and Forgetting
Dasol Choi, Dongbin Na
SIDL: A Real-World Dataset for Restoring Smartphone Images with Dirty Lenses
Sooyoung Choi, Sungyong Park, Heewon Kim
Intrinsic Image Decomposition for Robust Self-supervised Monocular Depth Estimation on Reflective Surfaces
Wonhyeok Choi, Kyumin Hwang, Minwoo Choi et al.
MASS: Overcoming Language Bias in Image-Text Matching
Jiwan Chung, Seungwon Lim, Sangkyu Lee et al.
AttackBench: Evaluating Gradient-based Attacks for Adversarial Examples
Antonio Emanuele Cinà, Jérôme Rony, Maura Pintor et al.
GCD-Sampling: A General Cross-scale Decoupled Sampling for Point Cloud
Tao Dai, Yanzi Wang, Jianyu Xiong et al.
Harmonious Music-driven Group Choreography with Trajectory-Controllable Diffusion
Yuqin Dai, Wanlu Zhu, Ronghui Li et al.
Self-Corrected Flow Distillation for Consistent One-Step and Few-Step Image Generation
Quan Dao, Hao Phung, Trung Tuan Dao et al.
PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery
Shristi Das Biswas, Matthew Shreve, Xuelu Li et al.
Single Exposure Quantitative Phase Imaging with a Conventional Microscope Using Diffusion Models
Gabriel della Maggiora, Luis Alberto Croquevielle, Harry Horsley et al.
Deep Non-Rigid Structure-from-Motion Revisited: Canonicalization and Sequence Modeling
Hui Deng, Jiawei Shi, Zhen Qin et al.
DiffCorr: Conditional Diffusion Model with Reliable Pseudo-Label Guidance for Unsupervised Point Cloud Shape Correspondence
Jiacheng Deng, Jiahao Lu, Zhixin Cheng et al.
Adaptive Siamese Masked Autoencoder with Global Optimization for Unsupervised Point Cloud Shape Correspondence
Jiacheng Deng, Jiahao Lu
OTIAS: OcTree Implicit Adaptive Sampling for Multispectral and Hyperspectral Image Fusion
Shangqi Deng, Jun Ma, Liang-Jian Deng et al.
Boundary-Aware Temporal Dynamic Pseudo-Supervision Pairs Generation for Zero-Shot Natural Language Video Localization
Xiongwen Deng, Haoyu Tang, Han Jiang et al.
Occlusion-Insensitive Talking Head Video Generation via Facelet Compensation
Yuhui Deng, Yuqin Lu, Yangyang Xu et al.
Dis²Booth: Learning Image Distribution with Disentangled Features for Text-to-Image Diffusion Models
Guanqi Ding, Chengyu Yang, Shuhui Wang et al.
Muses: 3D-Controllable Image Generation via Multi-Modal Agent Collaboration
Yanbo Ding, Shaobin Zhuang, Kunchang Li et al.
AS-Det: Active Sampling for Adaptive 3D Object Detection in Point Clouds
Ziheng Ding, Xiaze Zhang, Qi Jing et al.
GarFast: Realistic and Fast Garment Transfer with a Simplified Parser-Free Approach
Chenghu Du, Junyin Wang, Yi Rong et al.
Latent Diffusion-Enhanced Virtual Try-On via Optimized Pseudo-Label Generation
Chenghu Du, Junyin Wang, Feng Yu et al.
HybridReg: Robust 3D Point Cloud Registration with Hybrid Motions
Keyu Du, Hao Xu, Haipeng Li et al.
A Diffusion-Based Framework for Occluded Object Movement
Zheng-Peng Duan, Jiawei Zhang, Siyu Liu et al.
IniRetinex: Rethinking Retinex-type Low-Light Image Enhancer via Initialization Perspective
Guodong Fan, Zishu Yao, Guang-Yong Chen et al.
Vision-guided Text Mining for Unsupervised Cross-modal Hashing with Community Similarity Quantization
Haozhi Fan, Yuan Cao
EMHI: A Multimodal Egocentric Human Motion Dataset with HMD and Body-Worn IMUs
Zhen Fan, Peng Dai, Zhuo Su et al.
CoSDA: Enhancing the Robustness of Inversion-based Generative Image Watermarking Framework
Han Fang, Kejiang Chen, Zijin Yang et al.
SSUN-Net: Spatial-Spectral Prior-Aware Unfolding Network for Pan-Sharpening
Shijie Fang, Hongping Gan
AE-NeRF: Augmenting Event-Based Neural Radiance Fields for Non-ideal Conditions and Larger Scenes
Chaoran Feng, Wangbo Yu, Xinhua Cheng et al.
VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering
Chun-Mei Feng, Yang Bai, Tao Luo et al.
Weakly Supervised Gland Segmentation with Class Semantic Consistency and Purified Labels Filtration
Siyang Feng, Huadeng Wang, Chu Han et al.
HDLayout: Hierarchical and Directional Layout Planning for Arbitrary Shaped Visual Text Generation
Tonghui Feng, Chunsheng Yan, Qianru Wang et al.
Simplifying Control Mechanism in Text-to-Image Diffusion Models
Zhida Feng, Li Chen, Yuenan Sun et al.
BGHR: Bridging the Gap Between HBox-Supervised and RBox-Supervised Oriented Object Detection via Adaptive Fine-Grained Sample Mining
Chenlin Fu, Yingying Zhu
Foundation Model Driven Appearance Extraction for Robust Multiple Object Tracking
Teng Fu, Haiyang Yu, Ke Niu et al.
MFL-Owner: Ownership Protection for Multi-modal Federated Learning via Orthogonal Transform Watermark
Keke Gai, Dongjue Wang, Jing Yu et al.
DFDNet: Disentangling and Filtering Dynamics for Enhanced Video Prediction
Lianqiang Gan, Junyu Lai, Jingze Ju et al.
PNVC: Towards Practical INR-based Video Compression
Ge Gao, Ho Man Kwan, Fan Zhang et al.
AIM: Let Any Multimodal Large Language Models Embrace Efficient In-Context Learning
Jun Gao, Qian Qiao, Tianxiang Wu et al.
TC-LLaVA: Rethinking the Transfer of LLava from Image to Video Understanding with Temporal Considerations
Mingze Gao, Jingyu Liu, Mingda Li et al.
EventMamba: Enhancing Spatio-Temporal Locality with State Space Models for Event-Based Video Reconstruction
Chengjie Ge, Xueyang Fu, Peng He et al.
Implicit Location-Caption Alignment via Complementary Masking for Weakly-Supervised Dense Video Captioning
Shiping Ge, Qiang Chen, Zhiwei Jiang et al.
ParseCaps: An Interpretable Parsing Capsule Network for Medical Image Diagnosis
Xinyu Geng, Jiaming Wang, Xiaolin Huang et al.
MaintaAvatar: A Maintainable Avatar Based on Neural Radiance Fields by Continual Learning
Shengbo Gu, Yu-Kun Qiu, Yu-Ming Tang et al.
OT-StainNet: Optimal Transport Driven Semantic Matching for Weakly Paired H&E-to-IHC Stain Transfer
Xianchao Guan, Yifeng Wang, Ye Zhang et al.
You Should Learn to Stop Denoising on Point Clouds in Advance
Chuchen Guo, Weijie Zhou, Zheng Liu et al.
Surgical Workflow Recognition and Blocking Effectiveness Detection in Laparoscopic Liver Resection with Pringle Maneuver
Diandian Guo, Weixin Si, Zhixi Li et al.
Enhancing Low-Rank Adaptation with Recoverability-Based Reinforcement Pruning for Object Counting
Haojie Guo, Junyu Gao, Yuan Yuan
MetaNeRV: Meta Neural Representations for Videos with Spatial-Temporal Guidance
Jialong Guo, Ke Liu, Jiangchao Yao et al.
PromptDet: A Lightweight 3D Object Detection Framework with LiDAR Prompts
Kun Guo, Qiang Ling
OpenVIS: Open-vocabulary Video Instance Segmentation
Pinxue Guo, Hao Huang, Peiyang He et al.
SpikeGS: Reconstruct 3D Scene Captured by a Fast-Moving Bio-Inspired Camera
Yijia Guo, Liwen Hu, Yuanxi Bai et al.
VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding
Yongxin Guo, Jingyu Liu, Mingda Li et al.
LLaVA Needs More Knowledge: Retrieval Augmented Natural Language Generation with Knowledge Graph for Explaining Thoracic Pathologies
Ameer Hamza, Abdullah, Yong Hyun Ahn et al.
DME-Driver: Integrating Human Decision Logic and 3D Scene Perception in Autonomous Driving
Wencheng Han, Dongqian Guo, Cheng-Zhong Xu et al.
ID-Sculpt: ID-aware 3D Head Generation from Single In-the-wild Portrait Image
Jinkun Hao, Junshu Tang, Jiangning Zhang et al.
Efficient Online Training for Zero-Shot Time-Lapse Microscopy Denoising and Super-Resolution
Ruian He, Ri Cheng, Xinkai Lyu et al.
MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement
Xu He, Zhiyong Wu, Xiaoyu Li et al.
Long-Tailed Out-of-Distribution Detection: Prioritizing Attention to Tail
Yina He, Lei Peng, Yongcun Zhang et al.
FashionTailor: Controllable Clothing Editing for Human Images with Appearance Preserving
Jie Hou, Jianghong Ma, Xiangyu Mu et al.
Prompt Tuning In a Compact Attribute Space
Shiyu Hou, Tianfei Zhou, Shuai Zhang et al.
BloomScene: Lightweight Structured 3D Gaussian Splatting for Crossmodal Scene Generation
Xiaolu Hou, Mingcheng Li, Dingkang Yang et al.
Training-and-Prompt-Free General Painterly Harmonization via Zero-Shot Disentenglement on Style and Content References
Teng-Fang Hsiao, Bo-Kai Ruan, Hong-Han Shuai
GaussianSR: High Fidelity 2D Gaussian Splatting for Arbitrary-Scale Image Super-Resolution
Jintong Hu, Bin Xia, Bin Chen et al.
VRVVC: Variable-Rate NeRF-Based Volumetric Video Compression
Qiang Hu, Houqiang Zhong, Zihan Zheng et al.
Identity-Text Video Corpus Grounding
Bin Huang, Xin Wang, Hong Chen et al.
SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control
Binyuan Huang, Yuqing Wen, Yucheng Zhao et al.
Wavelet-Assisted Multi-Frequency Attention Network for Pansharpening
Jie Huang, Rui Huang, Jinghao Xu et al.
AUTE: Peer-Alignment and Self-Unlearning Boost Adversarial Robustness for Training Ensemble Models
Lifeng Huang, Tian Su, Chengying Gao et al.
EvoChart: A Benchmark and a Self-Training Approach Towards Real-World Chart Understanding
Muye Huang, Han Lai, Xinyu Zhang et al.
Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation
Qihan Huang, Siming Fu, Jinlong Liu et al.
Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation
Shaofei Huang, Rui Ling, Hongyu Li et al.
DreamPhysics: Learning Physics-Based 3D Dynamics with Video Diffusion Priors
Tianyu Huang, Haoze Zhang, Yihan Zeng et al.
Manta: Enhancing Mamba for Few-Shot Action Recognition of Long Sub-Sequence
Wenbo Huang, Jinghui Zhang, Guang Li et al.
CLIP-RestoreX: Restore Image Structure and Perception in Exposure Correction
Xiang Huang, Qing Zhang, Jian-Fang Hu et al.
Towards a Multimodal Large Language Model with Pixel-Level Insight for Biomedicine
Xiaoshuang Huang, Lingdong Shen, Jia Liu et al.
PSReg: Prior-guided Sparse Mixture of Experts for Point Cloud Registration
Xiaoshui Huang, Zhou Huang, Yifan Zuo et al.
Medical MLLM Is Vulnerable: Cross-Modality Jailbreak and Mismatched Attacks on Medical Multimodal Large Language Models
Xijie Huang, Xinyuan Wang, Hantao Zhang et al.
L4DR: LiDAR-4DRadar Fusion for Weather-Robust 3D Object Detection
Xun Huang, Ziyu Xu, Hai Wu et al.
SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning Stabilization
Yongle Huang, Haodong Chen, Zhenbang Xu et al.
PoseMamba: Monocular 3D Human Pose Estimation with Bidirectional Global-Local Spatio-Temporal State Space Model
Yunlong Huang, Junshuo Liu, Ke Xian et al.
EGSRAL:An Enhanced 3D Gaussian Splatting Based Renderer with Automated Labeling for Large-Scale Driving Scene
Yixiong Huo, Guangfeng Jiang, Hongyang Wei et al.
High-Resolution Frame Interpolation with Patch-based Cascaded Diffusion
Junhwa Hur, Charles Herrmann, Saurabh Saxena et al.
Zero-shot Depth Completion via Test-time Alignment with Affine-invariant Depth Prior
Lee Hyoseok, Kyeong Seon Kim, Kwon Byung-Ki et al.
VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting
Muhammet Furkan Ilaslan, Ali Köksal, Kevin Qinghong Lin et al.
Every Component Counts: Rethinking the Measure of Success for Medical Semantic Segmentation in Multi-Instance Segmentation Tasks
Alexander Jaus, Constantin Marc Seibold, Simon Reiß et al.
Game4Loc: A UAV Geo-Localization Benchmark from Game Data
Yuxiang Ji, Boyong He, Zhuoyue Tan et al.
Orchestrating the Symphony of Prompt Distribution Learning for Human-Object Interaction Detection
Mingda Jia, Liming Zhao, Ge Li et al.
FlexiTex: Enhancing Texture Generation via Visual Guidance
Dadong Jiang, Xianghui Yang, Zibo Zhao et al.
ARNet: Self-Supervised FG-SBIR with Unified Sample Feature Alignment and Multi-Scale Token Recycling
Jianan Jiang, Hao Tang, Zhilin Jiang et al.
SCCS: Deep Neural Spectral Clustering for Self-Supervised Subcellular Structure Segmentation
Jimao Jiang, Diya Sun, Tianbing Wang et al.
Restabilizing Diffusion Models with Predictive Noise Fusion Strategy for Image Super-Resolution
Luoqian Jiang, Yong Guo, Bingna Xu et al.
Query Quantized Neural SLAM
Sijia Jiang, Jing Hua, Zhizhong Han
Visual Prompting Upgrades Neural Network Sparsification: A Data-Model Perspective
Can Jin, Tianjin Huang, Yihua Zhang et al.
Pedestrian Attribute Recognition: A New Benchmark Dataset and a Large Language Model Augmented Framework
Jiandong Jin, Xiao Wang, Qian Zhu et al.
A Method for Enhancing Generalization of Adam by Multiple Integrations
Long Jin, Han Nong, Liangming Chen et al.
Bridging the Semantic Granularity Gap Between Text and Frame Representations for Partially Relevant Video Retrieval
WooJin Jun, WonJun Moon, Cheol-Ho Cho et al.
CodecNeRF: Toward Fast Encoding and Decoding, Compact, and High-quality Novel-view Synthesis
Gyeongjin Kang, Younggeun Lee, Seungjun Oh et al.
DiffusionREC: Diffusion Model with Adaptive Condition for Referring Expression Comprehension
Jingcheng Ke, Waikeung Wong, Jia Wang et al.
PLATYPUS: Progressive Local Surface Estimator for Arbitrary-Scale Point Cloud Upsampling
Donghyun Kim, Hyeonkyeong Kwon, Yumin Kim et al.
Generalized Zero-Shot Learning for Point Cloud Segmentation with Evidence-Based Dynamic Calibration
Hyeonseok Kim, Byeongkeun Kang, Yeejin Lee
APR-RD: Complemental Two Steps for Self-Supervised Real Image Denoising
Hyunjun Kim, Nam Ik Cho
DEEPTalk: Dynamic Emotion Embedding for Probabilistic Speech-Driven 3D Face Animation
Jisoo Kim, Jungbin Cho, Joonho Park et al.
ProtoOcc: Accurate, Efficient 3D Occupancy Prediction Using Dual Branch Encoder-Prototype Query Decoder
Jungho Kim, Changwon Kang, Dongyoung Lee et al.
MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity Talking Head Generation
Seyeon Kim, Siyoon Jin, Jihye Park et al.
TSDF-Based Efficient Motion-Compensated Temporal Interpolation for 3D Dynamic Sequences
Soowoong Kim, Minseong Kwon, Junho Choi et al.
ViPCap: Retrieval Text-Based Visual Prompts for Lightweight Image Captioning
Taewhan Kim, Soeun Lee, Si-Woo Kim et al.
Sequence Matters: Harnessing Video Models in 3D Super-Resolution
Hyun-kyu Ko, Dongheok Park, Youngin Park et al.
UniDet3D: Multi-dataset Indoor 3D Object Detection
Maksim Kolodiazhnyi, Anna Vorontsova, Matvey Skripkin et al.
Do Not DeepFake Me: Privacy-Preserving Neural 3D Head Reconstruction Without Sensitive Images
Jiayi Kong, Xurui Song, Shuo Huai et al.
Real-Time Neural Denoising with Render-Aware Knowledge Distillation
Mengxun Kong, Jie Guo, Chen Wang et al.
Stable Mean Teacher for Semi-supervised Video Action Detection
Akash Kumar, Sirshapan Mitra, Yogesh Singh Rawat
A Unified Degradation-Robust Approach to SSL and UDA for 3D Medical Images
Suruchi Kumari, Pravendra Singh
SAFIRE: Segment Any Forged Image Region
Myung-Joon Kwon, Wonjun Lee, Seung-Hun Nam et al.
Exploiting Diffusion Prior for Real-World Image Dehazing with Unpaired Training
Yunwei Lan, Zhigao Cui, Chang Liu et al.
Color Transfer with Modulated Flows
Maria Larchenko, Alexander Lobashev, Dmitry Guskov et al.
Rethinking Open-Vocabulary Segmentation of Radiance Fields in 3D Space
Hyunjee Lee, Youngsik Yun, Jeongmin Bae et al.
NBA3D: Neighbor-Based Confidence Adjustment for 3D Rare Object Detection Using LiDAR
Jooyoung Lee, Jaeyoon Lee, Jongwon Choi
MAMS: Model-Agnostic Module Selection Framework for Video Captioning
Sangho Lee, Il Yong Chun, Hogun Park
Enabling Region-Specific Control via Lassos in Point-Based Colorization
Sanghyeon Lee, Jooyeol Yun, Jaegul Choo
Concept Matching with Agent for Out-of-Distribution Detection
Yuxiao Lee, Xiaofeng Cao, Jingcai Guo et al.
FNIN: A Fourier Neural Operator-based Numerical Integration Network for Surface-from-gradients
Jiaqi Leng, Yakun Ju, Yuanxu Duan et al.
Disentangled Motion Modeling for Video Frame Interpolation
Jaihyun Lew, Jooyoung Choi, Chaehun Shin et al.
StyO: Stylize Your Face in Only One-Shot
Bonan Li, Zicheng Zhang, Xuecheng Nie et al.
FEAST-Mamba: FEAture and SpaTial Aware Mamba Network with Bidirectional Orthogonal Fusion for Cross-Modal Point Cloud Segmentation
Chade Li, Pengju Zhang, Bo Liu et al.
RemDet: Rethinking Efficient Model Design for UAV Object Detection
Chen Li, Rui Zhao, Zeyu Wang et al.
U-KAN Makes Strong Backbone for Medical Image Segmentation and Generation
Chenxin Li, Xinyu Liu, Wuyang Li et al.
Consistency of Compositional Generalization Across Multiple Levels
Chuanhao Li, Zhen Li, Chenchen Jing et al.
An Efficient Framework for Enhancing Discriminative Models via Diffusion Techniques
Chunxiao Li, Xiaoxiao Wang, Boming Miao et al.
Cascaded Diffusion Models for Virtual Try-On: Improving Control and Resolution
Guangyuan Li, Yongkang Wang, Junsheng Luan et al.
MaskViM: Domain Generalized Semantic Segmentation with State Space Models
Jiahao Li, Yang Lu, Yuan Xie et al.
Know Where You Are From: Event-Based Segmentation via Spatio-Temporal Propagation
Ke Li, Gengyu Lyu, Hao Chen et al.
Similar Modality Enhancement and Action Consistency Learning for Weakly Supervised Temporal Action Localization
Maodong Li, Chao Zheng, Jian Wang et al.
REGNav: Room Expert Guided Image-Goal Navigation
Pengna Li, Kangyi Wu, Jingwen Fu et al.
Region-aware Difference Distilling with Attribute-guided Contrastive Regularization for Change Captioning
Rong Li, Liang Li, Jiehua Zhang et al.
Enhancing Generalizability via Utilization of Unlabeled Data for Occupancy Perception
Ruihang Li, Tao Li, Shanding Ye et al.
A Compact Implicit Neural Representation for Efficient Storage of Massive 4D Functional Magnetic Resonance Imaging
Ruoran Li, Runzhao Yang, Wenxin Xiang et al.
DigitalLLaVA: Incorporating Digital Cognition Capability for Physical World Comprehension in Multimodal LLMs
Shiyu Li, Pengxu Wei, Pengchong Qiao et al.
Transferable Adversarial Face Attack with Text Controlled Attribute
Wenyun Li, Zheng Zhang, Xiangyuan Lan et al.
MambaLCT: Boosting Tracking via Long-term Context State Space Model
Xiaohai Li, Bineng Zhong, Qihua Liang et al.
PersonaMagic: Stage-Regulated High-Fidelity Face Customization with Tandem Equilibrium
Xinzhe Li, Jiahui Zhan, Shengfeng He et al.
Mamba-CAD: State Space Model for 3D Computer-Aided Design Generative Modeling
Xueyang Li, Yunzhong Lou, Yu Song et al.
StructSR: Refuse Spurious Details in Real-World Image Super-Resolution
Yachao Li, Dong Liang, Tianyu Ding et al.
Sparse Transfer Learning Accelerates and Enhances Certified Robustness: A Comprehensive Study
Zhangheng Li, Tianlong Chen, Linyi Li et al.
ProsodyTalker: 3D Visual Speech Animation via Prosody Decomposition
Zonglin Li, Xiaoqian Lv, Qinglin Liu et al.
Exploring the Potential of Large Vision-Language Models for Unsupervised Text-Based Person Retrieval
Zongyi Li, Li Jianbo, Yuxuan Shi et al.
Semantic-guided Masked Mutual Learning for Multi-modal Brain Tumor Segmentation with Arbitrary Missing Modalities
Guoyan Liang, Qin Zhou, Zhe Wang et al.
Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion
Li Liang, Naveed Akhtar, Jordan Vice et al.
S-INF: Towards Realistic Indoor Scene Synthesis via Scene Implicit Neural Field
Zixi Liang, Guowei Xu, Haifeng Wu et al.
Progressive Distribution Matching for Federated Semi-Supervised Learning
Dongping Liao, Xitong Gao, Yabo Xu et al.
Multi-Granularity Video Object Segmentation
Sangbeom Lim, Seongchan Kim, Seungjun An et al.
DreamFit: Garment-Centric Human Generation via a Lightweight Anything-Dressing Encoder
Ente Lin, Xujie Zhang, Fuwei Zhao et al.
Decoupling Appearance Variations with 3D Consistent Features in Gaussian Splatting
Jiaqi Lin, Zhihao Li, Binxiao Huang et al.
InvSeg: Test-Time Prompt Inversion for Semantic Segmentation
Jiayi Lin, Jiabo Huang, Jian Hu et al.
Memory Efficient Matting with Adaptive Token Routing
Yiheng Lin, Yihan Hu, Chenyi Zhang et al.
AGLLDiff: Guiding Diffusion Models Towards Unsupervised Training-free Real-world Low-light Image Enhancement
Yunlong Lin, Tian Ye, Sixiang Chen et al.
Deep Hierarchies and Invariant Disease-Indicative Feature Learning for Computer Aided Diagnosis of Multiple Fundus Diseases
Yuxin Lin, Wei Wang, Xiaoling Luo et al.
Boosting Multimodal Large Language Models with Visual Tokens Withdrawal for Rapid Inference
Zhihang Lin, Mingbao Lin, Luxi Lin et al.
SOVGaussian: Sparse-View 3D Gaussian Splatting for Open-Vocabulary Scene Understanding
Peng Ling, Tiao Tan, Jiaqi Lin et al.
Thinking Racial Bias in Fair Forgery Detection: Models, Datasets and Evaluations
Decheng Liu, Zongqi Wang, Chunlei Peng et al.
UFO: Enhancing Diffusion-Based Video Generation with a Uniform Frame Organizer
Delong Liu, Zhaohui Hou, Mingjie Zhan et al.
Zero-Shot Noise2Mean: Gap Minimization for Efficient Denoising from a Single Noisy Image
Duo Liu, Yiqi Shi, Guoyin Zhang et al.
SCott: Accelerating Diffusion Models with Stochastic Consistency Distillation
Hongjian Liu, Qingsong Xie, Tianxiang Ye et al.
PEIE: Physics Embedded Illumination Estimation for Adaptive Dehazing
Huaizhuo Liu, Hai-Miao Hu, Yonglong Jiang et al.
TCPFormer: Learning Temporal Correlation with Implicit Pose Proxy for 3D Human Pose Estimation
Jiajie Liu, Mengyuan Liu, Hong Liu et al.
Union Is Strength! Unite the Power of LLMs and MLLMs for Chart Question Answering
Jiapeng Liu, Liang Li, Shihao Rao et al.
UP-Restorer: When Unrolling Meets Prompts for Unified Image Restoration
Minghao Liu, Wenhan Yang, Jinyi Luo et al.
Path-Adaptive Matting for Efficient Inference Under Various Computational Cost Constraints
Qinglin Liu, Zonglin Li, Xiaoqian Lv et al.
DeRainGS: Gaussian Splatting for Enhanced Scene Reconstruction in Rainy Environments
Shuhong Liu, Xiang Chen, Hongming Chen et al.
VQTalker: Towards Multilingual Talking Avatars Through Facial Motion Tokenization
Tao Liu, Ziyang Ma, Qi Chen et al.
Multi-view Consistent 3D Panoptic Scene Understanding
Xianzhu Liu, Xin Sun, Haozhe Xie et al.
Unlocking the Potential of Reverse Distillation for Anomaly Detection
Xinyue Liu, Jianyuan Wang, Biao Leng et al.
Unveiling the Knowledge of CLIP for Training-Free Open-Vocabulary Semantic Segmentation
Yajie Liu, Guodong Wang, Jinjin Zhang et al.
DoGA: Enhancing Grounded Object Detection via Grouped Pre-Training with Attributes
Yang Liu, Feng Hou, Yunjie Peng et al.
Towards Robust Visual Question Answering via Prompt-Driven Geometric Harmonization
Yishu Liu, Jiawei Zhu, Congcong Wen et al.
See Through Their Minds: Learning Transferable Brain Decoding Models from Cross-Subject fMRI
Yulong Liu, Yongqiang Ma, Guibo Zhu et al.
SCOPE: Sign Language Contextual Processing with Embedding from LLMs
Yuqi Liu, Wenqian Zhang, Sihan Ren et al.
Advancing Comprehensive Aesthetic Insight with Multi-Scale Text-Guided Self-Supervised Learning
Yuti Liu, Shice Liu, Junyuan Gao et al.
Training Verification-Friendly Neural Networks via Neuron Behavior Consistency
Zongxin Liu, Zhe Zhao, Fu Song et al.
Robust SAM: On the Adversarial Robustness of Vision Foundation Models
Jiahuan Long, Zhengqin Xu, Tingsong Jiang et al.
RGBT Tracking via All-layer Multimodal Interactions with Progressive Fusion Mamba
Andong Lu, Wanyu Wang, Chenglong Li et al.
Privacy-Preserving V2X Collaborative Perception Integrating Unknown Collaborators
Bin Lu, Xinyu Xiao, Changzhou Zhang et al.
DeMo: Deep Motion Field Consensus with Learnable Kernels for Two-view Correspondence Learning
Yifan Lu, Jiajun Le, Zizhuo Li et al.
Generative Video Diffusion for Unseen Novel Semantic Video Moment Retrieval
Dezhao Luo, Shaogang Gong, Jiabo Huang et al.
Beyond Pixel and Object: Part Feature as Reference for Few-Shot Video Object Segmentation
Naisong Luo, Guoxin Xiong, Tianzhu Zhang
Privacy-Preserving Low-Rank Adaptation Against Membership Inference Attacks for Latent Diffusion Models
Zihao Luo, Xilie Xu, Feng Liu et al.
Revisiting Change Captioning from Self-supervised Global-Part Alignment
Feixiao Lv, Rui Wang, Lihua Jing
ScaleMatch: Multi-scale Consistency Enhancement for Semi-supervised Semantic Segmentation
Liang Lv, Lefei Zhang
Step-Calibrated Diffusion for Biomedical Optical Image Restoration
Yiwei Lyu, Sung Jik Cha, Cheng Jiang et al.
Aligning and Prompting Anything for Zero-Shot Generalized Anomaly Detection
Jitao Ma, Weiying Xie, Hangyu Ye et al.
Does VLM Classification Benefit from LLM Description Semantics?
Pingchuan Ma, Lennart Rietdorf, Dmytro Kotovenko et al.
Instruct Where the Model Fails: Generative Data Augmentation via Guided Self-contrastive Fine-tuning
Weijian Ma, Ruoxin Chen, Keyue Zhang et al.
A Trusted Lesion-assessment Network for Interpretable Diagnosis of Coronary Artery Disease in Coronary CT Angiography
Xinghua Ma, Xinyan Fang, Mingye Zou et al.
Follow-Your-Click: Open-domain Regional Image Animation via Motion Prompts
Yue Ma, Yingqing He, Hongfa Wang et al.
Few-Shot Fine-Grained Image Classification with Progressively Feature Refinement and Continuous Relationship Modeling
Zhen-Xiang Ma, Zhen-Duo Chen, Tai Zheng et al.
OUS: Bridging Scene Context and Facial Features to Overcome the Rigid Cognitive Problem
Xinji Mai, Haoran Wang, Zeng Tao et al.