Most Cited 2025 "training data bias" Papers
22,274 papers found • Page 102 of 112
Conference
Towards Better Robustness Against Natural Corruptions in Document Tampering Localization
Huiru Shao, Kaizhu Huang, Wei Wang et al.
SpeHeaTal: A Cluster-Enhanced Segmentation Method for Sperm Morphology Analysis
Yi Shi, Yun-Kai Wang, Xu-Peng Tian et al.
Enhancing Generalizability in Molecular Conformation Generation with METRIZATION-Informed Geometric Diffusion Pretraining
Xiaozhuang Song, Yuzhao Tu, Hangting Ye et al.
Embedding Robust Watermarking into Pattern to Protect the Copyright of Ceramic Artifacts
Lei Tan, Yuliang Xue, Guobiao Li et al.
PScalpel: A Machine Learning-based Guider for Protein Phase-Separating Behaviour Alteration
Jia Wang, Liyan Zhu, Zhe Wang et al.
VisRec: A Semi-Supervised Approach to Visibility Data Reconstruction in Radio Astronomy
Ruoqi Wang, Haitao Wang, Qiong Luo et al.
FMPM-DNet: Hyperspectral Pansharpening Dynamic Network Based on Feature Modulation and Probability Mask
Xiaozheng Wang, Yong Yang, Shuying Huang et al.
Aerodynamic Coefficients Prediction via Cross-Attention Fusion and Physical-Informed Training
Yueqing Wang, Peng Zhang, Yushuang Liu et al.
Generalized Implicit Neural Representations for Dynamic Molecular Surface Modeling
Fang Wu, Bozhen Hu, Stan Z. Li
Vision Transformers Beat WideResNets on Small Scale Datasets Adversarial Robustness
Juntao Wu, Ziyu Song, Xiaoyu Zhang et al.
MultiSFL: Towards Accurate Split Federated Learning via Multi-Model Aggregation and Knowledge Replay
Zeke Xia, Ming Hu, Dengke Yan et al.
DearLLM: Enhancing Personalized Healthcare via Large Language Models-Deduced Feature Correlations
Yongxin Xu, Xinke Jiang, Xu Chu et al.
PriFold: Biological Priors Improve RNA Secondary Structure Predictions
Chenchen Yang, Hao Wu, Tao Shen et al.
Disentangled Table-Graph Representation for Interpretable Transmission Line Fault Location
Na Yu, Yutong Deng, Shunyu Liu et al.
Accurate Nucleic Acid-Binding Residue Identification Based Domain-Adaptive Protein Language Model and Explainable Geometric Deep Learning
Wenwu Zeng, Liangrui Pan, Boya Ji et al.
SWAMamba: A Sliding Window Attention Mamba Framework for Predicting Translation Elongation Rates
Xi Zeng, Fei Ni, Shaoqing Jiao et al.
Portcullis: A Scalable and Verifiable Privacy Gateway for Third-Party LLM Inference
Jiangou Zhan, Wenhui Zhang, Zheng Zhang et al.
BERT-Based Code Learning for Exception Localization and Type Prediction
Chongyu Zhang, Qiping Tao, Liangyu Chen et al.
Motif-Oriented Representation Learning with Topology Refinement for Drug-Drug Interaction Prediction
Ran Zhang, Xuezhi Wang, Guannan Liu et al.
TC-Diffuser: Bi-Condition Multi-Modal Diffusion for Tropical Cyclone Forecasting
Shiqi Zhang, Pan Mu, Cheng Huang et al.
Formal Synthesis of Barrier Certificates Using Fourier Kolmogorov-Arnold Network
Xiongqi Zhang, Junwei Xu, Yang Wang et al.
Drawing Informative Gradients from Sources: A One-stage Transfer Learning Framework for Cross-city Spatiotemporal Forecasting
Yudong Zhang, Xu Wang, Xuan Yu et al.
A Gaussian Filter-Based 3D Registration Method for Series Section Electron Microscopy
Zhenbang Zhang, Hongjia Li, Zhiqiang Xu et al.
Multi-Perspective Consolidation Enhanced Cognitive Diagnosis via Conditional Diffusion Model
Guanhao Zhao, Zhenya Huang, Cheng Cheng et al.
DeNC: Unleash Neural Codecs in Video Streaming with Diffusion Enhancement
Qihua Zhou, Ruibin Li, Jingcai Guo et al.
Text-Guided Fine-grained Counterfactual Inference for Short Video Fake News Detection
Linlin Zong, Wenmin Lin, Jiahui Zhou et al.
Dynamic Interactive Bimodal Hypergraph Networks for Emotion Recognition in Conversations
Xuping Chen, Wuzhen Shi
Symbolic Functional Decomposition: A Reconfiguration Approach
Mateus de Oliveira Oliveira, Wim Van Den Broeck
MSAmba: Exploring Multimodal Sentiment Analysis with State Space Models
Xilin He, Haijian Liang, Boyi Peng et al.
CraftFactory: A Conditioned Control Policy Benchmark for Compositional Generalization
Jinbing Hou, Youpeng Zhao, Jian Zhao
AFFAKT: A Hierarchical Optimal Transport Based Method for Affective Facial Knowledge Transfer in Video Deception Detection
Zihan Ji, Xuetao Tian, Ye Liu
Deep Reinforcement Learning with Time-Scale Invariant Memory
Md Rysul Kabir, James Mochizuki-Freeman, Zoran Tiganj
MI-CAPTCHA: Enhance the Security of CAPTCHA Using Mooney Images
Jingmeng Li, Lukang Fu, Surun Yang et al.
Towards More Discriminative Feature Learning in SNNs with Temporal-Self-Erasing Supervision
Wei Liu, Li Yang, Mingxuan Zhao et al.
Multi-to-Single: Reducing Multimodal Dependency in Emotion Recognition Through Contrastive Learning
Yan-Kai Liu, Jinyu Cai, Bao-Liang Lu et al.
SpikingYOLOX: Improved YOLOX Object Detection with Fast Fourier Convolution and Spiking Neural Networks
Wei Miao, Jiangrong Shen, Qi Xu et al.
Knowledge-Enhanced Hierarchical Heterogeneous Graph for Personality Identification with Limited Training Data
Yuxuan Song, Qiudan Li, Yilin Wu et al.
A Multi-Focus-Driven Multi-Branch Network for Robust Multimodal Sentiment Analysis
Chuanqi Tao, Jiaming Li, Tianzi Zang et al.
Alignment of CNN and Human Judgments of Geometric and Topological Concepts
Neha Upadhyay, Vijay Marupudi, Kamala Varma et al.
DDJND: Dual Domain Just Noticeable Difference in Multi-Source Content Images with Structural Discrepancy
Miaohui Wang, Zhenming Li, Wuyuan Xie
DepMGNN: Matrixial Graph Neural Network for Video-based Automatic Depression Assessment
Zijian Wu, Leijing Zhou, Shuanglin Li et al.
Leveraging Asynchronous Spiking Neural Networks for Ultra Efficient Event-Based Visual Processing
DingYi Zeng, Yuchen Wang, Honglin Cao et al.
Learning Concept Prerequisite Relation via Global Knowledge Relation Optimization
Miao Zhang, Jiawei Wang, Kui Xiao et al.
SalM²: An Extremely Lightweight Saliency Mamba Model for Real-Time Cognitive Awareness of Driver Attention
Chunyu Zhao, Wentao Mu, Xian Zhou et al.
Look Around Before Locating: Considering Content and Structure Information for Visual Grounding
Shiyi Zheng, Peizhi Zhao, Zhilong Zheng et al.
PerReactor: Offline Personalised Multiple Appropriate Facial Reaction Generation
Hengde Zhu, Xiangyu Kong, Weicheng Xie et al.
Aspect Enhancement and Text Simplification in Multimodal Aspect-Based Sentiment Analysis for Multi-Aspect and Multi-Sentiment Scenarios
Linlin Zhu, Heli Sun, Qunshu Gao et al.
Progressive Self-Learning for Domain Adaptation on Symbolic Regression of Integer Sequences
Yaohui Zhu, Kaiming Sun, Zhengdong Luo et al.
HSRDiff: A Hierarchical Self-Regulation Diffusion Model for Stochastic Semantic Segmentation
Han Yang, Chuanguang Yang, Zhulin An et al.
AQUAFace: Age-Invariant Quality Adaptive Face Recognition for Unconstrained Selfie vs ID Verification
Shivang Agarwal, Jyoti Chaudhary, Sadiq Siraj Ebrahim et al.
CA-MLIF: Cross-Attention and Multimodal Low-Rank Interaction Fusion Framework for Tumor Prognostic Prediction
Yajun An, Jiale Chen, Huan Lin et al.
Frozen Language Models Are Gradient Coherence Rectifiers in Vision Transformers
Lichen Bai, Zixuan Xiong, Hai Lin et al.
Dual Manifold Regularization Steered Robust Representation Learning for Point Cloud Analysis
Jian Bi, Qianliang Wu, Jianjun Qian et al.
Divide-and-Conquer: Tree-structured Strategy with Answer Distribution Estimator for Goal-Oriented Visual Dialogue
Shuo Cai, Xinzhe Han, Shuhui Wang
Deep Graph Online Hashing for Multi-Label Image Retrieval
Yuan Cao, Xiangru Chen, Zifan Liu et al.
KeyGS: A Keyframe-Centric Gaussian Splatting Method for Monocular Image Sequences
Keng-Wei Chang, Zi-Ming Wang, Shang-Hong Lai
Skeleton-based Action Recognition with Non-linear Dependency Modeling and Hilbert-Schmidt Independence Criterion
Haipeng Chen, Yuheng Yang, Yingda Lyu
Adversarial Learning Under Hybrid Perturbations for Robust Acute Lymphoblastic Leukemia Classification
Jie Chen, Xinyuan Liu, Xintong Liu et al.
Contrasting Adversarial Perturbations: The Space of Harmless Perturbations
Lu Chen, Shaofeng Li, Benhao Huang et al.
Infinite-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation
Qihua Chen, Yue Ma, Hongfa Wang et al.
Unsupervised Degradation Representation Aware Transform for Real-World Blind Image Super-Resolution
Sen Chen, Hongying Liu, Chaowei Fang et al.
DiffDVC: Accurate Event Detection for Dense Video Captioning via Diffusion Models
Wei Chen, Jianwei Niu, Xuefeng Liu et al.
3D Measurement of Complex Textured Objects Based on Bidirectional Fringe Projection
Yuchong Chen, Jian Yu, Shaoyan Gai et al.
Unsupervised Diffusion-Based Degradation Modeling for Real-World Super-Resolution
Yuying Chen, Mingde Yao, Wenbo Li et al.
EvHDR-GS: Event-guided HDR Video Reconstruction with 3D Gaussian Splatting
Zehao Chen, Zhan Lu, De Ma et al.
VFM-Adapter: Adapting Visual Foundation Models for Dense Prediction with Dynamic Hybrid Operation Mapping
Zheng Chen, Yu Zeng, Zehui Chen et al.
VersaGen: Unleashing Versatile Visual Control for Text-to-Image Synthesis
Zhipeng Chen, Lan Yang, Yonggang Qi et al.
3DPGS: 3D Probabilistic Graph Search for Archaeological Piece Grouping
Junfeng Cheng, Yingkai Yang, Tania Stathaki
Aligning Instance Brownian Bridge with Texts for Open-Vocabulary Video Instance Segmentation
Zesen Cheng, Kehan Li, Li Hao et al.
SIDL: A Real-World Dataset for Restoring Smartphone Images with Dirty Lenses
Sooyoung Choi, Sungyong Park, Heewon Kim
Intrinsic Image Decomposition for Robust Self-supervised Monocular Depth Estimation on Reflective Surfaces
Wonhyeok Choi, Kyumin Hwang, Minwoo Choi et al.
MASS: Overcoming Language Bias in Image-Text Matching
Jiwan Chung, Seungwon Lim, Sangkyu Lee et al.
GCD-Sampling: A General Cross-scale Decoupled Sampling for Point Cloud
Tao Dai, Yanzi Wang, Jianyu Xiong et al.
Self-Corrected Flow Distillation for Consistent One-Step and Few-Step Image Generation
Quan Dao, Hao Phung, Trung Tuan Dao et al.
DiffCorr: Conditional Diffusion Model with Reliable Pseudo-Label Guidance for Unsupervised Point Cloud Shape Correspondence
Jiacheng Deng, Jiahao Lu, Zhixin Cheng et al.
Adaptive Siamese Masked Autoencoder with Global Optimization for Unsupervised Point Cloud Shape Correspondence
Jiacheng Deng, Jiahao Lu
OTIAS: OcTree Implicit Adaptive Sampling for Multispectral and Hyperspectral Image Fusion
Shangqi Deng, Jun Ma, Liang-Jian Deng et al.
Boundary-Aware Temporal Dynamic Pseudo-Supervision Pairs Generation for Zero-Shot Natural Language Video Localization
Xiongwen Deng, Haoyu Tang, Han Jiang et al.
Occlusion-Insensitive Talking Head Video Generation via Facelet Compensation
Yuhui Deng, Yuqin Lu, Yangyang Xu et al.
Dis²Booth: Learning Image Distribution with Disentangled Features for Text-to-Image Diffusion Models
Guanqi Ding, Chengyu Yang, Shuhui Wang et al.
AS-Det: Active Sampling for Adaptive 3D Object Detection in Point Clouds
Ziheng Ding, Xiaze Zhang, Qi Jing et al.
GarFast: Realistic and Fast Garment Transfer with a Simplified Parser-Free Approach
Chenghu Du, Junyin Wang, Yi Rong et al.
Latent Diffusion-Enhanced Virtual Try-On via Optimized Pseudo-Label Generation
Chenghu Du, Junyin Wang, Feng Yu et al.
HybridReg: Robust 3D Point Cloud Registration with Hybrid Motions
Keyu Du, Hao Xu, Haipeng Li et al.
IniRetinex: Rethinking Retinex-type Low-Light Image Enhancer via Initialization Perspective
Guodong Fan, Zishu Yao, Guang-Yong Chen et al.
Vision-guided Text Mining for Unsupervised Cross-modal Hashing with Community Similarity Quantization
Haozhi Fan, Yuan Cao
CoSDA: Enhancing the Robustness of Inversion-based Generative Image Watermarking Framework
Han Fang, Kejiang Chen, Zijin Yang et al.
SSUN-Net: Spatial-Spectral Prior-Aware Unfolding Network for Pan-Sharpening
Shijie Fang, Hongping Gan
Weakly Supervised Gland Segmentation with Class Semantic Consistency and Purified Labels Filtration
Siyang Feng, Huadeng Wang, Chu Han et al.
HDLayout: Hierarchical and Directional Layout Planning for Arbitrary Shaped Visual Text Generation
Tonghui Feng, Chunsheng Yan, Qianru Wang et al.
Simplifying Control Mechanism in Text-to-Image Diffusion Models
Zhida Feng, Li Chen, Yuenan Sun et al.
BGHR: Bridging the Gap Between HBox-Supervised and RBox-Supervised Oriented Object Detection via Adaptive Fine-Grained Sample Mining
Chenlin Fu, Yingying Zhu
Foundation Model Driven Appearance Extraction for Robust Multiple Object Tracking
Teng Fu, Haiyang Yu, Ke Niu et al.
MFL-Owner: Ownership Protection for Multi-modal Federated Learning via Orthogonal Transform Watermark
Keke Gai, Dongjue Wang, Jing Yu et al.
DFDNet: Disentangling and Filtering Dynamics for Enhanced Video Prediction
Lianqiang Gan, Junyu Lai, Jingze Ju et al.
AIM: Let Any Multimodal Large Language Models Embrace Efficient In-Context Learning
Jun Gao, Qian Qiao, Tianxiang Wu et al.
TC-LLaVA: Rethinking the Transfer of LLava from Image to Video Understanding with Temporal Considerations
Mingze Gao, Jingyu Liu, Mingda Li et al.
MaintaAvatar: A Maintainable Avatar Based on Neural Radiance Fields by Continual Learning
Shengbo Gu, Yu-Kun Qiu, Yu-Ming Tang et al.
OT-StainNet: Optimal Transport Driven Semantic Matching for Weakly Paired H&E-to-IHC Stain Transfer
Xianchao Guan, Yifeng Wang, Ye Zhang et al.
Enhancing Low-Rank Adaptation with Recoverability-Based Reinforcement Pruning for Object Counting
Haojie Guo, Junyu Gao, Yuan Yuan
SpikeGS: Reconstruct 3D Scene Captured by a Fast-Moving Bio-Inspired Camera
Yijia Guo, Liwen Hu, Yuanxi Bai et al.
ID-Sculpt: ID-aware 3D Head Generation from Single In-the-wild Portrait Image
Jinkun Hao, Junshu Tang, Jiangning Zhang et al.
Efficient Online Training for Zero-Shot Time-Lapse Microscopy Denoising and Super-Resolution
Ruian He, Ri Cheng, Xinkai Lyu et al.
FashionTailor: Controllable Clothing Editing for Human Images with Appearance Preserving
Jie Hou, Jianghong Ma, Xiangyu Mu et al.
Prompt Tuning In a Compact Attribute Space
Shiyu Hou, Tianfei Zhou, Shuai Zhang et al.
Identity-Text Video Corpus Grounding
Bin Huang, Xin Wang, Hong Chen et al.
AUTE: Peer-Alignment and Self-Unlearning Boost Adversarial Robustness for Training Ensemble Models
Lifeng Huang, Tian Su, Chengying Gao et al.
CLIP-RestoreX: Restore Image Structure and Perception in Exposure Correction
Xiang Huang, Qing Zhang, Jian-Fang Hu et al.
Orchestrating the Symphony of Prompt Distribution Learning for Human-Object Interaction Detection
Mingda Jia, Liming Zhao, Ge Li et al.
ARNet: Self-Supervised FG-SBIR with Unified Sample Feature Alignment and Multi-Scale Token Recycling
Jianan Jiang, Hao Tang, Zhilin Jiang et al.
SCCS: Deep Neural Spectral Clustering for Self-Supervised Subcellular Structure Segmentation
Jimao Jiang, Diya Sun, Tianbing Wang et al.
Restabilizing Diffusion Models with Predictive Noise Fusion Strategy for Image Super-Resolution
Luoqian Jiang, Yong Guo, Bingna Xu et al.
Query Quantized Neural SLAM
Sijia Jiang, Jing Hua, Zhizhong Han
A Method for Enhancing Generalization of Adam by Multiple Integrations
Long Jin, Han Nong, Liangming Chen et al.
Bridging the Semantic Granularity Gap Between Text and Frame Representations for Partially Relevant Video Retrieval
WooJin Jun, WonJun Moon, Cheol-Ho Cho et al.
DiffusionREC: Diffusion Model with Adaptive Condition for Referring Expression Comprehension
Jingcheng Ke, Waikeung Wong, Jia Wang et al.
Generalized Zero-Shot Learning for Point Cloud Segmentation with Evidence-Based Dynamic Calibration
Hyeonseok Kim, Byeongkeun Kang, Yeejin Lee
APR-RD: Complemental Two Steps for Self-Supervised Real Image Denoising
Hyunjun Kim, Nam Ik Cho
TSDF-Based Efficient Motion-Compensated Temporal Interpolation for 3D Dynamic Sequences
Soowoong Kim, Minseong Kwon, Junho Choi et al.
ViPCap: Retrieval Text-Based Visual Prompts for Lightweight Image Captioning
Taewhan Kim, Soeun Lee, Si-Woo Kim et al.
Sequence Matters: Harnessing Video Models in 3D Super-Resolution
Hyun-kyu Ko, Dongheok Park, Youngin Park et al.
A Unified Degradation-Robust Approach to SSL and UDA for 3D Medical Images
Suruchi Kumari, Pravendra Singh
NBA3D: Neighbor-Based Confidence Adjustment for 3D Rare Object Detection Using LiDAR
Jooyoung Lee, Jaeyoon Lee, Jongwon Choi
Enabling Region-Specific Control via Lassos in Point-Based Colorization
Sanghyeon Lee, Jooyeol Yun, Jaegul Choo
Concept Matching with Agent for Out-of-Distribution Detection
Yuxiao Lee, Xiaofeng Cao, Jingcai Guo et al.
FNIN: A Fourier Neural Operator-based Numerical Integration Network for Surface-from-gradients
Jiaqi Leng, Yakun Ju, Yuanxu Duan et al.
FEAST-Mamba: FEAture and SpaTial Aware Mamba Network with Bidirectional Orthogonal Fusion for Cross-Modal Point Cloud Segmentation
Chade Li, Pengju Zhang, Bo Liu et al.
An Efficient Framework for Enhancing Discriminative Models via Diffusion Techniques
Chunxiao Li, Xiaoxiao Wang, Boming Miao et al.
Cascaded Diffusion Models for Virtual Try-On: Improving Control and Resolution
Guangyuan Li, Yongkang Wang, Junsheng Luan et al.
MaskViM: Domain Generalized Semantic Segmentation with State Space Models
Jiahao Li, Yang Lu, Yuan Xie et al.
Know Where You Are From: Event-Based Segmentation via Spatio-Temporal Propagation
Ke Li, Gengyu Lyu, Hao Chen et al.
Similar Modality Enhancement and Action Consistency Learning for Weakly Supervised Temporal Action Localization
Maodong Li, Chao Zheng, Jian Wang et al.
Region-aware Difference Distilling with Attribute-guided Contrastive Regularization for Change Captioning
Rong Li, Liang Li, Jiehua Zhang et al.
Enhancing Generalizability via Utilization of Unlabeled Data for Occupancy Perception
Ruihang Li, Tao Li, Shanding Ye et al.
DigitalLLaVA: Incorporating Digital Cognition Capability for Physical World Comprehension in Multimodal LLMs
Shiyu Li, Pengxu Wei, Pengchong Qiao et al.
Mamba-CAD: State Space Model for 3D Computer-Aided Design Generative Modeling
Xueyang Li, Yunzhong Lou, Yu Song et al.
StructSR: Refuse Spurious Details in Real-World Image Super-Resolution
Yachao Li, Dong Liang, Tianyu Ding et al.
Sparse Transfer Learning Accelerates and Enhances Certified Robustness: A Comprehensive Study
Zhangheng Li, Tianlong Chen, Linyi Li et al.
ProsodyTalker: 3D Visual Speech Animation via Prosody Decomposition
Zonglin Li, Xiaoqian Lv, Qinglin Liu et al.
Exploring the Potential of Large Vision-Language Models for Unsupervised Text-Based Person Retrieval
Zongyi Li, Li Jianbo, Yuxuan Shi et al.
Semantic-guided Masked Mutual Learning for Multi-modal Brain Tumor Segmentation with Arbitrary Missing Modalities
Guoyan Liang, Qin Zhou, Zhe Wang et al.
Progressive Distribution Matching for Federated Semi-Supervised Learning
Dongping Liao, Xitong Gao, Yabo Xu et al.
Multi-Granularity Video Object Segmentation
Sangbeom Lim, Seongchan Kim, Seungjun An et al.
Memory Efficient Matting with Adaptive Token Routing
Yiheng Lin, Yihan Hu, Chenyi Zhang et al.
Deep Hierarchies and Invariant Disease-Indicative Feature Learning for Computer Aided Diagnosis of Multiple Fundus Diseases
Yuxin Lin, Wei Wang, Xiaoling Luo et al.
SOVGaussian: Sparse-View 3D Gaussian Splatting for Open-Vocabulary Scene Understanding
Peng Ling, Tiao Tan, Jiaqi Lin et al.
UFO: Enhancing Diffusion-Based Video Generation with a Uniform Frame Organizer
Delong Liu, Zhaohui Hou, Mingjie Zhan et al.
Zero-Shot Noise2Mean: Gap Minimization for Efficient Denoising from a Single Noisy Image
Duo Liu, Yiqi Shi, Guoyin Zhang et al.
PEIE: Physics Embedded Illumination Estimation for Adaptive Dehazing
Huaizhuo Liu, Hai-Miao Hu, Yonglong Jiang et al.
Union Is Strength! Unite the Power of LLMs and MLLMs for Chart Question Answering
Jiapeng Liu, Liang Li, Shihao Rao et al.
UP-Restorer: When Unrolling Meets Prompts for Unified Image Restoration
Minghao Liu, Wenhan Yang, Jinyi Luo et al.
Path-Adaptive Matting for Efficient Inference Under Various Computational Cost Constraints
Qinglin Liu, Zonglin Li, Xiaoqian Lv et al.
Multi-view Consistent 3D Panoptic Scene Understanding
Xianzhu Liu, Xin Sun, Haozhe Xie et al.
Unveiling the Knowledge of CLIP for Training-Free Open-Vocabulary Semantic Segmentation
Yajie Liu, Guodong Wang, Jinjin Zhang et al.
DoGA: Enhancing Grounded Object Detection via Grouped Pre-Training with Attributes
Yang Liu, Feng Hou, Yunjie Peng et al.
Towards Robust Visual Question Answering via Prompt-Driven Geometric Harmonization
Yishu Liu, Jiawei Zhu, Congcong Wen et al.
See Through Their Minds: Learning Transferable Brain Decoding Models from Cross-Subject fMRI
Yulong Liu, Yongqiang Ma, Guibo Zhu et al.
Advancing Comprehensive Aesthetic Insight with Multi-Scale Text-Guided Self-Supervised Learning
Yuti Liu, Shice Liu, Junyuan Gao et al.
Privacy-Preserving V2X Collaborative Perception Integrating Unknown Collaborators
Bin Lu, Xinyu Xiao, Changzhou Zhang et al.
DeMo: Deep Motion Field Consensus with Learnable Kernels for Two-view Correspondence Learning
Yifan Lu, Jiajun Le, Zizhuo Li et al.
Beyond Pixel and Object: Part Feature as Reference for Few-Shot Video Object Segmentation
Naisong Luo, Guoxin Xiong, Tianzhu Zhang
Revisiting Change Captioning from Self-supervised Global-Part Alignment
Feixiao Lv, Rui Wang, Lihua Jing
ScaleMatch: Multi-scale Consistency Enhancement for Semi-supervised Semantic Segmentation
Liang Lv, Lefei Zhang
Aligning and Prompting Anything for Zero-Shot Generalized Anomaly Detection
Jitao Ma, Weiying Xie, Hangyu Ye et al.
Instruct Where the Model Fails: Generative Data Augmentation via Guided Self-contrastive Fine-tuning
Weijian Ma, Ruoxin Chen, Keyue Zhang et al.
A Trusted Lesion-assessment Network for Interpretable Diagnosis of Coronary Artery Disease in Coronary CT Angiography
Xinghua Ma, Xinyan Fang, Mingye Zou et al.
Follow-Your-Click: Open-domain Regional Image Animation via Motion Prompts
Yue Ma, Yingqing He, Hongfa Wang et al.
Few-Shot Fine-Grained Image Classification with Progressively Feature Refinement and Continuous Relationship Modeling
Zhen-Xiang Ma, Zhen-Duo Chen, Tai Zheng et al.
OUS: Bridging Scene Context and Facial Features to Overcome the Rigid Cognitive Problem
Xinji Mai, Haoran Wang, Zeng Tao et al.
Sp3ctralMamba: Physics-Driven Joint State Space Model for Hyperspectral Image Reconstruction
Ge Meng, Jingyan Tu, Jingjia Huang et al.
Qua2SeDiMo: Quantifiable Quantization Sensitivity of Diffusion Models
Keith G. Mills, Mohammad Salameh, Ruichen Chen et al.
Energy vs. Noise: Towards Robust Temporal Action Localization in Open-World
Chenyu Mu, Jiahua Li, Kun Wei et al.
Learning with Open-world Noisy Data via Class-independent Margin in Dual Representation Space
Linchao Pan, Can Gao, Jie Zhou et al.
Fair Training with Zero Inputs
Wenjie Pan, Jianqing Zhu, Huanqiang Zeng
Procedure Knowledge Decoupled Distillation Strategy for Procedure Planning in Instructional Videos
Xiaotian Pan, Zhaobo Qi, Xin Sun et al.
Point Cloud Semantic Segmentation with Sparse and Inhomogeneous Annotations
Zhiyi Pan, Nan Zhang, Wei Gao et al.
Partially Blinded Unlearning: Class Unlearning for Deep Networks from Bayesian Perspective
Subhodip Panda, Shashwat Sourav, Prathosh A.P.
Beyond Text: Fine-Grained Multi-Modal Fact Verification with Hypergraph Transformers
Hui Pang, Chaozhuo Li, Litian Zhang et al.
SeeDiff: Off-the-Shelf Seeded Mask Generation from Diffusion Models
Joon Hyun Park, Kumju Jo, Sungyong Baik
CDE-Learning: Camera Deviation Elimination Learning for Unsupervised Person Re-identification
Jinjia Peng, Songyu Zhang, Huibing Wang
Boosting Image De-Raining via Central-Surrounding Synergistic Convolution
Long Peng, Yang Wang, Xin Di et al.
3D-aware Select, Expand, and Squeeze Token for Aerial Action Recognition
Luying Peng, Xiangbo Shu, Yazhou Yao et al.
OAMaskFlow: Occlusion-Aware Motion Mask for Scene Flow
Xiongfeng Peng, Zhihua Liu, Weiming Li et al.
HVDualformer: Histogram-Vision Dual Transformer for White Balance
Yan-Tsung Peng, Guan-Rong Chen
Leveraging Anatomical Consistency for Multi-Object Detection in Ultrasound Images via Source-free Unsupervised Domain Adaptation
Bin Pu, Xingguo Lv, Jiewen Yang et al.
Dive into Aerial Remote Sensing Underwater Depth Estimation with Hyperspectral Imagery
Jiahao Qi, Xingyue Liu, Chen Chen et al.
PhysDiff: Physiology-based Dynamicity Disentangled Diffusion Model for Remote Physiological Measurement
Wei Qian, Gaoji Su, Dan Guo et al.
Holistic Correction with Object Prototype for Video Object Segmentation
Shengye Qiao, Changqun Xia, Yanjie Liang et al.
Integrating Low-Level Visual Cues for Enhanced Unsupervised Semantic Segmentation
Yuhao Qing, Dan Zeng, Shaorong Xie et al.
High-Fidelity Polarimetric Implicit 3D Reconstruction with View-Dependent Physical Representation
Yu Qiu, Sijia Wen, Hainan Zhang et al.
HSOD-BIT-V2: A Challenging Benchmark for Hyperspectral Salient Object Detection
Yuhao Qiu, Shuyan Bai, Tingfa Xu et al.
Universal Features Guided Zero-Shot Category-Level Object Pose Estimation
Wentian Qu, Chenyu Meng, Heng Li et al.
CDTR: Semantic Alignment for Video Moment Retrieval Using Concept Decomposition Transformer
Ran Ran, Jiwei Wei, Xiangyi Cai et al.
GenHMR: Generative Human Mesh Recovery
Muhammad Usama Saleem, Ekkasit Pinyoanuntapong, Pu Wang et al.
In2NeCT: Inter-class and Intra-class Neural Collapse Tuning for Semantic Segmentation of Imbalanced Remote Sensing Images
Junao Shen, Qiyun Hu, Tian Feng et al.
Neural Block Compression: Variable Bitrates Feature Blocks for Texture Representation
Rui Shi, Yishun Dou, Zhong Zheng et al.
OGP-Net: Optical Guidance Meets Pixel-Level Contrastive Distillation for Robust Multi-Modal and Missing Modality Segmentation
Aniruddh Sikdar, Jayant Teotia, Suresh Sundaram
Fine-Grained Perception in Panoramic Scenes: A Novel Task, Dataset, and Method for Object Importance Ranking
Jia Song, Chenglizhao Chen, Xu Yu et al.
CtrlAvatar: Controllable Avatars Generation via Disentangled Invertible Networks
Wenfeng Song, Yang Ding, Fei Hou et al.
Temporal Coherent Object Flow for Multi-Object Tracking
Zikai Song, Run Luo, Lintao Ma et al.