Most Cited 2025 "cognitive reasoning" Papers

22,274 papers found • Page 102 of 112

#20201

Towards Better Robustness Against Natural Corruptions in Document Tampering Localization

Huiru Shao, Kaizhu Huang, Wei Wang et al.

AAAI 2025paper
#20202

SpeHeaTal: A Cluster-Enhanced Segmentation Method for Sperm Morphology Analysis

Yi Shi, Yun-Kai Wang, Xu-Peng Tian et al.

AAAI 2025paperarXiv:2502.13192
#20203

Enhancing Generalizability in Molecular Conformation Generation with METRIZATION-Informed Geometric Diffusion Pretraining

Xiaozhuang Song, Yuzhao Tu, Hangting Ye et al.

AAAI 2025paper
#20204

Embedding Robust Watermarking into Pattern to Protect the Copyright of Ceramic Artifacts

Lei Tan, Yuliang Xue, Guobiao Li et al.

AAAI 2025paper
#20205

PScalpel: A Machine Learning-based Guider for Protein Phase-Separating Behaviour Alteration

Jia Wang, Liyan Zhu, Zhe Wang et al.

AAAI 2025paper
#20206

VisRec: A Semi-Supervised Approach to Visibility Data Reconstruction in Radio Astronomy

Ruoqi Wang, Haitao Wang, Qiong Luo et al.

AAAI 2025paper
#20207

FMPM-DNet: Hyperspectral Pansharpening Dynamic Network Based on Feature Modulation and Probability Mask

Xiaozheng Wang, Yong Yang, Shuying Huang et al.

AAAI 2025paper
#20208

Aerodynamic Coefficients Prediction via Cross-Attention Fusion and Physical-Informed Training

Yueqing Wang, Peng Zhang, Yushuang Liu et al.

AAAI 2025paper
#20209

Generalized Implicit Neural Representations for Dynamic Molecular Surface Modeling

Fang Wu, Bozhen Hu, Stan Z. Li

AAAI 2025paper
#20210

Vision Transformers Beat WideResNets on Small Scale Datasets Adversarial Robustness

Juntao Wu, Ziyu Song, Xiaoyu Zhang et al.

AAAI 2025paper
#20211

MultiSFL: Towards Accurate Split Federated Learning via Multi-Model Aggregation and Knowledge Replay

Zeke Xia, Ming Hu, Dengke Yan et al.

AAAI 2025paper
#20212

DearLLM: Enhancing Personalized Healthcare via Large Language Models-Deduced Feature Correlations

Yongxin Xu, Xinke Jiang, Xu Chu et al.

AAAI 2025paper
#20213

PriFold: Biological Priors Improve RNA Secondary Structure Predictions

Chenchen Yang, Hao Wu, Tao Shen et al.

AAAI 2025paper
#20214

Disentangled Table-Graph Representation for Interpretable Transmission Line Fault Location

Na Yu, Yutong Deng, Shunyu Liu et al.

AAAI 2025paper
#20215

Accurate Nucleic Acid-Binding Residue Identification Based Domain-Adaptive Protein Language Model and Explainable Geometric Deep Learning

Wenwu Zeng, Liangrui Pan, Boya Ji et al.

AAAI 2025paper
#20216

SWAMamba: A Sliding Window Attention Mamba Framework for Predicting Translation Elongation Rates

Xi Zeng, Fei Ni, Shaoqing Jiao et al.

AAAI 2025paper
#20217

Portcullis: A Scalable and Verifiable Privacy Gateway for Third-Party LLM Inference

Jiangou Zhan, Wenhui Zhang, Zheng Zhang et al.

AAAI 2025paper
#20218

BERT-Based Code Learning for Exception Localization and Type Prediction

Chongyu Zhang, Qiping Tao, Liangyu Chen et al.

AAAI 2025paper
#20219

Motif-Oriented Representation Learning with Topology Refinement for Drug-Drug Interaction Prediction

Ran Zhang, Xuezhi Wang, Guannan Liu et al.

AAAI 2025paper
#20220

TC-Diffuser: Bi-Condition Multi-Modal Diffusion for Tropical Cyclone Forecasting

Shiqi Zhang, Pan Mu, Cheng Huang et al.

AAAI 2025paper
#20221

Formal Synthesis of Barrier Certificates Using Fourier Kolmogorov-Arnold Network

Xiongqi Zhang, Junwei Xu, Yang Wang et al.

AAAI 2025paper
#20222

Drawing Informative Gradients from Sources: A One-stage Transfer Learning Framework for Cross-city Spatiotemporal Forecasting

Yudong Zhang, Xu Wang, Xuan Yu et al.

AAAI 2025paper
#20223

A Gaussian Filter-Based 3D Registration Method for Series Section Electron Microscopy

Zhenbang Zhang, Hongjia Li, Zhiqiang Xu et al.

AAAI 2025paper
#20224

Multi-Perspective Consolidation Enhanced Cognitive Diagnosis via Conditional Diffusion Model

Guanhao Zhao, Zhenya Huang, Cheng Cheng et al.

AAAI 2025paper
#20225

DeNC: Unleash Neural Codecs in Video Streaming with Diffusion Enhancement

Qihua Zhou, Ruibin Li, Jingcai Guo et al.

AAAI 2025paper
#20226

Text-Guided Fine-grained Counterfactual Inference for Short Video Fake News Detection

Linlin Zong, Wenmin Lin, Jiahui Zhou et al.

AAAI 2025paper
#20227

Dynamic Interactive Bimodal Hypergraph Networks for Emotion Recognition in Conversations

Xuping Chen, Wuzhen Shi

AAAI 2025paper
#20228

Symbolic Functional Decomposition: A Reconfiguration Approach

Mateus de Oliveira Oliveira, Wim Van Den Broeck

AAAI 2025paperarXiv:2601.08354
#20229

MSAmba: Exploring Multimodal Sentiment Analysis with State Space Models

Xilin He, Haijian Liang, Boyi Peng et al.

AAAI 2025paper
#20230

CraftFactory: A Conditioned Control Policy Benchmark for Compositional Generalization

Jinbing Hou, Youpeng Zhao, Jian Zhao

AAAI 2025paper
#20231

AFFAKT: A Hierarchical Optimal Transport Based Method for Affective Facial Knowledge Transfer in Video Deception Detection

Zihan Ji, Xuetao Tian, Ye Liu

AAAI 2025paperarXiv:2412.08965
#20232

Deep Reinforcement Learning with Time-Scale Invariant Memory

Md Rysul Kabir, James Mochizuki-Freeman, Zoran Tiganj

AAAI 2025paperarXiv:2412.15292
#20233

MI-CAPTCHA: Enhance the Security of CAPTCHA Using Mooney Images

Jingmeng Li, Lukang Fu, Surun Yang et al.

AAAI 2025paper
#20234

Towards More Discriminative Feature Learning in SNNs with Temporal-Self-Erasing Supervision

Wei Liu, Li Yang, Mingxuan Zhao et al.

AAAI 2025paper
#20235

Multi-to-Single: Reducing Multimodal Dependency in Emotion Recognition Through Contrastive Learning

Yan-Kai Liu, Jinyu Cai, Bao-Liang Lu et al.

AAAI 2025paper
#20236

SpikingYOLOX: Improved YOLOX Object Detection with Fast Fourier Convolution and Spiking Neural Networks

Wei Miao, Jiangrong Shen, Qi Xu et al.

AAAI 2025paper
#20237

Knowledge-Enhanced Hierarchical Heterogeneous Graph for Personality Identification with Limited Training Data

Yuxuan Song, Qiudan Li, Yilin Wu et al.

AAAI 2025paper
#20238

A Multi-Focus-Driven Multi-Branch Network for Robust Multimodal Sentiment Analysis

Chuanqi Tao, Jiaming Li, Tianzi Zang et al.

AAAI 2025paper
#20239

Alignment of CNN and Human Judgments of Geometric and Topological Concepts

Neha Upadhyay, Vijay Marupudi, Kamala Varma et al.

AAAI 2025paper
#20240

DDJND: Dual Domain Just Noticeable Difference in Multi-Source Content Images with Structural Discrepancy

Miaohui Wang, Zhenming Li, Wuyuan Xie

AAAI 2025paper
#20241

DepMGNN: Matrixial Graph Neural Network for Video-based Automatic Depression Assessment

Zijian Wu, Leijing Zhou, Shuanglin Li et al.

AAAI 2025paper
#20242

Leveraging Asynchronous Spiking Neural Networks for Ultra Efficient Event-Based Visual Processing

DingYi Zeng, Yuchen Wang, Honglin Cao et al.

AAAI 2025paper
#20243

Learning Concept Prerequisite Relation via Global Knowledge Relation Optimization

Miao Zhang, Jiawei Wang, Kui Xiao et al.

AAAI 2025paper
#20244

SalM²: An Extremely Lightweight Saliency Mamba Model for Real-Time Cognitive Awareness of Driver Attention

Chunyu Zhao, Wentao Mu, Xian Zhou et al.

AAAI 2025paper
#20245

Look Around Before Locating: Considering Content and Structure Information for Visual Grounding

Shiyi Zheng, Peizhi Zhao, Zhilong Zheng et al.

AAAI 2025paper
#20246

PerReactor: Offline Personalised Multiple Appropriate Facial Reaction Generation

Hengde Zhu, Xiangyu Kong, Weicheng Xie et al.

AAAI 2025paper
#20247

Aspect Enhancement and Text Simplification in Multimodal Aspect-Based Sentiment Analysis for Multi-Aspect and Multi-Sentiment Scenarios

Linlin Zhu, Heli Sun, Qunshu Gao et al.

AAAI 2025paper
#20248

Progressive Self-Learning for Domain Adaptation on Symbolic Regression of Integer Sequences

Yaohui Zhu, Kaiming Sun, Zhengdong Luo et al.

AAAI 2025paper
#20249

HSRDiff: A Hierarchical Self-Regulation Diffusion Model for Stochastic Semantic Segmentation

Han Yang, Chuanguang Yang, Zhulin An et al.

AAAI 2025paper
#20250

AQUAFace: Age-Invariant Quality Adaptive Face Recognition for Unconstrained Selfie vs ID Verification

Shivang Agarwal, Jyoti Chaudhary, Sadiq Siraj Ebrahim et al.

AAAI 2025paper
#20251

CA-MLIF: Cross-Attention and Multimodal Low-Rank Interaction Fusion Framework for Tumor Prognostic Prediction

Yajun An, Jiale Chen, Huan Lin et al.

AAAI 2025paper
#20252

Frozen Language Models Are Gradient Coherence Rectifiers in Vision Transformers

Lichen Bai, Zixuan Xiong, Hai Lin et al.

AAAI 2025paper
#20253

Dual Manifold Regularization Steered Robust Representation Learning for Point Cloud Analysis

Jian Bi, Qianliang Wu, Jianjun Qian et al.

AAAI 2025paper
#20254

Divide-and-Conquer: Tree-structured Strategy with Answer Distribution Estimator for Goal-Oriented Visual Dialogue

Shuo Cai, Xinzhe Han, Shuhui Wang

AAAI 2025paperarXiv:2502.05806
#20255

Deep Graph Online Hashing for Multi-Label Image Retrieval

Yuan Cao, Xiangru Chen, Zifan Liu et al.

AAAI 2025paper
#20256

KeyGS: A Keyframe-Centric Gaussian Splatting Method for Monocular Image Sequences

Keng-Wei Chang, Zi-Ming Wang, Shang-Hong Lai

AAAI 2025paperarXiv:2412.20767
#20257

Skeleton-based Action Recognition with Non-linear Dependency Modeling and Hilbert-Schmidt Independence Criterion

Haipeng Chen, Yuheng Yang, Yingda Lyu

AAAI 2025paper
#20258

Adversarial Learning Under Hybrid Perturbations for Robust Acute Lymphoblastic Leukemia Classification

Jie Chen, Xinyuan Liu, Xintong Liu et al.

AAAI 2025paper
#20259

Contrasting Adversarial Perturbations: The Space of Harmless Perturbations

Lu Chen, Shaofeng Li, Benhao Huang et al.

AAAI 2025paperarXiv:2402.02095
#20260

Infinite-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation

Qihua Chen, Yue Ma, Hongfa Wang et al.

AAAI 2025paper
#20261

Unsupervised Degradation Representation Aware Transform for Real-World Blind Image Super-Resolution

Sen Chen, Hongying Liu, Chaowei Fang et al.

AAAI 2025paper
#20262

DiffDVC: Accurate Event Detection for Dense Video Captioning via Diffusion Models

Wei Chen, Jianwei Niu, Xuefeng Liu et al.

AAAI 2025paper
#20263

3D Measurement of Complex Textured Objects Based on Bidirectional Fringe Projection

Yuchong Chen, Jian Yu, Shaoyan Gai et al.

AAAI 2025paper
#20264

Unsupervised Diffusion-Based Degradation Modeling for Real-World Super-Resolution

Yuying Chen, Mingde Yao, Wenbo Li et al.

AAAI 2025paper
#20265

EvHDR-GS: Event-guided HDR Video Reconstruction with 3D Gaussian Splatting

Zehao Chen, Zhan Lu, De Ma et al.

AAAI 2025paper
#20266

VFM-Adapter: Adapting Visual Foundation Models for Dense Prediction with Dynamic Hybrid Operation Mapping

Zheng Chen, Yu Zeng, Zehui Chen et al.

AAAI 2025paper
#20267

VersaGen: Unleashing Versatile Visual Control for Text-to-Image Synthesis

Zhipeng Chen, Lan Yang, Yonggang Qi et al.

AAAI 2025paperarXiv:2412.11594
#20268

3DPGS: 3D Probabilistic Graph Search for Archaeological Piece Grouping

Junfeng Cheng, Yingkai Yang, Tania Stathaki

AAAI 2025paper
#20269

Aligning Instance Brownian Bridge with Texts for Open-Vocabulary Video Instance Segmentation

Zesen Cheng, Kehan Li, Li Hao et al.

AAAI 2025paper
#20270

SIDL: A Real-World Dataset for Restoring Smartphone Images with Dirty Lenses

Sooyoung Choi, Sungyong Park, Heewon Kim

AAAI 2025paper
#20271

Intrinsic Image Decomposition for Robust Self-supervised Monocular Depth Estimation on Reflective Surfaces

Wonhyeok Choi, Kyumin Hwang, Minwoo Choi et al.

AAAI 2025paperarXiv:2503.22209
#20272

MASS: Overcoming Language Bias in Image-Text Matching

Jiwan Chung, Seungwon Lim, Sangkyu Lee et al.

AAAI 2025paperarXiv:2501.11469
#20273

GCD-Sampling: A General Cross-scale Decoupled Sampling for Point Cloud

Tao Dai, Yanzi Wang, Jianyu Xiong et al.

AAAI 2025paper
#20274

Self-Corrected Flow Distillation for Consistent One-Step and Few-Step Image Generation

Quan Dao, Hao Phung, Trung Tuan Dao et al.

AAAI 2025paper
#20275

DiffCorr: Conditional Diffusion Model with Reliable Pseudo-Label Guidance for Unsupervised Point Cloud Shape Correspondence

Jiacheng Deng, Jiahao Lu, Zhixin Cheng et al.

AAAI 2025paper
#20276

Adaptive Siamese Masked Autoencoder with Global Optimization for Unsupervised Point Cloud Shape Correspondence

Jiacheng Deng, Jiahao Lu

AAAI 2025paper
#20277

OTIAS: OcTree Implicit Adaptive Sampling for Multispectral and Hyperspectral Image Fusion

Shangqi Deng, Jun Ma, Liang-Jian Deng et al.

AAAI 2025paper
#20278

Boundary-Aware Temporal Dynamic Pseudo-Supervision Pairs Generation for Zero-Shot Natural Language Video Localization

Xiongwen Deng, Haoyu Tang, Han Jiang et al.

AAAI 2025paper
#20279

Occlusion-Insensitive Talking Head Video Generation via Facelet Compensation

Yuhui Deng, Yuqin Lu, Yangyang Xu et al.

AAAI 2025paper
#20280

Dis²Booth: Learning Image Distribution with Disentangled Features for Text-to-Image Diffusion Models

Guanqi Ding, Chengyu Yang, Shuhui Wang et al.

AAAI 2025paper
#20281

AS-Det: Active Sampling for Adaptive 3D Object Detection in Point Clouds

Ziheng Ding, Xiaze Zhang, Qi Jing et al.

AAAI 2025paper
#20282

GarFast: Realistic and Fast Garment Transfer with a Simplified Parser-Free Approach

Chenghu Du, Junyin Wang, Yi Rong et al.

AAAI 2025paper
#20283

Latent Diffusion-Enhanced Virtual Try-On via Optimized Pseudo-Label Generation

Chenghu Du, Junyin Wang, Feng Yu et al.

AAAI 2025paper
#20284

HybridReg: Robust 3D Point Cloud Registration with Hybrid Motions

Keyu Du, Hao Xu, Haipeng Li et al.

AAAI 2025paperarXiv:2503.07019
#20285

IniRetinex: Rethinking Retinex-type Low-Light Image Enhancer via Initialization Perspective

Guodong Fan, Zishu Yao, Guang-Yong Chen et al.

AAAI 2025paper
#20286

Vision-guided Text Mining for Unsupervised Cross-modal Hashing with Community Similarity Quantization

Haozhi Fan, Yuan Cao

AAAI 2025paper
#20287

CoSDA: Enhancing the Robustness of Inversion-based Generative Image Watermarking Framework

Han Fang, Kejiang Chen, Zijin Yang et al.

AAAI 2025paper
#20288

SSUN-Net: Spatial-Spectral Prior-Aware Unfolding Network for Pan-Sharpening

Shijie Fang, Hongping Gan

AAAI 2025paper
#20289

Weakly Supervised Gland Segmentation with Class Semantic Consistency and Purified Labels Filtration

Siyang Feng, Huadeng Wang, Chu Han et al.

AAAI 2025paper
#20290

HDLayout: Hierarchical and Directional Layout Planning for Arbitrary Shaped Visual Text Generation

Tonghui Feng, Chunsheng Yan, Qianru Wang et al.

AAAI 2025paper
#20291

Simplifying Control Mechanism in Text-to-Image Diffusion Models

Zhida Feng, Li Chen, Yuenan Sun et al.

AAAI 2025paper
#20292

BGHR: Bridging the Gap Between HBox-Supervised and RBox-Supervised Oriented Object Detection via Adaptive Fine-Grained Sample Mining

Chenlin Fu, Yingying Zhu

AAAI 2025paper
#20293

Foundation Model Driven Appearance Extraction for Robust Multiple Object Tracking

Teng Fu, Haiyang Yu, Ke Niu et al.

AAAI 2025paper
#20294

MFL-Owner: Ownership Protection for Multi-modal Federated Learning via Orthogonal Transform Watermark

Keke Gai, Dongjue Wang, Jing Yu et al.

AAAI 2025paper
#20295

DFDNet: Disentangling and Filtering Dynamics for Enhanced Video Prediction

Lianqiang Gan, Junyu Lai, Jingze Ju et al.

AAAI 2025paper
#20296

AIM: Let Any Multimodal Large Language Models Embrace Efficient In-Context Learning

Jun Gao, Qian Qiao, Tianxiang Wu et al.

AAAI 2025paper
#20297

TC-LLaVA: Rethinking the Transfer of LLava from Image to Video Understanding with Temporal Considerations

Mingze Gao, Jingyu Liu, Mingda Li et al.

AAAI 2025paper
#20298

MaintaAvatar: A Maintainable Avatar Based on Neural Radiance Fields by Continual Learning

Shengbo Gu, Yu-Kun Qiu, Yu-Ming Tang et al.

AAAI 2025paperarXiv:2502.02372
#20299

OT-StainNet: Optimal Transport Driven Semantic Matching for Weakly Paired H&E-to-IHC Stain Transfer

Xianchao Guan, Yifeng Wang, Ye Zhang et al.

AAAI 2025paper
#20300

Enhancing Low-Rank Adaptation with Recoverability-Based Reinforcement Pruning for Object Counting

Haojie Guo, Junyu Gao, Yuan Yuan

AAAI 2025paper
#20301

SpikeGS: Reconstruct 3D Scene Captured by a Fast-Moving Bio-Inspired Camera

Yijia Guo, Liwen Hu, Yuanxi Bai et al.

AAAI 2025paper
#20302

ID-Sculpt: ID-aware 3D Head Generation from Single In-the-wild Portrait Image

Jinkun Hao, Junshu Tang, Jiangning Zhang et al.

AAAI 2025paperarXiv:2406.16710
#20303

Efficient Online Training for Zero-Shot Time-Lapse Microscopy Denoising and Super-Resolution

Ruian He, Ri Cheng, Xinkai Lyu et al.

AAAI 2025paper
#20304

FashionTailor: Controllable Clothing Editing for Human Images with Appearance Preserving

Jie Hou, Jianghong Ma, Xiangyu Mu et al.

AAAI 2025paper
#20305

Prompt Tuning In a Compact Attribute Space

Shiyu Hou, Tianfei Zhou, Shuai Zhang et al.

AAAI 2025paper
#20306

Identity-Text Video Corpus Grounding

Bin Huang, Xin Wang, Hong Chen et al.

AAAI 2025paper
#20307

AUTE: Peer-Alignment and Self-Unlearning Boost Adversarial Robustness for Training Ensemble Models

Lifeng Huang, Tian Su, Chengying Gao et al.

AAAI 2025paper
#20308

CLIP-RestoreX: Restore Image Structure and Perception in Exposure Correction

Xiang Huang, Qing Zhang, Jian-Fang Hu et al.

AAAI 2025paper
#20309

Orchestrating the Symphony of Prompt Distribution Learning for Human-Object Interaction Detection

Mingda Jia, Liming Zhao, Ge Li et al.

AAAI 2025paperarXiv:2412.08506
#20310

ARNet: Self-Supervised FG-SBIR with Unified Sample Feature Alignment and Multi-Scale Token Recycling

Jianan Jiang, Hao Tang, Zhilin Jiang et al.

AAAI 2025paperarXiv:2406.11551
#20311

SCCS: Deep Neural Spectral Clustering for Self-Supervised Subcellular Structure Segmentation

Jimao Jiang, Diya Sun, Tianbing Wang et al.

AAAI 2025paper
#20312

Restabilizing Diffusion Models with Predictive Noise Fusion Strategy for Image Super-Resolution

Luoqian Jiang, Yong Guo, Bingna Xu et al.

AAAI 2025paper
#20313

Query Quantized Neural SLAM

Sijia Jiang, Jing Hua, Zhizhong Han

AAAI 2025paperarXiv:2412.16476
#20314

A Method for Enhancing Generalization of Adam by Multiple Integrations

Long Jin, Han Nong, Liangming Chen et al.

AAAI 2025paperarXiv:2412.12473
#20315

Bridging the Semantic Granularity Gap Between Text and Frame Representations for Partially Relevant Video Retrieval

WooJin Jun, WonJun Moon, Cheol-Ho Cho et al.

AAAI 2025paper
#20316

DiffusionREC: Diffusion Model with Adaptive Condition for Referring Expression Comprehension

Jingcheng Ke, Waikeung Wong, Jia Wang et al.

AAAI 2025paper
#20317

Generalized Zero-Shot Learning for Point Cloud Segmentation with Evidence-Based Dynamic Calibration

Hyeonseok Kim, Byeongkeun Kang, Yeejin Lee

AAAI 2025paperarXiv:2509.08280
#20318

APR-RD: Complemental Two Steps for Self-Supervised Real Image Denoising

Hyunjun Kim, Nam Ik Cho

AAAI 2025paper
#20319

TSDF-Based Efficient Motion-Compensated Temporal Interpolation for 3D Dynamic Sequences

Soowoong Kim, Minseong Kwon, Junho Choi et al.

AAAI 2025paper
#20320

ViPCap: Retrieval Text-Based Visual Prompts for Lightweight Image Captioning

Taewhan Kim, Soeun Lee, Si-Woo Kim et al.

AAAI 2025paperarXiv:2412.19289
#20321

Sequence Matters: Harnessing Video Models in 3D Super-Resolution

Hyun-kyu Ko, Dongheok Park, Youngin Park et al.

AAAI 2025paperarXiv:2412.11525
#20322

A Unified Degradation-Robust Approach to SSL and UDA for 3D Medical Images

Suruchi Kumari, Pravendra Singh

AAAI 2025paper
#20323

NBA3D: Neighbor-Based Confidence Adjustment for 3D Rare Object Detection Using LiDAR

Jooyoung Lee, Jaeyoon Lee, Jongwon Choi

AAAI 2025paper
#20324

Enabling Region-Specific Control via Lassos in Point-Based Colorization

Sanghyeon Lee, Jooyeol Yun, Jaegul Choo

AAAI 2025paperarXiv:2412.13469
#20325

Concept Matching with Agent for Out-of-Distribution Detection

Yuxiao Lee, Xiaofeng Cao, Jingcai Guo et al.

AAAI 2025paperarXiv:2405.16766
#20326

FNIN: A Fourier Neural Operator-based Numerical Integration Network for Surface-from-gradients

Jiaqi Leng, Yakun Ju, Yuanxu Duan et al.

AAAI 2025paper
#20327

FEAST-Mamba: FEAture and SpaTial Aware Mamba Network with Bidirectional Orthogonal Fusion for Cross-Modal Point Cloud Segmentation

Chade Li, Pengju Zhang, Bo Liu et al.

AAAI 2025paper
#20328

An Efficient Framework for Enhancing Discriminative Models via Diffusion Techniques

Chunxiao Li, Xiaoxiao Wang, Boming Miao et al.

AAAI 2025paperarXiv:2412.09063
#20329

Cascaded Diffusion Models for Virtual Try-On: Improving Control and Resolution

Guangyuan Li, Yongkang Wang, Junsheng Luan et al.

AAAI 2025paper
#20330

MaskViM: Domain Generalized Semantic Segmentation with State Space Models

Jiahao Li, Yang Lu, Yuan Xie et al.

AAAI 2025paper
#20331

Know Where You Are From: Event-Based Segmentation via Spatio-Temporal Propagation

Ke Li, Gengyu Lyu, Hao Chen et al.

AAAI 2025paper
#20332

Similar Modality Enhancement and Action Consistency Learning for Weakly Supervised Temporal Action Localization

Maodong Li, Chao Zheng, Jian Wang et al.

AAAI 2025paper
#20333

Region-aware Difference Distilling with Attribute-guided Contrastive Regularization for Change Captioning

Rong Li, Liang Li, Jiehua Zhang et al.

AAAI 2025paper
#20334

Enhancing Generalizability via Utilization of Unlabeled Data for Occupancy Perception

Ruihang Li, Tao Li, Shanding Ye et al.

AAAI 2025paper
#20335

DigitalLLaVA: Incorporating Digital Cognition Capability for Physical World Comprehension in Multimodal LLMs

Shiyu Li, Pengxu Wei, Pengchong Qiao et al.

AAAI 2025paper
#20336

Mamba-CAD: State Space Model for 3D Computer-Aided Design Generative Modeling

Xueyang Li, Yunzhong Lou, Yu Song et al.

AAAI 2025paper
#20337

StructSR: Refuse Spurious Details in Real-World Image Super-Resolution

Yachao Li, Dong Liang, Tianyu Ding et al.

AAAI 2025paperarXiv:2501.05777
#20338

Sparse Transfer Learning Accelerates and Enhances Certified Robustness: A Comprehensive Study

Zhangheng Li, Tianlong Chen, Linyi Li et al.

AAAI 2025paper
#20339

ProsodyTalker: 3D Visual Speech Animation via Prosody Decomposition

Zonglin Li, Xiaoqian Lv, Qinglin Liu et al.

AAAI 2025paper
#20340

Exploring the Potential of Large Vision-Language Models for Unsupervised Text-Based Person Retrieval

Zongyi Li, Li Jianbo, Yuxuan Shi et al.

AAAI 2025paper
#20341

Semantic-guided Masked Mutual Learning for Multi-modal Brain Tumor Segmentation with Arbitrary Missing Modalities

Guoyan Liang, Qin Zhou, Zhe Wang et al.

AAAI 2025paperarXiv:2507.07592
#20342

Progressive Distribution Matching for Federated Semi-Supervised Learning

Dongping Liao, Xitong Gao, Yabo Xu et al.

AAAI 2025paper
#20343

Multi-Granularity Video Object Segmentation

Sangbeom Lim, Seongchan Kim, Seungjun An et al.

AAAI 2025paperarXiv:2412.01471
#20344

Memory Efficient Matting with Adaptive Token Routing

Yiheng Lin, Yihan Hu, Chenyi Zhang et al.

AAAI 2025paperarXiv:2412.10702
#20345

Deep Hierarchies and Invariant Disease-Indicative Feature Learning for Computer Aided Diagnosis of Multiple Fundus Diseases

Yuxin Lin, Wei Wang, Xiaoling Luo et al.

AAAI 2025paper
#20346

SOVGaussian: Sparse-View 3D Gaussian Splatting for Open-Vocabulary Scene Understanding

Peng Ling, Tiao Tan, Jiaqi Lin et al.

AAAI 2025paper
#20347

UFO: Enhancing Diffusion-Based Video Generation with a Uniform Frame Organizer

Delong Liu, Zhaohui Hou, Mingjie Zhan et al.

AAAI 2025paperarXiv:2412.09389
#20348

Zero-Shot Noise2Mean: Gap Minimization for Efficient Denoising from a Single Noisy Image

Duo Liu, Yiqi Shi, Guoyin Zhang et al.

AAAI 2025paper
#20349

PEIE: Physics Embedded Illumination Estimation for Adaptive Dehazing

Huaizhuo Liu, Hai-Miao Hu, Yonglong Jiang et al.

AAAI 2025paper
#20350

Union Is Strength! Unite the Power of LLMs and MLLMs for Chart Question Answering

Jiapeng Liu, Liang Li, Shihao Rao et al.

AAAI 2025paper
#20351

UP-Restorer: When Unrolling Meets Prompts for Unified Image Restoration

Minghao Liu, Wenhan Yang, Jinyi Luo et al.

AAAI 2025paper
#20352

Path-Adaptive Matting for Efficient Inference Under Various Computational Cost Constraints

Qinglin Liu, Zonglin Li, Xiaoqian Lv et al.

AAAI 2025paperarXiv:2503.03228
#20353

Multi-view Consistent 3D Panoptic Scene Understanding

Xianzhu Liu, Xin Sun, Haozhe Xie et al.

AAAI 2025paper
#20354

Unveiling the Knowledge of CLIP for Training-Free Open-Vocabulary Semantic Segmentation

Yajie Liu, Guodong Wang, Jinjin Zhang et al.

AAAI 2025paper
#20355

DoGA: Enhancing Grounded Object Detection via Grouped Pre-Training with Attributes

Yang Liu, Feng Hou, Yunjie Peng et al.

AAAI 2025paper
#20356

Towards Robust Visual Question Answering via Prompt-Driven Geometric Harmonization

Yishu Liu, Jiawei Zhu, Congcong Wen et al.

AAAI 2025paper
#20357

See Through Their Minds: Learning Transferable Brain Decoding Models from Cross-Subject fMRI

Yulong Liu, Yongqiang Ma, Guibo Zhu et al.

AAAI 2025paper
#20358

Advancing Comprehensive Aesthetic Insight with Multi-Scale Text-Guided Self-Supervised Learning

Yuti Liu, Shice Liu, Junyuan Gao et al.

AAAI 2025paperarXiv:2412.11952
#20359

Privacy-Preserving V2X Collaborative Perception Integrating Unknown Collaborators

Bin Lu, Xinyu Xiao, Changzhou Zhang et al.

AAAI 2025paper
#20360

DeMo: Deep Motion Field Consensus with Learnable Kernels for Two-view Correspondence Learning

Yifan Lu, Jiajun Le, Zizhuo Li et al.

AAAI 2025paper
#20361

Beyond Pixel and Object: Part Feature as Reference for Few-Shot Video Object Segmentation

Naisong Luo, Guoxin Xiong, Tianzhu Zhang

AAAI 2025paper
#20362

Revisiting Change Captioning from Self-supervised Global-Part Alignment

Feixiao Lv, Rui Wang, Lihua Jing

AAAI 2025paper
#20363

ScaleMatch: Multi-scale Consistency Enhancement for Semi-supervised Semantic Segmentation

Liang Lv, Lefei Zhang

AAAI 2025paper
#20364

Aligning and Prompting Anything for Zero-Shot Generalized Anomaly Detection

Jitao Ma, Weiying Xie, Hangyu Ye et al.

AAAI 2025paper
#20365

Instruct Where the Model Fails: Generative Data Augmentation via Guided Self-contrastive Fine-tuning

Weijian Ma, Ruoxin Chen, Keyue Zhang et al.

AAAI 2025paper
#20366

A Trusted Lesion-assessment Network for Interpretable Diagnosis of Coronary Artery Disease in Coronary CT Angiography

Xinghua Ma, Xinyan Fang, Mingye Zou et al.

AAAI 2025paper
#20367

Follow-Your-Click: Open-domain Regional Image Animation via Motion Prompts

Yue Ma, Yingqing He, Hongfa Wang et al.

AAAI 2025paper
#20368

Few-Shot Fine-Grained Image Classification with Progressively Feature Refinement and Continuous Relationship Modeling

Zhen-Xiang Ma, Zhen-Duo Chen, Tai Zheng et al.

AAAI 2025paper
#20369

OUS: Bridging Scene Context and Facial Features to Overcome the Rigid Cognitive Problem

Xinji Mai, Haoran Wang, Zeng Tao et al.

AAAI 2025paper
#20370

Sp3ctralMamba: Physics-Driven Joint State Space Model for Hyperspectral Image Reconstruction

Ge Meng, Jingyan Tu, Jingjia Huang et al.

AAAI 2025paper
#20371

Qua2SeDiMo: Quantifiable Quantization Sensitivity of Diffusion Models

Keith G. Mills, Mohammad Salameh, Ruichen Chen et al.

AAAI 2025paper
#20372

Energy vs. Noise: Towards Robust Temporal Action Localization in Open-World

Chenyu Mu, Jiahua Li, Kun Wei et al.

AAAI 2025paper
#20373

Learning with Open-world Noisy Data via Class-independent Margin in Dual Representation Space

Linchao Pan, Can Gao, Jie Zhou et al.

AAAI 2025paperarXiv:2501.11053
#20374

Fair Training with Zero Inputs

Wenjie Pan, Jianqing Zhu, Huanqiang Zeng

AAAI 2025paper
#20375

Procedure Knowledge Decoupled Distillation Strategy for Procedure Planning in Instructional Videos

Xiaotian Pan, Zhaobo Qi, Xin Sun et al.

AAAI 2025paper
#20376

Point Cloud Semantic Segmentation with Sparse and Inhomogeneous Annotations

Zhiyi Pan, Nan Zhang, Wei Gao et al.

AAAI 2025paperarXiv:2312.06259
#20377

Partially Blinded Unlearning: Class Unlearning for Deep Networks from Bayesian Perspective

Subhodip Panda, Shashwat Sourav, Prathosh A.P.

AAAI 2025paper
#20378

Beyond Text: Fine-Grained Multi-Modal Fact Verification with Hypergraph Transformers

Hui Pang, Chaozhuo Li, Litian Zhang et al.

AAAI 2025paper
#20379

SeeDiff: Off-the-Shelf Seeded Mask Generation from Diffusion Models

Joon Hyun Park, Kumju Jo, Sungyong Baik

AAAI 2025paperarXiv:2507.19808
#20380

CDE-Learning: Camera Deviation Elimination Learning for Unsupervised Person Re-identification

Jinjia Peng, Songyu Zhang, Huibing Wang

AAAI 2025paper
#20381

Boosting Image De-Raining via Central-Surrounding Synergistic Convolution

Long Peng, Yang Wang, Xin Di et al.

AAAI 2025paper
#20382

3D-aware Select, Expand, and Squeeze Token for Aerial Action Recognition

Luying Peng, Xiangbo Shu, Yazhou Yao et al.

AAAI 2025paper
#20383

OAMaskFlow: Occlusion-Aware Motion Mask for Scene Flow

Xiongfeng Peng, Zhihua Liu, Weiming Li et al.

AAAI 2025paper
#20384

HVDualformer: Histogram-Vision Dual Transformer for White Balance

Yan-Tsung Peng, Guan-Rong Chen

AAAI 2025paper
#20385

Leveraging Anatomical Consistency for Multi-Object Detection in Ultrasound Images via Source-free Unsupervised Domain Adaptation

Bin Pu, Xingguo Lv, Jiewen Yang et al.

AAAI 2025paper
#20386

Dive into Aerial Remote Sensing Underwater Depth Estimation with Hyperspectral Imagery

Jiahao Qi, Xingyue Liu, Chen Chen et al.

AAAI 2025paper
#20387

PhysDiff: Physiology-based Dynamicity Disentangled Diffusion Model for Remote Physiological Measurement

Wei Qian, Gaoji Su, Dan Guo et al.

AAAI 2025paper
#20388

Holistic Correction with Object Prototype for Video Object Segmentation

Shengye Qiao, Changqun Xia, Yanjie Liang et al.

AAAI 2025paper
#20389

Integrating Low-Level Visual Cues for Enhanced Unsupervised Semantic Segmentation

Yuhao Qing, Dan Zeng, Shaorong Xie et al.

AAAI 2025paper
#20390

High-Fidelity Polarimetric Implicit 3D Reconstruction with View-Dependent Physical Representation

Yu Qiu, Sijia Wen, Hainan Zhang et al.

AAAI 2025paper
#20391

HSOD-BIT-V2: A Challenging Benchmark for Hyperspectral Salient Object Detection

Yuhao Qiu, Shuyan Bai, Tingfa Xu et al.

AAAI 2025paper
#20392

Universal Features Guided Zero-Shot Category-Level Object Pose Estimation

Wentian Qu, Chenyu Meng, Heng Li et al.

AAAI 2025paperarXiv:2501.02831
#20393

CDTR: Semantic Alignment for Video Moment Retrieval Using Concept Decomposition Transformer

Ran Ran, Jiwei Wei, Xiangyi Cai et al.

AAAI 2025paper
#20394

GenHMR: Generative Human Mesh Recovery

Muhammad Usama Saleem, Ekkasit Pinyoanuntapong, Pu Wang et al.

AAAI 2025paperarXiv:2412.14444
#20395

In2NeCT: Inter-class and Intra-class Neural Collapse Tuning for Semantic Segmentation of Imbalanced Remote Sensing Images

Junao Shen, Qiyun Hu, Tian Feng et al.

AAAI 2025paper
#20396

Neural Block Compression: Variable Bitrates Feature Blocks for Texture Representation

Rui Shi, Yishun Dou, Zhong Zheng et al.

AAAI 2025paper
#20397

OGP-Net: Optical Guidance Meets Pixel-Level Contrastive Distillation for Robust Multi-Modal and Missing Modality Segmentation

Aniruddh Sikdar, Jayant Teotia, Suresh Sundaram

AAAI 2025paper
#20398

Fine-Grained Perception in Panoramic Scenes: A Novel Task, Dataset, and Method for Object Importance Ranking

Jia Song, Chenglizhao Chen, Xu Yu et al.

AAAI 2025paper
#20399

CtrlAvatar: Controllable Avatars Generation via Disentangled Invertible Networks

Wenfeng Song, Yang Ding, Fei Hou et al.

AAAI 2025paper
#20400

Temporal Coherent Object Flow for Multi-Object Tracking

Zikai Song, Run Luo, Lintao Ma et al.

AAAI 2025paper