AAAI 2025 Papers

3,028 papers found • Page 59 of 61

Unsupervised Domain Adaptive Person Search via Dual Self-Calibration

Linfeng Qi, Huibing Wang, Jiqing Zhang et al.

AAAI 2025paper

Unsupervised Kernel-based Multi-view Feature Selection with Robust Self-representation and Binary Hashing

Rongyao Hu, Jiangzhang Gan, Mengmeng Zhan et al.

AAAI 2025paper

Unsupervised Photometric-Consistent Depth Estimation from Endoscopic Monocular Video

Shijie Li, Weijun Lin, Qingyuan Xiang et al.

AAAI 2025paper

Unsupervised Region-Based Image Editing of Denoising Diffusion Models

Zixiang Li, Yue Song, Renshuai Tao et al.

AAAI 2025paper
1
citations

Unsupervised Self-Prior Embedding Neural Representation for Iterative Sparse-View CT Reconstruction

Xuanyu Tian, Lixuan Chen, Qing Wu et al.

AAAI 2025paper

Unsupervised Translation of Emergent Communication

Ido Levy, Orr Paradise, Boaz Carmeli et al.

AAAI 2025paper

Unveiling Multi-View Anomaly Detection: Intra-view Decoupling and Inter-view Fusion

Kai Mao, Yiyang Lian, Yangyang Wang et al.

AAAI 2025paper
1
citations

Unveiling the Impact of Coding Data Instruction Fine-Tuning on Large Language Models Reasoning

Xinlu Zhang, Zhiyu Zoey Chen, Xi Ye et al.

AAAI 2025paper
30
citations

Unveiling the Knowledge of CLIP for Training-Free Open-Vocabulary Semantic Segmentation

Yajie Liu, Guodong Wang, Jinjin Zhang et al.

AAAI 2025paper

Unveiling the Threat of Fraud Gangs to Graph Neural Networks: Multi-Target Graph Injection Attacks Against GNN-Based Fraud Detectors

Jinhyeok Choi, Heehyeon Kim, Joyce Jiyoung Whang

AAAI 2025paper
4
citations

UP-Restorer: When Unrolling Meets Prompts for Unified Image Restoration

Minghao Liu, Wenhan Yang, Jinyi Luo et al.

AAAI 2025paper

UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios

Baichuan Zhou, Haote Yang, Dairong Chen et al.

AAAI 2025paper
26
citations

USDRL: Unified Skeleton-Based Dense Representation Learning with Multi-Grained Feature Decorrelation

Wanjiang Weng, Hongsong Wang, Junbo Wang et al.

AAAI 2025paper

User Preference Meets Pareto-Optimality in Multi-Objective Bayesian Optimization

Joshua Hang Sai Ip, Ankush Chakrabarty, Ali Mesbah et al.

AAAI 2025paper
2
citations

Utilize the Flow Before Stepping into the Same River Twice: Certainty Represented Knowledge Flow for Refusal-Aware Instruction Tuning

Runchuan Zhu, Zhipeng Ma, Jiang Wu et al.

AAAI 2025paper
6
citations

Utterance-level Emotion Recognition in Conversation with Conversation-level Supervision

Ximing Li, Yuanchao Dai, Zhiyao Yang et al.

AAAI 2025paper

V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer

Hangzhou He, Lei Zhu, Xinliang Zhang et al.

AAAI 2025paper
8
citations

V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning

Hang Hua, Yunlong Tang, Chenliang Xu et al.

AAAI 2025paper
47
citations

VA-AR: Learning Velocity-Aware Action Representations with Mixture of Window Attention

Jiangning Wei, Lixiong Qin, Bo Yu et al.

AAAI 2025paper
4
citations

VarCMP: Adapting Cross-Modal Pre-Training Models for Video Anomaly Retrieval

Peng Wu, Wanshun Su, Xiangteng He et al.

AAAI 2025paper

VarDrop: Enhancing Training Efficiency by Reducing Variate Redundancy in Periodic Time Series Forecasting

Junhyeok Kang, Yooju Shin, Jae-Gil Lee

AAAI 2025paper
3
citations

VCR: A “Cone of Experience” Driven Synthetic Data Generation Framework for Mathematical Reasoning

Sannyuya Liu, Jintian Feng, Xiaoxuan Shen et al.

AAAI 2025paper

VE-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality Assessment

Shangkun Sun, Xiaoyu Liang, Songlin Fan et al.

AAAI 2025paper
6
citations

VEGAS: Towards Visually Explainable and Grounded Artificial Social Intelligence

Hao Li, Hao Fei, Zechao Hu et al.

AAAI 2025paper
4
citations

Verifying Proportionality in Temporal Voting

Edith Elkind, Svetlana Obraztsova, Jannik Peters et al.

AAAI 2025paper

VerilogCoder: Autonomous Verilog Coding Agents with Graph-based Planning and Abstract Syntax Tree (AST)-based Waveform Tracing Tool

Chia-Tung Ho, Haoxing Ren, Brucek Khailany

AAAI 2025paper
72
citations

VERO: Verification and Zero-Shot Feedback Acquisition for Few-Shot Multimodal Aspect-Level Sentiment Classification

Kai Sun, Hao Wu, Bin Shi et al.

AAAI 2025paper

VersaFusion: A Versatile Diffusion-Based Framework for Fine-Grained Image Editing and Enhancement

Haocun Ye, Xinlong Jiang, Chenlong Gao et al.

AAAI 2025paper

VersaGen: Unleashing Versatile Visual Control for Text-to-Image Synthesis

Zhipeng Chen, Lan Yang, Yonggang Qi et al.

AAAI 2025paper

VERSE: Verification-based Self-Play for Code Instructions

Hao Jiang, Qi Liu, Rui Li et al.

AAAI 2025paper

VFM-Adapter: Adapting Visual Foundation Models for Dense Prediction with Dynamic Hybrid Operation Mapping

Zheng Chen, Yu Zeng, Zehui Chen et al.

AAAI 2025paper

VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting

Muhammet Furkan Ilaslan, Ali Köksal, Kevin Qinghong Lin et al.

AAAI 2025paper

VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis

Chao Pang, Xingxing Weng, Jiang Wu et al.

AAAI 2025paper

VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning

Ji Soo Lee, Jongha Kim, Jeehye Na et al.

AAAI 2025paper
7
citations

Video Anomaly Detection with Motion and Appearance Guided Patch Diffusion Model

Hang Zhou, Jiale Cai, Yuteng Ye et al.

AAAI 2025paper
14
citations

Video Diffusion Models Are Strong Video Inpainter

Minhyeok Lee, Suhwan Cho, Chajin Shin et al.

AAAI 2025paper
14
citations

VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models

Yabo Zhang, Yuxiang Wei, Xianhui Lin et al.

AAAI 2025paper

Video Repurposing from User Generated Content: A Large-scale Dataset and Benchmark

Yongliang Wu, Wenbo Zhu, Jiawang Cao et al.

AAAI 2025paper

Video Summarization Using Denoising Diffusion Probabilistic Model

Zirui Shang, Yubo Zhu, Hongxi Li et al.

AAAI 2025paper

VidEvent: A Large Dataset for Understanding Dynamic Evolution of Events in Videos

Baoyu Liang, Qile Su, Shoutai Zhu et al.

AAAI 2025paper
2
citations

Vietnamese Words Are Not Constructed from Syllables: Rethinking the Role of Word Segmentation in Natural Language Processing for Vietnamese Texts

Nghia Hieu Nguyen, Dat Tien Nguyen, Ngan Luu-Thuy Nguyen

AAAI 2025paper

View Transformation Robustness for Multi-View 3D Object Reconstruction with Reconstruction Error-Guided View Selection

Qi Zhang, Zhouhang Luo, Tao Yu et al.

AAAI 2025paper
1
citations

ViFactCheck: A New Benchmark Dataset and Methods for Multi-Domain News Fact-Checking In Vietnamese

Tran Thai Hoa, Tran Quang Duy, Khanh Quoc Tran et al.

AAAI 2025paper

ViG: Linear-complexity Visual Sequence Learning with Gated Linear Attention

Bencheng Liao, Xinggang Wang, Lianghui Zhu et al.

AAAI 2025paper
8
citations

VIoTGPT: Learning to Schedule Vision Tools Towards Intelligent Video Internet of Things

Yaoyao Zhong, Mengshi Qi, Rui Wang et al.

AAAI 2025paper

ViPCap: Retrieval Text-Based Visual Prompts for Lightweight Image Captioning

Taewhan Kim, Soeun Lee, Si-Woo Kim et al.

AAAI 2025paper

ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy Prediction

Yi Feng, Yu Han, Xijing Zhang et al.

AAAI 2025paper
5
citations

Virtual Nodes Can Help: Tackling Distribution Shifts in Federated Graph Learning

Xingbo Fu, Zihan Chen, Yinhan He et al.

AAAI 2025paper

Vision-aware Multimodal Prompt Tuning for Uploadable Multi-source Few-shot Domain Adaptation

Kuanghong Liu, Jin Wang, Kangjian He et al.

AAAI 2025paper
2
citations

Vision-Based Generic Potential Function for Policy Alignment in Multi-Agent Reinforcement Learning

Hao Ma, Shijie Wang, Zhiqiang Pu et al.

AAAI 2025paper