AAAI 2025 Papers
3,028 papers found • Page 59 of 61
Unsupervised Domain Adaptive Person Search via Dual Self-Calibration
Linfeng Qi, Huibing Wang, Jiqing Zhang et al.
Unsupervised Kernel-based Multi-view Feature Selection with Robust Self-representation and Binary Hashing
Rongyao Hu, Jiangzhang Gan, Mengmeng Zhan et al.
Unsupervised Photometric-Consistent Depth Estimation from Endoscopic Monocular Video
Shijie Li, Weijun Lin, Qingyuan Xiang et al.
Unsupervised Region-Based Image Editing of Denoising Diffusion Models
Zixiang Li, Yue Song, Renshuai Tao et al.
Unsupervised Self-Prior Embedding Neural Representation for Iterative Sparse-View CT Reconstruction
Xuanyu Tian, Lixuan Chen, Qing Wu et al.
Unsupervised Translation of Emergent Communication
Ido Levy, Orr Paradise, Boaz Carmeli et al.
Unveiling Multi-View Anomaly Detection: Intra-view Decoupling and Inter-view Fusion
Kai Mao, Yiyang Lian, Yangyang Wang et al.
Unveiling the Impact of Coding Data Instruction Fine-Tuning on Large Language Models Reasoning
Xinlu Zhang, Zhiyu Zoey Chen, Xi Ye et al.
Unveiling the Knowledge of CLIP for Training-Free Open-Vocabulary Semantic Segmentation
Yajie Liu, Guodong Wang, Jinjin Zhang et al.
Unveiling the Threat of Fraud Gangs to Graph Neural Networks: Multi-Target Graph Injection Attacks Against GNN-Based Fraud Detectors
Jinhyeok Choi, Heehyeon Kim, Joyce Jiyoung Whang
UP-Restorer: When Unrolling Meets Prompts for Unified Image Restoration
Minghao Liu, Wenhan Yang, Jinyi Luo et al.
UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios
Baichuan Zhou, Haote Yang, Dairong Chen et al.
USDRL: Unified Skeleton-Based Dense Representation Learning with Multi-Grained Feature Decorrelation
Wanjiang Weng, Hongsong Wang, Junbo Wang et al.
User Preference Meets Pareto-Optimality in Multi-Objective Bayesian Optimization
Joshua Hang Sai Ip, Ankush Chakrabarty, Ali Mesbah et al.
Utilize the Flow Before Stepping into the Same River Twice: Certainty Represented Knowledge Flow for Refusal-Aware Instruction Tuning
Runchuan Zhu, Zhipeng Ma, Jiang Wu et al.
Utterance-level Emotion Recognition in Conversation with Conversation-level Supervision
Ximing Li, Yuanchao Dai, Zhiyao Yang et al.
V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer
Hangzhou He, Lei Zhu, Xinliang Zhang et al.
V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning
Hang Hua, Yunlong Tang, Chenliang Xu et al.
VA-AR: Learning Velocity-Aware Action Representations with Mixture of Window Attention
Jiangning Wei, Lixiong Qin, Bo Yu et al.
VarCMP: Adapting Cross-Modal Pre-Training Models for Video Anomaly Retrieval
Peng Wu, Wanshun Su, Xiangteng He et al.
VarDrop: Enhancing Training Efficiency by Reducing Variate Redundancy in Periodic Time Series Forecasting
Junhyeok Kang, Yooju Shin, Jae-Gil Lee
VCR: A “Cone of Experience” Driven Synthetic Data Generation Framework for Mathematical Reasoning
Sannyuya Liu, Jintian Feng, Xiaoxuan Shen et al.
VE-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality Assessment
Shangkun Sun, Xiaoyu Liang, Songlin Fan et al.
VEGAS: Towards Visually Explainable and Grounded Artificial Social Intelligence
Hao Li, Hao Fei, Zechao Hu et al.
Verifying Proportionality in Temporal Voting
Edith Elkind, Svetlana Obraztsova, Jannik Peters et al.
VerilogCoder: Autonomous Verilog Coding Agents with Graph-based Planning and Abstract Syntax Tree (AST)-based Waveform Tracing Tool
Chia-Tung Ho, Haoxing Ren, Brucek Khailany
VERO: Verification and Zero-Shot Feedback Acquisition for Few-Shot Multimodal Aspect-Level Sentiment Classification
Kai Sun, Hao Wu, Bin Shi et al.
VersaFusion: A Versatile Diffusion-Based Framework for Fine-Grained Image Editing and Enhancement
Haocun Ye, Xinlong Jiang, Chenlong Gao et al.
VersaGen: Unleashing Versatile Visual Control for Text-to-Image Synthesis
Zhipeng Chen, Lan Yang, Yonggang Qi et al.
VERSE: Verification-based Self-Play for Code Instructions
Hao Jiang, Qi Liu, Rui Li et al.
VFM-Adapter: Adapting Visual Foundation Models for Dense Prediction with Dynamic Hybrid Operation Mapping
Zheng Chen, Yu Zeng, Zehui Chen et al.
VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting
Muhammet Furkan Ilaslan, Ali Köksal, Kevin Qinghong Lin et al.
VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis
Chao Pang, Xingxing Weng, Jiang Wu et al.
VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning
Ji Soo Lee, Jongha Kim, Jeehye Na et al.
Video Anomaly Detection with Motion and Appearance Guided Patch Diffusion Model
Hang Zhou, Jiale Cai, Yuteng Ye et al.
Video Diffusion Models Are Strong Video Inpainter
Minhyeok Lee, Suhwan Cho, Chajin Shin et al.
VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models
Yabo Zhang, Yuxiang Wei, Xianhui Lin et al.
Video Repurposing from User Generated Content: A Large-scale Dataset and Benchmark
Yongliang Wu, Wenbo Zhu, Jiawang Cao et al.
Video Summarization Using Denoising Diffusion Probabilistic Model
Zirui Shang, Yubo Zhu, Hongxi Li et al.
VidEvent: A Large Dataset for Understanding Dynamic Evolution of Events in Videos
Baoyu Liang, Qile Su, Shoutai Zhu et al.
Vietnamese Words Are Not Constructed from Syllables: Rethinking the Role of Word Segmentation in Natural Language Processing for Vietnamese Texts
Nghia Hieu Nguyen, Dat Tien Nguyen, Ngan Luu-Thuy Nguyen
View Transformation Robustness for Multi-View 3D Object Reconstruction with Reconstruction Error-Guided View Selection
Qi Zhang, Zhouhang Luo, Tao Yu et al.
ViFactCheck: A New Benchmark Dataset and Methods for Multi-Domain News Fact-Checking In Vietnamese
Tran Thai Hoa, Tran Quang Duy, Khanh Quoc Tran et al.
ViG: Linear-complexity Visual Sequence Learning with Gated Linear Attention
Bencheng Liao, Xinggang Wang, Lianghui Zhu et al.
VIoTGPT: Learning to Schedule Vision Tools Towards Intelligent Video Internet of Things
Yaoyao Zhong, Mengshi Qi, Rui Wang et al.
ViPCap: Retrieval Text-Based Visual Prompts for Lightweight Image Captioning
Taewhan Kim, Soeun Lee, Si-Woo Kim et al.
ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy Prediction
Yi Feng, Yu Han, Xijing Zhang et al.
Virtual Nodes Can Help: Tackling Distribution Shifts in Federated Graph Learning
Xingbo Fu, Zihan Chen, Yinhan He et al.
Vision-aware Multimodal Prompt Tuning for Uploadable Multi-source Few-shot Domain Adaptation
Kuanghong Liu, Jin Wang, Kangjian He et al.
Vision-Based Generic Potential Function for Policy Alignment in Multi-Agent Reinforcement Learning
Hao Ma, Shijie Wang, Zhiqiang Pu et al.