Paper Papers

5,546 papers found • Page 63 of 111

Unsupervised Diffusion-Based Degradation Modeling for Real-World Super-Resolution

Yuying Chen, Mingde Yao, Wenbo Li et al.

AAAI 2025paper

Unsupervised Domain Adaptive Person Search via Dual Self-Calibration

Linfeng Qi, Huibing Wang, Jiqing Zhang et al.

AAAI 2025paperarXiv:2412.16506

Unsupervised Kernel-based Multi-view Feature Selection with Robust Self-representation and Binary Hashing

Rongyao Hu, Jiangzhang Gan, Mengmeng Zhan et al.

AAAI 2025paper

Unsupervised Photometric-Consistent Depth Estimation from Endoscopic Monocular Video

Shijie Li, Weijun Lin, Qingyuan Xiang et al.

AAAI 2025paper
3
citations

Unsupervised Region-Based Image Editing of Denoising Diffusion Models

Zixiang Li, Yue Song, Renshuai Tao et al.

AAAI 2025paperarXiv:2412.12912
1
citations

Unsupervised Self-Prior Embedding Neural Representation for Iterative Sparse-View CT Reconstruction

Xuanyu Tian, Lixuan Chen, Qing Wu et al.

AAAI 2025paperarXiv:2502.05445

Unsupervised Translation of Emergent Communication

Ido Levy, Orr Paradise, Boaz Carmeli et al.

AAAI 2025paperarXiv:2502.07552
1
citations

Unveiling Multi-View Anomaly Detection: Intra-view Decoupling and Inter-view Fusion

Kai Mao, Yiyang Lian, Yangyang Wang et al.

AAAI 2025paper
1
citations

Unveiling the Impact of Coding Data Instruction Fine-Tuning on Large Language Models Reasoning

Xinlu Zhang, Zhiyu Zoey Chen, Xi Ye et al.

AAAI 2025paperarXiv:2405.20535
30
citations

Unveiling the Knowledge of CLIP for Training-Free Open-Vocabulary Semantic Segmentation

Yajie Liu, Guodong Wang, Jinjin Zhang et al.

AAAI 2025paper

Unveiling the Threat of Fraud Gangs to Graph Neural Networks: Multi-Target Graph Injection Attacks Against GNN-Based Fraud Detectors

Jinhyeok Choi, Heehyeon Kim, Joyce Jiyoung Whang

AAAI 2025paperarXiv:2412.18370
4
citations

Unwinding Rotations Reduces VR Sickness in Nonsimulated Immersive Telepresence

Filip Kulisiewicz, Basak Sakcak, Evan G. Center et al.

ISMAR 2025paperarXiv:2509.26439

UP-Restorer: When Unrolling Meets Prompts for Unified Image Restoration

Minghao Liu, Wenhan Yang, Jinyi Luo et al.

AAAI 2025paper

UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios

Baichuan Zhou, Haote Yang, Dairong Chen et al.

AAAI 2025paperarXiv:2408.17267
26
citations

USDRL: Unified Skeleton-Based Dense Representation Learning with Multi-Grained Feature Decorrelation

Wanjiang Weng, Hongsong Wang, Junbo Wang et al.

AAAI 2025paperarXiv:2412.09220

User Preference Meets Pareto-Optimality in Multi-Objective Bayesian Optimization

Joshua Hang Sai Ip, Ankush Chakrabarty, Ali Mesbah et al.

AAAI 2025paperarXiv:2502.06971
4
citations

Utilize the Flow Before Stepping into the Same River Twice: Certainty Represented Knowledge Flow for Refusal-Aware Instruction Tuning

Runchuan Zhu, Zhipeng Ma, Jiang Wu et al.

AAAI 2025paperarXiv:2410.06913
6
citations

Utterance-level Emotion Recognition in Conversation with Conversation-level Supervision

Ximing Li, Yuanchao Dai, Zhiyao Yang et al.

AAAI 2025paper

V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer

Hangzhou He, Lei Zhu, Xinliang Zhang et al.

AAAI 2025paperarXiv:2501.04975
8
citations

V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning

Hang Hua, Yunlong Tang, Chenliang Xu et al.

AAAI 2025paperarXiv:2404.12353
48
citations

VA-AR: Learning Velocity-Aware Action Representations with Mixture of Window Attention

Jiangning Wei, Lixiong Qin, Bo Yu et al.

AAAI 2025paperarXiv:2503.11004
5
citations

VarCMP: Adapting Cross-Modal Pre-Training Models for Video Anomaly Retrieval

Peng Wu, Wanshun Su, Xiangteng He et al.

AAAI 2025paper

VarDrop: Enhancing Training Efficiency by Reducing Variate Redundancy in Periodic Time Series Forecasting

Junhyeok Kang, Yooju Shin, Jae-Gil Lee

AAAI 2025paperarXiv:2501.14183
3
citations

VCR: A “Cone of Experience” Driven Synthetic Data Generation Framework for Mathematical Reasoning

Sannyuya Liu, Jintian Feng, Xiaoxuan Shen et al.

AAAI 2025paper

VE-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality Assessment

Shangkun Sun, Xiaoyu Liang, Songlin Fan et al.

AAAI 2025paperarXiv:2408.11481
6
citations

VEGAS: Towards Visually Explainable and Grounded Artificial Social Intelligence

Hao Li, Hao Fei, Zechao Hu et al.

AAAI 2025paperarXiv:2504.02227
4
citations

Verifying Proportionality in Temporal Voting

Edith Elkind, Svetlana Obraztsova, Jannik Peters et al.

AAAI 2025paperarXiv:2502.05949

VerilogCoder: Autonomous Verilog Coding Agents with Graph-based Planning and Abstract Syntax Tree (AST)-based Waveform Tracing Tool

Chia-Tung Ho, Haoxing Ren, Brucek Khailany

AAAI 2025paperarXiv:2408.08927
76
citations

VERO: Verification and Zero-Shot Feedback Acquisition for Few-Shot Multimodal Aspect-Level Sentiment Classification

Kai Sun, Hao Wu, Bin Shi et al.

AAAI 2025paper

VersaFusion: A Versatile Diffusion-Based Framework for Fine-Grained Image Editing and Enhancement

Haocun Ye, Xinlong Jiang, Chenlong Gao et al.

AAAI 2025paper

VersaGen: Unleashing Versatile Visual Control for Text-to-Image Synthesis

Zhipeng Chen, Lan Yang, Yonggang Qi et al.

AAAI 2025paperarXiv:2412.11594

VERSE: Verification-based Self-Play for Code Instructions

Hao Jiang, Qi Liu, Rui Li et al.

AAAI 2025paper

VFM-Adapter: Adapting Visual Foundation Models for Dense Prediction with Dynamic Hybrid Operation Mapping

Zheng Chen, Yu Zeng, Zehui Chen et al.

AAAI 2025paper

VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting

Muhammet Furkan Ilaslan, Ali Köksal, Kevin Qinghong Lin et al.

AAAI 2025paperarXiv:2412.11621

VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis

Chao Pang, Xingxing Weng, Jiang Wu et al.

AAAI 2025paperarXiv:2403.20213
53
citations

VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning

Ji Soo Lee, Jongha Kim, Jeehye Na et al.

AAAI 2025paperarXiv:2501.06761
8
citations

Video Anomaly Detection with Motion and Appearance Guided Patch Diffusion Model

Hang Zhou, Jiale Cai, Yuteng Ye et al.

AAAI 2025paperarXiv:2412.09026
14
citations

Video Diffusion Models Are Strong Video Inpainter

Minhyeok Lee, Suhwan Cho, Chajin Shin et al.

AAAI 2025paperarXiv:2408.11402
14
citations

VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models

Yabo Zhang, Yuxiang Wei, Xianhui Lin et al.

AAAI 2025paperarXiv:2403.05438

Video Repurposing from User Generated Content: A Large-scale Dataset and Benchmark

Yongliang Wu, Wenbo Zhu, Jiawang Cao et al.

AAAI 2025paperarXiv:2412.08879

Video Summarization Using Denoising Diffusion Probabilistic Model

Zirui Shang, Yubo Zhu, Hongxi Li et al.

AAAI 2025paperarXiv:2412.08357

VidEvent: A Large Dataset for Understanding Dynamic Evolution of Events in Videos

Baoyu Liang, Qile Su, Shoutai Zhu et al.

AAAI 2025paperarXiv:2506.02448
2
citations

Vietnamese Words Are Not Constructed from Syllables: Rethinking the Role of Word Segmentation in Natural Language Processing for Vietnamese Texts

Nghia Hieu Nguyen, Dat Tien Nguyen, Ngan Luu-Thuy Nguyen

AAAI 2025paper

Viewpoint-Tolerant Depth Perception for Shared Extended Space Experience on Wall-Sized Display

Dooyoung Kim, Jinseok Hong, Heejeong Ko et al.

ISMAR 2025paperarXiv:2508.06889

View Transformation Robustness for Multi-View 3D Object Reconstruction with Reconstruction Error-Guided View Selection

Qi Zhang, Zhouhang Luo, Tao Yu et al.

AAAI 2025paperarXiv:2412.11428
1
citations

ViFactCheck: A New Benchmark Dataset and Methods for Multi-Domain News Fact-Checking In Vietnamese

Tran Thai Hoa, Tran Quang Duy, Khanh Quoc Tran et al.

AAAI 2025paperarXiv:2412.15308

ViG: Linear-complexity Visual Sequence Learning with Gated Linear Attention

Bencheng Liao, Xinggang Wang, Lianghui Zhu et al.

AAAI 2025paperarXiv:2405.18425
8
citations

VIoTGPT: Learning to Schedule Vision Tools Towards Intelligent Video Internet of Things

Yaoyao Zhong, Mengshi Qi, Rui Wang et al.

AAAI 2025paper
8
citations

ViPCap: Retrieval Text-Based Visual Prompts for Lightweight Image Captioning

Taewhan Kim, Soeun Lee, Si-Woo Kim et al.

AAAI 2025paperarXiv:2412.19289

ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy Prediction

Yi Feng, Yu Han, Xijing Zhang et al.

AAAI 2025paperarXiv:2412.11210
5
citations