Most Cited AAAI "visual tokens" Papers

5,317 papers found • Page 25 of 27

#4801

Expanding the Scope of Negatives: Boosting Image-Text Matching with Negatives Distribution Guided Learning

Zhao Zhou, Weizhong Zhang, Xiangcheng Du et al.

AAAI 2025paper
#4802

Unifying Decision and Function Queries in Stochastic Boolean Satisfiability

Yu-Wei Fan, Jie-Hong Jiang

AAAI 2024paper
#4803

Achieving Ensemble-Like Performance in a Single Model: A Feature Diversification Framework for Image-Text Matching

Zhao Zhou, Yiqun Wang, Weizhong Zhang et al.

AAAI 2025paper
#4804

Improving Generalization of Deep Neural Networks by Optimum Shifting

Yuyan Zhou, Ye Li, Lei Feng et al.

AAAI 2025paperarXiv:2405.14111
#4805

AI-Powered Algorithm-Centric Quantum Processor Topology Design

Tian Li, Xiao-Yue Xu, Chen Ding et al.

AAAI 2025paperarXiv:2412.13805
#4806

Enhancing Training of Spiking Neural Network with Stochastic Latency

Srinivas Anumasa, Bhaskar Mukhoty, Velibor Bojkovic et al.

AAAI 2024paper
#4807

Test-Time Adaptation on Noisy Data via Model-Pruning-Based Filtering and Flatness-Aware Entropy Minimization

Xingzhi Zhou, Zhiliang Tian, Boyang Zhang et al.

AAAI 2025paper
#4808

Spatiotemporal-Aware Neural Fields for Dynamic CT Reconstruction

Qingyang Zhou, Yunfan Ye, Zhiping Cai

AAAI 2025paper
#4809

GLIC: General Format Learned Image Compression

MingSheng Zhou, MingMing Kong

AAAI 2025paper
#4810

SeqRank: Sequential Ranking of Salient Objects

AAAI 2024paper
#4811

SceneX: Procedural Controllable Large-Scale Scene Generation

Mengqi Zhou, Yuxi Wang, Jun Hou et al.

AAAI 2025paperarXiv:2403.15698
#4812

Joint Class-level and Instance-level Relationship Modeling for Novel Class Discovery

Jiaying Zhou, Qingchao Chen

AAAI 2025paper
#4813

Bilateral Gradual Semantics for Weighted Argumentation

AAAI 2024paper
#4814

Core-to-Global Reasoning for Compositional Visual Question Answering

Hao Zhou, Tingjin Luo, Zhangqi Jiang

AAAI 2025paper
#4815

What Makes Quantization for Large Language Model Hard? An Empirical Study from the Lens of Perturbation

Huankang Guan, Rynson W.H. Lau

AAAI 2024paper
#4816

Uncertainty-Aware Yield Prediction with Multimodal Molecular Features

Jiayuan Chen, Kehan Guo, Zhen Liu et al.

AAAI 2024paper
#4817

Preference Aware Dual Contrastive Learning for Item Cold-Start Recommendation

Wenbo Wang, Bingquan Liu, Lili Shan et al.

AAAI 2024paper
#4818

Learning Performance Maximizing Ensembles with Explainability Guarantees

Vincent Pisztora, Jia Li

AAAI 2024paperarXiv:2312.12715
#4819

Communication Efficient Distributed Newton Method over Unreliable Networks

Ming Wen, Chengchang Liu, Yuedong Xu

AAAI 2024paper
#4820

DECIDER: Difference-aware Contrastive Diffusion Model with Adversarial Perturbations for Image Change Captioning

Guojin Zhong, Jinhong Hu, Jiajun Chen et al.

AAAI 2025paper
#4821

MMPF: Multi-Modal Perception Framework for Abnormal Medical Condition Detection

Chuyi Zhong, Dingkang Yang, Peng Zhai et al.

AAAI 2025paper
#4822

Continual Vision-Language Retrieval via Dynamic Knowledge Rectification

Zhenyu Cui, Yuxin Peng, Xun Wang et al.

AAAI 2024paper
#4823

When Shadow Removal Meets Intrinsic Image Decomposition: A Joint Learning Framework Using Unpaired Data

Rongjia Zheng, Qing Zhang, Yongwei Nie et al.

AAAI 2025paper
#4824

MuST: Robust Image Watermarking for Multi-Source Tracing

Guanjie Wang, Zehua Ma, Chang Liu et al.

AAAI 2024paper
#4825

Supportive Negatives Spectral Augmentation for Source-Free Cross-Domain Segmentation

Kexin Zheng, Haifeng Xia, Siyu Xia et al.

AAAI 2025paper
#4826

PHR-DIFF: Portrait Highlights Removal via Patch-aware Diffusion Model

Hongsheng Zheng, Zhongyun Bao, Gang Fu et al.

AAAI 2025paper
#4827

Hierarchical Planning and Learning for Robots in Stochastic Settings Using Zero-Shot Option Invention

Naman Shah, Siddharth Srivastava

AAAI 2024paper
#4828

Optimizing the Optimization of Planning Domains by Automatic Action Schema Splitting

Mojtaba Elahi, Jussi Rintanen

AAAI 2024paper
#4829

Frequency-Aware Deepfake Detection: Improving Generalizability through Frequency Domain Learning

Chuangchuang Tan, Yao Zhao, Shikui Wei et al.

AAAI 2024paper
#4830

HFF-Tracker: A Hierarchical Fine-grained Fusion Tracker for Referring Multi-Object Tracking

Zeyong Zhao, Yanchao Hao, Minghao Zhang et al.

AAAI 2025paper
#4831

PMRC: Prompt-Based Machine Reading Comprehension for Few-Shot Named Entity Recognition

Jin Huang, Danfeng Yan, Yuanqiang Cai

AAAI 2024paper
#4832

NightReID: A Large-Scale Nighttime Person Re-Identification Benchmark

Yuxuan Zhao, Weijian Ruan, He Li et al.

AAAI 2025paper
#4833

ESEG: Event-Based Segmentation Boosted by Explicit Edge-Semantic Guidance

Yucheng Zhao, Gengyu Lyu, Ke Li et al.

AAAI 2025paper
#4834

Audio-Visual Adaptive Fusion Network for Question Answering Based on Contrastive Learning

Xujian Zhao, Yixin Wang, Peiquan Jin

AAAI 2025paper
#4835

Fully Data-Driven Pseudo Label Estimation for Pointly-Supervised Panoptic Segmentation

Jing Li, Junsong Fan, Yuran Yang et al.

AAAI 2024paper
#4836

From Toxic to Trustworthy: Using Self-Distillation and Semi-supervised Methods to Refine Neural Networks

Xianda Zhang, Baolin Zheng, Jianbao Hu et al.

AAAI 2024paper
#4837

MINES: Message Intercommunication for Inductive Relation Reasoning over Neighbor-Enhanced Subgraphs

Ke Liang, Lingyuan Meng, Sihang Zhou et al.

AAAI 2024paper
#4838

Evolving Parameterized Prompt Memory for Continual Learning

Muhammad Rifki Kurniawan, Xiang Song, Zhiheng Ma et al.

AAAI 2024paper
#4839

ACAMDA: Improving Data Efficiency in Reinforcement Learning through Guided Counterfactual Data Augmentation

Yuewen Sun, Erli Wang, Biwei Huang et al.

AAAI 2024paper
#4840

Towards Safe Policy Learning under Partial Identifiability: A Causal Approach

Shalmali Joshi, Junzhe Zhang, Elias Bareinboim

AAAI 2024paper
#4841

HAGO-Net: Hierarchical Geometric Massage Passing for Molecular Representation Learning

Hongbin Pei, Taile Chen, Chen A et al.

AAAI 2024paper
#4842

Training-free Open-Vocabulary Semantic Segmentation via Diverse Prototype Construction and Sub-region Matching

Xuanpu Zhao, Dianmo Sheng, Zhentao Tan et al.

AAAI 2025paper
#4843

Excluding the Impossible for Open Vocabulary Semantic Segmentation

Shiyuan Zhao, Baodi Liu, Yu Bai et al.

AAAI 2025paper
#4844

Adaptive Wavelet-Positional Encoding for High-Frequency Information Learning in Implicit Neural Representation

Hongxu Zhao, Zelin Gao, Yue Wang et al.

AAAI 2025paper
#4845

Multi-scale Activation, Refinement, and Aggregation: Exploring Diverse Cues for Fine-Grained Bird Recognition

Zhicheng Zhang, Hao Tang, Jinhui Tang

AAAI 2025paperarXiv:2504.09215
#4846

Computing the Why-Provenance for Datalog Queries via SAT Solvers

Haitong Luo, Xuying Meng, Suhang Wang et al.

AAAI 2024paper
#4847

FairTrade: Achieving Pareto-Optimal Trade-Offs between Balanced Accuracy and Fairness in Federated Learning

Maryam Badar, Sandipan Sikdar, Wolfgang Nejdl et al.

AAAI 2024paper
#4848

Training-Free Image Manipulation Localization Using Diffusion Models

Zhenfei Zhang, Ming-Ching Chang, Xin Li

AAAI 2025paper
#4849

RP-PGD: Boosting Segmentation Robustness with a Region-and-Prototype Based Adversarial Attack

Yuxuan Zhang, Zhenbo Shi, Shuchang Wang et al.

AAAI 2025paper
#4850

Partial Point Cloud Registration with Multi-view 2D Image Learning

Yue Zhang, Yue Wu, Wenping Ma et al.

AAAI 2025paper
#4851

InstantSticker: Realistic Decal Blending via Disentangled Object Reconstruction

Yi Zhang, Xiaoyang Huang, Yishun Dou et al.

AAAI 2025paperarXiv:2504.06620
#4852

PhyCamo: A Robust Physical Camouflage via Contrastive Learning for Multi-View Physical Adversarial Attack

Ximin Zhang, Jinyin Chen, Haibin Zheng et al.

AAAI 2025paper
#4853

Enhancing Multimodal Large Language Models Complex Reason via Similarity Computation

Xiaofeng Zhang, Fanshuo Zeng, Yihao Quan et al.

AAAI 2025paperarXiv:2412.09817
#4854

Iterative Self-Training with Class-Aware Text-to-Image Synthesis for Visual Task Learning

Xiang Zhang, Wanqing Zhao, Pengyang Li et al.

AAAI 2025paper
#4855

Pragmatist: Multiview Conditional Diffusion Models for High-Fidelity 3D Reconstruction from Unposed Sparse Views

Songchun Zhang, Chunhui Zhao

AAAI 2025paperarXiv:2412.08412
#4856

DiMSOD: A Diffusion-Based Framework for Multi-Modal Salient Object Detection

Shuo Zhang, Jiaming Huang, Wenbing Tang et al.

AAAI 2025paper
#4857

Towards Multi-Mode Outlier Robust Tensor Ring Decomposition

Yuning Qiu, Guoxu Zhou, Andong Wang et al.

AAAI 2024paper
#4858

Visual Perturbation for Text-Based Person Search

Pengcheng Zhang, Xiaohan Yu, Xiao Bai et al.

AAAI 2025paper
#4859

SIGraph: Saliency Image-Graph Network for Retinal Disease Classification in Fundus Image

Peng Zhang, Yuan Li, Haotian Song et al.

AAAI 2025paper
#4860

PanoDiT: Panoramic Videos Generation with Diffusion Transformer

Muyang Zhang, Yuzhi Chen, Rongtao Xu et al.

AAAI 2025paper
#4861

A Brain-Inspired Way of Reducing the Network Complexity via Concept-Regularized Coding for Emotion Recognition

Han Lu, Xiahai Zhuang, Qiang Luo

AAAI 2024paper
#4862

Decoupling Scattering: Pseudo-Label Guided NeRF for Scenes with Scattering Media

Mingyang Zhang, Junkang Zhang, Faming Fang et al.

AAAI 2025paper
#4863

MOCID: Motion Context and Displacement Information Learning for Moving Infrared Small Target Detection

Mingjin Zhang, Yuanjun Ouyang, Fei Gao et al.

AAAI 2025paper
#4864

IRMamba: Pixel Difference Mamba with Layer Restoration for Infrared Small Target Detection

Mingjin Zhang, Xiaolong Li, Fei Gao et al.

AAAI 2025paper
#4865

Critical Forgetting-Based Multi-Scale Disentanglement for Deepfake Detection

Kai Li, Wenqi Ren, Jianshu Li et al.

AAAI 2025paper
#4866

Cumulative Difference Learning VAE for Time-Series with Temporally Correlated Inflow-Outflow

Tianchun Li, Chengxiang Wu, Pengyi Shi et al.

AAAI 2024paper
#4867

Common Sense Bias Modeling for Classification Tasks

Miao Zhang, Zee Fryer, Ben Colman et al.

AAAI 2025paperarXiv:2401.13213
#4868

R^2-Art: Category-Level Articulation Pose Estimation from Single RGB Image via Cascade Render Strategy

Li Zhang, Haonan Jiang, Yukang Huo et al.

AAAI 2025paper
#4869

Frame Semantic Role Labeling Using Arbitrary-Order Conditional Random Fields

AAAI 2024paper
#4870

When Open-Vocabulary Visual Question Answering Meets Causal Adapter: Benchmark and Approach

Feifei Zhang, Zhaoyi Zhang, Xi Zhang et al.

AAAI 2025paper
#4871

Spatio-Temporal Pivotal Graph Neural Networks for Traffic Flow Forecasting

Xiangyang Miao, Guobao Xiao, Shiping Wang et al.

AAAI 2024paper
#4872

Training-Free and Hardware-Friendly Acceleration for Diffusion Models via Similarity-based Token Pruning

Evelyn Zhang, Jiayi Tang, Xuefei Ning et al.

AAAI 2025paper
#4873

An Efficient Subgraph-Inferring Framework for Large-Scale Heterogeneous Graphs

Wei Zhou, Hong Huang, Ruize Shi et al.

AAAI 2024paper
#4874

DetRF: Detachable Novel Views Synthesis of Dynamic Scenes Using Backdrop-Driven Neural Radiance Fields

Boyu Zhang, Zheng Zhu, Wenbo Xu

AAAI 2025paper
#4875

TGFormer: Transformer with Track Query Group for Multi-Object Tracking

Rui Zeng, Yuanzhou Huang, Songwei Pei

AAAI 2025paper
#4876

Efficient Neural Network Encoding for 3D Color Lookup Tables

Vahid Zehtab, David B. Lindell, Marcus A. Brubaker et al.

AAAI 2025paperarXiv:2412.15438
#4877

Rectangle Search: An Anytime Beam Search

Sofia Lemons, Wheeler Ruml, Rob Holte et al.

AAAI 2024paperarXiv:2312.12554
#4878

Exploring Domain Incremental Video Highlights Detection with the LiveFood Benchmark

Sen Pei, Shixiong Xu, Xiaojie Jin

AAAI 2024paperarXiv:2209.05166
#4879

Fewer Steps, Better Performance: Efficient Cross-Modal Clip Trimming for Video Moment Retrieval Using Language

Xiang Fang, Daizong Liu, Wanlong Fang et al.

AAAI 2024paper
#4880

Cross-Lingual Text-Rich Visual Comprehension: An Information Theory Perspective

Xinmiao Yu, Xiaocheng Feng, Yun Li et al.

AAAI 2025paperarXiv:2412.17787
#4881

SkipDiff: Adaptive Skip Diffusion Model for High-Fidelity Perceptual Image Super-resolution

Xiaotong Luo, Yuan Xie, Yanyun Qu et al.

AAAI 2024paper
#4882

OTPNet: ODE-inspired Tuning-free Proximal Network for Remote Sensing Image Fusion

Wei Yu, Zonglin Li, Qinglin Liu et al.

AAAI 2025paper
#4883

Fine-grained Adaptive Visual Prompt for Generative Medical Visual Question Answering

Ting Yu, Zixuan Tong, Jun Yu et al.

AAAI 2025paper
#4884

STGC-NeRF: Spatial-Temporal Geometric Consistency for LiDAR Neural Radiance Fields in Dynamic Scenes

Shangshu Yu, Xiaotian Sun, Wen Li et al.

AAAI 2025paper
#4885

ReMoGPT: Part-Level Retrieval-Augmented Motion-Language Models

Qing Yu, Mikihiro Tanaka, Kent Fujiwara

AAAI 2025paper
#4886

Separating the Wheat from the Chaff: Spatio-Temporal Transformer with View-interweaved Attention for Photon-Efficient Depth Sensing

Letian Yu, Jiaxi Yang, Bo Dong et al.

AAAI 2025paper
#4887

SGFormer: Semantic-Geometry Fusion Transformer for Multi-modal 3D Panoptic Segmentation

Hongqi Yu, Sixian Chan, Xiaolong Zhou et al.

AAAI 2025paper
#4888

FlexDataset: Crafting Annotated Dataset Generation for Diverse Applications

Ellen Yi-Ge, Leo Shawn

AAAI 2025paper
#4889

ShareBERT: Embeddings Are Capable of Learning Hidden Layers

Jia Cheng Hu, Roberto Cavicchioli, Giulia Berardinelli et al.

AAAI 2024paper
#4890

Spatio-Temporal Fusion for Human Action Recognition via Joint Trajectory Graph

Yaolin Zheng, Hongbo Huang, Xiuying Wang et al.

AAAI 2024paper
#4891

Sparse Enhanced Network: An Adversarial Generation Method for Robust Augmentation in Sequential Recommendation

Junyang Chen, Guoxuan Zou, Pan Zhou et al.

AAAI 2024paper
#4892

PromptHaze: Prompting Real-world Dehazing via Depth Anything Model

Tian Ye, Sixiang Chen, Haoyu Chen et al.

AAAI 2025paper
#4893

VersaFusion: A Versatile Diffusion-Based Framework for Fine-Grained Image Editing and Enhancement

Haocun Ye, Xinlong Jiang, Chenlong Gao et al.

AAAI 2025paper
#4894

Sharpness-Aware Model-Agnostic Long-Tailed Domain Generalization

Houcheng Su, Weihao Luo, Daixian Liu et al.

AAAI 2024paper
#4895

As Pseudo-Label Free as Possible: Leveraging Adaptive Feature Generation for Sparsely Annotated Object Detection

Shuilian Yao, Yu Liu, Qi Jia et al.

AAAI 2025paper
#4896

MM-Tracker: Motion Mamba for UAV-platform Multiple Object Tracking

Mufeng Yao, Jinlong Peng, Qingdong He et al.

AAAI 2025paper
#4897

RealPortrait: Realistic Portrait Animation with Diffusion Transformers

Zejun Yang, Huawei Wei, Zhisheng Wang

AAAI 2025paper
#4898

ERF: A Benchmark Dataset for Robust Semantic Segmentation Under Extreme Rainfall Conditions

Xin Yang, Xin Zhang, Xinchao Wang

AAAI 2025paper
#4899

Semantic Segmentation on Raindrop Degraded Images Using Two-Stage Dual Teacher-Student Learning

Xin Yang, Wending Yan, Yuan Yuan et al.

AAAI 2025paper
#4900

DriveGazen: Event-Based Driving Status Recognition Using Conventional Camera

Xiaoyin Yang, Xin Yang

AAAI 2025paperarXiv:2412.11753
#4901

Dual Information Purification for Lightweight SAR Object Detection

Xi Yang, Jiachen Sun, Songsong Duan et al.

AAAI 2025paper
#4902

Asymmetric Hierarchical Difference-aware Interaction Network for Event-guided Motion Deblurring

Wen Yang, Jinjian Wu, Leida Li et al.

AAAI 2025paper
#4903

3CAD: A Large-Scale Real-World 3C Product Dataset for Unsupervised Anomaly Detection

Enquan Yang, Peng Xing, Hanyang Sun et al.

AAAI 2025paper
#4904

Robust Image Hashing Based on Contrastive Masked Autoencoder with Weak-Strong Augmentation Alignment

Cundian Yang, Guibo Luo, Yuesheng Zhu et al.

AAAI 2025paper
#4905

Data-Free Universal Attack by Exploiting the Intrinsic Vulnerability of Deep Models

YangTian Yan, Jinyu Tian

AAAI 2025paperarXiv:2503.22205
#4906

Physical Marker: Revealing Invisible Hyperlinks Hidden in Printed Trademarks

Yuliang Xue, Lei Tan, Guobiao Li et al.

AAAI 2025paper
#4907

StegaStyleGAN: Towards Generic and Practical Generative Image Steganography

Wenkang Su, Jiangqun Ni, Yiyan Sun

AAAI 2024paper
#4908

RetouchGPT: LLM-based Interactive High-Fidelity Face Retouching via Imperfection Prompting

Wen Xue, Chun Ding, Ruotao Xu et al.

AAAI 2025paper
#4909

Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP

Zhongxing Xu, Feilong Tang, Zhe Chen et al.

AAAI 2025paperarXiv:2412.19650
#4910

Simple Weak Coresets for Non-decomposable Classification Measures

Jayesh Malaviya, Anirban Dasgupta, Rachit Chhaya

AAAI 2024paperarXiv:2312.09885
#4911

Learning Multi-Scale Video-Text Correspondence for Weakly Supervised Temporal Article Gronding

Wenjia Geng, Yong Liu, Lei Chen et al.

AAAI 2024paper
#4912

A Surprisingly Simple Continuous-Action POMDP Solver: Lazy Cross-Entropy Search over Policy Trees

Marcus Hoerger, Hanna Kurniawati, Dirk Kroese et al.

AAAI 2024paperarXiv:2305.08049
#4913

FATE: Feature-Adapted Parameter Tuning for Vision-Language Models

Zhengqin Xu, Zelin Peng, Xiaokang Yang et al.

AAAI 2025paper
#4914

Explainable Origin-Destination Crowd Flow Interpolation via Variational Multi-Modal Recurrent Graph Auto-Encoder

Qiang Zhou, Xinjiang Lu, Jingjing Gu et al.

AAAI 2024paper
#4915

HOIMamba: Efficient Mamba-based Disentangled Progressive Learning for HOI Detection

Yongchao Xu, Jiawei Liu, Sen Tao et al.

AAAI 2025paper
#4916

SCKD: Semi-Supervised Cross-Modality Knowledge Distillation for 4D Radar Object Detection

Ruoyu Xu, Zhiyu Xiang, Chenwei Zhang et al.

AAAI 2025paperarXiv:2412.14571
#4917

Low Category Uncertainty and High Training Potential Instance Learning for Unsupervised Domain Adaptation

Xinyu Zhang, Meng Kang, Shuai Lü

AAAI 2024paper
#4918

Multiple Feature Refining Network for Visual Emotion Distribution Learning

Qinfu Xu, Shaozu Yuan, Yiwei Wei et al.

AAAI 2025paper
#4919

Efficient Learning of PDEs via Taylor Expansion and Sparse Decomposition into Value and Fourier Domains

Md Nasim, Yexiang Xue

AAAI 2024paperarXiv:2309.07344
#4920

Discrepancy and Uncertainty Aware Denoising Knowledge Distillation for Zero-Shot Cross-Lingual Named Entity Recognition

Ling Ge, Chunming Hu, Guanghui Ma et al.

AAAI 2024paper
#4921

Foundations of Reactive Synthesis for Declarative Process Specifications

Andrey Rivkin, Luca Geatti, Marco Montali

AAAI 2024paper
#4922

3DHumanEdit: Multi-modal Body Part-aware Conditioning Information Integration for 3D Human Manipulation

FeiFan Xu, Tianyi Chen, Fan Yang et al.

AAAI 2025paper
#4923

Less Is More: Token Context-Aware Learning for Object Tracking

Chenlong Xu, Bineng Zhong, Qihua Liang et al.

AAAI 2025paperarXiv:2501.00758
#4924

FR²Seg: Continual Segmentation Across Multiple Sites via Fourier Style Replay and Adaptive Consistency Regularization

Cheng Xu, Weiwen Zhang, Hongrui Zhang et al.

AAAI 2025paper
#4925

Resource Efficient Deep Learning Hardware Watermarks with Signature Alignment

Joseph Clements, Yingjie Lao

AAAI 2024paper
#4926

DiffScene: Diffusion-Based Safety-Critical Scenario Generation for Autonomous Vehicles

Chejian Xu, Aleksandr Petiushko, Ding Zhao et al.

AAAI 2025paper
#4927

Few-Shot Incremental Learning via Foreground Aggregation and Knowledge Transfer for Audio-Visual Semantic Segmentation

Jingqiao Xiu, Mengze Li, Zongxin Yang et al.

AAAI 2025paper
#4928

IOFM: Using the Interpolation Technique on the Over-Fitted Models to Identify Clean-Annotated Samples

Dongha Kim, Yongchan Choi, Kunwoong Kim et al.

AAAI 2024paper
#4929

Improving Distinguishability of Class for Graph Neural Networks

Dongxiao He, Shuwei Liu, Zhizhi Yu et al.

AAAI 2024paper
#4930

Discrete Prior-Based Temporal-Coherent Content Prediction for Blind Face Video Restoration

Lianxin Xie, Bingbing Zheng, Wen Xue et al.

AAAI 2025paperarXiv:2501.09960
#4931

Omni-Query Active Learning for Source-Free Domain Adaptive Cross-Modality 3D Semantic Segmentation

Jianxiang Xie, Yao Wu, Yachao Zhang et al.

AAAI 2025paper
#4932

Boosting Vision State Space Model with Fractal Scanning

Haoke Xiao, Lv Tang, Peng-tao Jiang et al.

AAAI 2025paper
#4933

SMR-Net: Semantic-Guided Mutually Reinforcing Network for Cross-Modal Image Fusion and Salient Object Detection

Guobao Xiao, Xinyu Liu, Zebin Lin et al.

AAAI 2025paper
#4934

X-RefSeg3D: Enhancing Referring 3D Instance Segmentation via Structured Cross-Modal Graph Neural Networks

Zhipeng Qian, Yiwei Ma, Jiayi Ji et al.

AAAI 2024paper
#4935

Iterative Regularization with K-support Norm: An Important Complement to Sparse Recovery

William de Vazelhes, Bhaskar Mukhoty, Xiaotong Yuan et al.

AAAI 2024paperarXiv:2401.05394
#4936

Learning GAI-Decomposable Utility Models for Multiattribute Decision Making

Margot Herin, Patrice Perny, Nataliya Sokolovska

AAAI 2024paper
#4937

CA-Edit: Causality-Aware Condition Adapter for High-Fidelity Local Facial Attribute Editing

Xiaole Xian, Xilin He, Zenghao Niu et al.

AAAI 2025paperarXiv:2412.13565
#4938

PlaNet: Learning to Mitigate Atmospheric Turbulence in Planetary Images

Yifei Xia, Chu Zhou, Chengxuan Zhu et al.

AAAI 2025paper
#4939

‘Why Didn’t You Allocate This Task to Them?’ Negotiation-Aware Task Allocation and Contrastive Explanation Generation

Zahra Zahedi, Sailik Sengupta, Subbarao Kambhampati

AAAI 2024paper
#4940

RETRACTED: GEONet: Global Enhancement and Optimization Network for Lane Detection

Suyang Xi, Yunhao Liu, Hong Ding et al.

AAAI 2025paper
#4941

Unified Knowledge Maintenance Pruning and Progressive Recovery with Weight Recalling for Large Vision-Language Models

Zimeng Wu, Jiaxin Chen, Yunhong Wang

AAAI 2025paper
#4942

MUCD: Unsupervised Point Cloud Change Detection via Masked Consistency

Yue Wu, Zhipeng Wang, Yongzhe Yuan et al.

AAAI 2025paper
#4943

Deconfound Semantic Shift and Incompleteness in Incremental Few-shot Semantic Segmentation

Yirui Wu, Yuhang Xia, Hao Li et al.

AAAI 2025paper
#4944

VarCMP: Adapting Cross-Modal Pre-Training Models for Video Anomaly Retrieval

Peng Wu, Wanshun Su, Xiangteng He et al.

AAAI 2025paper
#4945

SVRMamba: Slice-to-Volume Reconstruction from Multiple MRI Stacks with Slice Sequence Guided Mamba

Jiangjie Wu, Hongjiang Wei, Yuyao Zhang

AAAI 2025paper
#4946

Spin: Diffusion-based Semantic Image Painting Through Independent Information Injection

Dantong Wu, Zhiqiang Chen, Tianjiao Du et al.

AAAI 2025paper
#4947

Multi-axis Prompt and Multi-dimension Fusion Network for All-in-one Weather-degraded Image Restoration

Yuanbo Wen, Tao Gao, Jing Zhang et al.

AAAI 2025paper
#4948

Mitigating Idiom Inconsistency: A Multi-Semantic Contrastive Learning Method for Chinese Idiom Reading Comprehension

Mingmin Wu, Yuxue Hu, Yongcheng Zhang et al.

AAAI 2024paper
#4949

Power of Diversity: Enhancing Data-Free Black-Box Attack with Domain-Augmented Learning

Yang Wei, Jingyu Tan, Guowen Xu et al.

AAAI 2025paper
#4950

GlyphSR: A Simple Glyph-Aware Framework for Scene Text Image Super-Resolution

Baole Wei, Yuxuan Zhou, Liangcai Gao et al.

AAAI 2025paper
#4951

MSV-PCT: Multi-Sparse-View Enhanced Transformer Framework for Salient Object Detection in Point Clouds

Zihao Wang, Yiming Huang, Gengyu Lyu et al.

AAAI 2025paper
#4952

GOALNET: Interleaving Neural Goal Predicate Inference with Classical Planning for Generalization in Robot Instruction Following

Jigyasa Gupta, Shreya Sharma, Shreshth Tuli et al.

AAAI 2024paper
#4953

Attention-Imperceptible Backdoor Attacks on Vision Transformers

Zhishen Wang, Rui Wang, Lihua Jing

AAAI 2025paper
#4954

Thermal-Aware Low-Light Image Enhancement: A Real-World Benchmark and a New Light-Weight Model

Zhen Wang, Yaozu Wu, Dongyuan Li et al.

AAAI 2025paper
#4955

Style Nursing with Spatial and Semantic Guidance for Zero-Shot Traffic Scene Style Transfer

Zhen Wang, Zihang Lin, Meng Yuan et al.

AAAI 2025paper
#4956

Two-Stage Evolutionary Reinforcement Learning for Enhancing Exploration and Exploitation

AAAI 2024paper
#4957

DualNet: Robust Self-Supervised Stereo Matching with Pseudo-Label Supervision

Yun Wang, Jiahao Zheng, Chenghao Zhang et al.

AAAI 2025paper
#4958

Target Scanpath-Guided 360-Degree Image Enhancement

Yujia Wang, Fang-Lue Zhang, Neil A. Dodgson

AAAI 2025paper
#4959

Manifold Constraints for Imperceptible Adversarial Attacks on Point Clouds

AAAI 2024paper
#4960

SpFormer: Spatio-Temporal Modeling for Scanpaths with Transformer

Zhijie Nie, Richong Zhang, Zhongyuan Wang et al.

AAAI 2024paper
#4961

Capturing the Unseen: Vision-Free Facial Motion Capture Using Inertial Measurement Units

Youjia Wang, Yiwen Wu, Hengan Zhou et al.

AAAI 2025paperarXiv:2402.03944
#4962

RefDetector: A Simple Yet Effective Matching-based Method for Referring Expression Comprehension

Yabing Wang, Zhuotao Tian, Zheng Qin et al.

AAAI 2025paper
#4963

From Coarse to Fine: A Matching and Alignment Framework for Unsupervised Cross-View Geo-Localization

Xueyi Wang, Lele Zhang, Zheng Fan et al.

AAAI 2025paper
#4964

Enhancing Neural Radiance Fields with Adaptive Multi-Exposure Fusion: A Bilevel Optimization Approach for Novel View Synthesis

Yang Zou, Xingyuan Li, Zhiying Jiang et al.

AAAI 2024paper
#4965

MIMTrack: In-Context Tracking via Masked Image Modeling

Xingmei Wang, Guohao Nie, Jiaxiang Meng et al.

AAAI 2025paper
#4966

Lifting Scheme-Based Implicit Disentanglement of Emotion-Related Facial Dynamics in the Wild

Xingjian Wang, Li Chai

AAAI 2025paperarXiv:2412.13168
#4967

DCTMamba: Advancing JPEG Image Restoration Through Long-Sequence Modeling and Adaptive Frequency Strategy

Xi Wang, Xueyang Fu, Liang Li et al.

AAAI 2025paper
#4968

FreeGen: Bridging Visual-Linguistic Discrepancies Towards Diffusion-based Pixel-level Data Synthesis

Wenzhuang Wang, Mingcan Ma, Yong Chen et al.

AAAI 2025paper
#4969

Imagine: Image-Guided 3D Part Assembly with Structure Knowledge Graph

Weihao Wang, Yu Lan, Mingyu You et al.

AAAI 2025paper
#4970

The Parables of the Mustard Seed and the Yeast: Extremely Low-Budget, High-Performance Nighttime Semantic Segmentation

Shiqin Wang, Xin Xu, Haoyang Chen et al.

AAAI 2025paper
#4971

Deep Multi-modal Graph Clustering via Graph Transformer Network

Qianqian Wang, Haiming Xu, Zihao Zhang et al.

AAAI 2025paper
#4972

Tracking Everything Everywhere across Multiple Cameras

Li-Heng Wang, YuJu Cheng, Tyng-Luh Liu

AAAI 2025paper
#4973

EMControl: Adding Conditional Control to Text-to-Image Diffusion Models via Expectation-Maximization

He Wang, Longquan Dai, Jinhui Tang

AAAI 2025paper
#4974

msLPCC: A Multimodal-Driven Scalable Framework for Deep LiDAR Point Cloud Compression

Miaohui Wang, Runnan Huang, Hengjin Dong et al.

AAAI 2024paper
#4975

S³-Mamba: Small-Size-Sensitive Mamba for Lesion Segmentation

Gui Wang, Yuexiang Li, Wenting Chen et al.

AAAI 2025paper
#4976

Scene Graph-Grounded Image Generation

Fuyun Wang, Tong Zhang, Yuanzhi Wang et al.

AAAI 2025paper
#4977

A Black-Box Evaluation Framework for Semantic Robustness in Bird’s Eye View Detection

Fu Wang, Yanghao Zhang, Xiangyu Yin et al.

AAAI 2025paperarXiv:2412.13913
#4978

RA-GAR: A Richly Annotated Benchmark for Gait Attribute Recognition

Chenye Wang, Saihui Hou, Aoqi Li et al.

AAAI 2025paper
#4979

Chain-of-Thought Improves Text Generation with Citations in Large Language Models

Bin Ji, Huijun Liu, Mingzhe Du et al.

AAAI 2024paper
#4980

The Choice of Noninformative Priors for Thompson Sampling in Multiparameter Bandit Models

Jongyeong Lee, Chao-Kai Chiang, Masashi Sugiyama

AAAI 2024paperarXiv:2302.14407
#4981

Hypergraph Neural Architecture Search

Wei Lin, Xu Peng, Zhengtao Yu et al.

AAAI 2024paper
#4982

Box2Poly: Memory

Efficient Polygon Prediction of Arbitrarily Shaped and Rotated Text - Xuyang Chen, Dong Wang, Konrad Schindler et al.

AAAI 2024paper
#4983

Machine Learning

Powered Combinatorial Clock Auction - Ermis Nikiforos Soumalias, Jakob Weissteiner, Jakob Heiss et al.

AAAI 2024paperarXiv:2512.11133
#4984

Boosting Few

Shot Learning via Attentive Feature Regularization - Xingyu Zhu, Shuo Wang, Jinda Lu et al.

AAAI 2024paper
#4985

VOILA: Complexity-Aware Universal Segmentation of CT Images by Voxel Interacting with Language

Zishuo Wan, Yu Gao, Wanyuan Pang et al.

AAAI 2025paperarXiv:2501.03482
#4986

Memory-Augmented Re-Completion for 3D Semantic Scene Completion

Yu-Wen Tseng, Sheng-Ping Yang, Jhih-Ciang Wu et al.

AAAI 2025paper
#4987

LSTKC: Long Short

Term Knowledge Consolidation for Lifelong Person Re-identification - Kunlun Xu, Xu Zou, Jiahuan Zhou

AAAI 2024paper
#4988

Interpretable3D: An Ad

Hoc Interpretable Classifier for 3D Point Clouds - Tuo Feng, Ruijie Quan, Xiaohan Wang et al.

AAAI 2024paper
#4989

Stitch, Contrast, and Segment: Learning a Human Action Segmentation Model Using Trimmed Skeleton Videos

Haitao Tian, Pierre Payeur

AAAI 2025paper
#4990

TraceEvader: Making DeepFakes More Untraceable via Evading the Forgery Model Attribution

Mengjie Wu, Jingui Ma, Run Wang et al.

AAAI 2024paper
#4991

3D²-Actor: Learning Pose-Conditioned 3D-Aware Denoiser for Realistic Gaussian Avatar Modeling

Zichen Tang, Hongyu Yang, Hanchen Zhang et al.

AAAI 2025paper
#4992

From Representation Space to Prognostic Insights: Whole Slide Image Generation with Hierarchical Diffusion Model for Survival Prediction

Zhihao Tang, Xi Zhang, Chaozhuo Li

AAAI 2025paper
#4993

Learning Only When It Matters: Cost

Aware Long-Tailed Classification - Yu-Cheng He, Yao-Xiang Ding, Han-Jia Ye et al.

AAAI 2024paper
#4994

RAGG: Retrieval-Augmented Grasp Generation Model

Zhenhua Tang, Bin Zhu, Yanbin Hao et al.

AAAI 2025paper
#4995

MICA: Towards Explainable Skin Lesion Diagnosis via Multi

Level Image-Concept Alignment - Yequan Bie, Luyang Luo, Hao Chen

AAAI 2024paper
#4996

Talk Funny! A Large

Scale Humor Response Dataset with Chain-of-Humor Interpretation - Yuyan Chen, Yichen Yuan, Panjun Liu et al.

AAAI 2024paper
#4997

NaMa: Neighbor

Aware Multi-Modal Adaptive Learning for Prostate Tumor Segmentation on Anisotropic MR Images - Runqi Meng, Xiao Zhang, Shijie Huang et al.

AAAI 2024paper
#4998

M2Flow: A Motion Information Fusion Framework for Enhanced Unsupervised Optical Flow Estimation in Autonomous Driving

Xunpei Sun, Gang Chen, Zuoxun Hou

AAAI 2025paper
#4999

Transferable Adversarial Attacks for Object Detection Using Object

Aware Significant Feature Distortion - Xinlong Ding, Jiansheng Chen, Hongwei Yu et al.

AAAI 2024paper
#5000

Taxonomy Driven Fast Adversarial Training

Kun Tong, Chengze Jiang, Jie Gui et al.

AAAI 2024paper