Most Cited AAAI "visual tokens" Papers

5,317 papers found • Page 25 of 27

Filters:Most Cited AAAI visual tokens Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#4801

Expanding the Scope of Negatives: Boosting Image-Text Matching with Negatives Distribution Guided Learning

Zhao Zhou, Weizhong Zhang, Xiangcheng Du et al.

AAAI 2025paper

#4802

Unifying Decision and Function Queries in Stochastic Boolean Satisfiability

Yu-Wei Fan, Jie-Hong Jiang

AAAI 2024paper

#4803

Achieving Ensemble-Like Performance in a Single Model: A Feature Diversification Framework for Image-Text Matching

Zhao Zhou, Yiqun Wang, Weizhong Zhang et al.

AAAI 2025paper

#4804

Improving Generalization of Deep Neural Networks by Optimum Shifting

Yuyan Zhou, Ye Li, Lei Feng et al.

AAAI 2025paperarXiv:2405.14111

#4805

AI-Powered Algorithm-Centric Quantum Processor Topology Design

Tian Li, Xiao-Yue Xu, Chen Ding et al.

AAAI 2025paperarXiv:2412.13805

#4806

Enhancing Training of Spiking Neural Network with Stochastic Latency

Srinivas Anumasa, Bhaskar Mukhoty, Velibor Bojkovic et al.

AAAI 2024paper

#4807

Test-Time Adaptation on Noisy Data via Model-Pruning-Based Filtering and Flatness-Aware Entropy Minimization

Xingzhi Zhou, Zhiliang Tian, Boyang Zhang et al.

AAAI 2025paper

#4808

Spatiotemporal-Aware Neural Fields for Dynamic CT Reconstruction

Qingyang Zhou, Yunfan Ye, Zhiping Cai

AAAI 2025paper

#4809

GLIC: General Format Learned Image Compression

MingSheng Zhou, MingMing Kong

AAAI 2025paper

#4810

SeqRank: Sequential Ranking of Salient Objects

AAAI 2024paper

#4811

SceneX: Procedural Controllable Large-Scale Scene Generation

Mengqi Zhou, Yuxi Wang, Jun Hou et al.

AAAI 2025paperarXiv:2403.15698

#4812

Joint Class-level and Instance-level Relationship Modeling for Novel Class Discovery

Jiaying Zhou, Qingchao Chen

AAAI 2025paper

#4813

Bilateral Gradual Semantics for Weighted Argumentation

AAAI 2024paper

#4814

Core-to-Global Reasoning for Compositional Visual Question Answering

Hao Zhou, Tingjin Luo, Zhangqi Jiang

AAAI 2025paper

#4815

What Makes Quantization for Large Language Model Hard? An Empirical Study from the Lens of Perturbation

Huankang Guan, Rynson W.H. Lau

AAAI 2024paper

#4816

Uncertainty-Aware Yield Prediction with Multimodal Molecular Features

Jiayuan Chen, Kehan Guo, Zhen Liu et al.

AAAI 2024paper

#4817

Preference Aware Dual Contrastive Learning for Item Cold-Start Recommendation

Wenbo Wang, Bingquan Liu, Lili Shan et al.

AAAI 2024paper

#4818

Learning Performance Maximizing Ensembles with Explainability Guarantees

Vincent Pisztora, Jia Li

AAAI 2024paperarXiv:2312.12715

#4819

Communication Efficient Distributed Newton Method over Unreliable Networks

Ming Wen, Chengchang Liu, Yuedong Xu

AAAI 2024paper

#4820

DECIDER: Difference-aware Contrastive Diffusion Model with Adversarial Perturbations for Image Change Captioning

Guojin Zhong, Jinhong Hu, Jiajun Chen et al.

AAAI 2025paper

#4821

MMPF: Multi-Modal Perception Framework for Abnormal Medical Condition Detection

Chuyi Zhong, Dingkang Yang, Peng Zhai et al.

AAAI 2025paper

#4822

Continual Vision-Language Retrieval via Dynamic Knowledge Rectification

Zhenyu Cui, Yuxin Peng, Xun Wang et al.

AAAI 2024paper

#4823

When Shadow Removal Meets Intrinsic Image Decomposition: A Joint Learning Framework Using Unpaired Data

Rongjia Zheng, Qing Zhang, Yongwei Nie et al.

AAAI 2025paper

#4824

MuST: Robust Image Watermarking for Multi-Source Tracing

Guanjie Wang, Zehua Ma, Chang Liu et al.

AAAI 2024paper

#4825

Supportive Negatives Spectral Augmentation for Source-Free Cross-Domain Segmentation

Kexin Zheng, Haifeng Xia, Siyu Xia et al.

AAAI 2025paper

#4826

PHR-DIFF: Portrait Highlights Removal via Patch-aware Diffusion Model

Hongsheng Zheng, Zhongyun Bao, Gang Fu et al.

AAAI 2025paper

#4827

Hierarchical Planning and Learning for Robots in Stochastic Settings Using Zero-Shot Option Invention

Naman Shah, Siddharth Srivastava

AAAI 2024paper

#4828

Optimizing the Optimization of Planning Domains by Automatic Action Schema Splitting

Mojtaba Elahi, Jussi Rintanen

AAAI 2024paper

#4829

Frequency-Aware Deepfake Detection: Improving Generalizability through Frequency Domain Learning

Chuangchuang Tan, Yao Zhao, Shikui Wei et al.

AAAI 2024paper

#4830

HFF-Tracker: A Hierarchical Fine-grained Fusion Tracker for Referring Multi-Object Tracking

Zeyong Zhao, Yanchao Hao, Minghao Zhang et al.

AAAI 2025paper

#4831

PMRC: Prompt-Based Machine Reading Comprehension for Few-Shot Named Entity Recognition

Jin Huang, Danfeng Yan, Yuanqiang Cai

AAAI 2024paper

#4832

NightReID: A Large-Scale Nighttime Person Re-Identification Benchmark

Yuxuan Zhao, Weijian Ruan, He Li et al.

AAAI 2025paper

#4833

ESEG: Event-Based Segmentation Boosted by Explicit Edge-Semantic Guidance

Yucheng Zhao, Gengyu Lyu, Ke Li et al.

AAAI 2025paper

#4834

Audio-Visual Adaptive Fusion Network for Question Answering Based on Contrastive Learning

Xujian Zhao, Yixin Wang, Peiquan Jin

AAAI 2025paper

#4835

Fully Data-Driven Pseudo Label Estimation for Pointly-Supervised Panoptic Segmentation

Jing Li, Junsong Fan, Yuran Yang et al.

AAAI 2024paper

#4836

From Toxic to Trustworthy: Using Self-Distillation and Semi-supervised Methods to Refine Neural Networks

Xianda Zhang, Baolin Zheng, Jianbao Hu et al.

AAAI 2024paper

#4837

MINES: Message Intercommunication for Inductive Relation Reasoning over Neighbor-Enhanced Subgraphs

Ke Liang, Lingyuan Meng, Sihang Zhou et al.

AAAI 2024paper

#4838

Evolving Parameterized Prompt Memory for Continual Learning

Muhammad Rifki Kurniawan, Xiang Song, Zhiheng Ma et al.

AAAI 2024paper

#4839

ACAMDA: Improving Data Efficiency in Reinforcement Learning through Guided Counterfactual Data Augmentation

Yuewen Sun, Erli Wang, Biwei Huang et al.

AAAI 2024paper

#4840

Towards Safe Policy Learning under Partial Identifiability: A Causal Approach

Shalmali Joshi, Junzhe Zhang, Elias Bareinboim

AAAI 2024paper

#4841

HAGO-Net: Hierarchical Geometric Massage Passing for Molecular Representation Learning

Hongbin Pei, Taile Chen, Chen A et al.

AAAI 2024paper

#4842

Training-free Open-Vocabulary Semantic Segmentation via Diverse Prototype Construction and Sub-region Matching

Xuanpu Zhao, Dianmo Sheng, Zhentao Tan et al.

AAAI 2025paper

#4843

Excluding the Impossible for Open Vocabulary Semantic Segmentation

Shiyuan Zhao, Baodi Liu, Yu Bai et al.

AAAI 2025paper

#4844

Adaptive Wavelet-Positional Encoding for High-Frequency Information Learning in Implicit Neural Representation

Hongxu Zhao, Zelin Gao, Yue Wang et al.

AAAI 2025paper

#4845

Multi-scale Activation, Refinement, and Aggregation: Exploring Diverse Cues for Fine-Grained Bird Recognition

Zhicheng Zhang, Hao Tang, Jinhui Tang

AAAI 2025paperarXiv:2504.09215

#4846

Computing the Why-Provenance for Datalog Queries via SAT Solvers

Haitong Luo, Xuying Meng, Suhang Wang et al.

AAAI 2024paper

#4847

FairTrade: Achieving Pareto-Optimal Trade-Offs between Balanced Accuracy and Fairness in Federated Learning

Maryam Badar, Sandipan Sikdar, Wolfgang Nejdl et al.

AAAI 2024paper

#4848

Training-Free Image Manipulation Localization Using Diffusion Models

Zhenfei Zhang, Ming-Ching Chang, Xin Li

AAAI 2025paper

#4849

RP-PGD: Boosting Segmentation Robustness with a Region-and-Prototype Based Adversarial Attack

Yuxuan Zhang, Zhenbo Shi, Shuchang Wang et al.

AAAI 2025paper

#4850

Partial Point Cloud Registration with Multi-view 2D Image Learning

Yue Zhang, Yue Wu, Wenping Ma et al.

AAAI 2025paper

#4851

InstantSticker: Realistic Decal Blending via Disentangled Object Reconstruction

Yi Zhang, Xiaoyang Huang, Yishun Dou et al.

AAAI 2025paperarXiv:2504.06620

#4852

PhyCamo: A Robust Physical Camouflage via Contrastive Learning for Multi-View Physical Adversarial Attack

Ximin Zhang, Jinyin Chen, Haibin Zheng et al.

AAAI 2025paper

#4853

Enhancing Multimodal Large Language Models Complex Reason via Similarity Computation

Xiaofeng Zhang, Fanshuo Zeng, Yihao Quan et al.

AAAI 2025paperarXiv:2412.09817

#4854

Iterative Self-Training with Class-Aware Text-to-Image Synthesis for Visual Task Learning

Xiang Zhang, Wanqing Zhao, Pengyang Li et al.

AAAI 2025paper

#4855

Pragmatist: Multiview Conditional Diffusion Models for High-Fidelity 3D Reconstruction from Unposed Sparse Views

Songchun Zhang, Chunhui Zhao

AAAI 2025paperarXiv:2412.08412

#4856

DiMSOD: A Diffusion-Based Framework for Multi-Modal Salient Object Detection

Shuo Zhang, Jiaming Huang, Wenbing Tang et al.

AAAI 2025paper

#4857

Towards Multi-Mode Outlier Robust Tensor Ring Decomposition

Yuning Qiu, Guoxu Zhou, Andong Wang et al.

AAAI 2024paper

#4858

Visual Perturbation for Text-Based Person Search

Pengcheng Zhang, Xiaohan Yu, Xiao Bai et al.

AAAI 2025paper

#4859

SIGraph: Saliency Image-Graph Network for Retinal Disease Classification in Fundus Image

Peng Zhang, Yuan Li, Haotian Song et al.

AAAI 2025paper

#4860

PanoDiT: Panoramic Videos Generation with Diffusion Transformer

Muyang Zhang, Yuzhi Chen, Rongtao Xu et al.

AAAI 2025paper

#4861

A Brain-Inspired Way of Reducing the Network Complexity via Concept-Regularized Coding for Emotion Recognition

Han Lu, Xiahai Zhuang, Qiang Luo

AAAI 2024paper

#4862

Decoupling Scattering: Pseudo-Label Guided NeRF for Scenes with Scattering Media

Mingyang Zhang, Junkang Zhang, Faming Fang et al.

AAAI 2025paper

#4863

MOCID: Motion Context and Displacement Information Learning for Moving Infrared Small Target Detection

Mingjin Zhang, Yuanjun Ouyang, Fei Gao et al.

AAAI 2025paper

#4864

IRMamba: Pixel Difference Mamba with Layer Restoration for Infrared Small Target Detection

Mingjin Zhang, Xiaolong Li, Fei Gao et al.

AAAI 2025paper

#4865

Critical Forgetting-Based Multi-Scale Disentanglement for Deepfake Detection

Kai Li, Wenqi Ren, Jianshu Li et al.

AAAI 2025paper

#4866

Cumulative Difference Learning VAE for Time-Series with Temporally Correlated Inflow-Outflow

Tianchun Li, Chengxiang Wu, Pengyi Shi et al.

AAAI 2024paper

#4867

Common Sense Bias Modeling for Classification Tasks

Miao Zhang, Zee Fryer, Ben Colman et al.

AAAI 2025paperarXiv:2401.13213

#4868

R^2-Art: Category-Level Articulation Pose Estimation from Single RGB Image via Cascade Render Strategy

Li Zhang, Haonan Jiang, Yukang Huo et al.

AAAI 2025paper

#4869

Frame Semantic Role Labeling Using Arbitrary-Order Conditional Random Fields

AAAI 2024paper

#4870

When Open-Vocabulary Visual Question Answering Meets Causal Adapter: Benchmark and Approach

Feifei Zhang, Zhaoyi Zhang, Xi Zhang et al.

AAAI 2025paper

#4871

Spatio-Temporal Pivotal Graph Neural Networks for Traffic Flow Forecasting

Xiangyang Miao, Guobao Xiao, Shiping Wang et al.

AAAI 2024paper

#4872

Training-Free and Hardware-Friendly Acceleration for Diffusion Models via Similarity-based Token Pruning

Evelyn Zhang, Jiayi Tang, Xuefei Ning et al.

AAAI 2025paper

#4873

An Efficient Subgraph-Inferring Framework for Large-Scale Heterogeneous Graphs

Wei Zhou, Hong Huang, Ruize Shi et al.

AAAI 2024paper

#4874

DetRF: Detachable Novel Views Synthesis of Dynamic Scenes Using Backdrop-Driven Neural Radiance Fields

Boyu Zhang, Zheng Zhu, Wenbo Xu

AAAI 2025paper

#4875

TGFormer: Transformer with Track Query Group for Multi-Object Tracking

Rui Zeng, Yuanzhou Huang, Songwei Pei

AAAI 2025paper

#4876

Efficient Neural Network Encoding for 3D Color Lookup Tables

Vahid Zehtab, David B. Lindell, Marcus A. Brubaker et al.

AAAI 2025paperarXiv:2412.15438

#4877

Rectangle Search: An Anytime Beam Search

Sofia Lemons, Wheeler Ruml, Rob Holte et al.

AAAI 2024paperarXiv:2312.12554

#4878

Exploring Domain Incremental Video Highlights Detection with the LiveFood Benchmark

Sen Pei, Shixiong Xu, Xiaojie Jin

AAAI 2024paperarXiv:2209.05166

#4879

Fewer Steps, Better Performance: Efficient Cross-Modal Clip Trimming for Video Moment Retrieval Using Language

Xiang Fang, Daizong Liu, Wanlong Fang et al.

AAAI 2024paper

#4880

Cross-Lingual Text-Rich Visual Comprehension: An Information Theory Perspective

Xinmiao Yu, Xiaocheng Feng, Yun Li et al.

AAAI 2025paperarXiv:2412.17787

#4881

SkipDiff: Adaptive Skip Diffusion Model for High-Fidelity Perceptual Image Super-resolution

Xiaotong Luo, Yuan Xie, Yanyun Qu et al.

AAAI 2024paper

#4882

OTPNet: ODE-inspired Tuning-free Proximal Network for Remote Sensing Image Fusion

Wei Yu, Zonglin Li, Qinglin Liu et al.

AAAI 2025paper

#4883

Fine-grained Adaptive Visual Prompt for Generative Medical Visual Question Answering

Ting Yu, Zixuan Tong, Jun Yu et al.

AAAI 2025paper

#4884

STGC-NeRF: Spatial-Temporal Geometric Consistency for LiDAR Neural Radiance Fields in Dynamic Scenes

Shangshu Yu, Xiaotian Sun, Wen Li et al.

AAAI 2025paper

#4885

ReMoGPT: Part-Level Retrieval-Augmented Motion-Language Models

Qing Yu, Mikihiro Tanaka, Kent Fujiwara

AAAI 2025paper

#4886

Separating the Wheat from the Chaff: Spatio-Temporal Transformer with View-interweaved Attention for Photon-Efficient Depth Sensing

Letian Yu, Jiaxi Yang, Bo Dong et al.

AAAI 2025paper

#4887

SGFormer: Semantic-Geometry Fusion Transformer for Multi-modal 3D Panoptic Segmentation

Hongqi Yu, Sixian Chan, Xiaolong Zhou et al.

AAAI 2025paper

#4888

FlexDataset: Crafting Annotated Dataset Generation for Diverse Applications

Ellen Yi-Ge, Leo Shawn

AAAI 2025paper

#4889

ShareBERT: Embeddings Are Capable of Learning Hidden Layers

Jia Cheng Hu, Roberto Cavicchioli, Giulia Berardinelli et al.

AAAI 2024paper

#4890

Spatio-Temporal Fusion for Human Action Recognition via Joint Trajectory Graph

Yaolin Zheng, Hongbo Huang, Xiuying Wang et al.

AAAI 2024paper

#4891

Sparse Enhanced Network: An Adversarial Generation Method for Robust Augmentation in Sequential Recommendation

Junyang Chen, Guoxuan Zou, Pan Zhou et al.

AAAI 2024paper

#4892

PromptHaze: Prompting Real-world Dehazing via Depth Anything Model

Tian Ye, Sixiang Chen, Haoyu Chen et al.

AAAI 2025paper

#4893

VersaFusion: A Versatile Diffusion-Based Framework for Fine-Grained Image Editing and Enhancement

Haocun Ye, Xinlong Jiang, Chenlong Gao et al.

AAAI 2025paper

#4894

Sharpness-Aware Model-Agnostic Long-Tailed Domain Generalization

Houcheng Su, Weihao Luo, Daixian Liu et al.

AAAI 2024paper

#4895

As Pseudo-Label Free as Possible: Leveraging Adaptive Feature Generation for Sparsely Annotated Object Detection

Shuilian Yao, Yu Liu, Qi Jia et al.

AAAI 2025paper

#4896

MM-Tracker: Motion Mamba for UAV-platform Multiple Object Tracking

Mufeng Yao, Jinlong Peng, Qingdong He et al.

AAAI 2025paper

#4897

RealPortrait: Realistic Portrait Animation with Diffusion Transformers

Zejun Yang, Huawei Wei, Zhisheng Wang

AAAI 2025paper

#4898

ERF: A Benchmark Dataset for Robust Semantic Segmentation Under Extreme Rainfall Conditions

Xin Yang, Xin Zhang, Xinchao Wang

AAAI 2025paper

#4899

Semantic Segmentation on Raindrop Degraded Images Using Two-Stage Dual Teacher-Student Learning

Xin Yang, Wending Yan, Yuan Yuan et al.

AAAI 2025paper

#4900

DriveGazen: Event-Based Driving Status Recognition Using Conventional Camera

Xiaoyin Yang, Xin Yang

AAAI 2025paperarXiv:2412.11753

#4901

Dual Information Purification for Lightweight SAR Object Detection

Xi Yang, Jiachen Sun, Songsong Duan et al.

AAAI 2025paper

#4902

Asymmetric Hierarchical Difference-aware Interaction Network for Event-guided Motion Deblurring

Wen Yang, Jinjian Wu, Leida Li et al.

AAAI 2025paper

#4903

3CAD: A Large-Scale Real-World 3C Product Dataset for Unsupervised Anomaly Detection

Enquan Yang, Peng Xing, Hanyang Sun et al.

AAAI 2025paper

#4904

Robust Image Hashing Based on Contrastive Masked Autoencoder with Weak-Strong Augmentation Alignment

Cundian Yang, Guibo Luo, Yuesheng Zhu et al.

AAAI 2025paper

#4905

Data-Free Universal Attack by Exploiting the Intrinsic Vulnerability of Deep Models

YangTian Yan, Jinyu Tian

AAAI 2025paperarXiv:2503.22205

#4906

Physical Marker: Revealing Invisible Hyperlinks Hidden in Printed Trademarks

Yuliang Xue, Lei Tan, Guobiao Li et al.

AAAI 2025paper

#4907

StegaStyleGAN: Towards Generic and Practical Generative Image Steganography

Wenkang Su, Jiangqun Ni, Yiyan Sun

AAAI 2024paper

#4908

RetouchGPT: LLM-based Interactive High-Fidelity Face Retouching via Imperfection Prompting

Wen Xue, Chun Ding, Ruotao Xu et al.

AAAI 2025paper

#4909

Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP

Zhongxing Xu, Feilong Tang, Zhe Chen et al.

AAAI 2025paperarXiv:2412.19650

#4910

Simple Weak Coresets for Non-decomposable Classification Measures

Jayesh Malaviya, Anirban Dasgupta, Rachit Chhaya

AAAI 2024paperarXiv:2312.09885

#4911

Learning Multi-Scale Video-Text Correspondence for Weakly Supervised Temporal Article Gronding

Wenjia Geng, Yong Liu, Lei Chen et al.

AAAI 2024paper

#4912

A Surprisingly Simple Continuous-Action POMDP Solver: Lazy Cross-Entropy Search over Policy Trees

Marcus Hoerger, Hanna Kurniawati, Dirk Kroese et al.

AAAI 2024paperarXiv:2305.08049

#4913

FATE: Feature-Adapted Parameter Tuning for Vision-Language Models

Zhengqin Xu, Zelin Peng, Xiaokang Yang et al.

AAAI 2025paper

#4914

Explainable Origin-Destination Crowd Flow Interpolation via Variational Multi-Modal Recurrent Graph Auto-Encoder

Qiang Zhou, Xinjiang Lu, Jingjing Gu et al.

AAAI 2024paper

#4915

HOIMamba: Efficient Mamba-based Disentangled Progressive Learning for HOI Detection

Yongchao Xu, Jiawei Liu, Sen Tao et al.

AAAI 2025paper

#4916

SCKD: Semi-Supervised Cross-Modality Knowledge Distillation for 4D Radar Object Detection

Ruoyu Xu, Zhiyu Xiang, Chenwei Zhang et al.

AAAI 2025paperarXiv:2412.14571

#4917

Low Category Uncertainty and High Training Potential Instance Learning for Unsupervised Domain Adaptation

Xinyu Zhang, Meng Kang, Shuai Lü

AAAI 2024paper

#4918

Multiple Feature Refining Network for Visual Emotion Distribution Learning

Qinfu Xu, Shaozu Yuan, Yiwei Wei et al.

AAAI 2025paper

#4919

Efficient Learning of PDEs via Taylor Expansion and Sparse Decomposition into Value and Fourier Domains

Md Nasim, Yexiang Xue

AAAI 2024paperarXiv:2309.07344

#4920

Discrepancy and Uncertainty Aware Denoising Knowledge Distillation for Zero-Shot Cross-Lingual Named Entity Recognition

Ling Ge, Chunming Hu, Guanghui Ma et al.

AAAI 2024paper

#4921

Foundations of Reactive Synthesis for Declarative Process Specifications

Andrey Rivkin, Luca Geatti, Marco Montali

AAAI 2024paper

#4922

3DHumanEdit: Multi-modal Body Part-aware Conditioning Information Integration for 3D Human Manipulation

FeiFan Xu, Tianyi Chen, Fan Yang et al.

AAAI 2025paper

#4923

Less Is More: Token Context-Aware Learning for Object Tracking

Chenlong Xu, Bineng Zhong, Qihua Liang et al.

AAAI 2025paperarXiv:2501.00758

#4924

FR²Seg: Continual Segmentation Across Multiple Sites via Fourier Style Replay and Adaptive Consistency Regularization

Cheng Xu, Weiwen Zhang, Hongrui Zhang et al.

AAAI 2025paper

#4925

Resource Efficient Deep Learning Hardware Watermarks with Signature Alignment

Joseph Clements, Yingjie Lao

AAAI 2024paper

#4926

DiffScene: Diffusion-Based Safety-Critical Scenario Generation for Autonomous Vehicles

Chejian Xu, Aleksandr Petiushko, Ding Zhao et al.

AAAI 2025paper

#4927

Few-Shot Incremental Learning via Foreground Aggregation and Knowledge Transfer for Audio-Visual Semantic Segmentation

Jingqiao Xiu, Mengze Li, Zongxin Yang et al.

AAAI 2025paper

#4928

IOFM: Using the Interpolation Technique on the Over-Fitted Models to Identify Clean-Annotated Samples

Dongha Kim, Yongchan Choi, Kunwoong Kim et al.

AAAI 2024paper

#4929

Improving Distinguishability of Class for Graph Neural Networks

Dongxiao He, Shuwei Liu, Zhizhi Yu et al.

AAAI 2024paper

#4930

Discrete Prior-Based Temporal-Coherent Content Prediction for Blind Face Video Restoration

Lianxin Xie, Bingbing Zheng, Wen Xue et al.

AAAI 2025paperarXiv:2501.09960

#4931

Omni-Query Active Learning for Source-Free Domain Adaptive Cross-Modality 3D Semantic Segmentation

Jianxiang Xie, Yao Wu, Yachao Zhang et al.

AAAI 2025paper

#4932

Boosting Vision State Space Model with Fractal Scanning

Haoke Xiao, Lv Tang, Peng-tao Jiang et al.

AAAI 2025paper

#4933

SMR-Net: Semantic-Guided Mutually Reinforcing Network for Cross-Modal Image Fusion and Salient Object Detection

Guobao Xiao, Xinyu Liu, Zebin Lin et al.

AAAI 2025paper

#4934

X-RefSeg3D: Enhancing Referring 3D Instance Segmentation via Structured Cross-Modal Graph Neural Networks

Zhipeng Qian, Yiwei Ma, Jiayi Ji et al.

AAAI 2024paper

#4935

Iterative Regularization with K-support Norm: An Important Complement to Sparse Recovery

William de Vazelhes, Bhaskar Mukhoty, Xiaotong Yuan et al.

AAAI 2024paperarXiv:2401.05394

#4936

Learning GAI-Decomposable Utility Models for Multiattribute Decision Making

Margot Herin, Patrice Perny, Nataliya Sokolovska

AAAI 2024paper

#4937

CA-Edit: Causality-Aware Condition Adapter for High-Fidelity Local Facial Attribute Editing

Xiaole Xian, Xilin He, Zenghao Niu et al.

AAAI 2025paperarXiv:2412.13565

#4938

PlaNet: Learning to Mitigate Atmospheric Turbulence in Planetary Images

Yifei Xia, Chu Zhou, Chengxuan Zhu et al.

AAAI 2025paper

#4939

‘Why Didn’t You Allocate This Task to Them?’ Negotiation-Aware Task Allocation and Contrastive Explanation Generation

Zahra Zahedi, Sailik Sengupta, Subbarao Kambhampati

AAAI 2024paper

#4940

RETRACTED: GEONet: Global Enhancement and Optimization Network for Lane Detection

Suyang Xi, Yunhao Liu, Hong Ding et al.

AAAI 2025paper

#4941

Unified Knowledge Maintenance Pruning and Progressive Recovery with Weight Recalling for Large Vision-Language Models

Zimeng Wu, Jiaxin Chen, Yunhong Wang

AAAI 2025paper

#4942

MUCD: Unsupervised Point Cloud Change Detection via Masked Consistency

Yue Wu, Zhipeng Wang, Yongzhe Yuan et al.

AAAI 2025paper

#4943

Deconfound Semantic Shift and Incompleteness in Incremental Few-shot Semantic Segmentation

Yirui Wu, Yuhang Xia, Hao Li et al.

AAAI 2025paper

#4944

VarCMP: Adapting Cross-Modal Pre-Training Models for Video Anomaly Retrieval

Peng Wu, Wanshun Su, Xiangteng He et al.

AAAI 2025paper

#4945

SVRMamba: Slice-to-Volume Reconstruction from Multiple MRI Stacks with Slice Sequence Guided Mamba

Jiangjie Wu, Hongjiang Wei, Yuyao Zhang

AAAI 2025paper

#4946

Spin: Diffusion-based Semantic Image Painting Through Independent Information Injection

Dantong Wu, Zhiqiang Chen, Tianjiao Du et al.

AAAI 2025paper

#4947

Multi-axis Prompt and Multi-dimension Fusion Network for All-in-one Weather-degraded Image Restoration

Yuanbo Wen, Tao Gao, Jing Zhang et al.

AAAI 2025paper

#4948

Mitigating Idiom Inconsistency: A Multi-Semantic Contrastive Learning Method for Chinese Idiom Reading Comprehension

Mingmin Wu, Yuxue Hu, Yongcheng Zhang et al.

AAAI 2024paper

#4949

Power of Diversity: Enhancing Data-Free Black-Box Attack with Domain-Augmented Learning

Yang Wei, Jingyu Tan, Guowen Xu et al.

AAAI 2025paper

#4950

GlyphSR: A Simple Glyph-Aware Framework for Scene Text Image Super-Resolution

Baole Wei, Yuxuan Zhou, Liangcai Gao et al.

AAAI 2025paper

#4951

MSV-PCT: Multi-Sparse-View Enhanced Transformer Framework for Salient Object Detection in Point Clouds

Zihao Wang, Yiming Huang, Gengyu Lyu et al.

AAAI 2025paper

#4952

GOALNET: Interleaving Neural Goal Predicate Inference with Classical Planning for Generalization in Robot Instruction Following

Jigyasa Gupta, Shreya Sharma, Shreshth Tuli et al.

AAAI 2024paper

#4953

Attention-Imperceptible Backdoor Attacks on Vision Transformers

Zhishen Wang, Rui Wang, Lihua Jing

AAAI 2025paper

#4954

Thermal-Aware Low-Light Image Enhancement: A Real-World Benchmark and a New Light-Weight Model

Zhen Wang, Yaozu Wu, Dongyuan Li et al.

AAAI 2025paper

#4955

Style Nursing with Spatial and Semantic Guidance for Zero-Shot Traffic Scene Style Transfer

Zhen Wang, Zihang Lin, Meng Yuan et al.

AAAI 2025paper

#4956

Two-Stage Evolutionary Reinforcement Learning for Enhancing Exploration and Exploitation

AAAI 2024paper

#4957

DualNet: Robust Self-Supervised Stereo Matching with Pseudo-Label Supervision

Yun Wang, Jiahao Zheng, Chenghao Zhang et al.

AAAI 2025paper

#4958

Target Scanpath-Guided 360-Degree Image Enhancement

Yujia Wang, Fang-Lue Zhang, Neil A. Dodgson

AAAI 2025paper

#4959

Manifold Constraints for Imperceptible Adversarial Attacks on Point Clouds

AAAI 2024paper

#4960

SpFormer: Spatio-Temporal Modeling for Scanpaths with Transformer

Zhijie Nie, Richong Zhang, Zhongyuan Wang et al.

AAAI 2024paper

#4961

Capturing the Unseen: Vision-Free Facial Motion Capture Using Inertial Measurement Units

Youjia Wang, Yiwen Wu, Hengan Zhou et al.

AAAI 2025paperarXiv:2402.03944

#4962

RefDetector: A Simple Yet Effective Matching-based Method for Referring Expression Comprehension

Yabing Wang, Zhuotao Tian, Zheng Qin et al.

AAAI 2025paper

#4963

From Coarse to Fine: A Matching and Alignment Framework for Unsupervised Cross-View Geo-Localization

Xueyi Wang, Lele Zhang, Zheng Fan et al.

AAAI 2025paper

#4964

Enhancing Neural Radiance Fields with Adaptive Multi-Exposure Fusion: A Bilevel Optimization Approach for Novel View Synthesis

Yang Zou, Xingyuan Li, Zhiying Jiang et al.

AAAI 2024paper

#4965

MIMTrack: In-Context Tracking via Masked Image Modeling

Xingmei Wang, Guohao Nie, Jiaxiang Meng et al.

AAAI 2025paper

#4966

Lifting Scheme-Based Implicit Disentanglement of Emotion-Related Facial Dynamics in the Wild

Xingjian Wang, Li Chai

AAAI 2025paperarXiv:2412.13168

#4967

DCTMamba: Advancing JPEG Image Restoration Through Long-Sequence Modeling and Adaptive Frequency Strategy

Xi Wang, Xueyang Fu, Liang Li et al.

AAAI 2025paper

#4968

FreeGen: Bridging Visual-Linguistic Discrepancies Towards Diffusion-based Pixel-level Data Synthesis

Wenzhuang Wang, Mingcan Ma, Yong Chen et al.

AAAI 2025paper

#4969

Imagine: Image-Guided 3D Part Assembly with Structure Knowledge Graph

Weihao Wang, Yu Lan, Mingyu You et al.

AAAI 2025paper

#4970

The Parables of the Mustard Seed and the Yeast: Extremely Low-Budget, High-Performance Nighttime Semantic Segmentation

Shiqin Wang, Xin Xu, Haoyang Chen et al.

AAAI 2025paper

#4971

Deep Multi-modal Graph Clustering via Graph Transformer Network

Qianqian Wang, Haiming Xu, Zihao Zhang et al.

AAAI 2025paper

#4972

Tracking Everything Everywhere across Multiple Cameras

Li-Heng Wang, YuJu Cheng, Tyng-Luh Liu

AAAI 2025paper

#4973

EMControl: Adding Conditional Control to Text-to-Image Diffusion Models via Expectation-Maximization

He Wang, Longquan Dai, Jinhui Tang

AAAI 2025paper

#4974

msLPCC: A Multimodal-Driven Scalable Framework for Deep LiDAR Point Cloud Compression

Miaohui Wang, Runnan Huang, Hengjin Dong et al.

AAAI 2024paper

#4975

S³-Mamba: Small-Size-Sensitive Mamba for Lesion Segmentation

Gui Wang, Yuexiang Li, Wenting Chen et al.

AAAI 2025paper

#4976

Scene Graph-Grounded Image Generation

Fuyun Wang, Tong Zhang, Yuanzhi Wang et al.

AAAI 2025paper

#4977

A Black-Box Evaluation Framework for Semantic Robustness in Bird’s Eye View Detection

Fu Wang, Yanghao Zhang, Xiangyu Yin et al.

AAAI 2025paperarXiv:2412.13913

#4978

RA-GAR: A Richly Annotated Benchmark for Gait Attribute Recognition

Chenye Wang, Saihui Hou, Aoqi Li et al.

AAAI 2025paper

#4979

Chain-of-Thought Improves Text Generation with Citations in Large Language Models

Bin Ji, Huijun Liu, Mingzhe Du et al.

AAAI 2024paper

#4980

The Choice of Noninformative Priors for Thompson Sampling in Multiparameter Bandit Models

Jongyeong Lee, Chao-Kai Chiang, Masashi Sugiyama

AAAI 2024paperarXiv:2302.14407

#4981

Hypergraph Neural Architecture Search

Wei Lin, Xu Peng, Zhengtao Yu et al.

AAAI 2024paper

#4982

Box2Poly: Memory

Efficient Polygon Prediction of Arbitrarily Shaped and Rotated Text - Xuyang Chen, Dong Wang, Konrad Schindler et al.

AAAI 2024paper

#4983

Machine Learning

Powered Combinatorial Clock Auction - Ermis Nikiforos Soumalias, Jakob Weissteiner, Jakob Heiss et al.

AAAI 2024paperarXiv:2512.11133

#4984

Boosting Few

Shot Learning via Attentive Feature Regularization - Xingyu Zhu, Shuo Wang, Jinda Lu et al.

AAAI 2024paper

#4985

VOILA: Complexity-Aware Universal Segmentation of CT Images by Voxel Interacting with Language

Zishuo Wan, Yu Gao, Wanyuan Pang et al.

AAAI 2025paperarXiv:2501.03482

#4986

Memory-Augmented Re-Completion for 3D Semantic Scene Completion

Yu-Wen Tseng, Sheng-Ping Yang, Jhih-Ciang Wu et al.

AAAI 2025paper

#4987

LSTKC: Long Short

Term Knowledge Consolidation for Lifelong Person Re-identification - Kunlun Xu, Xu Zou, Jiahuan Zhou

AAAI 2024paper

#4988

Interpretable3D: An Ad

Hoc Interpretable Classifier for 3D Point Clouds - Tuo Feng, Ruijie Quan, Xiaohan Wang et al.

AAAI 2024paper

#4989

Stitch, Contrast, and Segment: Learning a Human Action Segmentation Model Using Trimmed Skeleton Videos

Haitao Tian, Pierre Payeur

AAAI 2025paper

#4990

TraceEvader: Making DeepFakes More Untraceable via Evading the Forgery Model Attribution

Mengjie Wu, Jingui Ma, Run Wang et al.

AAAI 2024paper

#4991

3D²-Actor: Learning Pose-Conditioned 3D-Aware Denoiser for Realistic Gaussian Avatar Modeling

Zichen Tang, Hongyu Yang, Hanchen Zhang et al.

AAAI 2025paper

#4992

From Representation Space to Prognostic Insights: Whole Slide Image Generation with Hierarchical Diffusion Model for Survival Prediction

Zhihao Tang, Xi Zhang, Chaozhuo Li

AAAI 2025paper

#4993

Learning Only When It Matters: Cost

Aware Long-Tailed Classification - Yu-Cheng He, Yao-Xiang Ding, Han-Jia Ye et al.

AAAI 2024paper

#4994

RAGG: Retrieval-Augmented Grasp Generation Model

Zhenhua Tang, Bin Zhu, Yanbin Hao et al.

AAAI 2025paper

#4995

MICA: Towards Explainable Skin Lesion Diagnosis via Multi

Level Image-Concept Alignment - Yequan Bie, Luyang Luo, Hao Chen

AAAI 2024paper

#4996

Talk Funny! A Large

Scale Humor Response Dataset with Chain-of-Humor Interpretation - Yuyan Chen, Yichen Yuan, Panjun Liu et al.

AAAI 2024paper

#4997

NaMa: Neighbor

Aware Multi-Modal Adaptive Learning for Prostate Tumor Segmentation on Anisotropic MR Images - Runqi Meng, Xiao Zhang, Shijie Huang et al.

AAAI 2024paper

#4998

M2Flow: A Motion Information Fusion Framework for Enhanced Unsupervised Optical Flow Estimation in Autonomous Driving

Xunpei Sun, Gang Chen, Zuoxun Hou

AAAI 2025paper

#4999

Transferable Adversarial Attacks for Object Detection Using Object

Aware Significant Feature Distortion - Xinlong Ding, Jiansheng Chen, Hongwei Yu et al.

AAAI 2024paper

#5000

Taxonomy Driven Fast Adversarial Training

Kun Tong, Chengze Jiang, Jie Gui et al.

AAAI 2024paper

← Previous

1...23 24 25 26 27