Most Cited AAAI "text embeddings fusion" Papers

5,317 papers found • Page 16 of 27

Filters:Most Cited AAAI text embeddings fusion Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#3001

Enhancing the Robustness of Spiking Neural Networks with Stochastic Gating Mechanisms

Jianhao Ding, Zhaofei Yu, Tiejun Huang et al.

AAAI 2024paper

#3002

A Closer Look at Curriculum Adversarial Training: From an Online Perspective

Lianghe Shi, Weiwei Liu

AAAI 2024paper

#3003

TrojanDec: Data-free Detection of Trojan Inputs in Self-supervised Learning

Yupei Liu, Yanting Wang, Jinyuan Jia

AAAI 2025paperarXiv:2501.04108

#3004

DRF: Improving Certified Robustness via Distributional Robustness Framework

Zekai Wang, Zhengyu Zhou, Weiwei Liu

AAAI 2024paper

#3005

Provably Convergent Federated Trilevel Learning

Yang Jiao, Kai YANG, Tiancheng Wu et al.

AAAI 2024paperarXiv:2312.11835

#3006

Recoverable Facial Identity Protection via Adaptive Makeup Transfer Adversarial Attacks

Xiyao Liu, Junxing Ma, Xinda Wang et al.

AAAI 2025paper

#3007

Dynamic Knowledge Injection for AIXI Agents

Samuel Yang-Zhao, Kee Siong Ng, Marcus Hutter

AAAI 2024paperarXiv:2312.16184

#3008

Finite-Time Frequentist Regret Bounds of Multi-Agent Thompson Sampling on Sparse Hypergraphs

Tianyuan Jin, Hao-Lun Hsu, William Chang et al.

AAAI 2024paperarXiv:2312.15549

#3009

Whole Genome Transformer for Gene Interaction Effects in Microbiome Habitat Specificity

Zhufeng Li, Sandeep Suresh Cranganore, Nicholas Youngblut et al.

AAAI 2025paperarXiv:2405.05998

#3010

AI-Powered Algorithm-Centric Quantum Processor Topology Design

Tian Li, Xiao-Yue Xu, Chen Ding et al.

AAAI 2025paperarXiv:2412.13805

#3011

Vox-UDA: Voxel-wise Unsupervised Domain Adaptation for Cryo-Electron Subtomogram Segmentation with Denoised Pseudo-Labeling

Haoran Li, Xingjian Li, Jiahua Shi et al.

AAAI 2025paperarXiv:2406.18610

#3012

IWRN:A Robust Blind Watermarking Method for Artwork Image Copyright Protection Against Noise Attack

Feifei Kou, Yuhan Yao, Siyuan Yao et al.

AAAI 2025paper

#3013

Learning Generalized Residual Exchange-Correlation-Uncertain Functional for Density Functional Theory

Sizhuo Jin, Shuo Chen, Jianjun Qian et al.

AAAI 2025paperarXiv:2412.18350

#3014

Feature Distribution Matching by Optimal Transport for Effective and Robust Coreset Selection

AAAI 2024paper

#3015

A Unified Self-Distillation Framework for Multimodal Sentiment Analysis with Uncertain Missing Modalities

AAAI 2024paper

#3016

Guiding a Harsh-Environments Robust Detector via RAW Data Characteristic Mining

AAAI 2024paper

#3017

Xiezhi: An Ever-Updating Benchmark for Holistic Domain Knowledge Evaluation

Zhouhong Gu, Xiaoxuan Zhu, Haoning Ye et al.

AAAI 2024paperarXiv:2306.05783

#3018

Resisting Backdoor Attacks in Federated Learning via Bidirectional Elections and Individual Perspective

Zhen Qin, Feiyi Chen, Chen Zhi et al.

AAAI 2024paperarXiv:2309.16456

#3019

Transportable Representations for Domain Generalization

Kasra Jalaldoust, Elias Bareinboim

AAAI 2024paper

#3020

Exponential Hardness of Optimization from the Locality in Quantum Neural Networks

Hao-Kai Zhang, Chengkai Zhu, Geng Liu et al.

AAAI 2024paper

#3021

Social Recommendation via Graph-Level Counterfactual Augmentation

Yinxuan Huang, Ke Liang, Yanyi Huang et al.

AAAI 2025paper

#3022

MFOS: Model-Free & One-Shot Object Pose Estimation

JongMin Lee, Yohann Cabon, Romain Brégier et al.

AAAI 2024paper

#3023

Hierarchical Topology Isomorphism Expertise Embedded Graph Contrastive Learning

Jiangmeng Li, Yifan Jin, Hang Gao et al.

AAAI 2024paperarXiv:2312.14222

#3024

ViFactCheck: A New Benchmark Dataset and Methods for Multi-Domain News Fact-Checking In Vietnamese

Tran Thai Hoa, Tran Quang Duy, Khanh Quoc Tran et al.

AAAI 2025paperarXiv:2412.15308

#3025

PDE+: Enhancing Generalization via PDE with Adaptive Distributional Diffusion

Yige Yuan, Bingbing Xu, Bo Lin et al.

AAAI 2024paperarXiv:2305.15835

#3026

HHAN: Comprehensive Infectious Disease Source Tracing via Heterogeneous Hypergraph Neural Network

Qiang He, Yunting Bao, Hui Fang et al.

AAAI 2025paper

#3027

Learning Representations on the Unit Sphere: Investigating Angular Gaussian and Von Mises-Fisher Distributions for Online Continual Learning

Nicolas Michel, Giovanni Chierchia, Romain Negrel et al.

AAAI 2024paperarXiv:2306.03364

#3028

Towards Real-World Test-Time Adaptation: Tri-net Self-Training with Balanced Normalization

Yongyi Su, Xun Xu, Kui Jia

AAAI 2024paperarXiv:2309.14949

#3029

A Theoretical Framework for an Efficient Normalizing Flow-Based Solution to the Electronic Schrödinger Equation

Daniel Freedman, Eyal Rozenberg, Alex Bronstein

AAAI 2025paper

#3030

Probabilistic Offline Policy Ranking with Approximate Bayesian Computation

Longchao Da, Porter Jenkins, Trevor Schwantes et al.

AAAI 2024paperarXiv:2312.11551

#3031

Revisiting Disentanglement in Downstream Tasks: A Study on Its Necessity for Abstract Visual Reasoning

Ruiqian Nai, Zixin Wen, Ji Li et al.

AAAI 2024paperarXiv:2403.00352

#3032

How to Re-enable PDE Loss for Physical Systems Modeling Under Partial Observation

Haodong Feng, Yue Wang, Dixia Fan

AAAI 2025paperarXiv:2412.09116

#3033

Knowledge Is Power: Harnessing Large Language Models for Enhanced Cognitive Diagnosis

Zhiang Dong, Jingyuan Chen, Fei Wu

AAAI 2025paperarXiv:2502.05556

#3034

Out of Thin Air: Exploring Data-Free Adversarial Robustness Distillation

Yuzheng Wang, Zhaoyu Chen, Dingkang Yang et al.

AAAI 2024paperarXiv:2303.11611

#3035

Improving Cancer Gene Prediction by Enhancing Common Information Between the PPI Network and Gene Functional Association

Chao Deng, Hongdong Li, Jianxin Wang

AAAI 2025paper

#3036

HAGO-Net: Hierarchical Geometric Massage Passing for Molecular Representation Learning

Hongbin Pei, Taile Chen, Chen A et al.

AAAI 2024paper

#3037

Path-Adaptive Matting for Efficient Inference Under Various Computational Cost Constraints

Qinglin Liu, Zonglin Li, Xiaoqian Lv et al.

AAAI 2025paperarXiv:2503.03228

#3038

Robust SAM: On the Adversarial Robustness of Vision Foundation Models

Jiahuan Long, Zhengqin Xu, Tingsong Jiang et al.

AAAI 2025paperarXiv:2504.08906

#3039

Generative Video Diffusion for Unseen Novel Semantic Video Moment Retrieval

Dezhao Luo, Shaogang Gong, Jiabo Huang et al.

AAAI 2025paperarXiv:2401.13329

#3040

VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting

Muhammet Furkan Ilaslan, Ali Köksal, Kevin Qinghong Lin et al.

AAAI 2025paperarXiv:2412.11621

#3041

Game4Loc: A UAV Geo-Localization Benchmark from Game Data

Yuxiang Ji, Boyong He, Zhuoyue Tan et al.

AAAI 2025paperarXiv:2409.16925

#3042

Orchestrating the Symphony of Prompt Distribution Learning for Human-Object Interaction Detection

Mingda Jia, Liming Zhao, Ge Li et al.

AAAI 2025paperarXiv:2412.08506

#3043

DEEPTalk: Dynamic Emotion Embedding for Probabilistic Speech-Driven 3D Face Animation

Jisoo Kim, Jungbin Cho, Joonho Park et al.

AAAI 2025paperarXiv:2408.06010

#3044

Pedestrian Attribute Recognition: A New Benchmark Dataset and a Large Language Model Augmented Framework

Jiandong Jin, Xiao Wang, Qian Zhu et al.

AAAI 2025paperarXiv:2408.09720

#3045

ViPCap: Retrieval Text-Based Visual Prompts for Lightweight Image Captioning

Taewhan Kim, Soeun Lee, Si-Woo Kim et al.

AAAI 2025paperarXiv:2412.19289

#3046

U-KAN Makes Strong Backbone for Medical Image Segmentation and Generation

Chenxin Li, Xinyu Liu, Wuyang Li et al.

AAAI 2025paperarXiv:2406.02918

#3047

UniDet3D: Multi-dataset Indoor 3D Object Detection

Maksim Kolodiazhnyi, Anna Vorontsova, Matvey Skripkin et al.

AAAI 2025paperarXiv:2409.04234

#3048

Do Not DeepFake Me: Privacy-Preserving Neural 3D Head Reconstruction Without Sensitive Images

Jiayi Kong, Xurui Song, Shuo Huai et al.

AAAI 2025paperarXiv:2312.04106

#3049

Exploiting Diffusion Prior for Real-World Image Dehazing with Unpaired Training

Yunwei Lan, Zhigao Cui, Chang Liu et al.

AAAI 2025paperarXiv:2503.15017

#3050

Rethinking Open-Vocabulary Segmentation of Radiance Fields in 3D Space

Hyunjee Lee, Youngsik Yun, Jeongmin Bae et al.

AAAI 2025paperarXiv:2408.07416

#3051

MaskViM: Domain Generalized Semantic Segmentation with State Space Models

Jiahao Li, Yang Lu, Yuan Xie et al.

AAAI 2025paper

#3052

A Compact Implicit Neural Representation for Efficient Storage of Massive 4D Functional Magnetic Resonance Imaging

Ruoran Li, Runzhao Yang, Wenxin Xiang et al.

AAAI 2025paperarXiv:2312.00082

#3053

Transferable Adversarial Face Attack with Text Controlled Attribute

Wenyun Li, Zheng Zhang, Xiangyuan Lan et al.

AAAI 2025paperarXiv:2412.11735

#3054

ProsodyTalker: 3D Visual Speech Animation via Prosody Decomposition

Zonglin Li, Xiaoqian Lv, Qinglin Liu et al.

AAAI 2025paper

#3055

Decoupling Appearance Variations with 3D Consistent Features in Gaussian Splatting

Jiaqi Lin, Zhihao Li, Binxiao Huang et al.

AAAI 2025paperarXiv:2501.10788

#3056

Disentangled Motion Modeling for Video Frame Interpolation

Jaihyun Lew, Jooyoung Choi, Chaehun Shin et al.

AAAI 2025paperarXiv:2406.17256

#3057

AGLLDiff: Guiding Diffusion Models Towards Unsupervised Training-free Real-world Low-light Image Enhancement

Yunlong Lin, Tian Ye, Sixiang Chen et al.

AAAI 2025paperarXiv:2407.14900

#3058

RemDet: Rethinking Efficient Model Design for UAV Object Detection

Chen Li, Rui Zhao, Zeyu Wang et al.

AAAI 2025paperarXiv:2412.10040

#3059

4D Diffusion for Dynamic Protein Structure Prediction with Reference and Motion Guidance

Kaihui Cheng, Ce Liu, Qingkun Su et al.

AAAI 2025paperarXiv:2408.12419

#3060

G2LDetect: A Global-to-Local Approach for Hallucination Detection

Xiaoxia Cheng, Zeqi Tan, Zhe Zheng et al.

AAAI 2025paper

#3061

RingFormer: A Ring-Enhanced Graph Transformer for Organic Solar Cell Property Prediction

Zhihao Ding, Ting Zhang, Yiran Li et al.

AAAI 2025paperarXiv:2412.09030

#3062

HeMeNet: Heterogeneous Multichannel Equivariant Network for Protein Multi-task Learning

Rong Han, Wenbing Huang, Lingxiao Luo et al.

AAAI 2025paper

#3063

Controllable Protein Sequence Generation with LLM Preference Optimization

Xiangyu Liu, Yi Liu, Silei Chen et al.

AAAI 2025paperarXiv:2501.15007

#3064

DAMMFND: Domain-Aware Multimodal Multi-view Fake News Detection

Weihai Lu, Yu Tong, Zhiqiu Ye

AAAI 2025paper

#3065

M²N: A Progressive Macro-to-Micro 3D Modeling Scheme for Unveiling Drug-Target Affinity

Tianxu Lv, Jie Zhu, Jinyi Liu et al.

AAAI 2025paper

#3066

Multi-modal Deepfake Detection via Multi-task Audio-Visual Prompt Learning

Hui Miao, Yuanfang Guo, Zeming Liu et al.

AAAI 2025paper

#3067

SpeHeaTal: A Cluster-Enhanced Segmentation Method for Sperm Morphology Analysis

Yi Shi, Yun-Kai Wang, Xu-Peng Tian et al.

AAAI 2025paperarXiv:2502.13192

#3068

Generalized Implicit Neural Representations for Dynamic Molecular Surface Modeling

Fang Wu, Bozhen Hu, Stan Z. Li

AAAI 2025paper

#3069

MultiSFL: Towards Accurate Split Federated Learning via Multi-Model Aggregation and Knowledge Replay

Zeke Xia, Ming Hu, Dengke Yan et al.

AAAI 2025paper

#3070

Uncovering LLM-Generated Code: A Zero-Shot Synthetic Code Detector via Code Rewriting

Tong Ye, Yangkai Du, Tengfei Ma et al.

AAAI 2025paperarXiv:2405.16133

#3071

Efficient Traffic Prediction Through Spatio-Temporal Distillation

Qianru Zhang, Xinyi Gao, Haixin Wang et al.

AAAI 2025paperarXiv:2501.10459

#3072

Multi-Perspective Consolidation Enhanced Cognitive Diagnosis via Conditional Diffusion Model

Guanhao Zhao, Zhenya Huang, Cheng Cheng et al.

AAAI 2025paper

#3073

Multi-View Incremental Learning with Structured Hebbian Plasticity for Enhanced Fusion Efficiency

Yuhong Chen, Ailin Song, Huifeng Yin et al.

AAAI 2025paperarXiv:2412.12801

#3074

Symbolic Functional Decomposition: A Reconfiguration Approach

Mateus de Oliveira Oliveira, Wim Van Den Broeck

AAAI 2025paperarXiv:2601.08354

#3075

Towards More Discriminative Feature Learning in SNNs with Temporal-Self-Erasing Supervision

Wei Liu, Li Yang, Mingxuan Zhao et al.

AAAI 2025paper

#3076

Multi-to-Single: Reducing Multimodal Dependency in Emotion Recognition Through Contrastive Learning

Yan-Kai Liu, Jinyu Cai, Bao-Liang Lu et al.

AAAI 2025paper

#3077

ToMATO: Verbalizing the Mental States of Role-Playing LLMs for Benchmarking Theory of Mind

Kazutoshi Shinoda, Nobukatsu Hojo, Kyosuke Nishida et al.

AAAI 2025paperarXiv:2501.08838

#3078

SalM²: An Extremely Lightweight Saliency Mamba Model for Real-Time Cognitive Awareness of Driver Attention

Chunyu Zhao, Wentao Mu, Xian Zhou et al.

AAAI 2025paper

#3079

Progressive Self-Learning for Domain Adaptation on Symbolic Regression of Integer Sequences

Yaohui Zhu, Kaiming Sun, Zhengdong Luo et al.

AAAI 2025paper

#3080

HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution Vision-Language Models

Kazi Hasan Ibn Arif, JinYi Yoon, Dimitrios S. Nikolopoulos et al.

AAAI 2025paperarXiv:2408.10945

#3081

Can Generative Models Improve Self-Supervised Representation Learning?

Sana Ayromlou, Vahid Reza Khazaie, Fereshteh Forghani et al.

AAAI 2025paperarXiv:2403.05966

#3082

The Master Key Filters Hypothesis: Deep Filters Are General

Zahra Babaiee, Peyman M. Kiasari, Daniela Rus et al.

AAAI 2025paperarXiv:2412.16751

#3083

FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing

Lingling Cai, Kang Zhao, Hangjie Yuan et al.

AAAI 2025paperarXiv:2409.20500

#3084

Deep Graph Online Hashing for Multi-Label Image Retrieval

Yuan Cao, Xiangru Chen, Zifan Liu et al.

AAAI 2025paper

#3085

Segment Any 3D Gaussians

Jiazhong Cen, Jiemin Fang, Chen Yang et al.

AAAI 2025paperarXiv:2312.00860

#3086

Infinite-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation

Qihua Chen, Yue Ma, Hongfa Wang et al.

AAAI 2025paper

#3087

Cross-View Referring Multi-Object Tracking

Sijia Chen, En Yu, Wenbing Tao

AAAI 2025paperarXiv:2412.17807

#3088

M3Net: Multimodal Multi-task Learning for 3D Detection, Segmentation, and Occupancy Prediction in Autonomous Driving

Xuesong Chen, Shaoshuai Shi, Tao Ma et al.

AAAI 2025paperarXiv:2503.18100

#3089

3D Measurement of Complex Textured Objects Based on Bidirectional Fringe Projection

Yuchong Chen, Jian Yu, Shaoyan Gai et al.

AAAI 2025paper

#3090

EvHDR-GS: Event-guided HDR Video Reconstruction with 3D Gaussian Splatting

Zehao Chen, Zhan Lu, De Ma et al.

AAAI 2025paper

#3091

3DPGS: 3D Probabilistic Graph Search for Archaeological Piece Grouping

Junfeng Cheng, Yingkai Yang, Tania Stathaki

AAAI 2025paper

#3092

Bridge 2D-3D: Uncertainty-aware Hierarchical Registration Network with Domain Alignment

Zhixin Cheng, Jiacheng Deng, Xinjun Li et al.

AAAI 2025paperarXiv:2504.01641

#3093

Distribution-Level Feature Distancing for Machine Unlearning: Towards a Better Trade-off Between Model Utility and Forgetting

Dasol Choi, Dongbin Na

AAAI 2025paperarXiv:2409.14747

#3094

SIDL: A Real-World Dataset for Restoring Smartphone Images with Dirty Lenses

Sooyoung Choi, Sungyong Park, Heewon Kim

AAAI 2025paper

#3095

AttackBench: Evaluating Gradient-based Attacks for Adversarial Examples

Antonio Emanuele Cinà, Jérôme Rony, Maura Pintor et al.

AAAI 2025paperarXiv:2404.19460

#3096

PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery

Shristi Das Biswas, Matthew Shreve, Xuelu Li et al.

AAAI 2025paperarXiv:2501.09826

#3097

Boundary-Aware Temporal Dynamic Pseudo-Supervision Pairs Generation for Zero-Shot Natural Language Video Localization

Xiongwen Deng, Haoyu Tang, Han Jiang et al.

AAAI 2025paper

#3098

AS-Det: Active Sampling for Adaptive 3D Object Detection in Point Clouds

Ziheng Ding, Xiaze Zhang, Qi Jing et al.

AAAI 2025paper

#3099

Latent Diffusion-Enhanced Virtual Try-On via Optimized Pseudo-Label Generation

Chenghu Du, Junyin Wang, Feng Yu et al.

AAAI 2025paper

#3100

SSUN-Net: Spatial-Spectral Prior-Aware Unfolding Network for Pan-Sharpening

Shijie Fang, Hongping Gan

AAAI 2025paper

#3101

PNVC: Towards Practical INR-based Video Compression

Ge Gao, Ho Man Kwan, Fan Zhang et al.

AAAI 2025paperarXiv:2409.00953

#3102

EventMamba: Enhancing Spatio-Temporal Locality with State Space Models for Event-Based Video Reconstruction

Chengjie Ge, Xueyang Fu, Peng He et al.

AAAI 2025paperarXiv:2503.19721

#3103

Implicit Location-Caption Alignment via Complementary Masking for Weakly-Supervised Dense Video Captioning

Shiping Ge, Qiang Chen, Zhiwei Jiang et al.

AAAI 2025paperarXiv:2412.12791

#3104

Surgical Workflow Recognition and Blocking Effectiveness Detection in Laparoscopic Liver Resection with Pringle Maneuver

Diandian Guo, Weixin Si, Zhixi Li et al.

AAAI 2025paperarXiv:2408.10538

#3105

PromptDet: A Lightweight 3D Object Detection Framework with LiDAR Prompts

Kun Guo, Qiang Ling

AAAI 2025paperarXiv:2412.12460

#3106

VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding

Yongxin Guo, Jingyu Liu, Mingda Li et al.

AAAI 2025paperarXiv:2405.13382

#3107

LLaVA Needs More Knowledge: Retrieval Augmented Natural Language Generation with Knowledge Graph for Explaining Thoracic Pathologies

Ameer Hamza, Abdullah, Yong Hyun Ahn et al.

AAAI 2025paperarXiv:2410.04749

#3108

MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement

Xu He, Zhiyong Wu, Xiaoyu Li et al.

AAAI 2025paperarXiv:2408.14211

#3109

Prompt Tuning In a Compact Attribute Space

Shiyu Hou, Tianfei Zhou, Shuai Zhang et al.

AAAI 2025paper

#3110

Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation

Qihan Huang, Siming Fu, Jinlong Liu et al.

AAAI 2025paperarXiv:2409.17920

#3111

PSReg: Prior-guided Sparse Mixture of Experts for Point Cloud Registration

Xiaoshui Huang, Zhou Huang, Yifan Zuo et al.

AAAI 2025paperarXiv:2501.07762

#3112

EGSRAL:An Enhanced 3D Gaussian Splatting Based Renderer with Automated Labeling for Large-Scale Driving Scene

Yixiong Huo, Guangfeng Jiang, Hongyang Wei et al.

AAAI 2025paperarXiv:2412.15550

#3113

High-Resolution Frame Interpolation with Patch-based Cascaded Diffusion

Junhwa Hur, Charles Herrmann, Saurabh Saxena et al.

AAAI 2025paperarXiv:2410.11838

#3114

Few-Shot Fine-Grained Image Classification with Progressively Feature Refinement and Continuous Relationship Modeling

Zhen-Xiang Ma, Zhen-Duo Chen, Tai Zheng et al.

AAAI 2025paper

#3115

SegFace: Face Segmentation of Long-Tail Classes

Kartik Narayan, Vibashan Vs, Vishal M. Patel

AAAI 2025paperarXiv:2412.08647

#3116

HiGDA: Hierarchical Graph of Nodes to Learn Local-to-Global Topology for Semi-Supervised Domain Adaptation

Ba Hung Ngo, Doanh C. Bui, Nhat-Tuong Do-Tran et al.

AAAI 2025paperarXiv:2412.11819

#3117

Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community

Jiancheng Pan, Yanxing Liu, Yuqian Fu et al.

AAAI 2025paperarXiv:2408.09110

#3118

Beyond Text: Fine-Grained Multi-Modal Fact Verification with Hypergraph Transformers

Hui Pang, Chaozhuo Li, Litian Zhang et al.

AAAI 2025paper

#3119

EfficientVMamba: Atrous Selective Scan for Light Weight Visual Mamba

Xiaohuan Pei, Tao Huang, Chang Xu

AAAI 2025paperarXiv:2403.09977

#3120

IMAGDressing-v1: Customizable Virtual Dressing

Fei Shen, Xin Jiang, Xin He et al.

AAAI 2025paperarXiv:2407.12705

#3121

Normal-NeRF: Ambiguity-Robust Normal Estimation for Highly Reflective Scenes

Ji Shi, Xianghua Ying, Ruohao Guo et al.

AAAI 2025paperarXiv:2501.09460

#3122

OGP-Net: Optical Guidance Meets Pixel-Level Contrastive Distillation for Robust Multi-Modal and Missing Modality Segmentation

Aniruddh Sikdar, Jayant Teotia, Suresh Sundaram

AAAI 2025paper

#3123

Temporal Coherent Object Flow for Multi-Object Tracking

Zikai Song, Run Luo, Lintao Ma et al.

AAAI 2025paper

#3124

Toward Improving Robustness and Accuracy in Unsupervised Domain Adaptation

Aishwarya Soni, Tanima Dutta

AAAI 2025paper

#3125

Explicit Relational Reasoning Network for Scene Text Detection

Yuchen Su, Zhineng Chen, Yongkun Du et al.

AAAI 2025paperarXiv:2412.14692

#3126

3D Annotation-Free Learning by Distilling 2D Open-Vocabulary Segmentation Models for Autonomous Driving

Boyi Sun, Yuhang Liu, Xingxia Wang et al.

AAAI 2025paperarXiv:2405.15286

#3127

C2P-CLIP: Injecting Category Common Prompt in CLIP to Enhance Generalization in Deepfake Detection

Chuangchuang Tan, Renshuai Tao, Huan Liu et al.

AAAI 2025paperarXiv:2408.09647

#3128

From Representation Space to Prognostic Insights: Whole Slide Image Generation with Hierarchical Diffusion Model for Survival Prediction

Zhihao Tang, Xi Zhang, Chaozhuo Li

AAAI 2025paper

#3129

Unsupervised Self-Prior Embedding Neural Representation for Iterative Sparse-View CT Reconstruction

Xuanyu Tian, Lixuan Chen, Qing Wu et al.

AAAI 2025paperarXiv:2502.05445

#3130

AI-generated Image Quality Assessment in Visual Communication

Yu Tian, Yixuan Li, Baoliang Chen et al.

AAAI 2025paperarXiv:2412.15677

#3131

G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4o

Tony Cheng Tong, Sirui He, Zhiwen Shao et al.

AAAI 2025paperarXiv:2412.13647

#3132

Towards Efficient Object Re-Identification with a Novel Cloud-Edge Collaborative Framework

Chuanming Wang, Yuxin Yang, Mengshi Qi et al.

AAAI 2025paperarXiv:2401.02041

#3133

EMControl: Adding Conditional Control to Text-to-Image Diffusion Models via Expectation-Maximization

He Wang, Longquan Dai, Jinhui Tang

AAAI 2025paper

#3134

MIMTrack: In-Context Tracking via Masked Image Modeling

Xingmei Wang, Guohao Nie, Jiaxiang Meng et al.

AAAI 2025paper

#3135

Hierarchical Alignment-enhanced Adaptive Grounding Network for Generalized Referring Expression Comprehension

Yaxian Wang, Henghui Ding, Shuting He et al.

AAAI 2025paperarXiv:2501.01416

#3136

Capturing the Unseen: Vision-Free Facial Motion Capture Using Inertial Measurement Units

Youjia Wang, Yiwen Wu, Hengan Zhou et al.

AAAI 2025paperarXiv:2402.03944

#3137

MambaPro: Multi-Modal Object Re-identification with Mamba Aggregation and Synergistic Prompt

Yuhao Wang, Xuehu Liu, Tianyu Yan et al.

AAAI 2025paperarXiv:2412.10707

#3138

IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Primary Word Emphasis

Yuji Wang, Jingchen Ni, Yong Liu et al.

AAAI 2025paperarXiv:2503.00936

#3139

Thermal-Aware Low-Light Image Enhancement: A Real-World Benchmark and a New Light-Weight Model

Zhen Wang, Yaozu Wu, Dongyuan Li et al.

AAAI 2025paper

#3140

Realistic Noise Synthesis with Diffusion Models

Qi Wu, Mingyan Han, Ting Jiang et al.

AAAI 2025paperarXiv:2305.14022

#3141

Deconfound Semantic Shift and Incompleteness in Incremental Few-shot Semantic Segmentation

Yirui Wu, Yuhang Xia, Hao Li et al.

AAAI 2025paper

#3142

Boosting Vision State Space Model with Fractal Scanning

Haoke Xiao, Lv Tang, Peng-tao Jiang et al.

AAAI 2025paper

#3143

Cross-modulated Attention Transformer for RGBT Tracking

Yun Xiao, Jiacong Zhao, Andong Lu et al.

AAAI 2025paperarXiv:2408.02222

#3144

PointTalk: Audio-Driven Dynamic Lip Point Cloud for 3D Gaussian-based Talking Head Synthesis

Yifan Xie, Tao Feng, Xin Zhang et al.

AAAI 2025paperarXiv:2412.08504

#3145

HieraFashDiff: Hierarchical Fashion Design with Multi-stage Diffusion Models

Zhifeng Xie, Hao Li, Huiming Ding et al.

AAAI 2025paperarXiv:2401.07450

#3146

FLAME: Learning to Navigate with Multimodal LLM in Urban Environments

Yunzhe Xu, Yiyuan Pan, Zhe Liu et al.

AAAI 2025paperarXiv:2408.11051

#3147

Diffusion Prior Interpolation for Flexibility Real-World Face Super-Resolution

Jiarui Yang, Tao Dai, Yufei Zhu et al.

AAAI 2025paperarXiv:2412.16552

#3148

Dual Information Purification for Lightweight SAR Object Detection

Xi Yang, Jiachen Sun, Songsong Duan et al.

AAAI 2025paper

#3149

MMGDreamer: Mixed-Modality Graph for Geometry-Controllable 3D Indoor Scene Generation

Zhifei Yang, Keyang Lu, Chao Zhang et al.

AAAI 2025paperarXiv:2502.05874

#3150

MM-Tracker: Motion Mamba for UAV-platform Multiple Object Tracking

Mufeng Yao, Jinlong Peng, Qingdong He et al.

AAAI 2025paper

#3151

FlexDataset: Crafting Annotated Dataset Generation for Diverse Applications

Ellen Yi-Ge, Leo Shawn

AAAI 2025paper

#3152

ImagePiece: Content-aware Re-tokenization for Efficient Image Recognition

Seungdong Yoa, Seungjun Lee, Hye-Seung Cho et al.

AAAI 2025paperarXiv:2412.16491

#3153

FOCUS: Towards Universal Foreground Segmentation

Zuyao You, Lingyu Kong, Lingchen Meng et al.

AAAI 2025paperarXiv:2501.05238

#3154

Fine-grained Adaptive Visual Prompt for Generative Medical Visual Question Answering

Ting Yu, Zixuan Tong, Jun Yu et al.

AAAI 2025paper

#3155

OTPNet: ODE-inspired Tuning-free Proximal Network for Remote Sensing Image Fusion

Wei Yu, Zonglin Li, Qinglin Liu et al.

AAAI 2025paper

#3156

Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP

Yating Yu, Congqi Cao, Yueran Zhang et al.

AAAI 2025paperarXiv:2412.09895

#3157

OLMD: Orientation-aware Long-term Motion Decoupling for Continuous Sign Language Recognition

Yiheng Yu, Sheng Liu, Yuan Feng et al.

AAAI 2025paperarXiv:2503.08205

#3158

Gaze Label Alignment: Alleviating Domain Shift for Gaze Estimation

Guanzhong Zeng, Jingjing Wang, Zefu Xu et al.

AAAI 2025paperarXiv:2412.15601

#3159

TGFormer: Transformer with Track Query Group for Multi-Object Tracking

Rui Zeng, Yuanzhou Huang, Songwei Pei

AAAI 2025paper

#3160

Training-Free and Hardware-Friendly Acceleration for Diffusion Models via Similarity-based Token Pruning

Evelyn Zhang, Jiayi Tang, Xuefei Ning et al.

AAAI 2025paper

#3161

Decoupling Scattering: Pseudo-Label Guided NeRF for Scenes with Scattering Media

Mingyang Zhang, Junkang Zhang, Faming Fang et al.

AAAI 2025paper

#3162

Visual Perturbation for Text-Based Person Search

Pengcheng Zhang, Xiaohan Yu, Xiao Bai et al.

AAAI 2025paper

#3163

CAMSIC: Content-aware Masked Image Modeling Transformer for Stereo Image Compression

Xinjie Zhang, Shenyuan Gao, Zhening Liu et al.

AAAI 2025paperarXiv:2403.08505

#3164

Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues

Yan Zhang, Gangyan Zeng, Huawen Shen et al.

AAAI 2025paperarXiv:2412.12502

#3165

InstantSticker: Realistic Decal Blending via Disentangled Object Reconstruction

Yi Zhang, Xiaoyang Huang, Yishun Dou et al.

AAAI 2025paperarXiv:2504.06620

#3166

Leveraging Consistent Spatio-Temporal Correspondence for Robust Visual Odometry

Zhaoxing Zhang, Junda Cheng, Gangwei Xu et al.

AAAI 2025paperarXiv:2412.16923

#3167

Adaptive Wavelet-Positional Encoding for High-Frequency Information Learning in Implicit Neural Representation

Hongxu Zhao, Zelin Gao, Yue Wang et al.

AAAI 2025paper

#3168

NightReID: A Large-Scale Nighttime Person Re-Identification Benchmark

Yuxuan Zhao, Weijian Ruan, He Li et al.

AAAI 2025paper

#3169

Universal Domain Adaptive Object Detection via Dual Probabilistic Alignment

Yuanfan Zheng, Jinlin Wu, Wuyang Li et al.

AAAI 2025paperarXiv:2412.11443

#3170

MMPF: Multi-Modal Perception Framework for Abnormal Medical Condition Detection

Chuyi Zhong, Dingkang Yang, Peng Zhai et al.

AAAI 2025paper

#3171

Core-to-Global Reasoning for Compositional Visual Question Answering

Hao Zhou, Tingjin Luo, Zhangqi Jiang

AAAI 2025paper

#3172

Mitigating Feature Gap for Adversarial Robustness by Feature Disentanglement

Nuoyan Zhou, Dawei Zhou, Decheng Liu et al.

AAAI 2025paperarXiv:2401.14707

#3173

GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic Expressions

Ziqi Zhou, Weize Quan, Hailin Shi et al.

AAAI 2025paperarXiv:2412.09296

#3174

A Lottery Ticket Hypothesis Approach with Sparse Fine-tuning and MAE for Image Forgery Detection and Localization

Jiaying Zhu, Dong Li, Xueyang Fu et al.

AAAI 2025paper

#3175

Less Is More: Adaptive Program Repair with Bug Localization and Preference Learning

Zhenlong Dai, Bingrui Chen, Zhuoluo Zhao et al.

AAAI 2025paperarXiv:2503.06510

#3176

Optimal Classification Trees for Continuous Feature Data Using Dynamic Programming with Branch-and-Bound

Cătălin E. Brița, Jacobus G. M. van der Linden, Emir Demirović

AAAI 2025paper

#3177

Decentralized Projected Riemannian Stochastic Recursive Momentum Method for Nonconvex Optimization

Kangkang Deng, Jiang Hu

AAAI 2025paperarXiv:2412.02382

#3178

Parameterized Complexity of Caching in Networks

Robert Ganian, Fionn Mc Inerney, Dimitra Tsigkari

AAAI 2025paperarXiv:2412.16585

#3179

DCC: Differentiable Cardinality Constraints for Partial Index Tracking

Wooyeon Jo, Hyunsouk Cho

AAAI 2025paperarXiv:2412.17175

#3180

Designing Specialized Two-Dimensional Graph Spectral Filters for Spatial-Temporal Graph Modeling

Yuxin Chen, Fangru Lin, Jingyi Huo et al.

AAAI 2025paper

#3181

POI-Enhancer: An LLM-based Semantic Enhancement Framework for POI Representation Learning

Jiawei Cheng, Jingyuan Wang, Yichuan Zhang et al.

AAAI 2025paperarXiv:2502.10038

#3182

Descriptive and Discriminative Document Identifiers for Generative Retrieval

Jiehan Cheng, Zhicheng Dou, Yutao Zhu et al.

AAAI 2025paper

#3183

Entire-Space Variational Information Exploitation for Post-Click Conversion Rate Prediction

Ke Fei, Xinyue Zhang, Jingjing Li

AAAI 2025paperarXiv:2502.15687

#3184

Mixed-Curvature Multi-Modal Knowledge Graph Completion

Yuxiao Gao, Fuwei Zhang, Zhao Zhang et al.

AAAI 2025paper

#3185

Multiple Purchase Chains with Negative Transfer Elimination for Multi-Behavior Recommendation

Shuwei Gong, Yuting Liu, Yizhou Dang et al.

AAAI 2025paper

#3186

K-ON: Stacking Knowledge on the Head Layer of Large Language Model

Lingbing Guo, Yichi Zhang, Zhongpu Bo et al.

AAAI 2025paperarXiv:2502.06257

#3187

Decomposed Spatio-Temporal Mamba for Long-Term Traffic Prediction

Sicheng He, Junzhong Ji, Minglong Lei

AAAI 2025paper

#3188

ST-FiT: Inductive Spatial-Temporal Forecasting with Limited Training Data

Zhenyu Lei, Yushun Dong, Jundong Li et al.

AAAI 2025paperarXiv:2412.10912

#3189

Public Opinion Field Effect and Hawkes Process Join Hands for Information Popularity Prediction

Junliang Li, Yajun Yang, Yujia Zhang et al.

AAAI 2025paper

#3190

Self-Explainable Graph Transformer for Link Sign Prediction

Lu Li, Jiale Liu, Xingyu Ji et al.

AAAI 2025paperarXiv:2408.08754

#3191

Context-aware Inductive Knowledge Graph Completion with Latent Type Constraints and Subgraph Reasoning

Muzhi Li, Cehao Yang, Chengjin Xu et al.

AAAI 2025paperarXiv:2410.16803

#3192

Structure Balance and Gradient Matching-Based Signed Graph Condensation

Rong Li, Long Xu, Songbai Liu et al.

AAAI 2025paper

#3193

LLMEmb: Large Language Model Can Be a Good Embedding Generator for Sequential Recommendation

Qidong Liu, Xian Wu, Wanyu Wang et al.

AAAI 2025paperarXiv:2409.19925

#3194

EPERM: An Evidence Path Enhanced Reasoning Model for Knowledge Graph Question and Answering

Xiao Long, Liansheng Zhuang, Aodi Li et al.

AAAI 2025paperarXiv:2502.16171

#3195

FairGP: A Scalable and Fair Graph Transformer Using Graph Partitioning

Renqiang Luo, Huafei Huang, Ivan Lee et al.

AAAI 2025paperarXiv:2412.10669

#3196

Sub-Interest-Aware Representation Uniformity for Recommender System

Ruijia Ma, Yahong Lian, Chunyao Song

AAAI 2025paper

#3197

GenAuction: A Generative Auction for Online Advertising

Yuchao Ma, Ruohan Qian, Bingzhe Wang et al.

AAAI 2025paper

#3198

Seeing Beyond Noise: Joint Graph Structure Evaluation and Denoising for Multimodal Recommendation

Yuxin Qi, Quan Zhang, Xi Lin et al.

AAAI 2025paper

#3199

Domain-Level Disentanglement Framework Based on Information Enhancement for Cross-Domain Cold-Start Recommendation

Nian Rong, Fei Xiong, Shirui Pan et al.

AAAI 2025paper

#3200

Language Pre-training Guided Masking Representation Learning for Time Series Classification

Liaoyuan Tang, Zheng Wang, Jie Wang et al.

AAAI 2025paper

← Previous

1...14 15 16 17 18...27

Most Cited AAAI "text embeddings fusion" Papers

Conference

Paper Type

Enhancing the Robustness of Spiking Neural Networks with Stochastic Gating Mechanisms

A Closer Look at Curriculum Adversarial Training: From an Online Perspective

TrojanDec: Data-free Detection of Trojan Inputs in Self-supervised Learning

DRF: Improving Certified Robustness via Distributional Robustness Framework

Provably Convergent Federated Trilevel Learning

Recoverable Facial Identity Protection via Adaptive Makeup Transfer Adversarial Attacks

Dynamic Knowledge Injection for AIXI Agents

Finite-Time Frequentist Regret Bounds of Multi-Agent Thompson Sampling on Sparse Hypergraphs

Whole Genome Transformer for Gene Interaction Effects in Microbiome Habitat Specificity

AI-Powered Algorithm-Centric Quantum Processor Topology Design

Vox-UDA: Voxel-wise Unsupervised Domain Adaptation for Cryo-Electron Subtomogram Segmentation with Denoised Pseudo-Labeling

IWRN:A Robust Blind Watermarking Method for Artwork Image Copyright Protection Against Noise Attack

Learning Generalized Residual Exchange-Correlation-Uncertain Functional for Density Functional Theory

Feature Distribution Matching by Optimal Transport for Effective and Robust Coreset Selection

A Unified Self-Distillation Framework for Multimodal Sentiment Analysis with Uncertain Missing Modalities

Guiding a Harsh-Environments Robust Detector via RAW Data Characteristic Mining

Xiezhi: An Ever-Updating Benchmark for Holistic Domain Knowledge Evaluation

Resisting Backdoor Attacks in Federated Learning via Bidirectional Elections and Individual Perspective

Transportable Representations for Domain Generalization

Exponential Hardness of Optimization from the Locality in Quantum Neural Networks

Social Recommendation via Graph-Level Counterfactual Augmentation

MFOS: Model-Free &#x26; One-Shot Object Pose Estimation

Hierarchical Topology Isomorphism Expertise Embedded Graph Contrastive Learning

ViFactCheck: A New Benchmark Dataset and Methods for Multi-Domain News Fact-Checking In Vietnamese

PDE+: Enhancing Generalization via PDE with Adaptive Distributional Diffusion

HHAN: Comprehensive Infectious Disease Source Tracing via Heterogeneous Hypergraph Neural Network

Learning Representations on the Unit Sphere: Investigating Angular Gaussian and Von Mises-Fisher Distributions for Online Continual Learning

Towards Real-World Test-Time Adaptation: Tri-net Self-Training with Balanced Normalization

A Theoretical Framework for an Efficient Normalizing Flow-Based Solution to the Electronic Schrödinger Equation

Probabilistic Offline Policy Ranking with Approximate Bayesian Computation

Revisiting Disentanglement in Downstream Tasks: A Study on Its Necessity for Abstract Visual Reasoning

How to Re-enable PDE Loss for Physical Systems Modeling Under Partial Observation

Knowledge Is Power: Harnessing Large Language Models for Enhanced Cognitive Diagnosis

Out of Thin Air: Exploring Data-Free Adversarial Robustness Distillation

Improving Cancer Gene Prediction by Enhancing Common Information Between the PPI Network and Gene Functional Association

HAGO-Net: Hierarchical Geometric Massage Passing for Molecular Representation Learning

Path-Adaptive Matting for Efficient Inference Under Various Computational Cost Constraints

Robust SAM: On the Adversarial Robustness of Vision Foundation Models

Generative Video Diffusion for Unseen Novel Semantic Video Moment Retrieval

VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting

Game4Loc: A UAV Geo-Localization Benchmark from Game Data

Orchestrating the Symphony of Prompt Distribution Learning for Human-Object Interaction Detection

DEEPTalk: Dynamic Emotion Embedding for Probabilistic Speech-Driven 3D Face Animation

Pedestrian Attribute Recognition: A New Benchmark Dataset and a Large Language Model Augmented Framework

ViPCap: Retrieval Text-Based Visual Prompts for Lightweight Image Captioning

U-KAN Makes Strong Backbone for Medical Image Segmentation and Generation

UniDet3D: Multi-dataset Indoor 3D Object Detection

Do Not DeepFake Me: Privacy-Preserving Neural 3D Head Reconstruction Without Sensitive Images

Exploiting Diffusion Prior for Real-World Image Dehazing with Unpaired Training

Rethinking Open-Vocabulary Segmentation of Radiance Fields in 3D Space

MaskViM: Domain Generalized Semantic Segmentation with State Space Models

A Compact Implicit Neural Representation for Efficient Storage of Massive 4D Functional Magnetic Resonance Imaging

Transferable Adversarial Face Attack with Text Controlled Attribute

ProsodyTalker: 3D Visual Speech Animation via Prosody Decomposition

Decoupling Appearance Variations with 3D Consistent Features in Gaussian Splatting

Disentangled Motion Modeling for Video Frame Interpolation

AGLLDiff: Guiding Diffusion Models Towards Unsupervised Training-free Real-world Low-light Image Enhancement

RemDet: Rethinking Efficient Model Design for UAV Object Detection

4D Diffusion for Dynamic Protein Structure Prediction with Reference and Motion Guidance

G2LDetect: A Global-to-Local Approach for Hallucination Detection

RingFormer: A Ring-Enhanced Graph Transformer for Organic Solar Cell Property Prediction

HeMeNet: Heterogeneous Multichannel Equivariant Network for Protein Multi-task Learning

Controllable Protein Sequence Generation with LLM Preference Optimization

DAMMFND: Domain-Aware Multimodal Multi-view Fake News Detection

M²N: A Progressive Macro-to-Micro 3D Modeling Scheme for Unveiling Drug-Target Affinity

Multi-modal Deepfake Detection via Multi-task Audio-Visual Prompt Learning

SpeHeaTal: A Cluster-Enhanced Segmentation Method for Sperm Morphology Analysis

Generalized Implicit Neural Representations for Dynamic Molecular Surface Modeling

MultiSFL: Towards Accurate Split Federated Learning via Multi-Model Aggregation and Knowledge Replay

Uncovering LLM-Generated Code: A Zero-Shot Synthetic Code Detector via Code Rewriting

Efficient Traffic Prediction Through Spatio-Temporal Distillation

Multi-Perspective Consolidation Enhanced Cognitive Diagnosis via Conditional Diffusion Model

Multi-View Incremental Learning with Structured Hebbian Plasticity for Enhanced Fusion Efficiency

Symbolic Functional Decomposition: A Reconfiguration Approach

Towards More Discriminative Feature Learning in SNNs with Temporal-Self-Erasing Supervision

Multi-to-Single: Reducing Multimodal Dependency in Emotion Recognition Through Contrastive Learning

ToMATO: Verbalizing the Mental States of Role-Playing LLMs for Benchmarking Theory of Mind

MFOS: Model-Free & One-Shot Object Pose Estimation