Most Cited 2025 "physics-guided architecture" Papers

22,274 papers found • Page 99 of 112

#19601

Effective Diffusion Transformer Architecture for Image Super-Resolution

Kun Cheng, Lei Yu, Zhijun Tu et al.

AAAI 2025paperarXiv:2409.19589
#19602

Aligning Instance Brownian Bridge with Texts for Open-Vocabulary Video Instance Segmentation

Zesen Cheng, Kehan Li, Li Hao et al.

AAAI 2025paper
#19603

Bridge 2D-3D: Uncertainty-aware Hierarchical Registration Network with Domain Alignment

Zhixin Cheng, Jiacheng Deng, Xinjun Li et al.

AAAI 2025paperarXiv:2504.01641
#19604

Zero-Shot Scene Change Detection

Kyusik Cho, Dong Yeop Kim, Euntai Kim

AAAI 2025paperarXiv:2406.11210
#19605

Distribution-Level Feature Distancing for Machine Unlearning: Towards a Better Trade-off Between Model Utility and Forgetting

Dasol Choi, Dongbin Na

AAAI 2025paperarXiv:2409.14747
#19606

SIDL: A Real-World Dataset for Restoring Smartphone Images with Dirty Lenses

Sooyoung Choi, Sungyong Park, Heewon Kim

AAAI 2025paper
#19607

Intrinsic Image Decomposition for Robust Self-supervised Monocular Depth Estimation on Reflective Surfaces

Wonhyeok Choi, Kyumin Hwang, Minwoo Choi et al.

AAAI 2025paperarXiv:2503.22209
#19608

MASS: Overcoming Language Bias in Image-Text Matching

Jiwan Chung, Seungwon Lim, Sangkyu Lee et al.

AAAI 2025paperarXiv:2501.11469
#19609

AttackBench: Evaluating Gradient-based Attacks for Adversarial Examples

Antonio Emanuele Cinà, Jérôme Rony, Maura Pintor et al.

AAAI 2025paperarXiv:2404.19460
#19610

GCD-Sampling: A General Cross-scale Decoupled Sampling for Point Cloud

Tao Dai, Yanzi Wang, Jianyu Xiong et al.

AAAI 2025paper
#19611

Harmonious Music-driven Group Choreography with Trajectory-Controllable Diffusion

Yuqin Dai, Wanlu Zhu, Ronghui Li et al.

AAAI 2025paper
#19612

Self-Corrected Flow Distillation for Consistent One-Step and Few-Step Image Generation

Quan Dao, Hao Phung, Trung Tuan Dao et al.

AAAI 2025paper
#19613

PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery

Shristi Das Biswas, Matthew Shreve, Xuelu Li et al.

AAAI 2025paperarXiv:2501.09826
#19614

Single Exposure Quantitative Phase Imaging with a Conventional Microscope Using Diffusion Models

Gabriel della Maggiora, Luis Alberto Croquevielle, Harry Horsley et al.

AAAI 2025paperarXiv:2406.04388
#19615

Deep Non-Rigid Structure-from-Motion Revisited: Canonicalization and Sequence Modeling

Hui Deng, Jiawei Shi, Zhen Qin et al.

AAAI 2025paperarXiv:2412.07230
#19616

DiffCorr: Conditional Diffusion Model with Reliable Pseudo-Label Guidance for Unsupervised Point Cloud Shape Correspondence

Jiacheng Deng, Jiahao Lu, Zhixin Cheng et al.

AAAI 2025paper
#19617

Adaptive Siamese Masked Autoencoder with Global Optimization for Unsupervised Point Cloud Shape Correspondence

Jiacheng Deng, Jiahao Lu

AAAI 2025paper
#19618

OTIAS: OcTree Implicit Adaptive Sampling for Multispectral and Hyperspectral Image Fusion

Shangqi Deng, Jun Ma, Liang-Jian Deng et al.

AAAI 2025paper
#19619

Boundary-Aware Temporal Dynamic Pseudo-Supervision Pairs Generation for Zero-Shot Natural Language Video Localization

Xiongwen Deng, Haoyu Tang, Han Jiang et al.

AAAI 2025paper
#19620

Occlusion-Insensitive Talking Head Video Generation via Facelet Compensation

Yuhui Deng, Yuqin Lu, Yangyang Xu et al.

AAAI 2025paper
#19621

Dis²Booth: Learning Image Distribution with Disentangled Features for Text-to-Image Diffusion Models

Guanqi Ding, Chengyu Yang, Shuhui Wang et al.

AAAI 2025paper
#19622

Muses: 3D-Controllable Image Generation via Multi-Modal Agent Collaboration

Yanbo Ding, Shaobin Zhuang, Kunchang Li et al.

AAAI 2025paperarXiv:2408.10605
#19623

AS-Det: Active Sampling for Adaptive 3D Object Detection in Point Clouds

Ziheng Ding, Xiaze Zhang, Qi Jing et al.

AAAI 2025paper
#19624

GarFast: Realistic and Fast Garment Transfer with a Simplified Parser-Free Approach

Chenghu Du, Junyin Wang, Yi Rong et al.

AAAI 2025paper
#19625

Latent Diffusion-Enhanced Virtual Try-On via Optimized Pseudo-Label Generation

Chenghu Du, Junyin Wang, Feng Yu et al.

AAAI 2025paper
#19626

HybridReg: Robust 3D Point Cloud Registration with Hybrid Motions

Keyu Du, Hao Xu, Haipeng Li et al.

AAAI 2025paperarXiv:2503.07019
#19627

A Diffusion-Based Framework for Occluded Object Movement

Zheng-Peng Duan, Jiawei Zhang, Siyu Liu et al.

AAAI 2025paperarXiv:2504.01873
#19628

IniRetinex: Rethinking Retinex-type Low-Light Image Enhancer via Initialization Perspective

Guodong Fan, Zishu Yao, Guang-Yong Chen et al.

AAAI 2025paper
#19629

Vision-guided Text Mining for Unsupervised Cross-modal Hashing with Community Similarity Quantization

Haozhi Fan, Yuan Cao

AAAI 2025paper
#19630

EMHI: A Multimodal Egocentric Human Motion Dataset with HMD and Body-Worn IMUs

Zhen Fan, Peng Dai, Zhuo Su et al.

AAAI 2025paperarXiv:2408.17168
#19631

CoSDA: Enhancing the Robustness of Inversion-based Generative Image Watermarking Framework

Han Fang, Kejiang Chen, Zijin Yang et al.

AAAI 2025paper
#19632

SSUN-Net: Spatial-Spectral Prior-Aware Unfolding Network for Pan-Sharpening

Shijie Fang, Hongping Gan

AAAI 2025paper
#19633

AE-NeRF: Augmenting Event-Based Neural Radiance Fields for Non-ideal Conditions and Larger Scenes

Chaoran Feng, Wangbo Yu, Xinhua Cheng et al.

AAAI 2025paperarXiv:2501.02807
#19634

VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering

Chun-Mei Feng, Yang Bai, Tao Luo et al.

AAAI 2025paperarXiv:2312.12273
#19635

Weakly Supervised Gland Segmentation with Class Semantic Consistency and Purified Labels Filtration

Siyang Feng, Huadeng Wang, Chu Han et al.

AAAI 2025paper
#19636

HDLayout: Hierarchical and Directional Layout Planning for Arbitrary Shaped Visual Text Generation

Tonghui Feng, Chunsheng Yan, Qianru Wang et al.

AAAI 2025paper
#19637

Simplifying Control Mechanism in Text-to-Image Diffusion Models

Zhida Feng, Li Chen, Yuenan Sun et al.

AAAI 2025paper
#19638

BGHR: Bridging the Gap Between HBox-Supervised and RBox-Supervised Oriented Object Detection via Adaptive Fine-Grained Sample Mining

Chenlin Fu, Yingying Zhu

AAAI 2025paper
#19639

Foundation Model Driven Appearance Extraction for Robust Multiple Object Tracking

Teng Fu, Haiyang Yu, Ke Niu et al.

AAAI 2025paper
#19640

MFL-Owner: Ownership Protection for Multi-modal Federated Learning via Orthogonal Transform Watermark

Keke Gai, Dongjue Wang, Jing Yu et al.

AAAI 2025paper
#19641

DFDNet: Disentangling and Filtering Dynamics for Enhanced Video Prediction

Lianqiang Gan, Junyu Lai, Jingze Ju et al.

AAAI 2025paper
#19642

PNVC: Towards Practical INR-based Video Compression

Ge Gao, Ho Man Kwan, Fan Zhang et al.

AAAI 2025paperarXiv:2409.00953
#19643

AIM: Let Any Multimodal Large Language Models Embrace Efficient In-Context Learning

Jun Gao, Qian Qiao, Tianxiang Wu et al.

AAAI 2025paper
#19644

TC-LLaVA: Rethinking the Transfer of LLava from Image to Video Understanding with Temporal Considerations

Mingze Gao, Jingyu Liu, Mingda Li et al.

AAAI 2025paper
#19645

EventMamba: Enhancing Spatio-Temporal Locality with State Space Models for Event-Based Video Reconstruction

Chengjie Ge, Xueyang Fu, Peng He et al.

AAAI 2025paperarXiv:2503.19721
#19646

Implicit Location-Caption Alignment via Complementary Masking for Weakly-Supervised Dense Video Captioning

Shiping Ge, Qiang Chen, Zhiwei Jiang et al.

AAAI 2025paperarXiv:2412.12791
#19647

ParseCaps: An Interpretable Parsing Capsule Network for Medical Image Diagnosis

Xinyu Geng, Jiaming Wang, Xiaolin Huang et al.

AAAI 2025paperarXiv:2411.01564
#19648

MaintaAvatar: A Maintainable Avatar Based on Neural Radiance Fields by Continual Learning

Shengbo Gu, Yu-Kun Qiu, Yu-Ming Tang et al.

AAAI 2025paperarXiv:2502.02372
#19649

OT-StainNet: Optimal Transport Driven Semantic Matching for Weakly Paired H&E-to-IHC Stain Transfer

Xianchao Guan, Yifeng Wang, Ye Zhang et al.

AAAI 2025paper
#19650

You Should Learn to Stop Denoising on Point Clouds in Advance

Chuchen Guo, Weijie Zhou, Zheng Liu et al.

AAAI 2025paper
#19651

Surgical Workflow Recognition and Blocking Effectiveness Detection in Laparoscopic Liver Resection with Pringle Maneuver

Diandian Guo, Weixin Si, Zhixi Li et al.

AAAI 2025paperarXiv:2408.10538
#19652

Enhancing Low-Rank Adaptation with Recoverability-Based Reinforcement Pruning for Object Counting

Haojie Guo, Junyu Gao, Yuan Yuan

AAAI 2025paper
#19653

MetaNeRV: Meta Neural Representations for Videos with Spatial-Temporal Guidance

Jialong Guo, Ke Liu, Jiangchao Yao et al.

AAAI 2025paperarXiv:2501.02427
#19654

PromptDet: A Lightweight 3D Object Detection Framework with LiDAR Prompts

Kun Guo, Qiang Ling

AAAI 2025paperarXiv:2412.12460
#19655

OpenVIS: Open-vocabulary Video Instance Segmentation

Pinxue Guo, Hao Huang, Peiyang He et al.

AAAI 2025paperarXiv:2305.16835
#19656

SpikeGS: Reconstruct 3D Scene Captured by a Fast-Moving Bio-Inspired Camera

Yijia Guo, Liwen Hu, Yuanxi Bai et al.

AAAI 2025paper
#19657

VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding

Yongxin Guo, Jingyu Liu, Mingda Li et al.

AAAI 2025paperarXiv:2405.13382
#19658

LLaVA Needs More Knowledge: Retrieval Augmented Natural Language Generation with Knowledge Graph for Explaining Thoracic Pathologies

Ameer Hamza, Abdullah, Yong Hyun Ahn et al.

AAAI 2025paperarXiv:2410.04749
#19659

DME-Driver: Integrating Human Decision Logic and 3D Scene Perception in Autonomous Driving

Wencheng Han, Dongqian Guo, Cheng-Zhong Xu et al.

AAAI 2025paperarXiv:2401.03641
#19660

ID-Sculpt: ID-aware 3D Head Generation from Single In-the-wild Portrait Image

Jinkun Hao, Junshu Tang, Jiangning Zhang et al.

AAAI 2025paperarXiv:2406.16710
#19661

Efficient Online Training for Zero-Shot Time-Lapse Microscopy Denoising and Super-Resolution

Ruian He, Ri Cheng, Xinkai Lyu et al.

AAAI 2025paper
#19662

MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement

Xu He, Zhiyong Wu, Xiaoyu Li et al.

AAAI 2025paperarXiv:2408.14211
#19663

Long-Tailed Out-of-Distribution Detection: Prioritizing Attention to Tail

Yina He, Lei Peng, Yongcun Zhang et al.

AAAI 2025paperarXiv:2408.06742
#19664

FashionTailor: Controllable Clothing Editing for Human Images with Appearance Preserving

Jie Hou, Jianghong Ma, Xiangyu Mu et al.

AAAI 2025paper
#19665

Prompt Tuning In a Compact Attribute Space

Shiyu Hou, Tianfei Zhou, Shuai Zhang et al.

AAAI 2025paper
#19666

BloomScene: Lightweight Structured 3D Gaussian Splatting for Crossmodal Scene Generation

Xiaolu Hou, Mingcheng Li, Dingkang Yang et al.

AAAI 2025paperarXiv:2501.10462
#19667

Training-and-Prompt-Free General Painterly Harmonization via Zero-Shot Disentenglement on Style and Content References

Teng-Fang Hsiao, Bo-Kai Ruan, Hong-Han Shuai

AAAI 2025paperarXiv:2404.12900
#19668

GaussianSR: High Fidelity 2D Gaussian Splatting for Arbitrary-Scale Image Super-Resolution

Jintong Hu, Bin Xia, Bin Chen et al.

AAAI 2025paperarXiv:2407.18046
#19669

VRVVC: Variable-Rate NeRF-Based Volumetric Video Compression

Qiang Hu, Houqiang Zhong, Zihan Zheng et al.

AAAI 2025paperarXiv:2412.11362
#19670

Identity-Text Video Corpus Grounding

Bin Huang, Xin Wang, Hong Chen et al.

AAAI 2025paper
#19671

SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control

Binyuan Huang, Yuqing Wen, Yucheng Zhao et al.

AAAI 2025paperarXiv:2403.19438
#19672

Wavelet-Assisted Multi-Frequency Attention Network for Pansharpening

Jie Huang, Rui Huang, Jinghao Xu et al.

AAAI 2025paperarXiv:2502.04903
#19673

AUTE: Peer-Alignment and Self-Unlearning Boost Adversarial Robustness for Training Ensemble Models

Lifeng Huang, Tian Su, Chengying Gao et al.

AAAI 2025paper
#19674

EvoChart: A Benchmark and a Self-Training Approach Towards Real-World Chart Understanding

Muye Huang, Han Lai, Xinyu Zhang et al.

AAAI 2025paperarXiv:2409.01577
#19675

Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation

Qihan Huang, Siming Fu, Jinlong Liu et al.

AAAI 2025paperarXiv:2409.17920
#19676

Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation

Shaofei Huang, Rui Ling, Hongyu Li et al.

AAAI 2025paperarXiv:2408.15876
#19677

DreamPhysics: Learning Physics-Based 3D Dynamics with Video Diffusion Priors

Tianyu Huang, Haoze Zhang, Yihan Zeng et al.

AAAI 2025paperarXiv:2406.01476
#19678

Manta: Enhancing Mamba for Few-Shot Action Recognition of Long Sub-Sequence

Wenbo Huang, Jinghui Zhang, Guang Li et al.

AAAI 2025paperarXiv:2412.07481
#19679

CLIP-RestoreX: Restore Image Structure and Perception in Exposure Correction

Xiang Huang, Qing Zhang, Jian-Fang Hu et al.

AAAI 2025paper
#19680

Towards a Multimodal Large Language Model with Pixel-Level Insight for Biomedicine

Xiaoshuang Huang, Lingdong Shen, Jia Liu et al.

AAAI 2025paperarXiv:2412.09278
#19681

PSReg: Prior-guided Sparse Mixture of Experts for Point Cloud Registration

Xiaoshui Huang, Zhou Huang, Yifan Zuo et al.

AAAI 2025paperarXiv:2501.07762
#19682

Medical MLLM Is Vulnerable: Cross-Modality Jailbreak and Mismatched Attacks on Medical Multimodal Large Language Models

Xijie Huang, Xinyuan Wang, Hantao Zhang et al.

AAAI 2025paperarXiv:2405.20775
#19683

L4DR: LiDAR-4DRadar Fusion for Weather-Robust 3D Object Detection

Xun Huang, Ziyu Xu, Hai Wu et al.

AAAI 2025paperarXiv:2408.03677
#19684

SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning Stabilization

Yongle Huang, Haodong Chen, Zhenbang Xu et al.

AAAI 2025paperarXiv:2501.01245
#19685

PoseMamba: Monocular 3D Human Pose Estimation with Bidirectional Global-Local Spatio-Temporal State Space Model

Yunlong Huang, Junshuo Liu, Ke Xian et al.

AAAI 2025paperarXiv:2408.03540
#19686

EGSRAL:An Enhanced 3D Gaussian Splatting Based Renderer with Automated Labeling for Large-Scale Driving Scene

Yixiong Huo, Guangfeng Jiang, Hongyang Wei et al.

AAAI 2025paperarXiv:2412.15550
#19687

High-Resolution Frame Interpolation with Patch-based Cascaded Diffusion

Junhwa Hur, Charles Herrmann, Saurabh Saxena et al.

AAAI 2025paperarXiv:2410.11838
#19688

Zero-shot Depth Completion via Test-time Alignment with Affine-invariant Depth Prior

Lee Hyoseok, Kyeong Seon Kim, Kwon Byung-Ki et al.

AAAI 2025paperarXiv:2502.06338
#19689

VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting

Muhammet Furkan Ilaslan, Ali Köksal, Kevin Qinghong Lin et al.

AAAI 2025paperarXiv:2412.11621
#19690

Every Component Counts: Rethinking the Measure of Success for Medical Semantic Segmentation in Multi-Instance Segmentation Tasks

Alexander Jaus, Constantin Marc Seibold, Simon Reiß et al.

AAAI 2025paperarXiv:2410.18684
#19691

Game4Loc: A UAV Geo-Localization Benchmark from Game Data

Yuxiang Ji, Boyong He, Zhuoyue Tan et al.

AAAI 2025paperarXiv:2409.16925
#19692

Orchestrating the Symphony of Prompt Distribution Learning for Human-Object Interaction Detection

Mingda Jia, Liming Zhao, Ge Li et al.

AAAI 2025paperarXiv:2412.08506
#19693

FlexiTex: Enhancing Texture Generation via Visual Guidance

Dadong Jiang, Xianghui Yang, Zibo Zhao et al.

AAAI 2025paperarXiv:2409.12431
#19694

ARNet: Self-Supervised FG-SBIR with Unified Sample Feature Alignment and Multi-Scale Token Recycling

Jianan Jiang, Hao Tang, Zhilin Jiang et al.

AAAI 2025paperarXiv:2406.11551
#19695

SCCS: Deep Neural Spectral Clustering for Self-Supervised Subcellular Structure Segmentation

Jimao Jiang, Diya Sun, Tianbing Wang et al.

AAAI 2025paper
#19696

Restabilizing Diffusion Models with Predictive Noise Fusion Strategy for Image Super-Resolution

Luoqian Jiang, Yong Guo, Bingna Xu et al.

AAAI 2025paper
#19697

Query Quantized Neural SLAM

Sijia Jiang, Jing Hua, Zhizhong Han

AAAI 2025paperarXiv:2412.16476
#19698

Visual Prompting Upgrades Neural Network Sparsification: A Data-Model Perspective

Can Jin, Tianjin Huang, Yihua Zhang et al.

AAAI 2025paperarXiv:2312.01397
#19699

Pedestrian Attribute Recognition: A New Benchmark Dataset and a Large Language Model Augmented Framework

Jiandong Jin, Xiao Wang, Qian Zhu et al.

AAAI 2025paperarXiv:2408.09720
#19700

A Method for Enhancing Generalization of Adam by Multiple Integrations

Long Jin, Han Nong, Liangming Chen et al.

AAAI 2025paperarXiv:2412.12473
#19701

Bridging the Semantic Granularity Gap Between Text and Frame Representations for Partially Relevant Video Retrieval

WooJin Jun, WonJun Moon, Cheol-Ho Cho et al.

AAAI 2025paper
#19702

CodecNeRF: Toward Fast Encoding and Decoding, Compact, and High-quality Novel-view Synthesis

Gyeongjin Kang, Younggeun Lee, Seungjun Oh et al.

AAAI 2025paperarXiv:2404.04913
#19703

DiffusionREC: Diffusion Model with Adaptive Condition for Referring Expression Comprehension

Jingcheng Ke, Waikeung Wong, Jia Wang et al.

AAAI 2025paper
#19704

PLATYPUS: Progressive Local Surface Estimator for Arbitrary-Scale Point Cloud Upsampling

Donghyun Kim, Hyeonkyeong Kwon, Yumin Kim et al.

AAAI 2025paperarXiv:2411.00432
#19705

Generalized Zero-Shot Learning for Point Cloud Segmentation with Evidence-Based Dynamic Calibration

Hyeonseok Kim, Byeongkeun Kang, Yeejin Lee

AAAI 2025paperarXiv:2509.08280
#19706

APR-RD: Complemental Two Steps for Self-Supervised Real Image Denoising

Hyunjun Kim, Nam Ik Cho

AAAI 2025paper
#19707

DEEPTalk: Dynamic Emotion Embedding for Probabilistic Speech-Driven 3D Face Animation

Jisoo Kim, Jungbin Cho, Joonho Park et al.

AAAI 2025paperarXiv:2408.06010
#19708

ProtoOcc: Accurate, Efficient 3D Occupancy Prediction Using Dual Branch Encoder-Prototype Query Decoder

Jungho Kim, Changwon Kang, Dongyoung Lee et al.

AAAI 2025paperarXiv:2412.08774
#19709

MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity Talking Head Generation

Seyeon Kim, Siyoon Jin, Jihye Park et al.

AAAI 2025paperarXiv:2403.19144
#19710

TSDF-Based Efficient Motion-Compensated Temporal Interpolation for 3D Dynamic Sequences

Soowoong Kim, Minseong Kwon, Junho Choi et al.

AAAI 2025paper
#19711

ViPCap: Retrieval Text-Based Visual Prompts for Lightweight Image Captioning

Taewhan Kim, Soeun Lee, Si-Woo Kim et al.

AAAI 2025paperarXiv:2412.19289
#19712

Sequence Matters: Harnessing Video Models in 3D Super-Resolution

Hyun-kyu Ko, Dongheok Park, Youngin Park et al.

AAAI 2025paperarXiv:2412.11525
#19713

UniDet3D: Multi-dataset Indoor 3D Object Detection

Maksim Kolodiazhnyi, Anna Vorontsova, Matvey Skripkin et al.

AAAI 2025paperarXiv:2409.04234
#19714

Do Not DeepFake Me: Privacy-Preserving Neural 3D Head Reconstruction Without Sensitive Images

Jiayi Kong, Xurui Song, Shuo Huai et al.

AAAI 2025paperarXiv:2312.04106
#19715

Real-Time Neural Denoising with Render-Aware Knowledge Distillation

Mengxun Kong, Jie Guo, Chen Wang et al.

AAAI 2025paper
#19716

Stable Mean Teacher for Semi-supervised Video Action Detection

Akash Kumar, Sirshapan Mitra, Yogesh Singh Rawat

AAAI 2025paperarXiv:2412.07072
#19717

A Unified Degradation-Robust Approach to SSL and UDA for 3D Medical Images

Suruchi Kumari, Pravendra Singh

AAAI 2025paper
#19718

SAFIRE: Segment Any Forged Image Region

Myung-Joon Kwon, Wonjun Lee, Seung-Hun Nam et al.

AAAI 2025paperarXiv:2412.08197
#19719

Exploiting Diffusion Prior for Real-World Image Dehazing with Unpaired Training

Yunwei Lan, Zhigao Cui, Chang Liu et al.

AAAI 2025paperarXiv:2503.15017
#19720

Color Transfer with Modulated Flows

Maria Larchenko, Alexander Lobashev, Dmitry Guskov et al.

AAAI 2025paperarXiv:2503.19062
#19721

Rethinking Open-Vocabulary Segmentation of Radiance Fields in 3D Space

Hyunjee Lee, Youngsik Yun, Jeongmin Bae et al.

AAAI 2025paperarXiv:2408.07416
#19722

NBA3D: Neighbor-Based Confidence Adjustment for 3D Rare Object Detection Using LiDAR

Jooyoung Lee, Jaeyoon Lee, Jongwon Choi

AAAI 2025paper
#19723

MAMS: Model-Agnostic Module Selection Framework for Video Captioning

Sangho Lee, Il Yong Chun, Hogun Park

AAAI 2025paperarXiv:2501.18269
#19724

Enabling Region-Specific Control via Lassos in Point-Based Colorization

Sanghyeon Lee, Jooyeol Yun, Jaegul Choo

AAAI 2025paperarXiv:2412.13469
#19725

Concept Matching with Agent for Out-of-Distribution Detection

Yuxiao Lee, Xiaofeng Cao, Jingcai Guo et al.

AAAI 2025paperarXiv:2405.16766
#19726

FNIN: A Fourier Neural Operator-based Numerical Integration Network for Surface-from-gradients

Jiaqi Leng, Yakun Ju, Yuanxu Duan et al.

AAAI 2025paper
#19727

Disentangled Motion Modeling for Video Frame Interpolation

Jaihyun Lew, Jooyoung Choi, Chaehun Shin et al.

AAAI 2025paperarXiv:2406.17256
#19728

StyO: Stylize Your Face in Only One-Shot

Bonan Li, Zicheng Zhang, Xuecheng Nie et al.

AAAI 2025paperarXiv:2303.03231
#19729

FEAST-Mamba: FEAture and SpaTial Aware Mamba Network with Bidirectional Orthogonal Fusion for Cross-Modal Point Cloud Segmentation

Chade Li, Pengju Zhang, Bo Liu et al.

AAAI 2025paper
#19730

RemDet: Rethinking Efficient Model Design for UAV Object Detection

Chen Li, Rui Zhao, Zeyu Wang et al.

AAAI 2025paperarXiv:2412.10040
#19731

U-KAN Makes Strong Backbone for Medical Image Segmentation and Generation

Chenxin Li, Xinyu Liu, Wuyang Li et al.

AAAI 2025paperarXiv:2406.02918
#19732

Consistency of Compositional Generalization Across Multiple Levels

Chuanhao Li, Zhen Li, Chenchen Jing et al.

AAAI 2025paperarXiv:2412.13636
#19733

An Efficient Framework for Enhancing Discriminative Models via Diffusion Techniques

Chunxiao Li, Xiaoxiao Wang, Boming Miao et al.

AAAI 2025paperarXiv:2412.09063
#19734

Cascaded Diffusion Models for Virtual Try-On: Improving Control and Resolution

Guangyuan Li, Yongkang Wang, Junsheng Luan et al.

AAAI 2025paper
#19735

MaskViM: Domain Generalized Semantic Segmentation with State Space Models

Jiahao Li, Yang Lu, Yuan Xie et al.

AAAI 2025paper
#19736

Know Where You Are From: Event-Based Segmentation via Spatio-Temporal Propagation

Ke Li, Gengyu Lyu, Hao Chen et al.

AAAI 2025paper
#19737

Similar Modality Enhancement and Action Consistency Learning for Weakly Supervised Temporal Action Localization

Maodong Li, Chao Zheng, Jian Wang et al.

AAAI 2025paper
#19738

REGNav: Room Expert Guided Image-Goal Navigation

Pengna Li, Kangyi Wu, Jingwen Fu et al.

AAAI 2025paperarXiv:2502.10785
#19739

Region-aware Difference Distilling with Attribute-guided Contrastive Regularization for Change Captioning

Rong Li, Liang Li, Jiehua Zhang et al.

AAAI 2025paper
#19740

Enhancing Generalizability via Utilization of Unlabeled Data for Occupancy Perception

Ruihang Li, Tao Li, Shanding Ye et al.

AAAI 2025paper
#19741

A Compact Implicit Neural Representation for Efficient Storage of Massive 4D Functional Magnetic Resonance Imaging

Ruoran Li, Runzhao Yang, Wenxin Xiang et al.

AAAI 2025paperarXiv:2312.00082
#19742

DigitalLLaVA: Incorporating Digital Cognition Capability for Physical World Comprehension in Multimodal LLMs

Shiyu Li, Pengxu Wei, Pengchong Qiao et al.

AAAI 2025paper
#19743

Transferable Adversarial Face Attack with Text Controlled Attribute

Wenyun Li, Zheng Zhang, Xiangyuan Lan et al.

AAAI 2025paperarXiv:2412.11735
#19744

MambaLCT: Boosting Tracking via Long-term Context State Space Model

Xiaohai Li, Bineng Zhong, Qihua Liang et al.

AAAI 2025paperarXiv:2412.13615
#19745

PersonaMagic: Stage-Regulated High-Fidelity Face Customization with Tandem Equilibrium

Xinzhe Li, Jiahui Zhan, Shengfeng He et al.

AAAI 2025paperarXiv:2412.15674
#19746

Mamba-CAD: State Space Model for 3D Computer-Aided Design Generative Modeling

Xueyang Li, Yunzhong Lou, Yu Song et al.

AAAI 2025paper
#19747

StructSR: Refuse Spurious Details in Real-World Image Super-Resolution

Yachao Li, Dong Liang, Tianyu Ding et al.

AAAI 2025paperarXiv:2501.05777
#19748

Sparse Transfer Learning Accelerates and Enhances Certified Robustness: A Comprehensive Study

Zhangheng Li, Tianlong Chen, Linyi Li et al.

AAAI 2025paper
#19749

ProsodyTalker: 3D Visual Speech Animation via Prosody Decomposition

Zonglin Li, Xiaoqian Lv, Qinglin Liu et al.

AAAI 2025paper
#19750

Exploring the Potential of Large Vision-Language Models for Unsupervised Text-Based Person Retrieval

Zongyi Li, Li Jianbo, Yuxuan Shi et al.

AAAI 2025paper
#19751

Semantic-guided Masked Mutual Learning for Multi-modal Brain Tumor Segmentation with Arbitrary Missing Modalities

Guoyan Liang, Qin Zhou, Zhe Wang et al.

AAAI 2025paperarXiv:2507.07592
#19752

Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion

Li Liang, Naveed Akhtar, Jordan Vice et al.

AAAI 2025paperarXiv:2501.07260
#19753

S-INF: Towards Realistic Indoor Scene Synthesis via Scene Implicit Neural Field

Zixi Liang, Guowei Xu, Haifeng Wu et al.

AAAI 2025paperarXiv:2412.17561
#19754

Progressive Distribution Matching for Federated Semi-Supervised Learning

Dongping Liao, Xitong Gao, Yabo Xu et al.

AAAI 2025paper
#19755

Multi-Granularity Video Object Segmentation

Sangbeom Lim, Seongchan Kim, Seungjun An et al.

AAAI 2025paperarXiv:2412.01471
#19756

DreamFit: Garment-Centric Human Generation via a Lightweight Anything-Dressing Encoder

Ente Lin, Xujie Zhang, Fuwei Zhao et al.

AAAI 2025paperarXiv:2412.17644
#19757

Decoupling Appearance Variations with 3D Consistent Features in Gaussian Splatting

Jiaqi Lin, Zhihao Li, Binxiao Huang et al.

AAAI 2025paperarXiv:2501.10788
#19758

InvSeg: Test-Time Prompt Inversion for Semantic Segmentation

Jiayi Lin, Jiabo Huang, Jian Hu et al.

AAAI 2025paperarXiv:2410.11473
#19759

Memory Efficient Matting with Adaptive Token Routing

Yiheng Lin, Yihan Hu, Chenyi Zhang et al.

AAAI 2025paperarXiv:2412.10702
#19760

AGLLDiff: Guiding Diffusion Models Towards Unsupervised Training-free Real-world Low-light Image Enhancement

Yunlong Lin, Tian Ye, Sixiang Chen et al.

AAAI 2025paperarXiv:2407.14900
#19761

Deep Hierarchies and Invariant Disease-Indicative Feature Learning for Computer Aided Diagnosis of Multiple Fundus Diseases

Yuxin Lin, Wei Wang, Xiaoling Luo et al.

AAAI 2025paper
#19762

Boosting Multimodal Large Language Models with Visual Tokens Withdrawal for Rapid Inference

Zhihang Lin, Mingbao Lin, Luxi Lin et al.

AAAI 2025paperarXiv:2405.05803
#19763

SOVGaussian: Sparse-View 3D Gaussian Splatting for Open-Vocabulary Scene Understanding

Peng Ling, Tiao Tan, Jiaqi Lin et al.

AAAI 2025paper
#19764

Thinking Racial Bias in Fair Forgery Detection: Models, Datasets and Evaluations

Decheng Liu, Zongqi Wang, Chunlei Peng et al.

AAAI 2025paperarXiv:2407.14367
#19765

UFO: Enhancing Diffusion-Based Video Generation with a Uniform Frame Organizer

Delong Liu, Zhaohui Hou, Mingjie Zhan et al.

AAAI 2025paperarXiv:2412.09389
#19766

Zero-Shot Noise2Mean: Gap Minimization for Efficient Denoising from a Single Noisy Image

Duo Liu, Yiqi Shi, Guoyin Zhang et al.

AAAI 2025paper
#19767

SCott: Accelerating Diffusion Models with Stochastic Consistency Distillation

Hongjian Liu, Qingsong Xie, Tianxiang Ye et al.

AAAI 2025paperarXiv:2403.01505
#19768

PEIE: Physics Embedded Illumination Estimation for Adaptive Dehazing

Huaizhuo Liu, Hai-Miao Hu, Yonglong Jiang et al.

AAAI 2025paper
#19769

TCPFormer: Learning Temporal Correlation with Implicit Pose Proxy for 3D Human Pose Estimation

Jiajie Liu, Mengyuan Liu, Hong Liu et al.

AAAI 2025paperarXiv:2501.01770
#19770

Union Is Strength! Unite the Power of LLMs and MLLMs for Chart Question Answering

Jiapeng Liu, Liang Li, Shihao Rao et al.

AAAI 2025paper
#19771

UP-Restorer: When Unrolling Meets Prompts for Unified Image Restoration

Minghao Liu, Wenhan Yang, Jinyi Luo et al.

AAAI 2025paper
#19772

Path-Adaptive Matting for Efficient Inference Under Various Computational Cost Constraints

Qinglin Liu, Zonglin Li, Xiaoqian Lv et al.

AAAI 2025paperarXiv:2503.03228
#19773

DeRainGS: Gaussian Splatting for Enhanced Scene Reconstruction in Rainy Environments

Shuhong Liu, Xiang Chen, Hongming Chen et al.

AAAI 2025paperarXiv:2408.11540
#19774

VQTalker: Towards Multilingual Talking Avatars Through Facial Motion Tokenization

Tao Liu, Ziyang Ma, Qi Chen et al.

AAAI 2025paperarXiv:2412.09892
#19775

Multi-view Consistent 3D Panoptic Scene Understanding

Xianzhu Liu, Xin Sun, Haozhe Xie et al.

AAAI 2025paper
#19776

Unlocking the Potential of Reverse Distillation for Anomaly Detection

Xinyue Liu, Jianyuan Wang, Biao Leng et al.

AAAI 2025paperarXiv:2412.07579
#19777

Unveiling the Knowledge of CLIP for Training-Free Open-Vocabulary Semantic Segmentation

Yajie Liu, Guodong Wang, Jinjin Zhang et al.

AAAI 2025paper
#19778

DoGA: Enhancing Grounded Object Detection via Grouped Pre-Training with Attributes

Yang Liu, Feng Hou, Yunjie Peng et al.

AAAI 2025paper
#19779

Towards Robust Visual Question Answering via Prompt-Driven Geometric Harmonization

Yishu Liu, Jiawei Zhu, Congcong Wen et al.

AAAI 2025paper
#19780

See Through Their Minds: Learning Transferable Brain Decoding Models from Cross-Subject fMRI

Yulong Liu, Yongqiang Ma, Guibo Zhu et al.

AAAI 2025paper
#19781

SCOPE: Sign Language Contextual Processing with Embedding from LLMs

Yuqi Liu, Wenqian Zhang, Sihan Ren et al.

AAAI 2025paperarXiv:2409.01073
#19782

Advancing Comprehensive Aesthetic Insight with Multi-Scale Text-Guided Self-Supervised Learning

Yuti Liu, Shice Liu, Junyuan Gao et al.

AAAI 2025paperarXiv:2412.11952
#19783

Training Verification-Friendly Neural Networks via Neuron Behavior Consistency

Zongxin Liu, Zhe Zhao, Fu Song et al.

AAAI 2025paperarXiv:2412.13229
#19784

Robust SAM: On the Adversarial Robustness of Vision Foundation Models

Jiahuan Long, Zhengqin Xu, Tingsong Jiang et al.

AAAI 2025paperarXiv:2504.08906
#19785

RGBT Tracking via All-layer Multimodal Interactions with Progressive Fusion Mamba

Andong Lu, Wanyu Wang, Chenglong Li et al.

AAAI 2025paperarXiv:2408.08827
#19786

Privacy-Preserving V2X Collaborative Perception Integrating Unknown Collaborators

Bin Lu, Xinyu Xiao, Changzhou Zhang et al.

AAAI 2025paper
#19787

DeMo: Deep Motion Field Consensus with Learnable Kernels for Two-view Correspondence Learning

Yifan Lu, Jiajun Le, Zizhuo Li et al.

AAAI 2025paper
#19788

Generative Video Diffusion for Unseen Novel Semantic Video Moment Retrieval

Dezhao Luo, Shaogang Gong, Jiabo Huang et al.

AAAI 2025paperarXiv:2401.13329
#19789

Beyond Pixel and Object: Part Feature as Reference for Few-Shot Video Object Segmentation

Naisong Luo, Guoxin Xiong, Tianzhu Zhang

AAAI 2025paper
#19790

Privacy-Preserving Low-Rank Adaptation Against Membership Inference Attacks for Latent Diffusion Models

Zihao Luo, Xilie Xu, Feng Liu et al.

AAAI 2025paperarXiv:2402.11989
#19791

Revisiting Change Captioning from Self-supervised Global-Part Alignment

Feixiao Lv, Rui Wang, Lihua Jing

AAAI 2025paper
#19792

ScaleMatch: Multi-scale Consistency Enhancement for Semi-supervised Semantic Segmentation

Liang Lv, Lefei Zhang

AAAI 2025paper
#19793

Step-Calibrated Diffusion for Biomedical Optical Image Restoration

Yiwei Lyu, Sung Jik Cha, Cheng Jiang et al.

AAAI 2025paperarXiv:2403.13680
#19794

Aligning and Prompting Anything for Zero-Shot Generalized Anomaly Detection

Jitao Ma, Weiying Xie, Hangyu Ye et al.

AAAI 2025paper
#19795

Does VLM Classification Benefit from LLM Description Semantics?

Pingchuan Ma, Lennart Rietdorf, Dmytro Kotovenko et al.

AAAI 2025paperarXiv:2412.11917
#19796

Instruct Where the Model Fails: Generative Data Augmentation via Guided Self-contrastive Fine-tuning

Weijian Ma, Ruoxin Chen, Keyue Zhang et al.

AAAI 2025paper
#19797

A Trusted Lesion-assessment Network for Interpretable Diagnosis of Coronary Artery Disease in Coronary CT Angiography

Xinghua Ma, Xinyan Fang, Mingye Zou et al.

AAAI 2025paper
#19798

Follow-Your-Click: Open-domain Regional Image Animation via Motion Prompts

Yue Ma, Yingqing He, Hongfa Wang et al.

AAAI 2025paper
#19799

Few-Shot Fine-Grained Image Classification with Progressively Feature Refinement and Continuous Relationship Modeling

Zhen-Xiang Ma, Zhen-Duo Chen, Tai Zheng et al.

AAAI 2025paper
#19800

OUS: Bridging Scene Context and Facial Features to Overcome the Rigid Cognitive Problem

Xinji Mai, Haoran Wang, Zeng Tao et al.

AAAI 2025paper