Most Cited AAAI "neural pde solver" Papers

5,317 papers found • Page 18 of 27

#3401

RefDetector: A Simple Yet Effective Matching-based Method for Referring Expression Comprehension

Yabing Wang, Zhuotao Tian, Zheng Qin et al.

AAAI 2025paper
#3402

Hierarchical Alignment-enhanced Adaptive Grounding Network for Generalized Referring Expression Comprehension

Yaxian Wang, Henghui Ding, Shuting He et al.

AAAI 2025paperarXiv:2501.01416
#3403

Breaking Barriers in Physical-World Adversarial Examples: Improving Robustness and Transferability via Robust Feature

Yichen Wang, Yuxuan Chou, Ziqi Zhou et al.

AAAI 2025paperarXiv:2412.16958
#3404

Capturing the Unseen: Vision-Free Facial Motion Capture Using Inertial Measurement Units

Youjia Wang, Yiwen Wu, Hengan Zhou et al.

AAAI 2025paperarXiv:2402.03944
#3405

Re-Attentional Controllable Video Diffusion Editing

Yuanzhi Wang, Yong Li, Mengyi Liu et al.

AAAI 2025paperarXiv:2412.11710
#3406

MambaPro: Multi-Modal Object Re-identification with Mamba Aggregation and Synergistic Prompt

Yuhao Wang, Xuehu Liu, Tianyu Yan et al.

AAAI 2025paperarXiv:2412.10707
#3407

IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Primary Word Emphasis

Yuji Wang, Jingchen Ni, Yong Liu et al.

AAAI 2025paperarXiv:2503.00936
#3408

Target Scanpath-Guided 360-Degree Image Enhancement

Yujia Wang, Fang-Lue Zhang, Neil A. Dodgson

AAAI 2025paper
#3409

DualNet: Robust Self-Supervised Stereo Matching with Pseudo-Label Supervision

Yun Wang, Jiahao Zheng, Chenghao Zhang et al.

AAAI 2025paper
#3410

Mamba YOLO: A Simple Baseline for Object Detection with State Space Model

Zeyu Wang, Chen Li, Huiying Xu et al.

AAAI 2025paperarXiv:2406.05835
#3411

Style Nursing with Spatial and Semantic Guidance for Zero-Shot Traffic Scene Style Transfer

Zhen Wang, Zihang Lin, Meng Yuan et al.

AAAI 2025paper
#3412

Thermal-Aware Low-Light Image Enhancement: A Real-World Benchmark and a New Light-Weight Model

Zhen Wang, Yaozu Wu, Dongyuan Li et al.

AAAI 2025paper
#3413

Attention-Imperceptible Backdoor Attacks on Vision Transformers

Zhishen Wang, Rui Wang, Lihua Jing

AAAI 2025paper
#3414

LLM-RG4: Flexible and Factual Radiology Report Generation Across Diverse Input Contexts

Zhuhao Wang, Yihua Sun, Zihan Li et al.

AAAI 2025paperarXiv:2412.12001
#3415

MSV-PCT: Multi-Sparse-View Enhanced Transformer Framework for Salient Object Detection in Point Clouds

Zihao Wang, Yiming Huang, Gengyu Lyu et al.

AAAI 2025paper
#3416

GlyphSR: A Simple Glyph-Aware Framework for Scene Text Image Super-Resolution

Baole Wei, Yuxuan Zhou, Liangcai Gao et al.

AAAI 2025paper
#3417

Power of Diversity: Enhancing Data-Free Black-Box Attack with Domain-Augmented Learning

Yang Wei, Jingyu Tan, Guowen Xu et al.

AAAI 2025paper
#3418

Achieving Lightweight Super-Resolution for Real-Time Computer Graphics

Yu Wen, Chen Zhang, Chenhao Xie et al.

AAAI 2025paper
#3419

Multi-axis Prompt and Multi-dimension Fusion Network for All-in-one Weather-degraded Image Restoration

Yuanbo Wen, Tao Gao, Jing Zhang et al.

AAAI 2025paper
#3420

USDRL: Unified Skeleton-Based Dense Representation Learning with Multi-Grained Feature Decorrelation

Wanjiang Weng, Hongsong Wang, Junbo Wang et al.

AAAI 2025paperarXiv:2412.09220
#3421

Spin: Diffusion-based Semantic Image Painting Through Independent Information Injection

Dantong Wu, Zhiqiang Chen, Tianjiao Du et al.

AAAI 2025paper
#3422

Structural Pruning via Spatial-aware Information Redundancy for Semantic Segmentation

Dongyue Wu, Zilin Guo, Li Yu et al.

AAAI 2025paperarXiv:2412.12672
#3423

SVRMamba: Slice-to-Volume Reconstruction from Multiple MRI Stacks with Slice Sequence Guided Mamba

Jiangjie Wu, Hongjiang Wei, Yuyao Zhang

AAAI 2025paper
#3424

VarCMP: Adapting Cross-Modal Pre-Training Models for Video Anomaly Retrieval

Peng Wu, Wanshun Su, Xiangteng He et al.

AAAI 2025paper
#3425

Realistic Noise Synthesis with Diffusion Models

Qi Wu, Mingyan Han, Ting Jiang et al.

AAAI 2025paperarXiv:2305.14022
#3426

PanAdapter: Two-Stage Fine-Tuning with Spatial-Spectral Priors Injecting for Pansharpening

RuoCheng Wu, Zien Zhang, Shangqi Deng et al.

AAAI 2025paperarXiv:2409.06980
#3427

Combating Multimodal LLM Hallucination via Bottom-Up Holistic Reasoning

Shengqiong Wu, Hao Fei, Liangming Pan et al.

AAAI 2025paperarXiv:2412.11124
#3428

CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities

Tao Wu, Yong Zhang, Xintao Wang et al.

AAAI 2025paperarXiv:2408.13239
#3429

Deconfound Semantic Shift and Incompleteness in Incremental Few-shot Semantic Segmentation

Yirui Wu, Yuhang Xia, Hao Li et al.

AAAI 2025paper
#3430

Video Repurposing from User Generated Content: A Large-scale Dataset and Benchmark

Yongliang Wu, Wenbo Zhu, Jiawang Cao et al.

AAAI 2025paperarXiv:2412.08879
#3431

MUCD: Unsupervised Point Cloud Change Detection via Masked Consistency

Yue Wu, Zhipeng Wang, Yongzhe Yuan et al.

AAAI 2025paper
#3432

Unified Knowledge Maintenance Pruning and Progressive Recovery with Weight Recalling for Large Vision-Language Models

Zimeng Wu, Jiaxin Chen, Yunhong Wang

AAAI 2025paper
#3433

RETRACTED: GEONet: Global Enhancement and Optimization Network for Lane Detection

Suyang Xi, Yunhao Liu, Hong Ding et al.

AAAI 2025paper
#3434

PlaNet: Learning to Mitigate Atmospheric Turbulence in Planetary Images

Yifei Xia, Chu Zhou, Chengxuan Zhu et al.

AAAI 2025paper
#3435

CA-Edit: Causality-Aware Condition Adapter for High-Fidelity Local Facial Attribute Editing

Xiaole Xian, Xilin He, Zenghao Niu et al.

AAAI 2025paperarXiv:2412.13565
#3436

ReMask-Animate: Refined Character Image Animation Using Mask-Guided Adapters

Xunzhi Xiang, Haiwei Xue, Zonghong Dai et al.

AAAI 2025paper
#3437

SMR-Net: Semantic-Guided Mutually Reinforcing Network for Cross-Modal Image Fusion and Salient Object Detection

Guobao Xiao, Xinyu Liu, Zebin Lin et al.

AAAI 2025paper
#3438

Boosting Vision State Space Model with Fractal Scanning

Haoke Xiao, Lv Tang, Peng-tao Jiang et al.

AAAI 2025paper
#3439

Text Proxy: Decomposing Retrieval from a 1-to-N Relationship into N 1-to-1 Relationships for Text-Video Retrieval

Jian Xiao, Zhenzhen Hu, Jia Li et al.

AAAI 2025paperarXiv:2410.06618
#3440

Cross-modulated Attention Transformer for RGBT Tracking

Yun Xiao, Jiacong Zhao, Andong Lu et al.

AAAI 2025paperarXiv:2408.02222
#3441

Omni-Query Active Learning for Source-Free Domain Adaptive Cross-Modality 3D Semantic Segmentation

Jianxiang Xie, Yao Wu, Yachao Zhang et al.

AAAI 2025paper
#3442

TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning

Jingjing Xie, Yuxin Zhang, Jun Peng et al.

AAAI 2025paperarXiv:2412.08176
#3443

Discrete Prior-Based Temporal-Coherent Content Prediction for Blind Face Video Restoration

Lianxin Xie, Bingbing Zheng, Wen Xue et al.

AAAI 2025paperarXiv:2501.09960
#3444

Expand VSR Benchmark for VLLM to Expertize in Spatial Rules

Peijin Xie, Lin Sun, Bingquan Liu et al.

AAAI 2025paperarXiv:2412.18224
#3445

PointTalk: Audio-Driven Dynamic Lip Point Cloud for 3D Gaussian-based Talking Head Synthesis

Yifan Xie, Tao Feng, Xin Zhang et al.

AAAI 2025paperarXiv:2412.08504
#3446

HieraFashDiff: Hierarchical Fashion Design with Multi-stage Diffusion Models

Zhifeng Xie, Hao Li, Huiming Ding et al.

AAAI 2025paperarXiv:2401.07450
#3447

Few-Shot Incremental Learning via Foreground Aggregation and Knowledge Transfer for Audio-Visual Semantic Segmentation

Jingqiao Xiu, Mengze Li, Zongxin Yang et al.

AAAI 2025paper
#3448

DiffScene: Diffusion-Based Safety-Critical Scenario Generation for Autonomous Vehicles

Chejian Xu, Aleksandr Petiushko, Ding Zhao et al.

AAAI 2025paper
#3449

FR²Seg: Continual Segmentation Across Multiple Sites via Fourier Style Replay and Adaptive Consistency Regularization

Cheng Xu, Weiwen Zhang, Hongrui Zhang et al.

AAAI 2025paper
#3450

Less Is More: Token Context-Aware Learning for Object Tracking

Chenlong Xu, Bineng Zhong, Qihua Liang et al.

AAAI 2025paperarXiv:2501.00758
#3451

3DHumanEdit: Multi-modal Body Part-aware Conditioning Information Integration for 3D Human Manipulation

FeiFan Xu, Tianyi Chen, Fan Yang et al.

AAAI 2025paper
#3452

Motion Artifact Removal in Pixel-Frequency Domain via Alternate Masks and Diffusion Model

Jiahua Xu, Dawei Zhou, Lei Hu et al.

AAAI 2025paperarXiv:2412.07590
#3453

OmniSR: Shadow Removal Under Direct and Indirect Lighting

Jiamin Xu, Zelong Li, Yuxin Zheng et al.

AAAI 2025paperarXiv:2410.01719
#3454

Multiple Feature Refining Network for Visual Emotion Distribution Learning

Qinfu Xu, Shaozu Yuan, Yiwei Wei et al.

AAAI 2025paper
#3455

SCKD: Semi-Supervised Cross-Modality Knowledge Distillation for 4D Radar Object Detection

Ruoyu Xu, Zhiyu Xiang, Chenwei Zhang et al.

AAAI 2025paperarXiv:2412.14571
#3456

LiON: Learning Point-Wise Abstaining Penalty for LiDAR Outlier DetectioN Using Diverse Synthetic Data

Shaocong Xu, Pengfei Li, Qianpu Sun et al.

AAAI 2025paperarXiv:2309.10230
#3457

Zero-shot Video Moment Retrieval via Off-the-shelf Multimodal Large Language Models

Yifang Xu, Yunzhuo Sun, Benxiang Zhai et al.

AAAI 2025paperarXiv:2501.07972
#3458

HOIMamba: Efficient Mamba-based Disentangled Progressive Learning for HOI Detection

Yongchao Xu, Jiawei Liu, Sen Tao et al.

AAAI 2025paper
#3459

OOTDiffusion: Outfitting Fusion Based Latent Diffusion for Controllable Virtual Try-On

Yuhao Xu, Tao Gu, Weifeng Chen et al.

AAAI 2025paperarXiv:2403.01779
#3460

FLAME: Learning to Navigate with Multimodal LLM in Urban Environments

Yunzhe Xu, Yiyuan Pan, Zhe Liu et al.

AAAI 2025paperarXiv:2408.11051
#3461

FATE: Feature-Adapted Parameter Tuning for Vision-Language Models

Zhengqin Xu, Zelin Peng, Xiaokang Yang et al.

AAAI 2025paper
#3462

Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP

Zhongxing Xu, Feilong Tang, Zhe Chen et al.

AAAI 2025paperarXiv:2412.19650
#3463

RetouchGPT: LLM-based Interactive High-Fidelity Face Retouching via Imperfection Prompting

Wen Xue, Chun Ding, Ruotao Xu et al.

AAAI 2025paper
#3464

Physical Marker: Revealing Invisible Hyperlinks Hidden in Printed Trademarks

Yuliang Xue, Lei Tan, Guobiao Li et al.

AAAI 2025paper
#3465

Towards Universal Rainy Image Restoration: Benchmark and Baseline

Hujie Yan

AAAI 2025paper
#3466

SGTC: Semantic-Guided Triplet Co-training for Sparsely Annotated Semi-Supervised Medical Image Segmentation

Ke Yan, Qing Cai, Fan Zhang et al.

AAAI 2025paperarXiv:2412.15526
#3467

Data-Free Universal Attack by Exploiting the Intrinsic Vulnerability of Deep Models

YangTian Yan, Jinyu Tian

AAAI 2025paperarXiv:2503.22205
#3468

Robust Image Hashing Based on Contrastive Masked Autoencoder with Weak-Strong Augmentation Alignment

Cundian Yang, Guibo Luo, Yuesheng Zhu et al.

AAAI 2025paper
#3469

PlanLLM: Video Procedure Planning with Refinable Large Language Models

Dejie Yang, Zijing Zhao, Yang Liu

AAAI 2025paperarXiv:2412.19139
#3470

3CAD: A Large-Scale Real-World 3C Product Dataset for Unsupervised Anomaly Detection

Enquan Yang, Peng Xing, Hanyang Sun et al.

AAAI 2025paper
#3471

Diffusion Prior Interpolation for Flexibility Real-World Face Super-Resolution

Jiarui Yang, Tao Dai, Yufei Zhu et al.

AAAI 2025paperarXiv:2412.16552
#3472

SMamba: Sparse Mamba for Event-based Object Detection

Nan Yang, Yang Wang, Zhanwen Liu et al.

AAAI 2025paperarXiv:2501.11971
#3473

One-Shot Reference-based Structure-Aware Image to Sketch Synthesis

Rui Yang, Honghong Yang, Li Zhao et al.

AAAI 2025paper
#3474

LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding

Senqiao Yang, Jiaming Liu, Renrui Zhang et al.

AAAI 2025paperarXiv:2312.14074
#3475

Asymmetric Hierarchical Difference-aware Interaction Network for Event-guided Motion Deblurring

Wen Yang, Jinjian Wu, Leida Li et al.

AAAI 2025paper
#3476

Dual Information Purification for Lightweight SAR Object Detection

Xi Yang, Jiachen Sun, Songsong Duan et al.

AAAI 2025paper
#3477

DriveGazen: Event-Based Driving Status Recognition Using Conventional Camera

Xiaoyin Yang, Xin Yang

AAAI 2025paperarXiv:2412.11753
#3478

Semantic Segmentation on Raindrop Degraded Images Using Two-Stage Dual Teacher-Student Learning

Xin Yang, Wending Yan, Yuan Yuan et al.

AAAI 2025paper
#3479

ERF: A Benchmark Dataset for Robust Semantic Segmentation Under Extreme Rainfall Conditions

Xin Yang, Xin Zhang, Xinchao Wang

AAAI 2025paper
#3480

FreqTS: Frequency-Aware Token Selection for Accelerating Diffusion Models

Xinye Yang, Yuxin Yang, Haoran Pang et al.

AAAI 2025paper
#3481

Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving

Yu Yang, Jianbiao Mei, Yukai Ma et al.

AAAI 2025paperarXiv:2408.14197
#3482

UAWTrack: Universal 3D Single Object Tracking in Adverse Weather

Yuxiang Yang, Hongjie Gu, Yingqi Deng et al.

AAAI 2025paper
#3483

RealPortrait: Realistic Portrait Animation with Diffusion Transformers

Zejun Yang, Huawei Wei, Zhisheng Wang

AAAI 2025paper
#3484

Single Image Rolling Shutter Removal with Diffusion Models

Zhanglei Yang, Haipeng Li, Mingbo Hong et al.

AAAI 2025paperarXiv:2407.02906
#3485

MMGDreamer: Mixed-Modality Graph for Geometry-Controllable 3D Indoor Scene Generation

Zhifei Yang, Keyang Lu, Chao Zhang et al.

AAAI 2025paperarXiv:2502.05874
#3486

MoRe: Class Patch Attention Needs Regularization for Weakly Supervised Semantic Segmentation

Zhiwei Yang, Yucong Meng, Kexue Fu et al.

AAAI 2025paperarXiv:2412.11076
#3487

MM-Tracker: Motion Mamba for UAV-platform Multiple Object Tracking

Mufeng Yao, Jinlong Peng, Qingdong He et al.

AAAI 2025paper
#3488

As Pseudo-Label Free as Possible: Leveraging Adaptive Feature Generation for Sparsely Annotated Object Detection

Shuilian Yao, Yu Liu, Qi Jia et al.

AAAI 2025paper
#3489

Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation

Chengyang Ye, Yunzhi Zhuge, Pingping Zhang

AAAI 2025paperarXiv:2412.19492
#3490

VersaFusion: A Versatile Diffusion-Based Framework for Fine-Grained Image Editing and Enhancement

Haocun Ye, Xinlong Jiang, Chenlong Gao et al.

AAAI 2025paper
#3491

PromptHaze: Prompting Real-world Dehazing via Depth Anything Model

Tian Ye, Sixiang Chen, Haoyu Chen et al.

AAAI 2025paper
#3492

Optimized Gradient Clipping for Noisy Label Learning

Xichen Ye, Yifan Wu, Weizhong Zhang et al.

AAAI 2025paperarXiv:2412.08941
#3493

Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language

Jeong Hun Yeo, Chae Won Kim, Hyunjun Kim et al.

AAAI 2025paperarXiv:2409.00986
#3494

FlexDataset: Crafting Annotated Dataset Generation for Diverse Applications

Ellen Yi-Ge, Leo Shawn

AAAI 2025paper
#3495

ImagePiece: Content-aware Re-tokenization for Efficient Image Recognition

Seungdong Yoa, Seungjun Lee, Hye-Seung Cho et al.

AAAI 2025paperarXiv:2412.16491
#3496

FOCUS: Towards Universal Foreground Segmentation

Zuyao You, Lingyu Kong, Lingchen Meng et al.

AAAI 2025paperarXiv:2501.05238
#3497

SGFormer: Semantic-Geometry Fusion Transformer for Multi-modal 3D Panoptic Segmentation

Hongqi Yu, Sixian Chan, Xiaolong Zhou et al.

AAAI 2025paper
#3498

Separating the Wheat from the Chaff: Spatio-Temporal Transformer with View-interweaved Attention for Photon-Efficient Depth Sensing

Letian Yu, Jiaxi Yang, Bo Dong et al.

AAAI 2025paper
#3499

ReMoGPT: Part-Level Retrieval-Augmented Motion-Language Models

Qing Yu, Mikihiro Tanaka, Kent Fujiwara

AAAI 2025paper
#3500

STGC-NeRF: Spatial-Temporal Geometric Consistency for LiDAR Neural Radiance Fields in Dynamic Scenes

Shangshu Yu, Xiaotian Sun, Wen Li et al.

AAAI 2025paper
#3501

KeyPose: Category-Level 6D Object Pose Estimation with Self-Adaptive Keypoints

Sheng Yu, Di-Hua Zhai, Yuanqing Xia

AAAI 2025paper
#3502

Fine-grained Adaptive Visual Prompt for Generative Medical Visual Question Answering

Ting Yu, Zixuan Tong, Jun Yu et al.

AAAI 2025paper
#3503

OTPNet: ODE-inspired Tuning-free Proximal Network for Remote Sensing Image Fusion

Wei Yu, Zonglin Li, Qinglin Liu et al.

AAAI 2025paper
#3504

Cross-Lingual Text-Rich Visual Comprehension: An Information Theory Perspective

Xinmiao Yu, Xiaocheng Feng, Yun Li et al.

AAAI 2025paperarXiv:2412.17787
#3505

Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP

Yating Yu, Congqi Cao, Yueran Zhang et al.

AAAI 2025paperarXiv:2412.09895
#3506

OLMD: Orientation-aware Long-term Motion Decoupling for Continuous Sign Language Recognition

Yiheng Yu, Sheng Liu, Yuan Feng et al.

AAAI 2025paperarXiv:2503.08205
#3507

Where Precision Meets Efficiency: Transformation Diffusion Model for Point Cloud Registration

Yongzhe Yuan, Yue Wu, Xiaolong Fan et al.

AAAI 2025paper
#3508

Efficient Neural Network Encoding for 3D Color Lookup Tables

Vahid Zehtab, David B. Lindell, Marcus A. Brubaker et al.

AAAI 2025paperarXiv:2412.15438
#3509

Gaze Label Alignment: Alleviating Domain Shift for Gaze Estimation

Guanzhong Zeng, Jingjing Wang, Zefu Xu et al.

AAAI 2025paperarXiv:2412.15601
#3510

TGFormer: Transformer with Track Query Group for Multi-Object Tracking

Rui Zeng, Yuanzhou Huang, Songwei Pei

AAAI 2025paper
#3511

Frequency-Aware Density Control via Reparameterization for High-Quality Rendering of 3D Gaussian Splatting

Zhaojie Zeng, Yuesong Wang, Lili Ju et al.

AAAI 2025paperarXiv:2503.07000
#3512

World Knowledge-Enhanced Reasoning Using Instruction-Guided Interactor in Autonomous Driving

Mingliang Zhai, Cheng Li, Zengyuan Guo et al.

AAAI 2025paperarXiv:2412.06324
#3513

DetRF: Detachable Novel Views Synthesis of Dynamic Scenes Using Backdrop-Driven Neural Radiance Fields

Boyu Zhang, Zheng Zhu, Wenbo Xu

AAAI 2025paper
#3514

Training-Free and Hardware-Friendly Acceleration for Diffusion Models via Similarity-based Token Pruning

Evelyn Zhang, Jiayi Tang, Xuefei Ning et al.

AAAI 2025paper
#3515

When Open-Vocabulary Visual Question Answering Meets Causal Adapter: Benchmark and Approach

Feifei Zhang, Zhaoyi Zhang, Xi Zhang et al.

AAAI 2025paper
#3516

DocKylin: A Large Multimodal Model for Visual Document Understanding with Efficient Visual Slimming

Jiaxin Zhang, Wentao Yang, Songxuan Lai et al.

AAAI 2025paperarXiv:2406.19101
#3517

Just a Few Glances: Open-Set Visual Perception with Image Prompt Paradigm

Jinrong Zhang, Penghui Wang, Chunxiao Liu et al.

AAAI 2025paperarXiv:2412.10719
#3518

R^2-Art: Category-Level Articulation Pose Estimation from Single RGB Image via Cascade Render Strategy

Li Zhang, Haonan Jiang, Yukang Huo et al.

AAAI 2025paper
#3519

Common Sense Bias Modeling for Classification Tasks

Miao Zhang, Zee Fryer, Ben Colman et al.

AAAI 2025paperarXiv:2401.13213
#3520

IRMamba: Pixel Difference Mamba with Layer Restoration for Infrared Small Target Detection

Mingjin Zhang, Xiaolong Li, Fei Gao et al.

AAAI 2025paper
#3521

MOCID: Motion Context and Displacement Information Learning for Moving Infrared Small Target Detection

Mingjin Zhang, Yuanjun Ouyang, Fei Gao et al.

AAAI 2025paper
#3522

Decoupling Scattering: Pseudo-Label Guided NeRF for Scenes with Scattering Media

Mingyang Zhang, Junkang Zhang, Faming Fang et al.

AAAI 2025paper
#3523

PanoDiT: Panoramic Videos Generation with Diffusion Transformer

Muyang Zhang, Yuzhi Chen, Rongtao Xu et al.

AAAI 2025paper
#3524

SIGraph: Saliency Image-Graph Network for Retinal Disease Classification in Fundus Image

Peng Zhang, Yuan Li, Haotian Song et al.

AAAI 2025paper
#3525

Visual Perturbation for Text-Based Person Search

Pengcheng Zhang, Xiaohan Yu, Xiao Bai et al.

AAAI 2025paper
#3526

Matching While Perceiving: Enhance Image Feature Matching with Applicable Semantic Amalgamation

Shihua Zhang, Zhenjie Zhu, Zizhuo Li et al.

AAAI 2025paper
#3527

DiMSOD: A Diffusion-Based Framework for Multi-Modal Salient Object Detection

Shuo Zhang, Jiaming Huang, Wenbing Tang et al.

AAAI 2025paper
#3528

Pragmatist: Multiview Conditional Diffusion Models for High-Fidelity 3D Reconstruction from Unposed Sparse Views

Songchun Zhang, Chunhui Zhao

AAAI 2025paperarXiv:2412.08412
#3529

DZAD: Diffusion-based Zero-shot Anomaly Detection

Tianrui Zhang, Liang Gao, Xinyu Li et al.

AAAI 2025paper
#3530

Enhancing Implicit Neural Representations via Symmetric Power Transformation

Weixiang Zhang, Shuzhao Xie, Chengwei Ren et al.

AAAI 2025paperarXiv:2412.09213
#3531

Iterative Self-Training with Class-Aware Text-to-Image Synthesis for Visual Task Learning

Xiang Zhang, Wanqing Zhao, Pengyang Li et al.

AAAI 2025paper
#3532

Enhancing Multimodal Large Language Models Complex Reason via Similarity Computation

Xiaofeng Zhang, Fanshuo Zeng, Yihao Quan et al.

AAAI 2025paperarXiv:2412.09817
#3533

PhyCamo: A Robust Physical Camouflage via Contrastive Learning for Multi-View Physical Adversarial Attack

Ximin Zhang, Jinyin Chen, Haibin Zheng et al.

AAAI 2025paper
#3534

CAMSIC: Content-aware Masked Image Modeling Transformer for Stereo Image Compression

Xinjie Zhang, Shenyuan Gao, Zhening Liu et al.

AAAI 2025paperarXiv:2403.08505
#3535

Pose Magic: Efficient and Temporally Consistent Human Pose Estimation with a Hybrid Mamba-GCN Network

Xinyi Zhang, Qiqi Bao, Qinpeng Cui et al.

AAAI 2025paperarXiv:2408.02922
#3536

VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models

Yabo Zhang, Yuxiang Wei, Xianhui Lin et al.

AAAI 2025paperarXiv:2403.05438
#3537

Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues

Yan Zhang, Gangyan Zeng, Huawen Shen et al.

AAAI 2025paperarXiv:2412.12502
#3538

Category Prompt Mamba Network for Nuclei Segmentation and Classification

Ye Zhang, Zijie Fang, Yifeng Wang et al.

AAAI 2025paperarXiv:2503.10422
#3539

Cross-Modal Few-Shot Learning with Second-Order Neural Ordinary Differential Equations

Yi Zhang, Chun-Wun Cheng, Junyi He et al.

AAAI 2025paperarXiv:2412.15813
#3540

InstantSticker: Realistic Decal Blending via Disentangled Object Reconstruction

Yi Zhang, Xiaoyang Huang, Yishun Dou et al.

AAAI 2025paperarXiv:2504.06620
#3541

Partial Point Cloud Registration with Multi-view 2D Image Learning

Yue Zhang, Yue Wu, Wenping Ma et al.

AAAI 2025paper
#3542

RP-PGD: Boosting Segmentation Robustness with a Region-and-Prototype Based Adversarial Attack

Yuxuan Zhang, Zhenbo Shi, Shuchang Wang et al.

AAAI 2025paper
#3543

Leveraging Consistent Spatio-Temporal Correspondence for Robust Visual Odometry

Zhaoxing Zhang, Junda Cheng, Gangwei Xu et al.

AAAI 2025paperarXiv:2412.16923
#3544

Training-Free Image Manipulation Localization Using Diffusion Models

Zhenfei Zhang, Ming-Ching Chang, Xin Li

AAAI 2025paper
#3545

Multi-scale Activation, Refinement, and Aggregation: Exploring Diverse Cues for Fine-Grained Bird Recognition

Zhicheng Zhang, Hao Tang, Jinhui Tang

AAAI 2025paperarXiv:2504.09215
#3546

DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation

Guosheng Zhao, Xiaofeng Wang, Zheng Zhu et al.

AAAI 2025paperarXiv:2403.06845
#3547

Adaptive Wavelet-Positional Encoding for High-Frequency Information Learning in Implicit Neural Representation

Hongxu Zhao, Zelin Gao, Yue Wang et al.

AAAI 2025paper
#3548

Excluding the Impossible for Open Vocabulary Semantic Segmentation

Shiyuan Zhao, Baodi Liu, Yu Bai et al.

AAAI 2025paper
#3549

KALAHash: Knowledge-Anchored Low-Resource Adaptation for Deep Hashing

Shu Zhao, Tan Yu, Xiaoshuai Hao et al.

AAAI 2025paperarXiv:2412.19417
#3550

Training-free Open-Vocabulary Semantic Segmentation via Diverse Prototype Construction and Sub-region Matching

Xuanpu Zhao, Dianmo Sheng, Zhentao Tan et al.

AAAI 2025paper
#3551

Audio-Visual Adaptive Fusion Network for Question Answering Based on Contrastive Learning

Xujian Zhao, Yixin Wang, Peiquan Jin

AAAI 2025paper
#3552

ESEG: Event-Based Segmentation Boosted by Explicit Edge-Semantic Guidance

Yucheng Zhao, Gengyu Lyu, Ke Li et al.

AAAI 2025paper
#3553

NightReID: A Large-Scale Nighttime Person Re-Identification Benchmark

Yuxuan Zhao, Weijian Ruan, He Li et al.

AAAI 2025paper
#3554

HFF-Tracker: A Hierarchical Fine-grained Fusion Tracker for Referring Multi-Object Tracking

Zeyong Zhao, Yanchao Hao, Minghao Zhang et al.

AAAI 2025paper
#3555

PHR-DIFF: Portrait Highlights Removal via Patch-aware Diffusion Model

Hongsheng Zheng, Zhongyun Bao, Gang Fu et al.

AAAI 2025paper
#3556

Breaking Information Isolation: Accelerating MRI via Inter-sequence Mapping and Progressive Masking

Jianwei Zheng, Xiaomin Yao, Guojiang Shen et al.

AAAI 2025paper
#3557

Supportive Negatives Spectral Augmentation for Source-Free Cross-Domain Segmentation

Kexin Zheng, Haifeng Xia, Siyu Xia et al.

AAAI 2025paper
#3558

When Shadow Removal Meets Intrinsic Image Decomposition: A Joint Learning Framework Using Unpaired Data

Rongjia Zheng, Qing Zhang, Yongwei Nie et al.

AAAI 2025paper
#3559

A New Adversarial Perspective for LiDAR-based 3D Object Detection

Shijun Zheng, Weiquan Liu, Yu Guo et al.

AAAI 2025paperarXiv:2412.13017
#3560

Universal Domain Adaptive Object Detection via Dual Probabilistic Alignment

Yuanfan Zheng, Jinlin Wu, Wuyang Li et al.

AAAI 2025paperarXiv:2412.11443
#3561

MMPF: Multi-Modal Perception Framework for Abnormal Medical Condition Detection

Chuyi Zhong, Dingkang Yang, Peng Zhai et al.

AAAI 2025paper
#3562

DECIDER: Difference-aware Contrastive Diffusion Model with Adversarial Perturbations for Image Change Captioning

Guojin Zhong, Jinhong Hu, Jiajun Chen et al.

AAAI 2025paper
#3563

PointCFormer: A Relation-Based Progressive Feature Extraction Network for Point Cloud Completion

Yi Zhong, Weize Quan, Dong-Ming Yan et al.

AAAI 2025paperarXiv:2412.08421
#3564

Controllable Distortion-Perception Tradeoff Through Latent Diffusion for Neural Image Compression

Chuqin Zhou, Guo Lu, Jiangchuan Li et al.

AAAI 2025paperarXiv:2412.11379
#3565

TrackGo: A Flexible and Efficient Method for Controllable Video Generation

Haitao Zhou, Chuang Wang, Rui Nie et al.

AAAI 2025paperarXiv:2408.11475
#3566

Core-to-Global Reasoning for Compositional Visual Question Answering

Hao Zhou, Tingjin Luo, Zhangqi Jiang

AAAI 2025paper
#3567

Joint Class-level and Instance-level Relationship Modeling for Novel Class Discovery

Jiaying Zhou, Qingchao Chen

AAAI 2025paper
#3568

GaussianPainter: Painting Point Cloud into 3D Gaussians with Normal Guidance

Jingqiu Zhou, Lue Fan, Xuesong Chen et al.

AAAI 2025paperarXiv:2412.17715
#3569

SceneX: Procedural Controllable Large-Scale Scene Generation

Mengqi Zhou, Yuxi Wang, Jun Hou et al.

AAAI 2025paperarXiv:2403.15698
#3570

GLIC: General Format Learned Image Compression

MingSheng Zhou, MingMing Kong

AAAI 2025paper
#3571

Mitigating Feature Gap for Adversarial Robustness by Feature Disentanglement

Nuoyan Zhou, Dawei Zhou, Decheng Liu et al.

AAAI 2025paperarXiv:2401.14707
#3572

Spatiotemporal-Aware Neural Fields for Dynamic CT Reconstruction

Qingyang Zhou, Yunfan Ye, Zhiping Cai

AAAI 2025paper
#3573

Test-Time Adaptation on Noisy Data via Model-Pruning-Based Filtering and Flatness-Aware Entropy Minimization

Xingzhi Zhou, Zhiliang Tian, Boyang Zhang et al.

AAAI 2025paper
#3574

Improving Generalization of Deep Neural Networks by Optimum Shifting

Yuyan Zhou, Ye Li, Lei Feng et al.

AAAI 2025paperarXiv:2405.14111
#3575

Achieving Ensemble-Like Performance in a Single Model: A Feature Diversification Framework for Image-Text Matching

Zhao Zhou, Yiqun Wang, Weizhong Zhang et al.

AAAI 2025paper
#3576

Expanding the Scope of Negatives: Boosting Image-Text Matching with Negatives Distribution Guided Learning

Zhao Zhou, Weizhong Zhang, Xiangcheng Du et al.

AAAI 2025paper
#3577

An Exemplar-based Framework for Chinese Text Recognition

Zhao Zhou, Xiangcheng Du, Yingbin Zheng et al.

AAAI 2025paper
#3578

GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic Expressions

Ziqi Zhou, Weize Quan, Hailin Shi et al.

AAAI 2025paperarXiv:2412.09296
#3579

A Lottery Ticket Hypothesis Approach with Sparse Fine-tuning and MAE for Image Forgery Detection and Localization

Jiaying Zhu, Dong Li, Xueyang Fu et al.

AAAI 2025paper
#3580

Thin-Plate Spline-based Interpolation for Animation Line Inbetweening

Tianyi Zhu, Wei Shang, Dongwei Ren

AAAI 2025paperarXiv:2408.09131
#3581

Mesh Watermark Removal Attack and Mitigation: A Novel Perspective of Function Space

Xingyu Zhu, Guanhui Ye, Chengdong Dong et al.

AAAI 2025paperarXiv:2311.12059
#3582

Mesoscopic Insights: Orchestrating Multi-Scale & Hybrid Architecture for Image Manipulation Localization

Xuekang Zhu, Xiaochen Ma, Lei Su et al.

AAAI 2025paperarXiv:2412.13753
#3583

MUC: Mixture of Uncalibrated Cameras for Robust 3D Human Body Reconstruction

Yitao Zhu, Sheng Wang, Mengjie Xu et al.

AAAI 2025paperarXiv:2403.05055
#3584

ST3: Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming

Jiedong Zhuang, Lu Lu, Ming Dai et al.

AAAI 2025paper
#3585

Dynamic Entity-Masked Graph Diffusion Model for Histopathology Image Representation Learning

Zhenfeng Zhuang, Min Cen, Yanfeng Li et al.

AAAI 2025paperarXiv:2412.10482
#3586

AdaCo: Overcoming Visual Foundation Model Noise in 3D Semantic Segmentation via Adaptive Label Correction

Pufan Zou, Shijia Zhao, Weijie Huang et al.

AAAI 2025paperarXiv:2412.18255
#3587

L-Man: A Large Multi-modal Model Unifying Human-centric Tasks

Jialong Zuo, Ying Nie, Tianyu Guo et al.

AAAI 2025paper
#3588

Learning Valid Dual Bounds in Constraint Programming: Boosted Lagrangian Decomposition with Self-Supervised Learning

Swann Bessa, Darius Dabert, Max Bourgeat et al.

AAAI 2025paperarXiv:2408.12695
#3589

Optimal Classification Trees for Continuous Feature Data Using Dynamic Programming with Branch-and-Bound

Cătălin E. Brița, Jacobus G. M. van der Linden, Emir Demirović

AAAI 2025paper
#3590

Linear Equations with Min and Max Operators: Computational Complexity

Krishnendu Chatterjee, Ruichen Luo, Raimundo Saona et al.

AAAI 2025paperarXiv:2412.12228
#3591

GPU-Accelerated Parallel Bilevel Optimization for Roubst 6G ISAC

Xingdi Chen, Kai Yang

AAAI 2025paper
#3592

Proof Simulation via Round-based Strategy Extraction for QBF

Leroy Chew

AAAI 2025paper
#3593

Decentralized Projected Riemannian Stochastic Recursive Momentum Method for Nonconvex Optimization

Kangkang Deng, Jiang Hu

AAAI 2025paperarXiv:2412.02382
#3594

Parameterized Complexity of Caching in Networks

Robert Ganian, Fionn Mc Inerney, Dimitra Tsigkari

AAAI 2025paperarXiv:2412.16585
#3595

FFCG: Effective and Fast Family Column Generation for Solving Large-Scale Linear Program

Yi-Xiang Hu, Feng Wu, Shaoang Li et al.

AAAI 2025paperarXiv:2412.19066
#3596

DCC: Differentiable Cardinality Constraints for Partial Index Tracking

Wooyeon Jo, Hyunsouk Cho

AAAI 2025paperarXiv:2412.17175
#3597

Online Prompt Selection for Program Synthesis

Yixuan Li, Lewis Frampton, Federico Mora et al.

AAAI 2025paperarXiv:2501.05247
#3598

Search Strategy Generation for Branch and Bound Using Genetic Programming

Gwen Maudet, Grégoire Danoy

AAAI 2025paperarXiv:2412.09444
#3599

Towards Real-Time Approximate Counting

Yash Pote, Kuldeep S. Meel, Jiong Yang

AAAI 2025paper
#3600

Computationally Hard Problems Are Hard for QBF Proof Systems Too

Agnes Schleitzer, Olaf Beyersdorff

AAAI 2025paper