Most Cited 2025 &quot;forward matrix deduction&quot; Papers

ICLR 2025arXiv:2502.07384

#11602

SAGEPhos: Sage Bio-Coupled and Augmented Fusion for Phosphorylation Site Detection

Jingjie Zhang, Hanqun Cao, Zijun Gao et al.

NEURIPS 2025arXiv:2505.09663

#11603

Analog Foundation Models

Julian Büchel, Iason Chalas, Giovanni Acampa et al.

ICLR 2025arXiv:2501.15857

#11604

Are Transformers Able to Reason by Connecting Separated Knowledge in Training Data?

Yutong Yin, Zhaoran Wang

ICLR 2025arXiv:2501.15326

#11605

Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data

Jiajie Li, Brian Quaranto, Chenhui Xu et al.

#11606

EcoFace: Audio-Visual Emotional Co-Disentanglement Speech-Driven 3D Talking Face Generation

Jiajian Xie, Shengyu Zhang, Mengze Li et al.

ICLR 2025arXiv:2406.12082

#11607

Uncertainty modeling for fine-tuned implicit functions

Anna Susmelj, Mael Macuglia, Natasa Tagasovska et al.

ICLR 2025arXiv:2407.01574

#11608

cryoSPHERE: Single-Particle HEterogeneous REconstruction from cryo EM

Gabriel Claude Jean Ducrocq, Lukas Grunewald, Sebastian Westenhoff et al.

NEURIPS 2025arXiv:2505.17599

#11609

Dynamic Bundling with Large Language Models for Zero-Shot Inference on Text-Attributed Graphs

Yusheng Zhao, Qixin Zhang, Xiao Luo et al.

ICLR 2025arXiv:2502.04643

#11610

Confidence Elicitation: A New Attack Vector for Large Language Models

Brian Formento, Chuan Sheng Foo, See-Kiong Ng

ICLR 2025arXiv:2407.02020

#11611

Decentralized Optimization with Coupled Constraints

Demyan Yarmoshik, Alexander Rogozin, Nikita Kiselev et al.

NEURIPS 2025arXiv:2506.24000

#11612

The Illusion of Progress? A Critical Look at Test-Time Adaptation for Vision-Language Models

Lijun Sheng, Jian Liang, Ran He et al.

ICLR 2025arXiv:2410.09181

#11613

Can a Large Language Model be a Gaslighter?

Wei Li, Luyao Zhu, Yang Song et al.

ICLR 2025arXiv:2410.05938

#11614

EMMA: Empowering Multi-modal Mamba with Structural and Hierarchical Alignment

Yifei Xing, Xiangyuan Lan, Ruiping Wang et al.

ICLR 2025arXiv:2402.05835

#11615

How Much is Unseen Depends Chiefly on Information About the Seen

Seongmin Lee, Marcel Boehme

ICLR 2025arXiv:2402.05626

#11616

Loss Landscape of Shallow ReLU-like Neural Networks: Stationary Points, Saddle Escape, and Network Embedding

Frank Zhengqing Wu, Berfin Simsek, François Ged

AAAI 2025paperarXiv:2501.03181

#11617

FaceSpeak: Expressive and High-Quality Speech Synthesis from Human Portraits of Different Styles

Tian-Hao Zhang, Jiawei Zhang, Jun Wang et al.

NEURIPS 2025arXiv:2505.19601

#11618

Preference Optimization by Estimating the Ratio of the Data Distribution

Yeongmin Kim, HeeSun Bae, Byeonghu Na et al.

#11619

Geometry of Long-Tailed Representation Learning: Rebalancing Features for Skewed Distributions

Lingjie Yi, Michael Yao, Weimin Lyu et al.

ICLR 2025arXiv:2410.05609

#11620

The Breakdown of Gaussian Universality in Classification of High-dimensional Linear Factor Mixtures

Xiaoyi MAI, Zhenyu Liao

ICLR 2025arXiv:2410.01101

#11621

Exploiting Structure in Offline Multi-Agent RL: The Benefits of Low Interaction Rank

Wenhao Zhan, Scott Fujimoto, Zheqing Zhu et al.

ICLR 2025oralarXiv:2505.05691

#11622

Physics-informed Temporal Difference Metric Learning for Robot Motion Planning

Ruiqi Ni, zherong pan, Ahmed Hussain Qureshi

ICLR 2025arXiv:2410.07502

#11623

Adaptive Batch Size for Privately Finding Second-Order Stationary Points

Daogao Liu, Kunal Talwar

NEURIPS 2025oralarXiv:2510.12422

#11624

VideoLucy: Deep Memory Backtracking for Long Video Understanding

Jialong Zuo, Yongtai Deng, Lingdong Kong et al.

#11625

Measuring And Improving Engagement of Text-to-Image Generation Models

Varun Khurana, Yaman Singla, Jayakumar Subramanian et al.

ICLR 2025oralarXiv:2410.20922

#11626

FACTS: A Factored State-Space Framework for World Modelling

Li Nanbo, Firas Laakom, Yucheng XU et al.

NEURIPS 2025arXiv:2502.04799

#11627

A Regularized Newton Method for Nonconvex Optimization with Global and Local Complexity Guarantees

Yuhao Zhou, Jintao Xu, Bingrui Li et al.

ICLR 2025oralarXiv:2402.04398

#11628

Learning under Temporal Label Noise

Sujay Nagaraj, Walter Gerych, Sana Tonekaboni et al.

ICLR 2025arXiv:2408.07245

#11629

$q$-exponential family for policy optimization

Lingwei Zhu, Haseeb Shah, Han Wang et al.

ICLR 2025arXiv:2410.07500

#11630

Learning to Generate Diverse Pedestrian Movements from Web Videos with Noisy Labels

Zhizheng Liu, Joe Lin, Wayne Wu et al.

#11631

Hierachical Balance Packing: Towards Efficient Supervised Fine-tuning for Long-Context LLM

Yongqiang Yao, Jingru Tan, Kaihuan Liang et al.

NEURIPS 2025

ICLR 2025arXiv:2405.16545

#11632

VICtoR: Learning Hierarchical Vision-Instruction Correlation Rewards for Long-horizon Manipulation

Kuo-Han Hung, Pang-Chi Lo, Jia-Fong Yeh et al.

ICLR 2025arXiv:2502.00129

#11633

ProtoSnap: Prototype Alignment For Cuneiform Signs

Rachel Mikulinsky, Morris Alper, Shai Gordin et al.

ICLR 2025arXiv:2502.08209

#11634

Equivariant Masked Position Prediction for Efficient Molecular Representation

Junyi An, Chao Qu, Yun-Fei Shi et al.

ICLR 2025arXiv:2410.15557

#11635

How to Find the Exact Pareto Front for Multi-Objective MDPs?

Yining Li, Peizhong Ju, Ness Shroff

#11636

Understanding Constraint Inference in Safety-Critical Inverse Reinforcement Learning

Bo Yue, Shufan Wang, Ashish Gaurav et al.

ICLR 2025arXiv:2502.14047

#11637

Towards a learning theory of representation alignment

Francesco Maria Gabriele Insulla, Shuo Huang, Lorenzo Rosasco

NEURIPS 2025arXiv:2510.25387

#11638

Instance-Level Composed Image Retrieval

Bill Psomas, George Retsinas, Nikos Efthymiadis et al.

ICLR 2025arXiv:2410.08437

#11639

Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks

Rushang Karia, Daniel Bramblett, Daksh Dobhal et al.

ICLR 2025arXiv:2410.05021

#11640

DEPT: Decoupled Embeddings for Pre-training Language Models

Alex Iacob, Lorenzo Sani, Meghdad Kurmanji et al.

#11641

ProAdvPrompter: A Two-Stage Journey to Effective Adversarial Prompting for LLMs

Hao Di, Tong He, Haishan Ye et al.

NEURIPS 2025arXiv:2410.07711

#11642

AdaptGrad: Adaptive Sampling to Reduce Noise

Linjiang Zhou, Chao Ma, Zepeng Wang et al.

NEURIPS 2025spotlightarXiv:2505.13878

#11643

InfiFPO: Implicit Model Fusion via Preference Optimization in Large Language Models

Yanggan Gu, Yuanyi Wang, Zhaoyi Yan et al.

ICLR 2025arXiv:2501.14216

#11644

TFG-Flow: Training-free Guidance in Multimodal Generative Flow

Haowei Lin, Shanda Li, Haotian Ye et al.

ICLR 2025arXiv:2408.13045

#11645

The adaptive complexity of parallelized log-concave sampling

Huanjian Zhou, Baoxiang Wang, Masashi Sugiyama

#11646

Progressive Parameter Efficient Transfer Learning for Semantic Segmentation

Nan Zhou, Huiqun Wang, Yaoyan Zheng et al.

ICLR 2025arXiv:2503.03989

#11647

Integrating Protein Dynamics into Structure-Based Drug Design via Full-Atom Stochastic Flows

Xiangxin Zhou, Yi Xiao, Haowei Lin et al.

ICLR 2025arXiv:2501.15055

#11648

Group Ligands Docking to Protein Pockets

Jiaqi Guan, Jiahan Li, Xiangxin Zhou et al.

NEURIPS 2025arXiv:2502.21309

#11649

Reasoning is Periodicity? Improving Large Language Models Through Effective Periodicity Modeling

Yihong Dong, Ge Li, Xue Jiang et al.

NEURIPS 2025arXiv:2510.03012

#11650

PocketSR: The Super-Resolution Expert in Your Pocket Mobiles

Haoze Sun, Linfeng Jiang, Fan Li et al.

ICLR 2025arXiv:2502.11729

#11651

On Quantizing Neural Representation for Variable-Rate Video Coding

Junqi Shi, Zhujia Chen, Hanfei Li et al.

#11652

Toward Efficient Multi-Agent Exploration With Trajectory Entropy Maximization

Tianxu Li, Kun Zhu

ICLR 2025arXiv:2410.01588

#11653

DynFrs: An Efficient Framework for Machine Unlearning in Random Forest

Shurong Wang, Zhuoyang Shen, Xinbao Qiao et al.

ICLR 2025oralarXiv:2411.13056

#11654

Efficient Masked AutoEncoder for Video Object Counting and A Large-Scale Benchmark

Bing Cao, Quanhao Lu, Jiekang Feng et al.

ICLR 2025arXiv:2410.10291

#11655

Evaluating Semantic Variation in Text-to-Image Synthesis: A Causal Perspective

Xiangru Zhu, Penglei Sun, Yaoxian Song et al.

#11656

INFER: A Neural-symbolic Model For Extrapolation Reasoning on Temporal Knowledge Graph

Ningyuan Li, Haihong E, Tianyu Yao et al.

ICLR 2025oral

ICLR 2025arXiv:2504.15513

#11657

InstaRevive: One-Step Image Enhancement via Dynamic Score Matching

Yixuan Zhu, Haolin Wang, Ao Li et al.

#11658

On Designing General and Expressive Quantum Graph Neural Networks with Applications to MILP Instance Representation

Xinyu Ye, Hao Xiong, Jianhao Huang et al.

ICLR 2025arXiv:2502.10988

#11659

OMG: Opacity Matters in Material Modeling with Gaussian Splatting

Silong Yong, Venkata Nagarjun Pudureddiyur Manivannan, Bernhard Kerbl et al.

ICLR 2025arXiv:2406.02929

#11660

ZeroDiff: Solidified Visual-semantic Correlation in Zero-Shot Learning

Zihan Ye, Shreyank Gowda, Shiming Chen et al.

NEURIPS 2025arXiv:2505.24870

#11661

GenSpace: Benchmarking Spatially-Aware Image Generation

Zehan Wang, Jiayang Xu, Ziang Zhang et al.

#11662

Combatting Dimensional Collapse in LLM Pre-Training Data via Submodular File Selection

Ziqing Fan, Siyuan Du, Shengchao Hu et al.

ICLR 2025arXiv:2504.11457

#11663

Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception

Ziqi Pang, Xin Xu, Yu-Xiong Wang

ICLR 2025arXiv:2504.09205

#11664

Query-based Knowledge Transfer for Heterogeneous Learning Environments

Norah Alballa, Wenxuan Zhang, Ziquan Liu et al.

ICLR 2025arXiv:2501.18277

#11665

SEBRA : Debiasing through Self-Guided Bias Ranking

Adarsh Kappiyath, Abhra Chaudhuri, AJAY JAISWAL et al.

#11666

A Statistical Approach for Controlled Training Data Detection

Zirui Hu, Yingjie Wang, Zheng Zhang et al.

NEURIPS 2025arXiv:2505.17455

#11667

Towards Evaluating Proactive Risk Awareness of Multimodal Language Models

Youliang Yuan, Wenxiang Jiao, Yuejin Xie et al.

NEURIPS 2025spotlightarXiv:2412.01605

#11668

MedChain: Bridging the Gap Between LLM Agents and Clinical Practice with Interactive Sequence

Jie Liu, Wenxuan Wang, Zizhan Ma et al.

NEURIPS 2025oralarXiv:2501.04184

#11669

MedicalNarratives: Connecting Medical Vision and Language with Localized Narratives

Wisdom Ikezogwo, Kevin M. Zhang, Saygin Seyfioglu

ICLR 2025arXiv:2504.12637

#11670

Scaling Instruction-tuned LLMs to Million-token Contexts via Hierarchical Synthetic Data Generation

Linda He, Jue Wang, Maurice Weber et al.

ICLR 2025arXiv:2410.16699

#11671

Graph Transformers Dream of Electric Flow

Xiang Cheng, Lawrence Carin, Suvrit Sra

ICLR 2025arXiv:2410.09878

#11672

Provably Reliable Conformal Prediction Sets in the Presence of Data Poisoning

Yan Scholten, Stephan Günnemann

ICLR 2025arXiv:2511.16924

#11673

CBMA: Improving Conformal Prediction through Bayesian Model Averaging

Pankaj Bhagwat, Linglong Kong, Bei Jiang

#11674

Efficient Online Pruning and Abstraction for Imperfect Information Extensive-Form Games

Boning Li, Longbo Huang

#11675

Bounds on $L_p$ Errors in Density Ratio Estimation via $f$-Divergence Loss Functions

Yoshiaki Kitazawa

ICLR 2025arXiv:2410.17547

#11676

Generalizable Motion Planning via Operator Learning

Sharath Matada, Luke Bhan, Yuanyuan Shi et al.

ICLR 2025arXiv:2406.13075

#11677

Exact Community Recovery under Side Information: Optimality of Spectral Algorithms

Julia Gaudio, Nirmit Joshi

ICLR 2025arXiv:2411.03755

#11678

Content-Style Learning from Unaligned Domains: Identifiability under Unknown Latent Dimensions

Sagar Shrestha, Xiao Fu

ICLR 2025arXiv:2410.07476

#11679

Towards a Unified and Verified Understanding of Group-Operation Networks

Wilson Wu, Louis Jaburi, jacob drori et al.

ISMAR 2025paperarXiv:2412.11762

#11680

GS-ProCams: Gaussian Splatting-Based Projector-Camera Systems

Qingyue Deng, Jijiang Li, Haibin Ling et al.

#11681

Toward Exploratory Inverse Constraint Inference with Generative Diffusion Verifiers

Runyi Zhao, Sheng Xu, Bo Yue et al.

ISMAR 2025paperarXiv:2508.04326

#11682

Radiance Fields in XR: A Survey on How Radiance Fields are Envisioned and Addressed for XR Research

Ke Li, Mana Masuda, Susanne Schmidt et al.

ICLR 2025arXiv:2402.05187

#11683

Learning mirror maps in policy mirror descent

Carlo Alfano, Sebastian Towers, Silvia Sapora et al.

ICLR 2025arXiv:2502.02922

#11684

Elucidating the Preconditioning in Consistency Distillation

Kaiwen Zheng, Guande He, Jianfei Chen et al.

#11685

InstantPortrait: One-Step Portrait Editing via Diffusion Multi-Objective Distillation

Zhixin Lai, Keqiang Sun, Fu-Yun Wang et al.

ICLR 2025arXiv:2501.04898

#11686

Optimality and Adaptivity of Deep Neural Features for Instrumental Variable Regression

Juno Kim, Dimitri Meunier, Arthur Gretton et al.

#11687

Probabilistic Verification of Cybersickness in Virtual Reality Through Bayesian Networks

Peng Wu, Nasim Ahmed, Abhiram Sarma et al.

ICLR 2025arXiv:2410.07081

#11688

JPEG Inspired Deep Learning

Ahmed Hussien Salamah, Kaixiang Zheng, Yiwen Liu et al.

ISMAR 2025paperarXiv:2508.14346

#11689

Exploring Organizational Strategies in Immersive Computational Notebooks

Sungwon In, Ayush Roy, Eric Krokos et al.

ISMAR 2025paperarXiv:2505.03027

#11690

Revisiting Performance Models of Distal Pointing Tasks in Virtual Reality

Logan Lane, Feiyu Lu, Shakiba Davari et al.

#11691

Can People's Brains Synchronize during Remote AR Collaboration?

Jaehwan You, Myeongul Jung, Kwanguk Kim

ICLR 2025oralarXiv:2410.19406

#11692

An Auditing Test to Detect Behavioral Shift in Language Models

Leo Richter, Xuanli He, Pasquale Minervini et al.

ICLR 2025oralarXiv:2504.14805

#11693

Dynamic Contrastive Skill Learning with State-Transition Based Skill Clustering and Dynamic Length Adjustment

Jinwoo Choi, Seung-Woo Seo

#11694

Exploring and Modeling the Effects of Eye-Tracking Accuracy and Precision on Gaze-Based Steering in Virtual Environments

Xuning Hu, Yichuan Zhang, Yushi Wei et al.

NEURIPS 2025arXiv:2505.19136

#11695

Uncertainty Quantification for Physics-Informed Neural Networks with Extended Fiducial Inference

Frank Shih, Zhenghao Jiang, Faming Liang

ICLR 2025arXiv:2405.12069

#11696

Gaussian Head & Shoulders: High Fidelity Neural Upper Body Avatars with Anchor Gaussian Guided Texture Warping

Tianhao Wu, Jing Yang, Zhilin Guo et al.

ICLR 2025arXiv:2502.10195

#11697

Exploring the Camera Bias of Person Re-identification

Myungseo Song, Jin-Woo Park, Jong-Seok Lee

#11698

Birds of a Feather Augment Together: Exploring Sonic Links Between Real and Virtual Worlds in Audio Augmented Reality

Jacob Bhattacharyya, Alessandro Vinciarelli, Stephen Anthony Brewster

ISMAR 2025paperarXiv:2508.01915

#11699

EgoTrigger: Toward Audio-Driven Image Capture for Human Memory Enhancement in All-Day Energy-Efficient Smart Glasses

Akshay Paruchuri, Sinan Hersek, Lavisha Aggarwal et al.

ICLR 2025arXiv:2502.02004

#11700

Wavelet-based Positional Representation for Long Context

Yui Oka, Taku Hasegawa, Kyosuke Nishida et al.

NEURIPS 2025arXiv:2509.20890

#11701

FerretNet: Efficient Synthetic Image Detection via Local Pixel Dependencies

Shuqiao Liang, Jian Liu, Chen Renzhang et al.

ICLR 2025arXiv:2512.06795

#11702

ADAM Optimization with Adaptive Batch Selection

Gyu Yeol Kim, Min-hwan Oh

ICLR 2025oralarXiv:2504.06070

#11703

PINP: Physics-Informed Neural Predictor with latent estimation of fluid flows

Huaguan Chen, Yang Liu, Hao Sun

ICLR 2025arXiv:2411.01099

#11704

Few-Class Arena: A Benchmark for Efficient Selection of Vision Models and Dataset Difficulty Measurement

Bryan Bo Cao, Lawrence OGorman, Michael Coss et al.

ICLR 2025oralarXiv:2412.07188

#11705

Graph Neural Networks Are More Than Filters: Revisiting and Benchmarking from A Spectral Perspective

Yushun Dong, Patrick Soga, Yinhan He et al.

AAAI 2025paperarXiv:2412.15308

#11706

ViFactCheck: A New Benchmark Dataset and Methods for Multi-Domain News Fact-Checking In Vietnamese

Tran Thai Hoa, Tran Quang Duy, Khanh Quoc Tran et al.

ICLR 2025arXiv:2410.05966

#11707

FLOPS: Forward Learning with OPtimal Sampling

Tao Ren, Zishi Zhang, Jinyang Jiang et al.

#11708

IDseq: Decoupled and Sequentially Detecting and Grounding Multi-Modal Media Manipulation

Runxin Liu, Tian Xie, Jiaming Li et al.

ICLR 2025oralarXiv:2506.09526

#11709

Neural Functions for Learning Periodic Signal

Woojin Cho, Minju Jo, Kookjin Lee et al.

#11710

Teaching Human Behavior Improves Content Understanding Abilities Of VLMs

SOMESH SINGH, Harini S I, Yaman Singla et al.

#11711

AutoCGP: Closed-Loop Concept-Guided Policies from Unlabeled Demonstrations

Pei Zhou, Ruizhe Liu, Qian Luo et al.

ICLR 2025arXiv:2409.10773

#11712

Tight Lower Bounds under Asymmetric High-Order Hölder Smoothness and Uniform Convexity

Cedar Site Bai, Brian Bullins

AAAI 2025paperarXiv:2412.18254

#11713

RaCMC: Residual-Aware Compensation Network with Multi-Granularity Constraints for Fake News Detection

Xinquan Yu, Ziqi Sheng, Wei Lu et al.

#11714

KooNPro: A Variance-Aware Koopman Probabilistic Model Enhanced by Neural Process for Time Series Forecasting

Ronghua Zheng, Hanru Bai, Weiyang Ding

ICLR 2025oral

AAAI 2025paperarXiv:2503.04865

#11715

E4: Energy-Efficient DNN Inference for Edge Video Analytics via Early Exiting and DVFS

Ziyang Zhang, Yang Zhao, Ming-Ching Chang et al.

#11716

Enhancing Identity-Deformation Disentanglement in StyleGAN for One-Shot Face Video Re-Enactment

Qing Chang, Yao-Xiang Ding, Kun Zhou

ICLR 2025oralarXiv:2502.15370

#11717

Weakly Supervised Video Scene Graph Generation via Natural Language Supervision

Kibum Kim, Kanghoon Yoon, Yeonjun In et al.

AAAI 2025paperarXiv:2502.20858

#11718

EyEar: Learning Audio Synchronized Human Gaze Trajectory Based on Physics-Informed Dynamics

Xiaochuan Liu, Xin Cheng, Yuchong Sun et al.

#11719

Multimodal Fine-Grained Apparent Personality Trait Recognition: Joint Modeling of Big Five and Questionnaire Item-level Scores

Ryo Masumura, Shota Orihashi, Mana Ihori et al.

ICLR 2025arXiv:2511.16904

#11720

Warm Diffusion: Recipe for Blur-Noise Mixture Diffusion Models

Hao-Chien Hsueh, Wen-Hsiao Peng, Ching-Chun Huang

AAAI 2025paperarXiv:2412.16751

#11721

The Master Key Filters Hypothesis: Deep Filters Are General

Zahra Babaiee, Peyman M. Kiasari, Daniela Rus et al.

AAAI 2025paperarXiv:2412.17512

#11722

BEE: Metric-Adapted Explanations via Baseline Exploration-Exploitation

Oren Barkan, Yehonatan Elisha, Jonathan Weill et al.

AAAI 2025paperarXiv:2409.20500

#11723

FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing

Lingling Cai, Kang Zhao, Hangjie Yuan et al.

AAAI 2025paperarXiv:2412.14837

#11724

ObjVariantEnsemble: Advancing Point Cloud LLM Evaluation in Challenging Scenes with Subtly Distinguished Objects

Qihang Cao, Huangxun Chen

AAAI 2025paperarXiv:2412.19720

#11725

Sharpening Neural Implicit Functions with Frequency Consolidation Priors

Chao Chen, Yu-Shen Liu, Zhizhong Han

AAAI 2025paperarXiv:2501.04477

#11726

Rethinking High-speed Image Reconstruction Framework with Spike Camera

Kang Chen, Yajing Zheng, Tiejun Huang et al.

#11727

GLOMA: Global Video Text Spotting with Morphological Association

Han Wang, Yanjie Wang, Yang Li et al.

ICLR 2025oral

AAAI 2025paperarXiv:2412.17800

#11728

Comprehensive Multi-Modal Prototypes Are Simple and Effective Classifiers for Vast-Vocabulary Object Detection

Yitong Chen, Wenhao Yao, Lingchen Meng et al.

AAAI 2025paperarXiv:2412.11820

#11729

Spatiotemporal Blind-Spot Network with Calibrated Flow Alignment for Self-Supervised Video Denoising

Zikang Chen, Tao Jiang, Xiaowan Hu et al.

AAAI 2025paperarXiv:2412.08975

#11730

Elevating Flow-Guided Video Inpainting with Reference Generation

Suhwan Cho, Seoung Wug Oh, Sangyoun Lee et al.

AAAI 2025paperarXiv:2503.17728

#11731

DynASyn: Multi-Subject Personalization Enabling Dynamic Action Synthesis

Yongjin Choi, Chanhun Park, Seung Jun Baek

AAAI 2025paperarXiv:2501.09826

#11732

PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery

Shristi Das Biswas, Matthew Shreve, Xuelu Li et al.

#11733

Semantic Ambiguity Modeling and Propagation for Fine-Grained Visual Cross View Geo-Localization

Mingtao Feng, Fenghao Tian, Jianqiao Luo et al.

AAAI 2025paperarXiv:2411.01564

#11734

ParseCaps: An Interpretable Parsing Capsule Network for Medical Image Diagnosis

Xinyu Geng, Jiaming Wang, Xiaolin Huang et al.

#11735

You Should Learn to Stop Denoising on Point Clouds in Advance

Chuchen Guo, Weijie Zhou, Zheng Liu et al.

AAAI 2025paperarXiv:2412.08149

#11736

AsyncDSB: Schedule-Asynchronous Diffusion Schrödinger Bridge for Image Inpainting

Zihao Han, Baoquan Zhang, Lisai Zhang et al.

#11737

Multi-Frame Deformable Look-Up Table for Compressed Video Quality Enhancement

Gang He, Guancheng Quan, Chang Wu et al.

#11738

Achieving Speed-Accuracy Balance in Vision-based 3D Occupancy Prediction via Geometric-Semantic Disentanglement

Yulin He, Wei Chen, Siqi Wang et al.

#11739

Robust and Consistent Online Video Instance Segmentation via Instance Mask Propagation

Miran Heo, Seoung Wug Oh, Seon Joo Kim et al.

AAAI 2025paperarXiv:2502.08149

#11740

Generalized Class Discovery in Instance Segmentation

Cuong Manh Hoang, Yeejin Lee, Byeongkeun Kang

AAAI 2025paperarXiv:2408.14868

#11741

ZeroMamba: Exploring Visual State Space Model for Zero-Shot Learning

Wenjin Hou, Dingjie Fu, Kun Li et al.

AAAI 2025paperarXiv:2501.10462

#11742

BloomScene: Lightweight Structured 3D Gaussian Splatting for Crossmodal Scene Generation

Xiaolu Hou, Mingcheng Li, Dingkang Yang et al.

#11743

Motion Decoupled 3D Gaussian Splatting for Dynamic Object Representation

Xiao Hu, Libo Long, Jochen Lang

#11744

LPCG: A Self-conditional Architecture for Labeled Point Cloud Generation

Dongshuo Huang, Xiaoshui Huang, Chengdong Zhang et al.

AAAI 2025paperarXiv:2501.07762

#11745

PSReg: Prior-guided Sparse Mixture of Experts for Point Cloud Registration

Xiaoshui Huang, Zhou Huang, Yifan Zuo et al.

AAAI 2025paperarXiv:2502.19769

#11746

QORT-Former: Query-optimized Real-time Transformer for Understanding Two Hands Manipulating Objects

Elkhan Ismayilzada, MD Khalequzzaman Chowdhury Sayem, Yihalem Yimolal Tiruneh et al.

AAAI 2025paperarXiv:2412.09050

#11747

ContextHOI: Spatial Context Learning for Human-Object Interaction Detection

Mingda Jia, Liming Zhao, Ge Li et al.

#11748

SparsyFed: Sparse Adaptive Federated Learning

Adriano Guastella, Lorenzo Sani, Alex Iacob et al.

#11749

Deep Tree Tensor Networks

Chang Nie

NEURIPS 2025

AAAI 2025paperarXiv:2412.16467

#11750

Sensing Surface Patches in Volume Rendering for Inferring Signed Distance Functions

Sijia Jiang, Tong Wu, Jing Hua et al.

AAAI 2025paperarXiv:2412.17523

#11751

Constructing Fair Latent Space for Intersection of Fairness and Explainability

Hyungjun Joo, Hyeonggeun Han, Sehwan Kim et al.

ICLR 2025arXiv:2401.00036

#11752

Discrete Distribution Networks

Lei Yang

AAAI 2025paperarXiv:2501.02640

#11753

Multispectral Pedestrian Detection with Sparsely Annotated Label

Chan Lee, Seungho Shin, Gyeong-Moon Park et al.

AAAI 2025paperarXiv:2412.19543

#11754

Diverse Rare Sample Generation with Pretrained GANs

Subeen Lee, Jiyeon Han, Soyeon Kim et al.

AAAI 2025paperarXiv:2504.04687

#11755

Bridging Knowledge Gap Between Image Inpainting and Large-Area Visible Watermark Removal

Yicheng Leng, Chaowei Fang, Junye Chen et al.

#11756

M²RL-Net: Multi-View and Multi-Level Relation Learning Network for Weakly-Supervised Image Forgery Detection

Jiafeng Li, Ying Wen, Lianghua He

AAAI 2025paperarXiv:2502.19751

#11757

Lightweight Contrastive Distilled Hashing for Online Cross-modal Retrieval

Jiaxing Li, Lin Jiang, Zeqi Ma et al.

#11758

Multi-View 3D Human Pose Estimation with Weakly Synchronized Images

Ling Li, Ruiwen Gu, Chongyang Wang et al.

#11759

SyncNoise: Geometrically Consistent Noise Prediction for Instruction-based 3D Editing

Ruihuang Li, Liyi Chen, Zhengqiang Zhang et al.

#11760

Endowing Visual Reprogramming with Adversarial Robustness

Shengjie Zhou, Xin Cheng, Haiyang Xu et al.

AAAI 2025paperarXiv:2506.02448

#11761

VidEvent: A Large Dataset for Understanding Dynamic Evolution of Events in Videos

Baoyu Liang, Qile Su, Shoutai Zhu et al.

AAAI 2025paperarXiv:2412.10176

#11762

UN-DETR: Promoting Objectness Learning via Joint Supervision for Unknown Object Detection

HaoMiao Liu, Hao Xu, Chuhuai Yue et al.

AAAI 2025paperarXiv:2408.13226

#11763

D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matching

Jingyu Liu, Minquan Wang, Ye Ma et al.

#11764

Efficient Deformable Convolutional Prompt for Continual Test-Time Adaptation in Medical Image Segmentation

Shiyu Liu, Daoqiang Zhang, Xiaoke Hao

#11765

MUN: Image Forgery Localization Based on M³ Encoder and UN Decoder

Yaqi Liu, Shuhuan Chen, Haichao Shi et al.

#11766

Enhancing Low-Light Images: A Synthetic Data Perspective on Practical and Generalizable Solutions

Yu Long, Qinghua Lin, Zhihua Wang et al.

AAAI 2025paperarXiv:2401.13329

#11767

Generative Video Diffusion for Unseen Novel Semantic Video Moment Retrieval

Dezhao Luo, Shaogang Gong, Jiabo Huang et al.

AAAI 2025paperarXiv:2412.11917

#11768

Does VLM Classification Benefit from LLM Description Semantics?

Pingchuan Ma, Lennart Rietdorf, Dmytro Kotovenko et al.

#11769

Novel View Synthesis Under Large-Deviation Viewpoint for Autonomous Driving

Xin Ma, Jiguang Zhang, Peng Lu et al.

#11770

Relaxed Class-consensus Consistency for Semi-supervised Semantic Segmentation

Huayu Mai, Rui Sun, Feng Wu

ICLR 2025arXiv:2403.15576

#11771

Data-centric Prediction Explanation via Kernelized Stein Discrepancy

Mahtab Sarvmaili, Hassan Sajjad, Ga Wu

NEURIPS 2025arXiv:2511.10721

#11772

Fast Data Attribution for Text-to-Image Models

Sheng-Yu Wang, Aaron Hertzmann, Alexei Efros et al.

AAAI 2025paperarXiv:2412.18404

#11773

Extract Free Dense Misalignment from CLIP

JeongYeon Nam, Jinbae Im, Wonjae Kim et al.

#11774

Global Convergence of Policy Gradient in Average Reward MDPs

Navdeep Kumar, Yashaswini Murthy, Itai Shufaro et al.

AAAI 2025paperarXiv:2412.13156

#11775

S2S2: Semantic Stacking for Robust Semantic Segmentation in Medical Imaging

Yimu Pan, Sitao Zhang, Alison D. Gernand et al.

ICLR 2025arXiv:2502.17024

#11776

Towards Auto-Regressive Next-Token Prediction: In-context Learning Emerges from Generalization

Zixuan Gong, Xiaolin Hu, Huayi Tang et al.

AAAI 2025paperarXiv:2412.14821

#11777

PC-BEV: An Efficient Polar-Cartesian BEV Fusion Framework for LiDAR Semantic Segmentation

Shoumeng Qiu, Xinrun Li, Xiangyang Xue et al.

AAAI 2025paperarXiv:2502.03359

#11778

GHOST: Gaussian Hypothesis Open-Set Technique

Ryan Rabinowitz, Steve Cruz, Manuel Günther et al.

AAAI 2025paperarXiv:2409.08272

#11779

Click2Mask: Local Editing with Dynamic Mask Generation

Omer Regev, Omri Avrahami, Dani Lischinski

AAAI 2025paperarXiv:2503.19283

#11780

ISPDiffuser: Learning RAW-to-sRGB Mappings with Texture-Aware Diffusion Models and Histogram-Guided Color Consistency

Yang Ren, Hai Jiang, Menglong Yang et al.

AAAI 2025paperarXiv:2412.08357

#11781

Video Summarization Using Denoising Diffusion Probabilistic Model

Zirui Shang, Yubo Zhu, Hongxi Li et al.

AAAI 2025paperarXiv:2502.05902

#11782

Fast Omni-Directional Image Super-Resolution: Adapting the Implicit Image Function with Pixel and Semantic-Wise Spherical Geometric Priors

Xuelin Shen, Yitong Wang, Silin Zheng et al.

AAAI 2025paperarXiv:2405.05858

#11783

Free-Moving Object Reconstruction and Pose Estimation with Virtual Camera

Haixin Shi, Yinlin Hu, Daniel Koguciuk et al.

AAAI 2025paperarXiv:2408.11810

#11784

Pixel Is Not a Barrier: An Effective Evasion Attack for Pixel-Domain Diffusion Models

Chun-Yen Shih, Li-Xuan Peng, Jia-Wei Liao et al.

AAAI 2025paperarXiv:2502.03459

#11785

SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living

Arkaprava Sinha, Dominick Reilly, Francois Bremond et al.

AAAI 2025paperarXiv:2412.13708

#11786

JoVALE: Detecting Human Actions in Video Using Audiovisual and Language Contexts

Taein Son, Soo Won Seo, Jisong Kim et al.

AAAI 2025paperarXiv:2306.05497

#11787

Enhancing Noise-Robust Losses for Large-Scale Noisy Data Learning

Max Staats, Matthias Thamm, Bernd Rosenow

AAAI 2025paperarXiv:2409.04050

#11788

EigenSR: Eigenimage-Bridged Pre-Trained RGB Learners for Single Hyperspectral Image Super-Resolution

Xi Su, Xiangfei Shen, Mingyang Wan et al.

#11789

Learning Fine-Grained Alignment for Aerial Vision-Dialog Navigation

Yifei Su, Dong An, Kehan Chen et al.

AAAI 2025paperarXiv:2412.14692

#11790

Explicit Relational Reasoning Network for Scene Text Detection

Yuchen Su, Zhineng Chen, Yongkun Du et al.

ICLR 2025oralarXiv:2502.11858

#11791

Rethinking Audio-Visual Adversarial Vulnerability from Temporal and Modality Perspectives

Zeliang Zhang, Susan Liang, Daiki Shimada et al.

AAAI 2025paperarXiv:2412.14473

#11792

Promptable Representation Distribution Learning and Data Augmentation for Gigapixel Histopathology WSI Analysis

Kunming Tang, Zhiguo Jiang, Jun Shi et al.

AAAI 2025paperarXiv:2502.17766

#11793

Improving Transformer Based Line Segment Detection with Matched Predicting and Re-ranking

Xin Tong, Shi Peng, Baojie Tian et al.

AAAI 2025paperarXiv:2501.07100

#11794

Collaborative Learning for 3D Hand-Object Reconstruction and Compositional Action Recognition from Egocentric RGB Videos Using Superquadrics

Tze Ho Elden Tse, Runyang Feng, Linfang Zheng et al.

#11795

Overcoming Heterogeneous Data in Federated Medical Vision-Language Pre-training: A Triple-Embedding Model Selector Approach

Aowen Wang, Zhiwang Zhang, Dongang Wang et al.

#11796

SSC-VAE: Structured Sparse Coding Based Variational Autoencoder for Detail Preserved Image Reconstruction

Hao Wang, Lu Wang, Zhongyu Wang et al.

#11797

Bright-NeRF: Brightening Neural Radiance Field with Color Restoration from Low-Light RAW Images

Min Wang, Xin Huang, Guoqing Zhou et al.

#11798

HomoMatcher: Achieving Dense Feature Matching with Semi-Dense Efficiency by Homography Estimation

Xiaolong Wang, Lei Yu, Yingying Zhang et al.

ICLR 2025arXiv:2502.05537

#11799

Sequential Stochastic Combinatorial Optimization Using Hierarchal Reinforcement Learning

Xinsong Feng, Zihan Yu, Yanhai Xiong et al.

#11800

Aligning Composed Query with Image via Discriminative Perception from Negative Correspondences

Yifan Wang, Wuliang Huang, Chun Yuan