Most Cited 2025 &quot;top-p sampling&quot; Papers

ICLR 2025oralarXiv:2504.05075

#12402

PvNeXt: Rethinking Network Design and Temporal Motion for Point Cloud Video Recognition

Jie Wang, Tingfa Xu, Lihe Ding et al.

#12403

RFMamba: Frequency-Aware State Space Model for RF-Based Human-Centric Perception

Rui Zhang, Ruixu Geng, Yadong Li et al.

ICLR 2025arXiv:2506.06582

#12404

Demystifying Topological Message-Passing with Relational Structures: A Case Study on Oversquashing in Simplicial Message-Passing

Diaaeldin Taha, James Chapman, Marzieh Eidi et al.

ICLR 2025arXiv:2507.23539

#12405

Improved Algorithms for Kernel Matrix-Vector Multiplication Under Sparsity Assumptions

Piotr Indyk, Michael Kapralov, Kshiteej Jitesh Sheth et al.

ICLR 2025arXiv:2409.18582

#12406

Optimistic Games for Combinatorial Bayesian Optimization with Application to Protein Design

Melis Ilayda Bal, Pier Giuseppe Sessa, Mojmir Mutny et al.

ICLR 2025arXiv:2502.07489

#12407

Physiome-ODE: A Benchmark for Irregularly Sampled Multivariate Time-Series Forecasting Based on Biological ODEs

Christian Klötergens, Vijaya Krishna Yalavarthi, Randolf Scholz et al.

ICLR 2025oralarXiv:2410.04171

#12408

IV-mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis

Shitong Shao, zikai zhou, Lichen Bai et al.

#12409

LLM-based Typed Hyperresolution for Commonsense Reasoning with Knowledge Bases

Armin Toroghi, Ali Pesaranghader, Tanmana Sadhu et al.

ICLR 2025arXiv:2506.00849

#12410

Generalization in VAE and Diffusion Models: A Unified Information-Theoretic Analysis

Qi Chen, Jierui Zhu, Florian Shkurti

#12411

REBIND: Enhancing Ground-state Molecular Conformation Prediction via Force-Based Graph Rewiring

Taewon Kim, Hyunjin Seo, Sungsoo Ahn et al.

ICLR 2025arXiv:2503.12343

#12412

TopoGaussian: Inferring Internal Topology Structures from Visual Clues

Xiaoyu Xiong, Changyu Hu, Chunru Lin et al.

#12413

Differentiable Rule Induction from Raw Sequence Inputs

Kun Gao, Katsumi Inoue, Yongzhi Cao et al.

ICLR 2025arXiv:2405.17049

#12414

Verifying Properties of Binary Neural Networks Using Sparse Polynomial Optimization

Jianting Yang, Srecko Durasinovic, Jean Bernard Lasserre et al.

ICLR 2025arXiv:2504.08840

#12415

Adaptive Shrinkage Estimation for Personalized Deep Kernel Regression in Modeling Brain Trajectories

Vasiliki Tassopoulou, Haochang Shou, Christos Davatzikos

ICLR 2025arXiv:2412.10370

#12416

Computational Explorations of Total Variation Distance

Arnab Bhattacharyya, Sutanu Gayen, Kuldeep S. Meel et al.

ICLR 2025arXiv:2410.22311

#12417

Convex Formulations for Training Two-Layer ReLU Neural Networks

Karthik Prakhya, Tolga Birdal, Alp Yurtsever

ICLR 2025arXiv:2407.01331

#12418

Restyling Unsupervised Concept Based Interpretable Networks with Generative Models

Jayneel Parekh, Quentin Bouniot, Pavlo Mozharovskyi et al.

ICLR 2025arXiv:2406.18450

#12419

Preference Elicitation for Offline Reinforcement Learning

Alizée Pace, Bernhard Schölkopf, Gunnar Ratsch et al.

#12420

Zero-Shot Natural Language Explanations

Fawaz Sammani, Nikos Deligiannis

ICLR 2025arXiv:2501.13810

#12421

Learning to Help in Multi-Class Settings

Yu Wu, Yansong Li, Zeyu Dong et al.

#12422

The Directionality of Optimization Trajectories in Neural Networks

Sidak Pal Singh, Bobby He, Thomas Hofmann et al.

ICLR 2025arXiv:2408.08776

#12423

NEAR: A Training-Free Pre-Estimator of Machine Learning Model Performance

Raphael Husistein, Markus Reiher, Marco Eckhoff

ICLR 2025arXiv:2405.13922

#12424

Towards Certification of Uncertainty Calibration under Adversarial Attacks

Cornelius Emde, Francesco Pinto, Thomas Lukasiewicz et al.

ICLR 2025arXiv:2410.01374

#12425

Newton Meets Marchenko-Pastur: Massively Parallel Second-Order Optimization with Hessian Sketching and Debiasing

Elad Romanov, Fangzhao Zhang, Mert Pilanci

ICLR 2025arXiv:2409.06316

#12426

PharmacoMatch: Efficient 3D Pharmacophore Screening via Neural Subgraph Matching

Daniel Rose, Oliver Wieder, Thomas Seidel et al.

ICLR 2025arXiv:2405.07482

#12427

Towards Marginal Fairness Sliced Wasserstein Barycenter

Khai Nguyen, Hai Nguyen, Nhat Ho

ICLR 2025oralarXiv:2504.12186

#12428

CoMotion: Concurrent Multi-person 3D Motion

Alejandro Newell, Peiyun Hu, Lahav Lipson et al.

#12429

Salvage: Shapley-distribution Approximation Learning Via Attribution Guided Exploration for Explainable Image Classification

Mehdi Naouar, Hanne Raum, Jens Rahnfeld et al.

ICLR 2025oralarXiv:2502.17121

#12430

Adversarial Training for Defense Against Label Poisoning Attacks

Melis Ilayda Bal, Volkan Cevher, Michael Muehlebach

ICLR 2025arXiv:2410.13178

#12431

GeSubNet: Gene Interaction Inference for Disease Subtype Network Generation

Ziwei Yang, Zheng Chen, XIN LIU et al.

ICLR 2025arXiv:2503.07665

#12432

The Computational Complexity of Positive Non-Clashing Teaching in Graphs

Robert Ganian, Liana Khazaliya, Fionn Mc Inerney et al.

#12433

PaLD: Detection of Text Partially Written by Large Language Models

Eric Lei, Hsiang Hsu, Chun-Fu Chen

ICLR 2025arXiv:2411.03349

#12434

RuAG: Learned-rule-augmented Generation for Large Language Models

Yudi Zhang, Pei Xiao, Lu Wang et al.

#12435

Easing Training Process of Rectified Flow Models Via Lengthening Inter-Path Distance

Shifeng Xu, Yanzhu Liu, Adams Kong

ICLR 2025arXiv:2502.02257

#12436

UNIP: Rethinking Pre-trained Attention Patterns for Infrared Semantic Segmentation

Tao Zhang, Jinyong Wen, Zhen Chen et al.

ICLR 2025arXiv:2601.10169

#12437

CtD: Composition through Decomposition in Emergent Communication

Boaz Carmeli, Ron Meir, Yonatan Belinkov

ICLR 2025arXiv:2412.07298

#12438

The Rise and Down of Babel Tower: Investigating the Evolution Process of Multilingual Code Large Language Model

Jiawei Chen, Wentao Chen, Jing Su et al.

ICLR 2025arXiv:2502.16828

#12439

Predicting the Energy Landscape of Stochastic Dynamical System via Physics-informed Self-supervised Learning

Ruikun Li, Huandong Wang, Qingmin Liao et al.

ICLR 2025arXiv:2411.06055

#12440

Linear Spherical Sliced Optimal Transport: A Fast Metric for Comparing Spherical Data

Xinran Liu, Yikun Bai, Rocio Diaz Martin et al.

#12441

Supervised and Semi-Supervised Diffusion Maps with Label-Driven Diffusion

Harel Mendelman, Ronen Talmon

ICLR 2025arXiv:2402.06855

#12442

For Better or For Worse? Learning Minimum Variance Features With Label Augmentation

Muthu Chidambaram, Rong Ge

ICLR 2025arXiv:2406.07843

#12443

Self-Attention-Based Contextual Modulation Improves Neural System Identification

Isaac Lin, Tianye Wang, Shang Gao et al.

ICLR 2025arXiv:2406.11458

#12444

Adversaries With Incentives: A Strategic Alternative to Adversarial Robustness

Maayan Ehrenberg, Roy Ganz, Nir Rosenfeld

ICLR 2025arXiv:2502.07384

#12445

SAGEPhos: Sage Bio-Coupled and Augmented Fusion for Phosphorylation Site Detection

Jingjie Zhang, Hanqun Cao, Zijun Gao et al.

ICLR 2025arXiv:2501.15857

#12446

Are Transformers Able to Reason by Connecting Separated Knowledge in Training Data?

Yutong Yin, Zhaoran Wang

ICLR 2025arXiv:2501.15326

#12447

Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data

Jiajie Li, Brian Quaranto, Chenhui Xu et al.

#12448

EcoFace: Audio-Visual Emotional Co-Disentanglement Speech-Driven 3D Talking Face Generation

Jiajian Xie, Shengyu Zhang, Mengze Li et al.

ICLR 2025arXiv:2406.12082

#12449

Uncertainty modeling for fine-tuned implicit functions

Anna Susmelj, Mael Macuglia, Natasa Tagasovska et al.

ICLR 2025arXiv:2407.01574

#12450

cryoSPHERE: Single-Particle HEterogeneous REconstruction from cryo EM

Gabriel Claude Jean Ducrocq, Lukas Grunewald, Sebastian Westenhoff et al.

ICLR 2025arXiv:2502.04643

#12451

Confidence Elicitation: A New Attack Vector for Large Language Models

Brian Formento, Chuan Sheng Foo, See-Kiong Ng

ICLR 2025arXiv:2407.02020

#12452

Decentralized Optimization with Coupled Constraints

Demyan Yarmoshik, Alexander Rogozin, Nikita Kiselev et al.

ICLR 2025arXiv:2410.09181

#12453

Can a Large Language Model be a Gaslighter?

Wei Li, Luyao Zhu, Yang Song et al.

ICLR 2025arXiv:2410.05938

#12454

EMMA: Empowering Multi-modal Mamba with Structural and Hierarchical Alignment

Yifei Xing, Xiangyuan Lan, Ruiping Wang et al.

ICLR 2025arXiv:2402.05835

#12455

How Much is Unseen Depends Chiefly on Information About the Seen

Seongmin Lee, Marcel Boehme

ICLR 2025arXiv:2402.05626

#12456

Loss Landscape of Shallow ReLU-like Neural Networks: Stationary Points, Saddle Escape, and Network Embedding

Frank Zhengqing Wu, Berfin Simsek, François Ged

ICLR 2025arXiv:2410.05586

#12457

TeaserGen: Generating Teasers for Long Documentaries

Weihan Xu, Paul Pu Liang, Haven Kim et al.

#12458

Geometry of Long-Tailed Representation Learning: Rebalancing Features for Skewed Distributions

Lingjie Yi, Michael Yao, Weimin Lyu et al.

ICLR 2025arXiv:2410.05609

#12459

The Breakdown of Gaussian Universality in Classification of High-dimensional Linear Factor Mixtures

Xiaoyi MAI, Zhenyu Liao

ICLR 2025arXiv:2410.01101

#12460

Exploiting Structure in Offline Multi-Agent RL: The Benefits of Low Interaction Rank

Wenhao Zhan, Scott Fujimoto, Zheqing Zhu et al.

ICLR 2025oralarXiv:2505.05691

#12461

Physics-informed Temporal Difference Metric Learning for Robot Motion Planning

Ruiqi Ni, zherong pan, Ahmed Hussain Qureshi

ICLR 2025arXiv:2410.07502

#12462

Adaptive Batch Size for Privately Finding Second-Order Stationary Points

Daogao Liu, Kunal Talwar

#12463

Measuring And Improving Engagement of Text-to-Image Generation Models

Varun Khurana, Yaman Singla, Jayakumar Subramanian et al.

ICLR 2025oralarXiv:2410.20922

#12464

FACTS: A Factored State-Space Framework for World Modelling

Li Nanbo, Firas Laakom, Yucheng XU et al.

ICLR 2025oralarXiv:2402.04398

#12465

Learning under Temporal Label Noise

Sujay Nagaraj, Walter Gerych, Sana Tonekaboni et al.

ICLR 2025arXiv:2408.07245

#12466

$q$-exponential family for policy optimization

Lingwei Zhu, Haseeb Shah, Han Wang et al.

ICLR 2025arXiv:2410.07500

#12467

Learning to Generate Diverse Pedestrian Movements from Web Videos with Noisy Labels

Zhizheng Liu, Joe Lin, Wayne Wu et al.

ICLR 2025arXiv:2405.16545

#12468

VICtoR: Learning Hierarchical Vision-Instruction Correlation Rewards for Long-horizon Manipulation

Kuo-Han Hung, Pang-Chi Lo, Jia-Fong Yeh et al.

ICLR 2025arXiv:2502.00129

#12469

ProtoSnap: Prototype Alignment For Cuneiform Signs

Rachel Mikulinsky, Morris Alper, Shai Gordin et al.

ICLR 2025arXiv:2502.08209

#12470

Equivariant Masked Position Prediction for Efficient Molecular Representation

Junyi An, Chao Qu, Yun-Fei Shi et al.

ICLR 2025arXiv:2410.15557

#12471

How to Find the Exact Pareto Front for Multi-Objective MDPs?

Yining Li, Peizhong Ju, Ness Shroff

#12472

Understanding Constraint Inference in Safety-Critical Inverse Reinforcement Learning

Bo Yue, Shufan Wang, Ashish Gaurav et al.

ICLR 2025arXiv:2502.14047

#12473

Towards a learning theory of representation alignment

Francesco Maria Gabriele Insulla, Shuo Huang, Lorenzo Rosasco

ICLR 2025arXiv:2410.08437

#12474

Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks

Rushang Karia, Daniel Bramblett, Daksh Dobhal et al.

ICLR 2025arXiv:2410.05021

#12475

DEPT: Decoupled Embeddings for Pre-training Language Models

Alex Iacob, Lorenzo Sani, Meghdad Kurmanji et al.

#12476

ProAdvPrompter: A Two-Stage Journey to Effective Adversarial Prompting for LLMs

Hao Di, Tong He, Haishan Ye et al.

ICLR 2025arXiv:2501.14216

#12477

TFG-Flow: Training-free Guidance in Multimodal Generative Flow

Haowei Lin, Shanda Li, Haotian Ye et al.

ICLR 2025arXiv:2408.13045

#12478

The adaptive complexity of parallelized log-concave sampling

Huanjian Zhou, Baoxiang Wang, Masashi Sugiyama

#12479

Progressive Parameter Efficient Transfer Learning for Semantic Segmentation

Nan Zhou, Huiqun Wang, Yaoyan Zheng et al.

ICLR 2025arXiv:2503.03989

#12480

Integrating Protein Dynamics into Structure-Based Drug Design via Full-Atom Stochastic Flows

Xiangxin Zhou, Yi Xiao, Haowei Lin et al.

ICLR 2025arXiv:2501.15055

#12481

Group Ligands Docking to Protein Pockets

Jiaqi Guan, Jiahan Li, Xiangxin Zhou et al.

ICLR 2025arXiv:2502.11729

#12482

On Quantizing Neural Representation for Variable-Rate Video Coding

Junqi Shi, Zhujia Chen, Hanfei Li et al.

#12483

Toward Efficient Multi-Agent Exploration With Trajectory Entropy Maximization

Tianxu Li, Kun Zhu

ICLR 2025arXiv:2410.01588

#12484

DynFrs: An Efficient Framework for Machine Unlearning in Random Forest

Shurong Wang, Zhuoyang Shen, Xinbao Qiao et al.

ICLR 2025oralarXiv:2411.13056

#12485

Efficient Masked AutoEncoder for Video Object Counting and A Large-Scale Benchmark

Bing Cao, Quanhao Lu, Jiekang Feng et al.

ICLR 2025arXiv:2410.10291

#12486

Evaluating Semantic Variation in Text-to-Image Synthesis: A Causal Perspective

Xiangru Zhu, Penglei Sun, Yaoxian Song et al.

#12487

INFER: A Neural-symbolic Model For Extrapolation Reasoning on Temporal Knowledge Graph

Ningyuan Li, Haihong E, Tianyu Yao et al.

ICLR 2025oral

ICLR 2025arXiv:2504.15513

#12488

InstaRevive: One-Step Image Enhancement via Dynamic Score Matching

Yixuan Zhu, Haolin Wang, Ao Li et al.

#12489

On Designing General and Expressive Quantum Graph Neural Networks with Applications to MILP Instance Representation

Xinyu Ye, Hao Xiong, Jianhao Huang et al.

ICLR 2025arXiv:2502.10988

#12490

OMG: Opacity Matters in Material Modeling with Gaussian Splatting

Silong Yong, Venkata Nagarjun Pudureddiyur Manivannan, Bernhard Kerbl et al.

ICLR 2025arXiv:2406.02929

#12491

ZeroDiff: Solidified Visual-semantic Correlation in Zero-Shot Learning

Zihan Ye, Shreyank Gowda, Shiming Chen et al.

#12492

Combatting Dimensional Collapse in LLM Pre-Training Data via Submodular File Selection

Ziqing Fan, Siyuan Du, Shengchao Hu et al.

ICLR 2025arXiv:2504.11457

#12493

Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception

Ziqi Pang, Xin Xu, Yu-Xiong Wang

ICLR 2025arXiv:2504.09205

#12494

Query-based Knowledge Transfer for Heterogeneous Learning Environments

Norah Alballa, Wenxuan Zhang, Ziquan Liu et al.

ICLR 2025arXiv:2501.18277

#12495

SEBRA : Debiasing through Self-Guided Bias Ranking

Adarsh Kappiyath, Abhra Chaudhuri, AJAY JAISWAL et al.

#12496

A Statistical Approach for Controlled Training Data Detection

Zirui Hu, Yingjie Wang, Zheng Zhang et al.

ICLR 2025arXiv:2504.12637

#12497

Scaling Instruction-tuned LLMs to Million-token Contexts via Hierarchical Synthetic Data Generation

Linda He, Jue Wang, Maurice Weber et al.

ICLR 2025arXiv:2410.16699

#12498

Graph Transformers Dream of Electric Flow

Xiang Cheng, Lawrence Carin, Suvrit Sra

ICLR 2025arXiv:2410.09878

#12499

Provably Reliable Conformal Prediction Sets in the Presence of Data Poisoning

Yan Scholten, Stephan Günnemann

ICLR 2025arXiv:2511.16924

#12500

CBMA: Improving Conformal Prediction through Bayesian Model Averaging

Pankaj Bhagwat, Linglong Kong, Bei Jiang

#12501

Efficient Online Pruning and Abstraction for Imperfect Information Extensive-Form Games

Boning Li, Longbo Huang

#12502

Bounds on $L_p$ Errors in Density Ratio Estimation via $f$-Divergence Loss Functions

Yoshiaki Kitazawa

ICLR 2025arXiv:2410.17547

#12503

Generalizable Motion Planning via Operator Learning

Sharath Matada, Luke Bhan, Yuanyuan Shi et al.

ICLR 2025arXiv:2406.13075

#12504

Exact Community Recovery under Side Information: Optimality of Spectral Algorithms

Julia Gaudio, Nirmit Joshi

ICLR 2025arXiv:2411.03755

#12505

Content-Style Learning from Unaligned Domains: Identifiability under Unknown Latent Dimensions

Sagar Shrestha, Xiao Fu

ISMAR 2025paperarXiv:2412.11762

#12506

GS-ProCams: Gaussian Splatting-Based Projector-Camera Systems

Qingyue Deng, Jijiang Li, Haibin Ling et al.

ISMAR 2025paperarXiv:2508.04326

#12507

Radiance Fields in XR: A Survey on How Radiance Fields are Envisioned and Addressed for XR Research

Ke Li, Mana Masuda, Susanne Schmidt et al.

#12508

Probabilistic Verification of Cybersickness in Virtual Reality Through Bayesian Networks

Peng Wu, Nasim Ahmed, Abhiram Sarma et al.

ISMAR 2025paperarXiv:2508.14346

#12509

Exploring Organizational Strategies in Immersive Computational Notebooks

Sungwon In, Ayush Roy, Eric Krokos et al.

ISMAR 2025paperarXiv:2505.03027

#12510

Revisiting Performance Models of Distal Pointing Tasks in Virtual Reality

Logan Lane, Feiyu Lu, Shakiba Davari et al.

#12511

Can People's Brains Synchronize during Remote AR Collaboration?

Jaehwan You, Myeongul Jung, Kwanguk Kim

#12512

Exploring and Modeling the Effects of Eye-Tracking Accuracy and Precision on Gaze-Based Steering in Virtual Environments

Xuning Hu, Yichuan Zhang, Yushi Wei et al.

#12513

Birds of a Feather Augment Together: Exploring Sonic Links Between Real and Virtual Worlds in Audio Augmented Reality

Jacob Bhattacharyya, Alessandro Vinciarelli, Stephen Anthony Brewster

ISMAR 2025paperarXiv:2508.01915

#12514

EgoTrigger: Toward Audio-Driven Image Capture for Human Memory Enhancement in All-Day Energy-Efficient Smart Glasses

Akshay Paruchuri, Sinan Hersek, Lavisha Aggarwal et al.

AAAI 2025paperarXiv:2412.15308

#12515

ViFactCheck: A New Benchmark Dataset and Methods for Multi-Domain News Fact-Checking In Vietnamese

Tran Thai Hoa, Tran Quang Duy, Khanh Quoc Tran et al.

#12516

IDseq: Decoupled and Sequentially Detecting and Grounding Multi-Modal Media Manipulation

Runxin Liu, Tian Xie, Jiaming Li et al.

AAAI 2025paperarXiv:2412.18254

#12517

RaCMC: Residual-Aware Compensation Network with Multi-Granularity Constraints for Fake News Detection

Xinquan Yu, Ziqi Sheng, Wei Lu et al.

AAAI 2025paperarXiv:2503.04865

#12518

E4: Energy-Efficient DNN Inference for Edge Video Analytics via Early Exiting and DVFS

Ziyang Zhang, Yang Zhao, Ming-Ching Chang et al.

#12519

Enhancing Identity-Deformation Disentanglement in StyleGAN for One-Shot Face Video Re-Enactment

Qing Chang, Yao-Xiang Ding, Kun Zhou

AAAI 2025paperarXiv:2502.20858

#12520

EyEar: Learning Audio Synchronized Human Gaze Trajectory Based on Physics-Informed Dynamics

Xiaochuan Liu, Xin Cheng, Yuchong Sun et al.

#12521

Multimodal Fine-Grained Apparent Personality Trait Recognition: Joint Modeling of Big Five and Questionnaire Item-level Scores

Ryo Masumura, Shota Orihashi, Mana Ihori et al.

AAAI 2025paperarXiv:2412.16751

#12522

The Master Key Filters Hypothesis: Deep Filters Are General

Zahra Babaiee, Peyman M. Kiasari, Daniela Rus et al.

AAAI 2025paperarXiv:2412.17512

#12523

BEE: Metric-Adapted Explanations via Baseline Exploration-Exploitation

Oren Barkan, Yehonatan Elisha, Jonathan Weill et al.

AAAI 2025paperarXiv:2409.20500

#12524

FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing

Lingling Cai, Kang Zhao, Hangjie Yuan et al.

AAAI 2025paperarXiv:2412.14837

#12525

ObjVariantEnsemble: Advancing Point Cloud LLM Evaluation in Challenging Scenes with Subtly Distinguished Objects

Qihang Cao, Huangxun Chen

AAAI 2025paperarXiv:2412.19720

#12526

Sharpening Neural Implicit Functions with Frequency Consolidation Priors

Chao Chen, Yu-Shen Liu, Zhizhong Han

AAAI 2025paperarXiv:2501.04477

#12527

Rethinking High-speed Image Reconstruction Framework with Spike Camera

Kang Chen, Yajing Zheng, Tiejun Huang et al.

AAAI 2025paperarXiv:2412.17800

#12528

Comprehensive Multi-Modal Prototypes Are Simple and Effective Classifiers for Vast-Vocabulary Object Detection

Yitong Chen, Wenhao Yao, Lingchen Meng et al.

AAAI 2025paperarXiv:2412.11820

#12529

Spatiotemporal Blind-Spot Network with Calibrated Flow Alignment for Self-Supervised Video Denoising

Zikang Chen, Tao Jiang, Xiaowan Hu et al.

AAAI 2025paperarXiv:2412.08975

#12530

Elevating Flow-Guided Video Inpainting with Reference Generation

Suhwan Cho, Seoung Wug Oh, Sangyoun Lee et al.

AAAI 2025paperarXiv:2503.17728

#12531

DynASyn: Multi-Subject Personalization Enabling Dynamic Action Synthesis

Yongjin Choi, Chanhun Park, Seung Jun Baek

AAAI 2025paperarXiv:2501.09826

#12532

PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery

Shristi Das Biswas, Matthew Shreve, Xuelu Li et al.

#12533

Semantic Ambiguity Modeling and Propagation for Fine-Grained Visual Cross View Geo-Localization

Mingtao Feng, Fenghao Tian, Jianqiao Luo et al.

AAAI 2025paperarXiv:2411.01564

#12534

ParseCaps: An Interpretable Parsing Capsule Network for Medical Image Diagnosis

Xinyu Geng, Jiaming Wang, Xiaolin Huang et al.

#12535

You Should Learn to Stop Denoising on Point Clouds in Advance

Chuchen Guo, Weijie Zhou, Zheng Liu et al.

AAAI 2025paperarXiv:2412.08149

#12536

AsyncDSB: Schedule-Asynchronous Diffusion Schrödinger Bridge for Image Inpainting

Zihao Han, Baoquan Zhang, Lisai Zhang et al.

#12537

Multi-Frame Deformable Look-Up Table for Compressed Video Quality Enhancement

Gang He, Guancheng Quan, Chang Wu et al.

#12538

Achieving Speed-Accuracy Balance in Vision-based 3D Occupancy Prediction via Geometric-Semantic Disentanglement

Yulin He, Wei Chen, Siqi Wang et al.

#12539

Robust and Consistent Online Video Instance Segmentation via Instance Mask Propagation

Miran Heo, Seoung Wug Oh, Seon Joo Kim et al.

AAAI 2025paperarXiv:2502.08149

#12540

Generalized Class Discovery in Instance Segmentation

Cuong Manh Hoang, Yeejin Lee, Byeongkeun Kang

AAAI 2025paperarXiv:2408.14868

#12541

ZeroMamba: Exploring Visual State Space Model for Zero-Shot Learning

Wenjin Hou, Dingjie Fu, Kun Li et al.

AAAI 2025paperarXiv:2501.10462

#12542

BloomScene: Lightweight Structured 3D Gaussian Splatting for Crossmodal Scene Generation

Xiaolu Hou, Mingcheng Li, Dingkang Yang et al.

#12543

Motion Decoupled 3D Gaussian Splatting for Dynamic Object Representation

Xiao Hu, Libo Long, Jochen Lang

#12544

LPCG: A Self-conditional Architecture for Labeled Point Cloud Generation

Dongshuo Huang, Xiaoshui Huang, Chengdong Zhang et al.

AAAI 2025paperarXiv:2501.07762

#12545

PSReg: Prior-guided Sparse Mixture of Experts for Point Cloud Registration

Xiaoshui Huang, Zhou Huang, Yifan Zuo et al.

AAAI 2025paperarXiv:2502.19769

#12546

QORT-Former: Query-optimized Real-time Transformer for Understanding Two Hands Manipulating Objects

Elkhan Ismayilzada, MD Khalequzzaman Chowdhury Sayem, Yihalem Yimolal Tiruneh et al.

AAAI 2025paperarXiv:2412.09050

#12547

ContextHOI: Spatial Context Learning for Human-Object Interaction Detection

Mingda Jia, Liming Zhao, Ge Li et al.

AAAI 2025paperarXiv:2412.16467

#12548

Sensing Surface Patches in Volume Rendering for Inferring Signed Distance Functions

Sijia Jiang, Tong Wu, Jing Hua et al.

AAAI 2025paperarXiv:2412.17523

#12549

Constructing Fair Latent Space for Intersection of Fairness and Explainability

Hyungjun Joo, Hyeonggeun Han, Sehwan Kim et al.

AAAI 2025paperarXiv:2501.02640

#12550

Multispectral Pedestrian Detection with Sparsely Annotated Label

Chan Lee, Seungho Shin, Gyeong-Moon Park et al.

AAAI 2025paperarXiv:2412.19543

#12551

Diverse Rare Sample Generation with Pretrained GANs

Subeen Lee, Jiyeon Han, Soyeon Kim et al.

AAAI 2025paperarXiv:2504.04687

#12552

Bridging Knowledge Gap Between Image Inpainting and Large-Area Visible Watermark Removal

Yicheng Leng, Chaowei Fang, Junye Chen et al.

#12553

M²RL-Net: Multi-View and Multi-Level Relation Learning Network for Weakly-Supervised Image Forgery Detection

Jiafeng Li, Ying Wen, Lianghua He

AAAI 2025paperarXiv:2502.19751

#12554

Lightweight Contrastive Distilled Hashing for Online Cross-modal Retrieval

Jiaxing Li, Lin Jiang, Zeqi Ma et al.

#12555

Multi-View 3D Human Pose Estimation with Weakly Synchronized Images

Ling Li, Ruiwen Gu, Chongyang Wang et al.

#12556

SyncNoise: Geometrically Consistent Noise Prediction for Instruction-based 3D Editing

Ruihuang Li, Liyi Chen, Zhengqiang Zhang et al.

AAAI 2025paperarXiv:2506.02448

#12557

VidEvent: A Large Dataset for Understanding Dynamic Evolution of Events in Videos

Baoyu Liang, Qile Su, Shoutai Zhu et al.

AAAI 2025paperarXiv:2412.10176

#12558

UN-DETR: Promoting Objectness Learning via Joint Supervision for Unknown Object Detection

HaoMiao Liu, Hao Xu, Chuhuai Yue et al.

AAAI 2025paperarXiv:2408.13226

#12559

D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matching

Jingyu Liu, Minquan Wang, Ye Ma et al.

#12560

Efficient Deformable Convolutional Prompt for Continual Test-Time Adaptation in Medical Image Segmentation

Shiyu Liu, Daoqiang Zhang, Xiaoke Hao

#12561

MUN: Image Forgery Localization Based on M³ Encoder and UN Decoder

Yaqi Liu, Shuhuan Chen, Haichao Shi et al.

#12562

Enhancing Low-Light Images: A Synthetic Data Perspective on Practical and Generalizable Solutions

Yu Long, Qinghua Lin, Zhihua Wang et al.

AAAI 2025paperarXiv:2401.13329

#12563

Generative Video Diffusion for Unseen Novel Semantic Video Moment Retrieval

Dezhao Luo, Shaogang Gong, Jiabo Huang et al.

AAAI 2025paperarXiv:2412.11917

#12564

Does VLM Classification Benefit from LLM Description Semantics?

Pingchuan Ma, Lennart Rietdorf, Dmytro Kotovenko et al.

#12565

Novel View Synthesis Under Large-Deviation Viewpoint for Autonomous Driving

Xin Ma, Jiguang Zhang, Peng Lu et al.

#12566

Relaxed Class-consensus Consistency for Semi-supervised Semantic Segmentation

Huayu Mai, Rui Sun, Feng Wu

AAAI 2025paperarXiv:2412.18404

#12567

Extract Free Dense Misalignment from CLIP

JeongYeon Nam, Jinbae Im, Wonjae Kim et al.

AAAI 2025paperarXiv:2412.13156

#12568

S2S2: Semantic Stacking for Robust Semantic Segmentation in Medical Imaging

Yimu Pan, Sitao Zhang, Alison D. Gernand et al.

AAAI 2025paperarXiv:2412.14821

#12569

PC-BEV: An Efficient Polar-Cartesian BEV Fusion Framework for LiDAR Semantic Segmentation

Shoumeng Qiu, Xinrun Li, Xiangyang Xue et al.

AAAI 2025paperarXiv:2502.03359

#12570

GHOST: Gaussian Hypothesis Open-Set Technique

Ryan Rabinowitz, Steve Cruz, Manuel Günther et al.

AAAI 2025paperarXiv:2409.08272

#12571

Click2Mask: Local Editing with Dynamic Mask Generation

Omer Regev, Omri Avrahami, Dani Lischinski

AAAI 2025paperarXiv:2503.19283

#12572

ISPDiffuser: Learning RAW-to-sRGB Mappings with Texture-Aware Diffusion Models and Histogram-Guided Color Consistency

Yang Ren, Hai Jiang, Menglong Yang et al.

AAAI 2025paperarXiv:2412.08357

#12573

Video Summarization Using Denoising Diffusion Probabilistic Model

Zirui Shang, Yubo Zhu, Hongxi Li et al.

AAAI 2025paperarXiv:2502.05902

#12574

Fast Omni-Directional Image Super-Resolution: Adapting the Implicit Image Function with Pixel and Semantic-Wise Spherical Geometric Priors

Xuelin Shen, Yitong Wang, Silin Zheng et al.

AAAI 2025paperarXiv:2405.05858

#12575

Free-Moving Object Reconstruction and Pose Estimation with Virtual Camera

Haixin Shi, Yinlin Hu, Daniel Koguciuk et al.

AAAI 2025paperarXiv:2408.11810

#12576

Pixel Is Not a Barrier: An Effective Evasion Attack for Pixel-Domain Diffusion Models

Chun-Yen Shih, Li-Xuan Peng, Jia-Wei Liao et al.

AAAI 2025paperarXiv:2502.03459

#12577

SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living

Arkaprava Sinha, Dominick Reilly, Francois Bremond et al.

AAAI 2025paperarXiv:2412.13708

#12578

JoVALE: Detecting Human Actions in Video Using Audiovisual and Language Contexts

Taein Son, Soo Won Seo, Jisong Kim et al.

AAAI 2025paperarXiv:2306.05497

#12579

Enhancing Noise-Robust Losses for Large-Scale Noisy Data Learning

Max Staats, Matthias Thamm, Bernd Rosenow

AAAI 2025paperarXiv:2409.04050

#12580

EigenSR: Eigenimage-Bridged Pre-Trained RGB Learners for Single Hyperspectral Image Super-Resolution

Xi Su, Xiangfei Shen, Mingyang Wan et al.

#12581

Learning Fine-Grained Alignment for Aerial Vision-Dialog Navigation

Yifei Su, Dong An, Kehan Chen et al.

AAAI 2025paperarXiv:2412.14692

#12582

Explicit Relational Reasoning Network for Scene Text Detection

Yuchen Su, Zhineng Chen, Yongkun Du et al.

AAAI 2025paperarXiv:2412.14473

#12583

Promptable Representation Distribution Learning and Data Augmentation for Gigapixel Histopathology WSI Analysis

Kunming Tang, Zhiguo Jiang, Jun Shi et al.

AAAI 2025paperarXiv:2502.17766

#12584

Improving Transformer Based Line Segment Detection with Matched Predicting and Re-ranking

Xin Tong, Shi Peng, Baojie Tian et al.

AAAI 2025paperarXiv:2501.07100

#12585

Collaborative Learning for 3D Hand-Object Reconstruction and Compositional Action Recognition from Egocentric RGB Videos Using Superquadrics

Tze Ho Elden Tse, Runyang Feng, Linfang Zheng et al.

#12586

Overcoming Heterogeneous Data in Federated Medical Vision-Language Pre-training: A Triple-Embedding Model Selector Approach

Aowen Wang, Zhiwang Zhang, Dongang Wang et al.

#12587

SSC-VAE: Structured Sparse Coding Based Variational Autoencoder for Detail Preserved Image Reconstruction

Hao Wang, Lu Wang, Zhongyu Wang et al.

#12588

Bright-NeRF: Brightening Neural Radiance Field with Color Restoration from Low-Light RAW Images

Min Wang, Xin Huang, Guoqing Zhou et al.

#12589

HomoMatcher: Achieving Dense Feature Matching with Semi-Dense Efficiency by Homography Estimation

Xiaolong Wang, Lei Yu, Yingying Zhang et al.

#12590

Aligning Composed Query with Image via Discriminative Perception from Negative Correspondences

Yifan Wang, Wuliang Huang, Chun Yuan

#12591

AnyTalk: Multi-modal Driven Multi-domain Talking Head Generation

Yu Wang, Yunfei Liu, Fa-Ting Hong et al.

AAAI 2025paperarXiv:2501.15409

#12592

TdAttenMix: Top-Down Attention Guided Mixup

Zhiming Wang, Lin Gu, Feng Lu

AAAI 2025paperarXiv:2412.12672

#12593

Structural Pruning via Spatial-aware Information Redundancy for Semantic Segmentation

Dongyue Wu, Zilin Guo, Li Yu et al.

#12594

LVPTrack: High Performance Domain Adaptive UAV Tracking with Label Aligned Visual Prompt Tuning

Hongjing Wu, Siyuan Yao, Feng Huang et al.

AAAI 2025paperarXiv:2412.17022

#12595

FriendsQA: A New Large-Scale Deep Video Understanding Dataset with Fine-grained Topic Categorization for Story Videos

Zhengqian Wu, Ruizhe Li, Zijun Xu et al.

#12596

Hierarchically Controlled Deformable 3D Gaussians for Talking Head Synthesis

Zhenhua Wu, Linxuan Jiang, Xiang Li et al.

AAAI 2025paperarXiv:2408.12454

#12597

Relaxed Rotational Equivariance via G-Biases in Vision

Zhiqiang Wu, Yingjie Liu, Licheng Sun et al.

#12598

Exploiting Continuous Motion Clues for Vision-Based Occupancy Prediction

Haoran Xu, Peixi Peng, Xinyi Zhang et al.

#12599

Physical-aware Neural Radiance Fields for Efficient Exposure Correction

Kai Xu, Mingwen Shao, Yuanjian Qiao et al.

AAAI 2025paperarXiv:2412.05596

#12600

TB-HSU: Hierarchical 3D Scene Understanding with Contextual Affordances

Wenting Xu, Viorela Ila, Luping Zhou et al.