Most Cited 2025 "human-ai interaction" Papers

22,274 papers found • Page 32 of 112

Filters:Most Cited 2025 human-ai interaction Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#6201

Smoothed Preference Optimization via ReNoise Inversion for Aligning Diffusion Models with Varied Human Preferences

Yunhong Lu, Qichao Wang, Hengyuan Cao et al.

ICML 2025arXiv:2506.02698

citations

#6202

FactorGCL: A Hypergraph-Based Factor Model with Temporal Residual Contrastive Learning for Stock Returns Prediction

Yitong Duan, Weiran Wang, Jian Li

AAAI 2025paperarXiv:2502.05218

citations

#6203

MultiVENT 2.0: A Massive Multilingual Benchmark for Event-Centric Video Retrieval

Reno Kriz, Kate Sanders, David Etter et al.

CVPR 2025arXiv:2410.11619

citations

#6204

MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent

Xinyao Liao, Xianfang Zeng, Liao Wang et al.

ICCV 2025arXiv:2502.03207

citations

#6205

Reasoning Models Hallucinate More: Factuality-Aware Reinforcement Learning for Large Reasoning Models

Junyi Li, Hwee Tou Ng

NEURIPS 2025arXiv:2505.24630

citations

#6206

OpenRSD: Towards Open-prompts for Object Detection in Remote Sensing Images

Ziyue Huang, Yongchao Feng, Ziqi Liu et al.

ICCV 2025arXiv:2503.06146

citations

#6207

Variational Regularized Unbalanced Optimal Transport: Single Network, Least Action

Yuhao Sun, Zhenyi Zhang, Zihan Wang et al.

NEURIPS 2025arXiv:2505.11823

citations

#6208

Discovering Fine-Grained Visual-Concept Relations by Disentangled Optimal Transport Concept Bottleneck Models

Yan Xie, Zequn Zeng, Hao Zhang et al.

CVPR 2025arXiv:2505.07209

citations

#6209

Is Complex Query Answering Really Complex?

Cosimo Gregucci, Bo Xiong, Daniel Hernández et al.

ICML 2025spotlightarXiv:2410.12537

citations

#6210

AerialVG: A Challenging Benchmark for Aerial Visual Grounding by Exploring Positional Relations

Junli Liu, Qizhi Chen, Zhigang Wang et al.

ICCV 2025arXiv:2504.07836

citations

#6211

TVNet: A Novel Time Series Analysis Method Based on Dynamic Convolution and 3D-Variation

Chenghan Li, Mingchen LI, Ruisheng Diao

ICLR 2025arXiv:2503.07674

citations

#6212

Physics-Informed Generative Modeling of Wireless Channels

Benedikt Böck, Andreas Oeldemann, Timo Mayer et al.

ICML 2025arXiv:2502.10137

citations

#6213

Provable and Practical Online Learning Rate Adaptation with Hypergradient Descent

Ya-Chi Chu, Wenzhi Gao, Yinyu Ye et al.

ICML 2025arXiv:2502.11229

citations

#6214

CWNet: Causal Wavelet Network for Low-Light Image Enhancement

Tongshun Zhang, Pingping Liu, Yubing Lu et al.

ICCV 2025arXiv:2507.10689

citations

#6215

SceneDiffuser++: City-Scale Traffic Simulation via a Generative World Model

Shuhan Tan, John Wheatley Lambert, Hong Jeon et al.

CVPR 2025arXiv:2506.21976

citations

#6216

Rethinking the Embodied Gap in Vision-and-Language Navigation: A Holistic Study of Physical and Visual Disparities

Liuyi Wang, Xinyuan Xia, Hui Zhao et al.

ICCV 2025arXiv:2507.13019

citations

#6217

Enhancing Uncertainty Modeling with Semantic Graph for Hallucination Detection

Kedi Chen, Qin Chen, Jie Zhou et al.

AAAI 2025paperarXiv:2501.02020

citations

#6218

Mitigating Plasticity Loss in Continual Reinforcement Learning by Reducing Churn

Hongyao Tang, Johan Obando-Ceron, Pablo Samuel Castro et al.

ICML 2025oralarXiv:2506.00592

citations

#6219

GROVE: A Generalized Reward for Learning Open-Vocabulary Physical Skill

Jieming Cui, Tengyu Liu, Ziyu Meng et al.

CVPR 2025arXiv:2504.04191

citations

#6220

Enhancing Privacy-Utility Trade-offs to Mitigate Memorization in Diffusion Models

Chen Chen, Daochang Liu, Mubarak Shah et al.

CVPR 2025arXiv:2504.18032

citations

#6221

Efficient Active Imitation Learning with Random Network Distillation

Emilien Biré, Anthony Kobanda, Ludovic Denoyer et al.

ICLR 2025arXiv:2411.01894

citations

#6222

Attributing Culture-Conditioned Generations to Pretraining Corpora

Huihan Li, Arnav Goel, Keyu He et al.

ICLR 2025arXiv:2412.20760

citations

#6223

ROPO: Robust Preference Optimization for Large Language Models

Xize Liang, Chao Chen, Shuang Qiu et al.

ICML 2025arXiv:2404.04102

citations

#6224

Privacy Attacks on Image AutoRegressive Models

Antoni Kowalczuk, Jan Dubiński, Franziska Boenisch et al.

ICML 2025arXiv:2502.02514

citations

#6225

Conformalized Interactive Imitation Learning: Handling Expert Shift and Intermittent Feedback

Michelle Zhao, Henny Admoni, Reid Simmons et al.

ICLR 2025arXiv:2410.08852

citations

#6226

The Bandit Whisperer: Communication Learning for Restless Bandits

Yunfan Zhao, Tonghan Wang, Dheeraj Mysore Nagaraj et al.

AAAI 2025paperarXiv:2408.05686

citations

#6227

TruthPrInt: Mitigating Large Vision-Language Models Object Hallucination Via Latent Truthful-Guided Pre-Intervention

Jinhao Duan, Fei Kong, Hao Cheng et al.

ICCV 2025

citations

#6228

Multi-Agent Motion Planning for Differential Drive Robots Through Stationary State Search

Jingtian Yan, Jiaoyang Li

AAAI 2025paperarXiv:2412.13359

citations

#6229

RoBridge: A Hierarchical Architecture Bridging Cognition and Execution for General Robotic Manipulation

Kaidong Zhang, Rongtao Xu, Ren Pengzhen et al.

ICCV 2025arXiv:2505.01709

citations

#6230

Fully Test-time Adaptation for Tabular Data

Zhi Zhou, Kun-Yang Yu, Lan-Zhe Guo et al.

AAAI 2025paperarXiv:2412.10871

citations

#6231

Factor Augmented Tensor-on-Tensor Neural Networks

Guanhao Zhou, Yuefeng Han, Xiufan Yu

AAAI 2025paperarXiv:2405.19610

citations

#6232

MathGAP: Out-of-Distribution Evaluation on Problems with Arbitrarily Complex Proofs

Andreas Opedal, Haruki Shirakami, Bernhard Schölkopf et al.

ICLR 2025arXiv:2410.13502

citations

#6233

Detecting Visual Information Manipulation Attacks in Augmented Reality: A Multimodal Semantic Reasoning Approach

Yanming Xiu, Maria Gorlatova

ISMAR 2025paperarXiv:2507.20356

citations

#6234

CO-MOT: Boosting End-to-end Transformer-based Multi-Object Tracking via Coopetition Label Assignment and Shadow Sets

feng yan, Weixin Luo, Yujie Zhong et al.

ICLR 2025

citations

#6235

On the Expressiveness and Length Generalization of Selective State Space Models on Regular Languages

Aleksandar Terzic, Michael Hersche, Giacomo Camposampiero et al.

AAAI 2025paper

citations

#6236

CLIPDrag: Combining Text-based and Drag-based Instructions for Image Editing

Ziqi Jiang, Zhen Wang, Long Chen

ICLR 2025arXiv:2410.03097

citations

#6237

Scene Map-based Prompt Tuning for Navigation Instruction Generation

Sheng Fan, Rui Liu, Wenguan Wang et al.

CVPR 2025

citations

#6238

AgentRecBench: Benchmarking LLM Agent-based Personalized Recommender Systems

Yu Shang, Peijie Liu, Yuwei Yan et al.

NEURIPS 2025spotlightarXiv:2505.19623

citations

#6239

Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models

Jun Zhang, Jue Wang, Huan Li et al.

ICLR 2025arXiv:2502.13533

citations

#6240

Depth-Bounds for Neural Networks via the Braid Arrangement

Moritz Grillo, Christoph Hertrich, Georg Loho

NEURIPS 2025oralarXiv:2502.09324

citations

#6241

Breaking Silos: Adaptive Model Fusion Unlocks Better Time Series Forecasting

Zhining Liu, Ze Yang, Xiao Lin et al.

ICML 2025oralarXiv:2505.18442

citations

#6242

GCE-Pose: Global Context Enhancement for Category-level Object Pose Estimation

Weihang Li, Hongli XU, Junwen Huang et al.

CVPR 2025arXiv:2502.04293

citations

#6243

TOPLOC: A Locality Sensitive Hashing Scheme for Trustless Verifiable Inference

Jack Min Ong, Matthew Di Ferrante, Aaron Pazdera et al.

ICML 2025arXiv:2501.16007

citations

#6244

Error Bounds for Gaussian Process Regression Under Bounded Support Noise with Applications to Safety Certification

Robert Reed, Luca Laurenti, Morteza Lahijanian

AAAI 2025paperarXiv:2408.09033

citations

#6245

Improving Complex Reasoning with Dynamic Prompt Corruption: A Soft Prompt Optimization Approach

Sinan Fan, Liang Xie, Chen Shen et al.

ICLR 2025arXiv:2503.13208

citations

#6246

Doubly Robust Conformalized Survival Analysis with Right-Censored Data

Matteo Sesia, vladimir svetnik

ICML 2025spotlightarXiv:2412.09729

citations

#6247

Compliant Residual DAgger: Improving Real-World Contact-Rich Manipulation with Human Corrections

Xiaomeng Xu, Yifan Hou, Zeyi Liu et al.

NEURIPS 2025arXiv:2506.16685

citations

#6248

BrainOOD: Out-of-distribution Generalizable Brain Network Analysis

Jiaxing Xu, Yongqiang Chen, Xia Dong et al.

ICLR 2025arXiv:2502.01688

citations

#6249

DebGCD: Debiased Learning with Distribution Guidance for Generalized Category Discovery

Yuanpei Liu, Kai Han

ICLR 2025arXiv:2504.04804

citations

#6250

Salvaging the Overlooked: Leveraging Class-Aware Contrastive Learning for Multi-Class Anomaly Detection

Lei Fan, Junjie Huang, Donglin Di et al.

ICCV 2025arXiv:2412.04769

citations

#6251

GS-LIVM: Real-Time Photo-Realistic LiDAR-Inertial-Visual Mapping with Gaussian Splatting

Yusen XIE, Zhenmin Huang, Jin Wu et al.

ICCV 2025arXiv:2410.17084

citations

#6252

HyGen: Efficient LLM Serving via Elastic Online-Offline Request Co-location

Ting Sun, Penghan Wang, Fan Lai

NEURIPS 2025arXiv:2501.14808

citations

#6253

Can Students Beyond the Teacher? Distilling Knowledge from Teacher’s Bias

Jianhua Zhang, Yi Gao, Ruyu Liu et al.

AAAI 2025paperarXiv:2412.09874

citations

#6254

Adaptive Gradient Clipping for Robust Federated Learning

Youssef Allouah, Rachid Guerraoui, Nirupam Gupta et al.

ICLR 2025arXiv:2405.14432

citations

#6255

The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation

Bingjie Gao, Xinyu Gao, Xiaoxue Wu et al.

CVPR 2025arXiv:2504.11739

citations

#6256

BinaryDM: Accurate Weight Binarization for Efficient Diffusion Models

Xingyu Zheng, Xianglong Liu, Haotong Qin et al.

ICLR 2025arXiv:2404.05662

citations

#6257

Question-Aware Gaussian Experts for Audio-Visual Question Answering

Hongyeob Kim, Inyoung Jung, Dayoon Suh et al.

CVPR 2025highlightarXiv:2503.04459

citations

#6258

Expressivity of Neural Networks with Random Weights and Learned Biases

Ezekiel Williams, Alexandre Payeur, Avery Ryoo et al.

ICLR 2025arXiv:2407.00957

citations

#6259

Continual Learning Using a Kernel-Based Method Over Foundation Models

Saleh Momeni, Sahisnu Mazumder, Bing Liu

AAAI 2025paperarXiv:2412.15571

citations

#6260

What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?

Jinhong Ni, Chang-Bin Zhang, Qiang Zhang et al.

ICCV 2025arXiv:2505.22129

citations

#6261

AutoSGNN: Automatic Propagation Mechanism Discovery for Spectral Graph Neural Networks

Shibing Mo, Kai Wu, Qixuan Gao et al.

AAAI 2025paperarXiv:2412.12483

citations

#6262

Sequential Conditional Transport on Probabilistic Graphs for Interpretable Counterfactual Fairness

Agathe Fernandes Machado, Arthur Charpentier, Ewen Gallic

AAAI 2025paperarXiv:2408.03425

citations

#6263

LEDiff: Latent Exposure Diffusion for HDR Generation

Chao Wang, Zhihao Xia, Thomas Leimkuehler et al.

CVPR 2025arXiv:2412.14456

citations

#6264

UniDetox: Universal Detoxification of Large Language Models via Dataset Distillation

Huimin LU, Masaru Isonuma, Junichiro Mori et al.

ICLR 2025arXiv:2504.20500

citations

#6265

Kinetic Langevin Diffusion for Crystalline Materials Generation

François Cornet, Federico Bergamin, Arghya Bhowmik et al.

ICML 2025arXiv:2507.03602

citations

#6266

Training Consistent Mixture-of-Experts-Based Prompt Generator for Continual Learning

Yue Lu, Shizhou Zhang, De Cheng et al.

AAAI 2025paper

citations

#6267

Distance-Based Tree-Sliced Wasserstein Distance

Viet-Hoang Tran, Minh-Khoi Nguyen-Nhat, Trang Pham et al.

ICLR 2025arXiv:2503.11050

citations

#6268

Explaining Modern Gated-Linear RNNs via a Unified Implicit Attention Formulation

Itamar Zimerman, ameen ali ali, Lior Wolf

ICLR 2025arXiv:2405.16504

citations

#6269

MobileIE: An Extremely Lightweight and Effective ConvNet for Real-Time Image Enhancement on Mobile Devices

HAILONG YAN, Ao Li, Xiangtao Zhang et al.

ICCV 2025arXiv:2507.01838

citations

#6270

RelationField: Relate Anything in Radiance Fields

Sebastian Koch, Johanna Wald, Mirco Colosi et al.

CVPR 2025arXiv:2412.13652

citations

#6271

AnoLLM: Large Language Models for Tabular Anomaly Detection

Che-Ping Tsai, Ganyu Teng, Phillip Wallis et al.

ICLR 2025

citations

#6272

Causal Representation Learning from Multimodal Biomedical Observations

Yuewen Sun, Lingjing Kong, Guangyi Chen et al.

ICLR 2025arXiv:2411.06518

citations

#6273

HotSpot: Signed Distance Function Optimization with an Asymptotically Sufficient Condition

Zimo Wang, Cheng Wang, Taiki Yoshino et al.

CVPR 2025highlightarXiv:2411.14628

citations

#6274

The emergence of sparse attention: impact of data distribution and benefits of repetition

Nicolas Zucchet, Francesco D'Angelo, Andrew Lampinen et al.

NEURIPS 2025oralarXiv:2505.17863

citations

#6275

Assessing Pre-Trained Models for Transfer Learning Through Distribution of Spectral Components

Tengxue Zhang, Yang Shu, Xinyang Chen et al.

AAAI 2025paperarXiv:2412.19085

citations

#6276

Feature Responsiveness Scores: Model-Agnostic Explanations for Recourse

Seung Hyun Cheon, Anneke Wernerfelt, Sorelle Friedler et al.

ICLR 2025arXiv:2410.22598

citations

#6277

Sherlock: Self-Correcting Reasoning in Vision-Language Models

Yi Ding, Ruqi Zhang

NEURIPS 2025arXiv:2505.22651

citations

#6278

Balancing Multimodal Training Through Game-Theoretic Regularization

Konstantinos Kontras, Thomas Strypsteen, Christos Chatzichristos et al.

NEURIPS 2025spotlightarXiv:2411.07335

citations

#6279

AtomSurf: Surface Representation for Learning on Protein Structures

Vincent Mallet, Yangyang Miao, Souhaib Attaiki et al.

ICLR 2025arXiv:2309.16519

citations

#6280

CREIMBO: Cross-Regional Ensemble Interactions in Multi-view Brain Observations

Noga Mudrik, Ryan Ly, Oliver Ruebel et al.

ICLR 2025oralarXiv:2405.17395

citations

#6281

Video2BEV: Transforming Drone Videos to BEVs for Video-based Geo-localization

Hao Ju, Shaofei Huang, Si Liu et al.

ICCV 2025arXiv:2411.13610

citations

#6282

ARIG: Autoregressive Interactive Head Generation for Real-time Conversations

Ying Guo, Xi Liu, Cheng Zhen et al.

ICCV 2025arXiv:2507.00472

citations

#6283

LIRM: Large Inverse Rendering Model for Progressive Reconstruction of Shape, Materials and View-dependent Radiance Fields

Zhengqin Li, Dilin Wang, Ka chen et al.

CVPR 2025arXiv:2504.20026

citations

#6284

Object-centric binding in Contrastive Language-Image Pretraining

Rim Assouel, Pietro Astolfi, Florian Bordes et al.

NEURIPS 2025arXiv:2502.14113

citations

#6285

Controllable Satellite-to-Street-View Synthesis with Precise Pose Alignment and Zero-Shot Environmental Control

Xianghui Ze, Zhenbo Song, Qiwei Wang et al.

ICLR 2025arXiv:2502.03498

citations

#6286

Modeling Cell Dynamics and Interactions with Unbalanced Mean Field Schrödinger Bridge

Zhenyi Zhang, Zihan Wang, Yuhao Sun et al.

NEURIPS 2025arXiv:2505.11197

citations

#6287

Emergent Response Planning in LLMs

Zhichen Dong, Zhanhui Zhou, Zhixuan Liu et al.

ICML 2025arXiv:2502.06258

citations

#6288

EEE-Bench: A Comprehensive Multimodal Electrical And Electronics Engineering Benchmark

Ming Li, Jike Zhong, Tianle Chen et al.

CVPR 2025arXiv:2411.01492

citations

#6289

Small Singular Values Matter: A Random Matrix Analysis of Transformer Models

Max Staats, Matthias Thamm, Bernd Rosenow

NEURIPS 2025arXiv:2410.17770

citations

#6290

Mix-CPT: A Domain Adaptation Framework via Decoupling Knowledge Learning and Format Alignment

Jinhao Jiang, Junyi Li, Xin Zhao et al.

ICLR 2025arXiv:2407.10804

citations

#6291

VCT: Training Consistency Models with Variational Noise Coupling

Gianluigi Silvestri, Luca Ambrogioni, Chieh-Hsin Lai et al.

ICML 2025arXiv:2502.18197

citations

#6292

Keyframe-oriented Vision Token Pruning: Enhancing Efficiency of Large Vision Language Models on Long-Form Video Processing

Yudong Liu, Jingwei Sun, Yueqian Lin et al.

ICCV 2025arXiv:2503.10742

citations

#6293

Enhancing 3D Reconstruction for Dynamic Scenes

Jisang Han, Honggyu An, Jaewoo Jung et al.

NEURIPS 2025oralarXiv:2504.06264

citations

#6294

Adaptive Part Learning for Fine-Grained Generalized Category Discovery: A Plug-and-Play Enhancement

Qiyuan Dai, Hanzhuo Huang, Yu Wu et al.

CVPR 2025arXiv:2507.06928

citations

#6295

On the Transfer of Object-Centric Representation Learning

Aniket Rajiv Didolkar, Andrii Zadaianchuk, Anirudh Goyal et al.

ICLR 2025

citations

#6296

Humanizing the Machine: Proxy Attacks to Mislead LLM Detectors

Tianchun Wang, Yuanzhou Chen, Zichuan Liu et al.

ICLR 2025arXiv:2410.19230

citations

#6297

Improving Language Model Distillation through Hidden State Matching

Sayantan Dasgupta, Trevor Cohn

ICLR 2025

citations

#6298

On the Completeness of Invariant Geometric Deep Learning Models

Zian Li, Xiyuan Wang, Shijia Kang et al.

ICLR 2025arXiv:2402.04836

citations

#6299

Vulnerability-Aware Alignment: Mitigating Uneven Forgetting in Harmful Fine-Tuning

Liang CHEN, Xueting Han, Li Shen et al.

ICML 2025arXiv:2506.03850

citations

#6300

Provable Maximum Entropy Manifold Exploration via Diffusion Models

Riccardo De Santi, Marin Vlastelica, Ya-Ping Hsieh et al.

ICML 2025arXiv:2506.15385

citations

#6301

StdGEN: Semantic-Decomposed 3D Character Generation from Single Images

Yuze He, Yanning Zhou, Wang Zhao et al.

CVPR 2025arXiv:2411.05738

citations

#6302

It’s a (Blind) Match! Towards Vision-Language Correspondence without Parallel Data

Dominik Schnaus, Nikita Araslanov, Daniel Cremers

CVPR 2025arXiv:2503.24129

citations

#6303

Not Just Text: Uncovering Vision Modality Typographic Threats in Image Generation Models

Hao Cheng, Erjia Xiao, Jiayan Yang et al.

CVPR 2025arXiv:2412.05538

citations

#6304

Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL

Ghada Sokar, Johan S Obando Ceron, Aaron Courville et al.

ICLR 2025arXiv:2410.01930

citations

#6305

FlipSketch: Flipping Static Drawings to Text-Guided Sketch Animations

Hmrishav Bandyopadhyay, Yi-Zhe Song

CVPR 2025arXiv:2411.10818

citations

#6306

Aligning Protein Conformation Ensemble Generation with Physical Feedback

Jiarui Lu, Xiaoyin Chen, Stephen Lu et al.

ICML 2025arXiv:2505.24203

citations

#6307

Sweeping Heterogeneity with Smart MoPs: Mixture of Prompts for LLM Task Adaptation

Chen Dun, Mirian Del Carmen Hipolito Garcia, Guoqing Zheng et al.

AAAI 2025paperarXiv:2310.02842

citations

#6308

Do Computer Vision Foundation Models Learn the Low-level Characteristics of the Human Visual System?

Yancheng Cai, Fei Yin, Dounia Hammou et al.

CVPR 2025highlightarXiv:2502.20256

citations

#6309

Image Quality Assessment: From Human to Machine Preference

Chunyi Li, Yuan Tian, Xiaoyue Ling et al.

CVPR 2025highlightarXiv:2503.10078

citations

#6310

Activation Control for Efficiently Eliciting Long Chain-of-thought Ability of Language Models

Zekai Zhao, Qi Liu, Kun Zhou et al.

NEURIPS 2025spotlightarXiv:2505.17697

citations

#6311

Classifier-Free Guidance Inside the Attraction Basin May Cause Memorization

Anubhav Jain, Yuya Kobayashi, Takashi Shibuya et al.

CVPR 2025arXiv:2411.16738

citations

#6312

Training Language Models to Generate Quality Code with Program Analysis Feedback

Feng Yao, Zilong Wang, Liyuan Liu et al.

NEURIPS 2025arXiv:2505.22704

citations

#6313

Semantic Watermarking Reinvented: Enhancing Robustness and Generation Quality with Fourier Integrity

Sung Ju Lee, Nam Ik Cho

ICCV 2025arXiv:2509.07647

citations

#6314

Generative Densification: Learning to Densify Gaussians for High-Fidelity Generalizable 3D Reconstruction

Seungtae Nam, Xiangyu Sun, Gyeongjin Kang et al.

CVPR 2025highlightarXiv:2412.06234

citations

#6315

Pose Priors from Language Models

Sanjay Subramanian, Evonne Ng, Lea Müller et al.

CVPR 2025arXiv:2405.03689

citations

#6316

InstaSHAP: Interpretable Additive Models Explain Shapley Values Instantly

James Enouen, Yan Liu

ICLR 2025arXiv:2502.14177

citations

#6317

A Simple yet Effective Layout Token in Large Language Models for Document Understanding

Zhaoqing Zhu, Chuwei Luo, Zirui Shao et al.

CVPR 2025arXiv:2503.18434

citations

#6318

VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning

Han Lin, Tushar Nagarajan, Nicolas Ballas et al.

ICLR 2025arXiv:2410.03478

citations

#6319

Stochastic Online Instrumental Variable Regression: Regrets for Endogeneity and Bandit Feedback

Riccardo Della Vecchia, Debabrota Basu

AAAI 2025paperarXiv:2302.09357

citations

#6320

Bidirectional Decoding: Improving Action Chunking via Guided Test-Time Sampling

Yuejiang Liu, Jubayer Hamid, Annie Xie et al.

ICLR 2025oralarXiv:2408.17355

citations

#6321

Beyond the Frame: Generating 360° Panoramic Videos from Perspective Videos

Rundong Luo, Matthew Wallingford, Ali Farhadi et al.

ICCV 2025arXiv:2504.07940

citations

#6322

Distilling Reinforcement Learning Algorithms for In-Context Model-Based Planning

Jaehyeon Son, Soochan Lee, Gunhee Kim

ICLR 2025arXiv:2502.19009

citations

#6323

MimiQ: Low-Bit Data-Free Quantization of Vision Transformers with Encouraging Inter-Head Attention Similarity

Kanghyun Choi, Hyeyoon Lee, Dain Kwon et al.

AAAI 2025paperarXiv:2407.20021

citations

#6324

Inversion Circle Interpolation: Diffusion-based Image Augmentation for Data-scarce Classification

Yanghao Wang, Long Chen

CVPR 2025arXiv:2408.16266

citations

#6325

Enhancing Target-unspecific Tasks through a Features Matrix

Fangming Cui, Yonggang Zhang, Xuan Wang et al.

ICML 2025arXiv:2505.03414

citations

#6326

Graph Neural Ricci Flow: Evolving Feature from a Curvature Perspective

Jialong Chen, Bowen Deng, Zhen WANG et al.

ICLR 2025

citations

#6327

Generalized Dimension Reduction Using Semi-Relaxed Gromov-Wasserstein Distance

Ranthony A. Clark, Tom Needham, Thomas Weighill

AAAI 2025paperarXiv:2405.15959

citations

#6328

DiverseFlow: Sample-Efficient Diverse Mode Coverage in Flows

Mashrur M. Morshed, Vishnu Naresh Boddeti

CVPR 2025arXiv:2504.07894

citations

#6329

Last-Iterate Convergence Properties of Regret-Matching Algorithms in Games

Yang Cai, Gabriele Farina, Julien Grand-Clément et al.

ICLR 2025arXiv:2311.00676

citations

#6330

PWM: Policy Learning with Multi-Task World Models

Ignat Georgiev, Varun Giridhar, Nick Hansen et al.

ICLR 2025arXiv:2407.02466

citations

#6331

CAX: Cellular Automata Accelerated in JAX

Maxence Faldor, Antoine Cully

ICLR 2025arXiv:2410.02651

citations

#6332

SVasP: Self-Versatility Adversarial Style Perturbation for Cross-Domain Few-Shot Learning

Wenqian Li, Pengfei Fang, Hui Xue

AAAI 2025paperarXiv:2412.09073

citations

#6333

REDUCIO! Generating 1K Video within 16 Seconds using Extremely Compressed Motion Latents

Rui Tian, Qi Dai, Jianmin Bao et al.

ICCV 2025arXiv:2411.13552

citations

#6334

HiPART: Hierarchical Pose AutoRegressive Transformer for Occluded 3D Human Pose Estimation

Hongwei Zheng, Han Li, Wenrui Dai et al.

CVPR 2025arXiv:2503.23331

citations

#6335

Parameter-Efficient Adaptation of Geospatial Foundation Models through Embedding Deflection

Romain Thoreau, Valerio Marsocci, Dawa Derksen

ICCV 2025arXiv:2503.09493

citations

#6336

ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models

Hongbo Liu, Jingwen He, Yi Jin et al.

NEURIPS 2025arXiv:2506.21356

citations

#6337

Circuit Representation Learning with Masked Gate Modeling and Verilog-AIG Alignment

Haoyuan Wu, Haisheng Zheng, Yuan Pu et al.

ICLR 2025arXiv:2502.12732

citations

#6338

Loosely Synchronized Rule-Based Planning for Multi-Agent Path Finding with Asynchronous Actions

Shuai Zhou, Shizhe Zhao, Zhongqiang Ren

AAAI 2025paperarXiv:2412.11678

citations

#6339

Perceptually Accurate 3D Talking Head Generation: New Definitions, Speech-Mesh Representation, and Evaluation Metrics

Lee Chae-Yeon, Oh Hyun-Bin, Han EunGi et al.

CVPR 2025highlightarXiv:2503.20308

citations

#6340

Visual Lexicon: Rich Image Features in Language Space

XuDong Wang, Xingyi Zhou, Alireza Fathi et al.

CVPR 2025arXiv:2412.06774

citations

#6341

BitStack: Any-Size Compression of Large Language Models in Variable Memory Environments

Xinghao Wang, Pengyu Wang, Bo Wang et al.

ICLR 2025arXiv:2410.23918

citations

#6342

Adjoint Schrödinger Bridge Sampler

Guan-Horng Liu, Jaemoo Choi, Yongxin Chen et al.

NEURIPS 2025oralarXiv:2506.22565

citations

#6343

Routing Experts: Learning to Route Dynamic Experts in Existing Multi-modal Large Language Models

Qiong Wu, Zhaoxi Ke, Yiyi Zhou et al.

ICLR 2025

citations

#6344

DoF: A Diffusion Factorization Framework for Offline Multi-Agent Reinforcement Learning

Chao Li, Ziwei Deng, Chenxing Lin et al.

ICLR 2025

citations

#6345

Uncertainty Quantification with the Empirical Neural Tangent Kernel

Joseph Wilson, Chris van der Heide, Liam Hodgkinson et al.

NEURIPS 2025arXiv:2502.02870

citations

#6346

Time-o1: Time-Series Forecasting Needs Transformed Label Alignment

Hao Wang, Licheng Pan, Zhichao Chen et al.

NEURIPS 2025oralarXiv:2505.17847

citations

#6347

Instruction-Augmented Long-Horizon Planning: Embedding Grounding Mechanisms in Embodied Mobile Manipulation

Fangyuan Wang, Shipeng Lyu, Peng Zhou et al.

AAAI 2025paperarXiv:2503.08084

citations

#6348

Safe Planner: Empowering Safety Awareness in Large Pre-Trained Models for Robot Task Planning

Siyuan Li, Feifan Liu, Lingfei Cui et al.

AAAI 2025paperarXiv:2411.06920

citations

#6349

Can Generative Video Models Help Pose Estimation?

Ruojin Cai, Jason Y. Zhang, Philipp Henzler et al.

CVPR 2025highlightarXiv:2412.16155

citations

#6350

Understanding Individual Agent Importance in Multi-Agent System via Counterfactual Reasoning

Jianming Chen, Yawen Wang, Junjie Wang et al.

AAAI 2025paperarXiv:2412.15619

citations

#6351

Enhancing Text-to-Image Diffusion Transformer via Split-Text Conditioning

Yu Zhang, Jialei Zhou, Xinchen Li et al.

NEURIPS 2025arXiv:2505.19261

citations

#6352

Scene Splatter: Momentum 3D Scene Generation from Single Image with Video Diffusion Model

Shengjun Zhang, Jinzhao Li, Xin Fei et al.

CVPR 2025arXiv:2504.02764

citations

#6353

Earlier Tokens Contribute More: Learning Direct Preference Optimization From Temporal Decay Perspective

Ruichen Shao, Bei Li, Gangao Liu et al.

ICLR 2025oralarXiv:2502.14340

citations

#6354

Dense Video Object Captioning from Disjoint Supervision

Xingyi Zhou, Anurag Arnab, Chen Sun et al.

ICLR 2025oralarXiv:2306.11729

citations

#6355

Int2Planner: An Intention-based Multi-modal Motion Planner for Integrated Prediction and Planning

Xiaolei Chen, Junchi Yan, Wenlong Liao et al.

AAAI 2025paperarXiv:2501.12799

citations

#6356

Filter Images First, Generate Instructions Later: Pre-Instruction Data Selection for Visual Instruction Tuning

Bardia Safaei, Faizan Siddiqui, Jiacong Xu et al.

CVPR 2025highlightarXiv:2503.07591

citations

#6357

RealEdit: Reddit Edits As a Large-scale Empirical Dataset for Image Transformations

Peter Sushko, Ayana Bharadwaj, Zhi Yang Lim et al.

CVPR 2025arXiv:2502.03629

citations

#6358

ChatHuman: Chatting about 3D Humans with Tools

Jing Lin, Yao Feng, Weiyang Liu et al.

CVPR 2025arXiv:2405.04533

citations

#6359

ChatGen: Automatic Text-to-Image Generation From FreeStyle Chatting

Chengyou Jia, Changliang Xia, Zhuohang Dang et al.

CVPR 2025arXiv:2411.17176

citations

#6360

Equivariant Symmetry Breaking Sets

YuQing Xie, Tess Smidt

ICLR 2025arXiv:2402.02681

citations

#6361

Long-Term EEG Partitioning for Seizure Onset Detection

Zheng Chen, Yasuko Matsubara, Yasushi Sakurai et al.

AAAI 2025paperarXiv:2412.15598

citations

#6362

Relative Pose Estimation through Affine Corrections of Monocular Depth Priors

Yifan Yu, Shaohui Liu, Rémi Pautrat et al.

CVPR 2025highlightarXiv:2501.05446

citations

#6363

GPS: A Probabilistic Distributional Similarity with Gumbel Priors for Set-to-Set Matching

Ziming Zhang, Fangzhou Lin, Haotian Liu et al.

ICLR 2025oral

citations

#6364

Driving by the Rules: A Benchmark for Integrating Traffic Sign Regulations into Vectorized HD Map

Xinyuan Chang, Maixuan Xue, Xinran Liu et al.

CVPR 2025highlightarXiv:2410.23780

citations

#6365

SITE: towards Spatial Intelligence Thorough Evaluation

Wenqi Wang, Reuben Tan, Pengyue Zhu et al.

ICCV 2025arXiv:2505.05456

citations

#6366

Enhancing Language Model Agents using Diversity of Thoughts

Vijay Chandra Lingam, Behrooz Tehrani, sujay sanghavi et al.

ICLR 2025

citations

#6367

Actions Speak Louder Than Words: Rate-Reward Trade-off in Markov Decision Processes

Haotian Wu, Gongpu Chen, Deniz Gunduz

ICLR 2025arXiv:2502.03335

citations

#6368

Learned Image Transmission with Hierarchical Variational Autoencoder

Guangyi Zhang, Hanlei Li, Yunlong Cai et al.

AAAI 2025paperarXiv:2408.16340

citations

#6369

Learning Distances from Data with Normalizing Flows and Score Matching

Peter Sorrenson, Daniel Behrend-Uriarte, Christoph Schnörr et al.

ICML 2025arXiv:2407.09297

citations

#6370

TR-PTS: Task-Relevant Parameter and Token Selection for Efficient Tuning

Siqi Luo, Haoran Yang, Yi Xin et al.

ICCV 2025arXiv:2507.22872

citations

#6371

Better autoregressive regression with LLMs via regression-aware fine-tuning

Michal Lukasik, Zhao Meng, Harikrishna Narasimhan et al.

ICLR 2025

citations

#6372

SkySense V2: A Unified Foundation Model for Multi-modal Remote Sensing

Yingying Zhang, Lixiang Ru, Kang Wu et al.

ICCV 2025arXiv:2507.13812

citations

#6373

TurboFill: Adapting Few-step Text-to-image Model for Fast Image Inpainting

Liangbin Xie, Daniil Pakhomov, Zhonghao Wang et al.

CVPR 2025arXiv:2504.00996

citations

#6374

Stochastic Semi-Gradient Descent for Learning Mean Field Games with Population-Aware Function Approximation

Chenyu Zhang, Xu Chen, Xuan Di

ICLR 2025arXiv:2408.08192

citations

#6375

AdaDPCC: Adaptive Rate Control and Rate-Distortion-Complexity Optimization for Dynamic Point Cloud Compression

Chenhao Zhang, Wei Gao

AAAI 2025paperarXiv:2508.20741

citations

#6376

MMSearch: Unveiling the Potential of Large Models as Multi-modal Search Engines

Dongzhi Jiang, Renrui Zhang, Ziyu Guo et al.

ICLR 2025

citations

#6377

Event-based Tiny Object Detection: A Benchmark Dataset and Baselines

Nuo Chen, Chao Xiao, Yimian Dai et al.

ICCV 2025arXiv:2506.23575

citations

#6378

Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion

Vitor Guizilini, Muhammad Zubair Irshad, Dian Chen et al.

CVPR 2025arXiv:2501.18804

citations

#6379

Do We Always Need the Simplicity Bias? Looking for Optimal Inductive Biases in the Wild

Damien Teney, Liangze Jiang, Florin Gogianu et al.

CVPR 2025arXiv:2503.10065

citations

#6380

Stochastic Process Learning via Operator Flow Matching

Yaozhong Shi, Zachary Ross, Domniki Asimaki et al.

NEURIPS 2025spotlightarXiv:2501.04126

citations

#6381

Agentic Plan Caching: Test-Time Memory for Fast and Cost-Efficient LLM Agents

Qizheng Zhang, Michael Wornow, Kunle Olukotun

NEURIPS 2025arXiv:2506.14852

citations

#6382

Exact Expressive Power of Transformers with Padding

Will Merrill, Ashish Sabharwal

NEURIPS 2025arXiv:2505.18948

citations

#6383

Object-centric Video Question Answering with Visual Grounding and Referring

Haochen Wang, Qirui Chen, Cilin Yan et al.

ICCV 2025arXiv:2507.19599

citations

#6384

StableCodec: Taming One-Step Diffusion for Extreme Image Compression

Tianyu Zhang, Xin Luo, Li Li et al.

ICCV 2025arXiv:2506.21977

citations

#6385

Implicit Neural Surface Deformation with Explicit Velocity Fields

Lu Sang, Zehranaz Canfes, Dongliang Cao et al.

ICLR 2025arXiv:2501.14038

citations

#6386

Reasoning Elicitation in Language Models via Counterfactual Feedback

Alihan Hüyük, Xinnuo Xu, Jacqueline Maasch et al.

ICLR 2025arXiv:2410.03767

citations

#6387

Discrete GCBF Proximal Policy Optimization for Multi-agent Safe Optimal Control

Songyuan Zhang, Oswin So, Mitchell Black et al.

ICLR 2025arXiv:2502.03640

citations

#6388

On the Robustness of Reward Models for Language Model Alignment

Jiwoo Hong, Noah Lee, Eunki Kim et al.

ICML 2025arXiv:2505.07271

citations

#6389

Shielded Diffusion: Generating Novel and Diverse Images using Sparse Repellency

Michael Kirchhof, James Thornton, Louis Béthune et al.

ICML 2025arXiv:2410.06025

citations

#6390

CoVoMix2: Advancing Zero-Shot Dialogue Generation with Fully Non-Autoregressive Flow Matching

Leying Zhang, Yao Qian, Xiaofei Wang et al.

NEURIPS 2025arXiv:2506.00885

citations

#6391

Speculate, then Collaborate: Fusing Knowledge of Language Models during Decoding

Ziyao Wang, Muneeza Azmat, Ang Li et al.

ICML 2025arXiv:2502.08020

citations

#6392

Rethinking Fair Representation Learning for Performance-Sensitive Tasks

Charles Jones, Fabio De Sousa Ribeiro, Mélanie Roschewitz et al.

ICLR 2025arXiv:2410.04120

citations

#6393

GDiffRetro: Retrosynthesis Prediction with Dual Graph Enhanced Molecular Representation and Diffusion Generation

Shengyin Sun, Wenhao Yu, Yuxiang Ren et al.

AAAI 2025paperarXiv:2501.08001

citations

#6394

Dynamic Typography: Bringing Text to Life via Video Diffusion Prior

Zichen Liu, Yihao Meng, Hao Ouyang et al.

ICCV 2025arXiv:2404.11614

citations

#6395

Let Your Features Tell The Differences: Understanding Graph Convolution By Feature Splitting

Yilun Zheng, Xiang Li, Sitao Luan et al.

ICLR 2025

citations

#6396

Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced Retrieval-Augmented Generation on Knowledge Graphs

Xiaqiang Tang, Jian Li, Nan Du et al.

AAAI 2025paperarXiv:2412.07618

citations

#6397

SEAL: Semantic Attention Learning for Long Video Representation

Lan Wang, Yujia Chen, Wen-Sheng Chu et al.

CVPR 2025arXiv:2412.01798

citations

#6398

A Theory for Token-Level Harmonization in Retrieval-Augmented Generation

Shicheng Xu, Liang Pang, Huawei Shen et al.

ICLR 2025arXiv:2406.00944

citations

#6399

MotiF: Making Text Count in Image Animation with Motion Focal Loss

Shijie Wang, Samaneh Azadi, Rohit Girdhar et al.

CVPR 2025arXiv:2412.16153

citations

#6400

Scaling Laws for Task-Optimized Models of the Primate Visual Ventral Stream

Abdulkadir Gokce, Martin Schrimpf

ICML 2025oralarXiv:2411.05712

citations

← Previous

1...30 31 32 33 34...112