Most Cited 2025 "experiment design" Papers

22,274 papers found • Page 18 of 112

#3401

Learning Safety Constraints for Large Language Models

Xin Chen, Yarden As, Andreas Krause

ICML 2025spotlightarXiv:2505.24445
7
citations
#3402

Learning from negative feedback, or positive feedback or both

Abbas Abdolmaleki, Bilal Piot, Bobak Shahriari et al.

ICLR 2025posterarXiv:2410.04166
7
citations
#3403

Improved Finite-Particle Convergence Rates for Stein Variational Gradient Descent

Sayan Banerjee, Krishna Balasubramanian, PROMIT GHOSAL

ICLR 2025posterarXiv:2409.08469
7
citations
#3404

Overcoming the Curse of Dimensionality in Reinforcement Learning Through Approximate Factorization

Chenbei Lu, Laixi Shi, Zaiwei Chen et al.

ICML 2025posterarXiv:2411.07591
7
citations
#3405

LLAVIDAL: A Large LAnguage VIsion Model for Daily Activities of Living

Dominick Reilly, Rajatsubhra Chakraborty, Arkaprava Sinha et al.

CVPR 2025posterarXiv:2406.09390
7
citations
#3406

Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing

Peihao Wang, Ruisi Cai, Yuehao Wang et al.

ICLR 2025posterarXiv:2501.00658
7
citations
#3407

Revisiting Random Walks for Learning on Graphs

Jinwoo Kim, Olga Zaghen, Ayhan Suleymanzade et al.

ICLR 2025posterarXiv:2407.01214
7
citations
#3408

3D Gaussian Head Avatars with Expressive Dynamic Appearances by Compact Tensorial Representations

yating wang, Xuan Wang, Ran Yi et al.

CVPR 2025posterarXiv:2504.14967
7
citations
#3409

Training Language Models on Synthetic Edit Sequences Improves Code Synthesis

Ulyana Piterbarg, Lerrel Pinto, Rob Fergus

ICLR 2025posterarXiv:2410.02749
7
citations
#3410

Vision-centric Token Compression in Large Language Model

Ling Xing, Alex Jinpeng Wang, Rui Yan et al.

NEURIPS 2025spotlightarXiv:2502.00791
7
citations
#3411

SMITE: Segment Me In TimE

Amirhossein Alimohammadi, Sauradip Nag, Saeid Asgari et al.

ICLR 2025posterarXiv:2410.18538
7
citations
#3412

Among Us: A Sandbox for Measuring and Detecting Agentic Deception

Satvik Golechha, Adrià Garriga-Alonso

NEURIPS 2025spotlightarXiv:2504.04072
7
citations
#3413

Towards Understanding Why Label Smoothing Degrades Selective Classification and How to Fix It

Guoxuan Xia, Olivier Laurent, Gianni Franchi et al.

ICLR 2025posterarXiv:2403.14715
7
citations
#3414

Segment Any 3D Object with Language

Seungjun Lee, Yuyang Zhao, Gim H Lee

ICLR 2025posterarXiv:2404.02157
7
citations
#3415

Boosting the visual interpretability of CLIP via adversarial fine-tuning

Shizhan Gong, Haoyu LEI, Qi Dou et al.

ICLR 2025poster
7
citations
#3416

ChatGen: Automatic Text-to-Image Generation From FreeStyle Chatting

Chengyou Jia, Changliang Xia, Zhuohang Dang et al.

CVPR 2025posterarXiv:2411.17176
7
citations
#3417

Attributing Culture-Conditioned Generations to Pretraining Corpora

Huihan Li, Arnav Goel, Keyu He et al.

ICLR 2025posterarXiv:2412.20760
7
citations
#3418

In-context Time Series Predictor

Jiecheng Lu, Yan Sun, Shihao Yang

ICLR 2025posterarXiv:2405.14982
7
citations
#3419

ROICtrl: Boosting Instance Control for Visual Generation

Yuchao Gu, Yipin Zhou, Yunfan Ye et al.

CVPR 2025posterarXiv:2411.17949
7
citations
#3420

Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration

Max Wilcoxson, Qiyang Li, Kevin Frans et al.

ICML 2025posterarXiv:2410.18076
7
citations
#3421

Multi-Modal and Multi-Attribute Generation of Single Cells with CFGen

Alessandro Palma, Till Richter, Hanyi Zhang et al.

ICLR 2025posterarXiv:2407.11734
7
citations
#3422

ARM: Appearance Reconstruction Model for Relightable 3D Generation

Xiang Feng, Chang Yu, Zoubin Bi et al.

CVPR 2025highlightarXiv:2411.10825
7
citations
#3423

Beyond the Frame: Generating 360° Panoramic Videos from Perspective Videos

Rundong Luo, Matthew Wallingford, Ali Farhadi et al.

ICCV 2025posterarXiv:2504.07940
7
citations
#3424

FlowMo: Variance-Based Flow Guidance for Coherent Motion in Video Generation

Ariel Shaulov, Itay Hazan, Lior Wolf et al.

NEURIPS 2025oralarXiv:2506.01144
7
citations
#3425

SAM2-LOVE: Segment Anything Model 2 in Language-aided Audio-Visual Scenes

Yuji Wang, Haoran Xu, Yong Liu et al.

CVPR 2025posterarXiv:2506.01558
7
citations
#3426

SplatFormer: Point Transformer for Robust 3D Gaussian Splatting

Yutong Chen, Marko Mihajlovic, Xiyi Chen et al.

ICLR 2025posterarXiv:2411.06390
7
citations
#3427

POp-GS: Next Best View in 3D-Gaussian Splatting with P-Optimality

Joey Wilson, Marcelino M. de Almeida, Sachit Mahajan et al.

CVPR 2025posterarXiv:2503.07819
7
citations
#3428

AutoElicit: Using Large Language Models for Expert Prior Elicitation in Predictive Modelling

Alexander Capstick, Rahul G. Krishnan, Payam Barnaghi

ICML 2025posterarXiv:2411.17284
7
citations
#3429

Generating Freeform Endoskeletal Robots

Muhan Li, Lingji Kong, Sam Kriegman

ICLR 2025posterarXiv:2412.01036
7
citations
#3430

Stealthy Backdoor Attack in Self-Supervised Learning Vision Encoders for Large Vision Language Models

Zhaoyi Liu, Huan Zhang

CVPR 2025posterarXiv:2502.18290
7
citations
#3431

DISCO: learning to DISCover an evolution Operator for multi-physics-agnostic prediction

Rudy Morel, Jiequn Han, Edouard Oyallon

ICML 2025oralarXiv:2504.19496
7
citations
#3432

DeNVeR: Deformable Neural Vessel Representations for Unsupervised Video Vessel Segmentation

Chun-Hung Wu, Shih-Hong Chen, Chih Yao Hu et al.

CVPR 2025posterarXiv:2406.01591
7
citations
#3433

DebGCD: Debiased Learning with Distribution Guidance for Generalized Category Discovery

Yuanpei Liu, Kai Han

ICLR 2025posterarXiv:2504.04804
7
citations
#3434

The Computer Vision Foundation

Yancheng Cai, Fei Yin, Dounia Hammou et al.

CVPR 2025arXiv:2502.20256
7
citations
#3435

Dynamic Updates for Language Adaptation in Visual-Language Tracking

Xiaohai Li, Bineng Zhong, Qihua Liang et al.

CVPR 2025posterarXiv:2503.06621
7
citations
#3436

Symbolic regression via MDLformer-guided search: from minimizing prediction error to minimizing description length

Zihan Yu, Jingtao Ding, Yong Li et al.

ICLR 2025posterarXiv:2411.03753
7
citations
#3437

Cross-modal Causal Relation Alignment for Video Question Grounding

weixing chen, Yang Liu, Binglin Chen et al.

CVPR 2025highlightarXiv:2503.07635
7
citations
#3438

Evaluating Neuron Explanations: A Unified Framework with Sanity Checks

Tuomas Oikarinen, Ge Yan, Lily Weng

ICML 2025posterarXiv:2506.05774
7
citations
#3439

Amortized Sampling with Transferable Normalizing Flows

Charlie Tan, Majdi Hassan, Leon Klein et al.

NEURIPS 2025posterarXiv:2508.18175
7
citations
#3440

On the Transfer of Object-Centric Representation Learning

Aniket Rajiv Didolkar, Andrii Zadaianchuk, Anirudh Goyal et al.

ICLR 2025poster
7
citations
#3441

Mitigating Parameter Interference in Model Merging via Sharpness-Aware Fine-Tuning

Yeoreum Lee, Jinwook Jung, Sungyong Baik

ICLR 2025posterarXiv:2504.14662
7
citations
#3442

UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation

Lunhao Duan, Shanshan Zhao, Wenjun Yan et al.

CVPR 2025posterarXiv:2412.18928
7
citations
#3443

Enhancing 3D Gaze Estimation in the Wild using Weak Supervision with Gaze Following Labels

Pierre Vuillecard, Jean-marc Odobez

CVPR 2025posterarXiv:2502.20249
7
citations
#3444

Towards Stable and Storage-efficient Dataset Distillation: Matching Convexified Trajectory

Wenliang Zhong, Haoyu Tang, Qinghai Zheng et al.

CVPR 2025posterarXiv:2406.19827
7
citations
#3445

VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning

Han Lin, Tushar Nagarajan, Nicolas Ballas et al.

ICLR 2025posterarXiv:2410.03478
7
citations
#3446

Towards Generalizable Scene Change Detection

Jae-Woo KIM, Ue-Hwan Kim

CVPR 2025posterarXiv:2409.06214
7
citations
#3447

Data-Juicer Sandbox: A Feedback-Driven Suite for Multimodal Data-Model Co-development

Daoyuan Chen, Haibin Wang, Yilun Huang et al.

ICML 2025spotlightarXiv:2407.11784
7
citations
#3448

Distributional Associations vs In-Context Reasoning: A Study of Feed-forward and Attention Layers

Lei Chen, Joan Bruna, Alberto Bietti

ICLR 2025posterarXiv:2406.03068
7
citations
#3449

CoreGuard: Safeguarding Foundational Capabilities of LLMs Against Model Stealing in Edge Deployment

Qinfeng Li, Tianyue Luo, Xuhong Zhang et al.

NEURIPS 2025posterarXiv:2410.13903
7
citations
#3450

PQMass: Probabilistic Assessment of the Quality of Generative Models using Probability Mass Estimation

Pablo Lemos, Sammy Sharief, Nikolay Malkin et al.

ICLR 2025posterarXiv:2402.04355
7
citations
#3451

CausalRivers - Scaling up benchmarking of causal discovery for real-world time-series

Gideon Stein, Maha Shadaydeh, Jan Blunk et al.

ICLR 2025oralarXiv:2503.17452
7
citations
#3452

ConMo: Controllable Motion Disentanglement and Recomposition for Zero-Shot Motion Transfer

Jiayi Gao, Zijin Yin, Changcheng Hua et al.

CVPR 2025posterarXiv:2504.02451
7
citations
#3453

Ringmaster ASGD: The First Asynchronous SGD with Optimal Time Complexity

Artavazd Maranjyan, Alexander Tyurin, Peter Richtarik

ICML 2025posterarXiv:2501.16168
7
citations
#3454

Position: We Need An Algorithmic Understanding of Generative AI

Oliver Eberle, Thomas McGee, Hamza Giaffar et al.

ICML 2025spotlightarXiv:2507.07544
7
citations
#3455

M3amba: Memory Mamba is All You Need for Whole Slide Image Classification

Tingting Zheng, Kui Jiang, Yi Xiao et al.

CVPR 2025poster
7
citations
#3456

ManiVideo: Generating Hand-Object Manipulation Video with Dexterous and Generalizable Grasping

Youxin Pang, Ruizhi Shao, Jiajun Zhang et al.

CVPR 2025highlightarXiv:2412.16212
7
citations
#3457

CrossOver: 3D Scene Cross-Modal Alignment

Sayan Deb Sarkar, Ondrej Miksik, Marc Pollefeys et al.

CVPR 2025highlightarXiv:2502.15011
7
citations
#3458

EvEnhancer: Empowering Effectiveness, Efficiency and Generalizability for Continuous Space-Time Video Super-Resolution with Events

Shuoyan Wei, Feng Li, Shengeng Tang et al.

CVPR 2025highlightarXiv:2505.04657
7
citations
#3459

Empowering LLM Agents with Zero-Shot Optimal Decision-Making through Q-learning

Jiajun Chai, Sicheng Li, Yuqian Fu et al.

ICLR 2025poster
7
citations
#3460

GraphLand: Evaluating Graph Machine Learning Models on Diverse Industrial Data

Gleb Bazhenov, Oleg Platonov, Liudmila Prokhorenkova

NEURIPS 2025oralarXiv:2409.14500
7
citations
#3461

Conformal Linguistic Calibration: Trading-off between Factuality and Specificity

Zhengping Jiang, Anqi Liu, Ben Van Durme

NEURIPS 2025posterarXiv:2502.19110
7
citations
#3462

Valid Conformal Prediction for Dynamic GNNs

Ed Davis, Ian Gallagher, Daniel Lawson et al.

ICLR 2025posterarXiv:2405.19230
7
citations
#3463

System-1.5 Reasoning: Traversal in Language and Latent Spaces with Dynamic Shortcuts

Xiaoqiang Wang, Suyuchen Wang, Yun Zhu et al.

NEURIPS 2025posterarXiv:2505.18962
7
citations
#3464

Expensive Multi-Objective Bayesian Optimization Based on Diffusion Models

Bingdong Li, Zixiang Di, Yongfan Lu et al.

AAAI 2025paperarXiv:2405.08674
7
citations
#3465

Decision Tree Induction Through LLMs via Semantically-Aware Evolution

Tennison Liu, Nicolas Huynh, Mihaela van der Schaar

ICLR 2025posterarXiv:2503.14217
7
citations
#3466

COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training

Sanghwan Kim, Rui Xiao, Iuliana Georgescu et al.

CVPR 2025posterarXiv:2412.01814
7
citations
#3467

Scene Map-based Prompt Tuning for Navigation Instruction Generation

Sheng Fan, Rui Liu, Wenguan Wang et al.

CVPR 2025poster
7
citations
#3468

Mitigating Social Bias in Large Language Models: A Multi-Objective Approach Within a Multi-Agent Framework

Zhenjie Xu, Wenqing Chen, Yi Tang et al.

AAAI 2025paperarXiv:2412.15504
7
citations
#3469

Distilling Reinforcement Learning Algorithms for In-Context Model-Based Planning

Jaehyeon Son, Soochan Lee, Gunhee Kim

ICLR 2025posterarXiv:2502.19009
7
citations
#3470

DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion Models

Hyogon Ryu, NaHyeon Park, Hyunjung Shim

ICLR 2025posterarXiv:2501.04304
7
citations
#3471

A Comprehensive Evaluation on Event Reasoning of Large Language Models

Zhengwei Tao, Zhi Jin, Yifan Zhang et al.

AAAI 2025paperarXiv:2404.17513
7
citations
#3472

Dynamic-Width Speculative Beam Decoding for LLM Inference

Zongyue Qin, Zifan He, Neha Prakriya et al.

AAAI 2025paper
7
citations
#3473

Distributed Speculative Inference (DSI): Speculation Parallelism for Provably Faster Lossless Language Model Inference

Nadav Timor, Jonathan Mamou, Daniel Korat et al.

ICLR 2025posterarXiv:2405.14105
7
citations
#3474

Gradient-Guided Annealing for Domain Generalization

Aristotelis Ballas, Christos Diou

CVPR 2025highlightarXiv:2502.20162
7
citations
#3475

Mix-CPT: A Domain Adaptation Framework via Decoupling Knowledge Learning and Format Alignment

Jinhao Jiang, Junyi Li, Xin Zhao et al.

ICLR 2025posterarXiv:2407.10804
7
citations
#3476

Adaptive Draft-Verification for Efficient Large Language Model Decoding

Xukun Liu, Bowen Lei, Ruqi Zhang et al.

AAAI 2025paperarXiv:2407.12021
7
citations
#3477

SWEA: Updating Factual Knowledge in Large Language Models via Subject Word Embedding Altering

Xiaopeng Li, Shasha Li, Shezheng Song et al.

AAAI 2025paperarXiv:2401.17809
7
citations
#3478

ShEPhERD: Diffusing shape, electrostatics, and pharmacophores for bioisosteric drug design

Keir Adams, Kento Abeywardane, Jenna Fromer et al.

ICLR 2025posterarXiv:2411.04130
7
citations
#3479

Fast Think-on-Graph: Wider, Deeper and Faster Reasoning of Large Language Model on Knowledge Graph

Xujian Liang, Zhaoquan Gu

AAAI 2025paperarXiv:2501.14300
7
citations
#3480

S4M: S4 for multivariate time series forecasting with Missing values

Jing Peng, Meiqi Yang, Qiong Zhang et al.

ICLR 2025oralarXiv:2503.00900
7
citations
#3481

Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents

Yaxin Luo, Zhaoyi Li, Jiacheng Liu et al.

NEURIPS 2025posterarXiv:2505.24878
7
citations
#3482

Debiasing Multimodal Large Language Models via Noise-Aware Preference Optimization

zefeng zhang, Hengzhu Tang, Jiawei Sheng et al.

CVPR 2025posterarXiv:2503.17928
7
citations
#3483

Against All Odds: Overcoming Typology, Script, and Language Confusion in Multilingual Embedding Inversion Attacks

Yiyi Chen, Russa Biswas, Heather Lent et al.

AAAI 2025paperarXiv:2408.11749
7
citations
#3484

A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation

Liang Chen, Sinan Tan, Zefan Cai et al.

ICLR 2025posterarXiv:2410.01912
7
citations
#3485

The Bandit Whisperer: Communication Learning for Restless Bandits

Yunfan Zhao, Tonghan Wang, Dheeraj Mysore Nagaraj et al.

AAAI 2025paperarXiv:2408.05686
7
citations
#3486

DiffLocks: Generating 3D Hair from a Single Image using Diffusion Models

Radu Alexandru Rosu, Keyu Wu, Yao Feng et al.

CVPR 2025posterarXiv:2505.06166
7
citations
#3487

CP-Guard: Malicious Agent Detection and Defense in Collaborative Bird’s Eye View Perception

Senkang Hu, Yihang Tao, Guowen Xu et al.

AAAI 2025paperarXiv:2412.12000
7
citations
#3488

VideoMage: Multi-Subject and Motion Customization of Text-to-Video Diffusion Models

Chi-Pin Huang, Yen-Siang Wu, Hung-Kai Chung et al.

CVPR 2025posterarXiv:2503.21781
7
citations
#3489

ESE: Espresso Sentence Embeddings

Xianming Li, Zongxi Li, Jing Li et al.

ICLR 2025poster
7
citations
#3490

Factor Augmented Tensor-on-Tensor Neural Networks

Guanhao Zhou, Yuefeng Han, Xiufan Yu

AAAI 2025paperarXiv:2405.19610
7
citations
#3491

ModeSeq: Taming Sparse Multimodal Motion Prediction with Sequential Mode Modeling

Zikang Zhou, Hengjian Zhou, Haibo Hu et al.

CVPR 2025posterarXiv:2411.11911
7
citations
#3492

Transformers Learn Low Sensitivity Functions: Investigations and Implications

Bhavya Vasudeva, Deqing Fu, Tianyi Zhou et al.

ICLR 2025posterarXiv:2403.06925
7
citations
#3493

HarmonySet: A Comprehensive Dataset for Understanding Video-Music Semantic Alignment and Temporal Synchronization

Zitang Zhou, Ke Mei, Yu Lu et al.

CVPR 2025posterarXiv:2503.01725
7
citations
#3494

Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations

Xiang Xu, Lingdong Kong, Song Wang et al.

ICCV 2025posterarXiv:2507.05260
7
citations
#3495

Dynamic Typography: Bringing Text to Life via Video Diffusion Prior

Zichen Liu, Yihao Meng, Hao Ouyang et al.

ICCV 2025posterarXiv:2404.11614
7
citations
#3496

Second Order Bounds for Contextual Bandits with Function Approximation

Aldo Pacchiano

ICLR 2025posterarXiv:2409.16197
7
citations
#3497

Extrapolated Urban View Synthesis Benchmark

Xiangyu Han, Zhen Jia, Boyi Li et al.

ICCV 2025posterarXiv:2412.05256
7
citations
#3498

Sculpting Features from Noise: Reward-Guided Hierarchical Diffusion for Task-Optimal Feature Transformation

Nanxu Gong, Zijun Li, Sixun Dong et al.

NEURIPS 2025posterarXiv:2505.15152
7
citations
#3499

Error Bounds for Gaussian Process Regression Under Bounded Support Noise with Applications to Safety Certification

Robert Reed, Luca Laurenti, Morteza Lahijanian

AAAI 2025paperarXiv:2408.09033
7
citations
#3500

RI3D: Few-Shot Gaussian Splatting With Repair and Inpainting Diffusion Priors

Avinash Paliwal, xilong zhou, Wei Ye et al.

ICCV 2025posterarXiv:2503.10860
7
citations
#3501

A Generalist Intracortical Motor Decoder

Joel Ye, Fabio Rizzoglio, Xuan Ma et al.

NEURIPS 2025poster
7
citations
#3502

Real-time High-fidelity Gaussian Human Avatars with Position-based Interpolation of Spatially Distributed MLPs

Youyi Zhan, Tianjia Shao, Yin Yang et al.

CVPR 2025highlightarXiv:2504.12909
7
citations
#3503

Cycle Consistency as Reward: Learning Image-Text Alignment without Human Preferences

Hyojin Bahng, Caroline Chan, Fredo Durand et al.

ICCV 2025posterarXiv:2506.02095
7
citations
#3504

BearLLM: A Prior Knowledge-Enhanced Bearing Health Management Framework with Unified Vibration Signal Representation

Haotian Peng, Jiawei Liu, Jinsong Du et al.

AAAI 2025paperarXiv:2408.11281
7
citations
#3505

Continual Learning Using a Kernel-Based Method Over Foundation Models

Saleh Momeni, Sahisnu Mazumder, Bing Liu

AAAI 2025paperarXiv:2412.15571
7
citations
#3506

SEMU: Singular Value Decomposition for Efficient Machine Unlearning

Marcin Sendera, Łukasz Struski, Kamil Książek et al.

ICML 2025posterarXiv:2502.07587
7
citations
#3507

Error Feedback under $(L_0,L_1)$-Smoothness: Normalization and Momentum

SARIT KHIRIRAT, Abdurakhmon Sadiev, Artem Riabinin et al.

NEURIPS 2025posterarXiv:2410.16871
7
citations
#3508

Progress-Aware Video Frame Captioning

Zihui Xue, Joungbin An, Xitong Yang et al.

CVPR 2025posterarXiv:2412.02071
7
citations
#3509

Training-Free Text-Guided Image Editing with Visual Autoregressive Model

Yufei Wang, Lanqing Guo, Zhihao Li et al.

ICCV 2025posterarXiv:2503.23897
7
citations
#3510

On the Expressiveness and Length Generalization of Selective State Space Models on Regular Languages

Aleksandar Terzic, Michael Hersche, Giacomo Camposampiero et al.

AAAI 2025paper
7
citations
#3511

BinaryDM: Accurate Weight Binarization for Efficient Diffusion Models

Xingyu Zheng, Xianglong Liu, Haotong Qin et al.

ICLR 2025posterarXiv:2404.05662
7
citations
#3512

CAPrompt: Cyclic Prompt Aggregation for Pre-Trained Model Based Class Incremental Learning

Qiwei Li, Jiahuan Zhou

AAAI 2025paperarXiv:2412.08929
7
citations
#3513

Destroy and Repair Using Hyper-Graphs for Routing

Ke Li, Fei Liu, Zhenkun Wang et al.

AAAI 2025paperarXiv:2502.16170
7
citations
#3514

NeSyC: A Neuro-symbolic Continual Learner For Complex Embodied Tasks in Open Domains

Wonje Choi, Jinwoo Park, Sanghyun Ahn et al.

ICLR 2025posterarXiv:2503.00870
7
citations
#3515

AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented Efficient Long Video Understanding

Xue zhucun, Jiangning Zhang, Xie Xurong et al.

NEURIPS 2025posterarXiv:2506.13589
7
citations
#3516

FlexSelect: Flexible Token Selection for Efficient Long Video Understanding

yunzhu zhang, Yu Lu, Tianyi Wang et al.

NEURIPS 2025oralarXiv:2506.00993
7
citations
#3517

Video-SafetyBench: A Benchmark for Safety Evaluation of Video LVLMs

Xuannan Liu, Zekun Li, Zheqi He et al.

NEURIPS 2025oralarXiv:2505.11842
7
citations
#3518

DIVE: Taming DINO for Subject-Driven Video Editing

Yi Huang, Wei Xiong, He Zhang et al.

ICCV 2025posterarXiv:2412.03347
7
citations
#3519

CTSyn: A Foundation Model for Cross Tabular Data Generation

Xiaofeng Lin, Chenheng Xu, Matthew Yang et al.

ICLR 2025posterarXiv:2406.04619
7
citations
#3520

HQGS: High-Quality Novel View Synthesis with Gaussian Splatting in Degraded Scenes

Xin Lin, Shi Luo, Xiaojun Shan et al.

ICLR 2025poster
7
citations
#3521

Generative RLHF-V: Learning Principles from Multi-modal Human Preference

Jiayi Zhou, Jiaming Ji, Boyuan Chen et al.

NEURIPS 2025posterarXiv:2505.18531
7
citations
#3522

Beyond Verifiable Rewards: Scaling Reinforcement Learning in Language Models to Unverifiable Data

Yunhao Tang, Sid Wang, Lovish Madaan et al.

NEURIPS 2025posterarXiv:2503.19618
7
citations
#3523

Corvid: Improving Multimodal Large Language Models Towards Chain-of-Thought Reasoning

Jingjing Jiang, Chao Ma, Xurui Song et al.

ICCV 2025highlightarXiv:2507.07424
7
citations
#3524

MimiQ: Low-Bit Data-Free Quantization of Vision Transformers with Encouraging Inter-Head Attention Similarity

Kanghyun Choi, Hyeyoon Lee, Dain Kwon et al.

AAAI 2025paperarXiv:2407.20021
7
citations
#3525

Luminance-GS: Adapting 3D Gaussian Splatting to Challenging Lighting Conditions with View-Adaptive Curve Adjustment

Ziteng Cui, Xuangeng Chu, Tatsuya Harada

CVPR 2025posterarXiv:2504.01503
7
citations
#3526

Capture Global Feature Statistics for One-Shot Federated Learning

Zenghao Guan, Yucan Zhou, Xiaoyan Gu

AAAI 2025paperarXiv:2503.06962
7
citations
#3527

Stable Port-Hamiltonian Neural Networks

Fabian J. Roth, Dominik K. Klein, Maximilian Kannapinn et al.

NEURIPS 2025posterarXiv:2502.02480
7
citations
#3528

Don't Just Chase “Highlighted Tokens” in MLLMs: Revisiting Visual Holistic Context Retention

Xin Zou, Di Lu, Yizhou Wang et al.

NEURIPS 2025posterarXiv:2510.02912
7
citations
#3529

Earlier Tokens Contribute More: Learning Direct Preference Optimization From Temporal Decay Perspective

Ruichen Shao, Bei Li, Gangao Liu et al.

ICLR 2025oralarXiv:2502.14340
7
citations
#3530

Object-centric binding in Contrastive Language-Image Pretraining

Rim Assouel, Pietro Astolfi, Florian Bordes et al.

NEURIPS 2025posterarXiv:2502.14113
7
citations
#3531

MUNBa: Machine Unlearning via Nash Bargaining

Jing Wu, Mehrtash Harandi

ICCV 2025posterarXiv:2411.15537
7
citations
#3532

Safe Planner: Empowering Safety Awareness in Large Pre-Trained Models for Robot Task Planning

Siyuan Li, Feifan Liu, Lingfei Cui et al.

AAAI 2025paperarXiv:2411.06920
7
citations
#3533

Fine-Grained Erasure in Text-to-Image Diffusion-based Foundation Models

Kartik Thakral, Tamar Glaser, Tal Hassner et al.

CVPR 2025posterarXiv:2503.19783
7
citations
#3534

DynamicVL: Benchmarking Multimodal Large Language Models for Dynamic City Understanding

Weihao Xuan, Junjue Wang, Heli Qi et al.

NEURIPS 2025oralarXiv:2505.21076
7
citations
#3535

SymDPO: Boosting In-Context Learning of Large Multimodal Models with Symbol Demonstration Direct Preference Optimization

Hongrui Jia, Chaoya Jiang, Haiyang Xu et al.

CVPR 2025posterarXiv:2411.11909
7
citations
#3536

Stochastic Online Instrumental Variable Regression: Regrets for Endogeneity and Bandit Feedback

Riccardo Della Vecchia, Debabrota Basu

AAAI 2025paperarXiv:2302.09357
7
citations
#3537

Int2Planner: An Intention-based Multi-modal Motion Planner for Integrated Prediction and Planning

Xiaolei Chen, Junchi Yan, Wenlong Liao et al.

AAAI 2025paperarXiv:2501.12799
7
citations
#3538

Forward KL Regularized Preference Optimization for Aligning Diffusion Policies

Zhao Shan, Chenyou Fan, Shuang Qiu et al.

AAAI 2025paperarXiv:2409.05622
7
citations
#3539

Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs

Zeyi Huang, Yuyang Ji, Xiaofang Wang et al.

CVPR 2025posterarXiv:2501.04336
7
citations
#3540

SliderSpace: Decomposing the Visual Capabilities of Diffusion Models

Rohit Gandikota, Zongze Wu, Richard Zhang et al.

ICCV 2025posterarXiv:2502.01639
7
citations
#3541

Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining

Jie Cheng, Ruixi Qiao, ma yingwei et al.

ICLR 2025oralarXiv:2410.00564
7
citations
#3542

A General Adaptive Dual-level Weighting Mechanism for Remote Sensing Pansharpening

Jie Huang, Haorui Chen, Jiaxuan Ren et al.

CVPR 2025posterarXiv:2503.13214
7
citations
#3543

Erase Then Rectify: A Training-Free Parameter Editing Approach for Cost-Effective Graph Unlearning

Zhe-Rui Yang, Jindong Han, Chang-Dong Wang et al.

AAAI 2025paperarXiv:2409.16684
7
citations
#3544

DiC: Rethinking Conv3x3 Designs in Diffusion Models

Yuchuan Tian, Jing Han, Chengcheng Wang et al.

CVPR 2025posterarXiv:2501.00603
7
citations
#3545

LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+ FPS

Wanhua Li, Yujie Zhao, Minghan Qin et al.

NEURIPS 2025posterarXiv:2507.07136
7
citations
#3546

InterAct: Advancing Large-Scale Versatile 3D Human-Object Interaction Generation

Sirui Xu, Dongting Li, Yucheng Zhang et al.

CVPR 2025posterarXiv:2509.09555
7
citations
#3547

Depth-Bounds for Neural Networks via the Braid Arrangement

Moritz Grillo, Christoph Hertrich, Georg Loho

NEURIPS 2025oralarXiv:2502.09324
7
citations
#3548

Better autoregressive regression with LLMs via regression-aware fine-tuning

Michal Lukasik, Zhao Meng, Harikrishna Narasimhan et al.

ICLR 2025poster
7
citations
#3549

Poison-splat: Computation Cost Attack on 3D Gaussian Splatting

Jiahao Lu, Yifan Zhang, Qiuhong Shen et al.

ICLR 2025posterarXiv:2410.08190
7
citations
#3550

ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models

Hongbo Liu, Jingwen He, Yi Jin et al.

NEURIPS 2025posterarXiv:2506.21356
7
citations
#3551

ALE-Bench: A Benchmark for Long-Horizon Objective-Driven Algorithm Engineering

Yuki Imajuku, Kohki Horie, Yoichi Iwata et al.

NEURIPS 2025posterarXiv:2506.09050
7
citations
#3552

Long-Term EEG Partitioning for Seizure Onset Detection

Zheng Chen, Yasuko Matsubara, Yasushi Sakurai et al.

AAAI 2025paperarXiv:2412.15598
7
citations
#3553

Enhancing 3D Reconstruction for Dynamic Scenes

Jisang Han, Honggyu An, Jaewoo Jung et al.

NEURIPS 2025oralarXiv:2504.06264
7
citations
#3554

Adaptive Part Learning for Fine-Grained Generalized Category Discovery: A Plug-and-Play Enhancement

Qiyuan Dai, Hanzhuo Huang, Yu Wu et al.

CVPR 2025posterarXiv:2507.06928
7
citations
#3555

Contextual AD Narration with Interleaved Multimodal Sequence

Hanlin Wang, Zhan Tong, Kecheng Zheng et al.

CVPR 2025posterarXiv:2403.12922
7
citations
#3556

Pursuing Feature Separation based on Neural Collapse for Out-of-Distribution Detection

Yingwen Wu, Ruiji Yu, Xinwen Cheng et al.

ICLR 2025posterarXiv:2405.17816
7
citations
#3557

GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography

Mengchen Zhang, Tong Wu, Jing Tan et al.

ICCV 2025posterarXiv:2504.07083
7
citations
#3558

Agentic Plan Caching: Test-Time Memory for Fast and Cost-Efficient LLM Agents

Qizheng Zhang, Michael Wornow, Kunle Olukotun

NEURIPS 2025posterarXiv:2506.14852
7
citations
#3559

A Simple yet Effective Layout Token in Large Language Models for Document Understanding

Zhaoqing Zhu, Chuwei Luo, Zirui Shao et al.

CVPR 2025posterarXiv:2503.18434
7
citations
#3560

Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis

Jingjing Ren, Wenbo Li, Zhongdao Wang et al.

ICCV 2025posterarXiv:2504.14470
7
citations
#3561

DTGBrepGen: A Novel B-rep Generative Model through Decoupling Topology and Geometry

Jing Li, Yihang Fu, Falai Chen

CVPR 2025posterarXiv:2503.13110
7
citations
#3562

Emergence and Evolution of Interpretable Concepts in Diffusion Models

Berk Tinaz, Zalan Fabian, Mahdi Soltanolkotabi

NEURIPS 2025spotlightarXiv:2504.15473
7
citations
#3563

Driving by the Rules: A Benchmark for Integrating Traffic Sign Regulations into Vectorized HD Map

Xinyuan Chang, Maixuan Xue, Xinran Liu et al.

CVPR 2025highlightarXiv:2410.23780
7
citations
#3564

Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion

Vitor Guizilini, Muhammad Zubair Irshad, Dian Chen et al.

CVPR 2025posterarXiv:2501.18804
7
citations
#3565

Directional Gradient Projection for Robust Fine-Tuning of Foundation Models

Chengyue Huang, Junjiao Tian, Brisa Maneechotesuwan et al.

ICLR 2025posterarXiv:2502.15895
7
citations
#3566

Locality in Image Diffusion Models Emerges from Data Statistics

Artem Lukoianov, Chenyang Yuan, Justin Solomon et al.

NEURIPS 2025spotlightarXiv:2509.09672
7
citations
#3567

HAIF-GS: Hierarchical and Induced Flow-Guided Gaussian Splatting for Dynamic Scene

Jianing Chen, Zehao Li, Yujun Cai et al.

NEURIPS 2025oralarXiv:2506.09518
7
citations
#3568

LIFe-GoM: Generalizable Human Rendering with Learned Iterative Feedback Over Multi-Resolution Gaussians-on-Mesh

Jing Wen, Alex Schwing, Shenlong Wang

ICLR 2025posterarXiv:2502.09617
7
citations
#3569

SSL-STMFormer Self-Supervised Learning Spatio-Temporal Entanglement Transformer for Traffic Flow Prediction

Zetao Li, Zheng Hu, Peng Han et al.

AAAI 2025paper
7
citations
#3570

DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding

Jungbin Cho, Junwan Kim, Jisoo Kim et al.

ICCV 2025highlightarXiv:2411.19527
7
citations
#3571

Mitra: Mixed Synthetic Priors for Enhancing Tabular Foundation Models

Xiyuan Zhang, Danielle Maddix Robinson, Junming Yin et al.

NEURIPS 2025posterarXiv:2510.21204
7
citations
#3572

VORTA: Efficient Video Diffusion via Routing Sparse Attention

Wenhao Sun, Rong-Cheng Tu, Yifu Ding et al.

NEURIPS 2025posterarXiv:2505.18809
7
citations
#3573

GauSTAR: Gaussian Surface Tracking and Reconstruction

Chengwei Zheng, Lixin Xue, Juan Jose Zarate et al.

CVPR 2025posterarXiv:2501.10283
7
citations
#3574

Video In-context Learning: Autoregressive Transformers are Zero-Shot Video Imitators

Wentao Zhang, Junliang Guo, Tianyu He et al.

ICLR 2025posterarXiv:2407.07356
7
citations
#3575

Hyperbolic Category Discovery

Yuanpei Liu, Zhenqi He, Kai Han

CVPR 2025posterarXiv:2504.06120
7
citations
#3576

Privacy amplification by random allocation

Moshe Shenfeld, Vitaly Feldman

NEURIPS 2025spotlightarXiv:2502.08202
7
citations
#3577

SnapMoGen: Human Motion Generation from Expressive Texts

chuan guo, Inwoo Hwang, Jian Wang et al.

NEURIPS 2025oralarXiv:2507.09122
7
citations
#3578

Quantization Error Propagation: Revisiting Layer-Wise Post-Training Quantization

Yamato Arai, Yuma Ichikawa

NEURIPS 2025posterarXiv:2504.09629
7
citations
#3579

MimeQA: Towards Socially-Intelligent Nonverbal Foundation Models

Hengzhi Li, Megan Tjandrasuwita, Yi R. (May) Fung et al.

NEURIPS 2025posterarXiv:2502.16671
7
citations
#3580

Learning Bijective Surface Parameterization for Inferring Signed Distance Functions from Sparse Point Clouds with Grid Deformation

Takeshi Noda, Chao Chen, Junsheng Zhou et al.

CVPR 2025posterarXiv:2503.23670
7
citations
#3581

Details Enhancement in Unsigned Distance Field Learning for High-fidelity 3D Surface Reconstruction

Cheng Xu, Fei Hou, Wencheng Wang et al.

AAAI 2025paperarXiv:2406.00346
7
citations
#3582

FOCUS: Knowledge-enhanced Adaptive Visual Compression for Few-shot Whole Slide Image Classification

Zhengrui Guo, Conghao Xiong, Jiabo MA et al.

CVPR 2025posterarXiv:2411.14743
7
citations
#3583

From Judgment to Interference: Early Stopping LLM Harmful Outputs via Streaming Content Monitoring

Yang Li, Qiang Sheng, Yehan Yang et al.

NEURIPS 2025posterarXiv:2506.09996
7
citations
#3584

CVLUE: A New Benchmark Dataset for Chinese Vision-Language Understanding Evaluation

Yuxuan Wang, Yijun Liu, Fei Yu et al.

AAAI 2025paperarXiv:2407.01081
7
citations
#3585

Generative Pre-trained Autoregressive Diffusion Transformer

Yuan Zhang, Jiacheng Jiang, Guoqing Ma et al.

NEURIPS 2025posterarXiv:2505.07344
7
citations
#3586

FlashMD: long-stride, universal prediction of molecular dynamics

Filippo Bigi, Sanggyu Chong, Agustinus Kristiadi et al.

NEURIPS 2025spotlightarXiv:2505.19350
7
citations
#3587

Monocular and Generalizable Gaussian Talking Head Animation

Shengjie Gong, Haojie Li, Jiapeng Tang et al.

CVPR 2025posterarXiv:2504.00665
7
citations
#3588

HyperGLM: HyperGraph for Video Scene Graph Generation and Anticipation

Trong-Thuan Nguyen, Pha Nguyen, Jackson Cothren et al.

CVPR 2025posterarXiv:2411.18042
7
citations
#3589

Online Experimental Design With Estimation-Regret Trade-off Under Network Interference

Zhiheng Zhang, Zichen Wang

NEURIPS 2025posterarXiv:2412.03727
7
citations
#3590

Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs

Shuo Li, Tao Ji, Xiaoran Fan et al.

ICLR 2025posterarXiv:2410.11302
7
citations
#3591

Anywhere: A Multi-Agent Framework for User-Guided, Reliable, and Diverse Foreground-Conditioned Image Generation

Xie Tianyidan, Rui Ma, Qian Wang et al.

AAAI 2025paperarXiv:2404.18598
7
citations
#3592

HAMoBE: Hierarchical and Adaptive Mixture of Biometric Experts for Video-based Person ReID

Yiyang Su, Yunping Shi, Feng Liu et al.

ICCV 2025posterarXiv:2508.05038
7
citations
#3593

DiffGrasp: Whole-Body Grasping Synthesis Guided by Object Motion Using a Diffusion Model

Yonghao Zhang, Qiang He, Yanguang Wan et al.

AAAI 2025paperarXiv:2412.20657
7
citations
#3594

Panorama Generation From NFoV Image Done Right

Dian Zheng, Cheng Zhang, Xiao-Ming Wu et al.

CVPR 2025highlightarXiv:2503.18420
7
citations
#3595

Beyond Human Data: Aligning Multimodal Large Language Models by Iterative Self-Evolution

Wentao Tan, Qiong Cao, Yibing Zhan et al.

AAAI 2025paperarXiv:2412.15650
7
citations
#3596

Relieving Universal Label Noise for Unsupervised Visible-Infrared Person Re-Identification by Inferring from Neighbors

Xiao Teng, Long Lan, Dingyao Chen et al.

AAAI 2025paperarXiv:2412.12220
7
citations
#3597

Rethinking Pseudo-Label Guided Learning for Weakly Supervised Temporal Action Localization from the Perspective of Noise Correction

Quan Zhang, Yuxin Qi, Xi Tang et al.

AAAI 2025paperarXiv:2501.11124
7
citations
#3598

Hand1000: Generating Realistic Hands from Text with Only 1,000 Images

Haozhuo Zhang, Bin Zhu, Yu Cao et al.

AAAI 2025paperarXiv:2408.15461
7
citations
#3599

InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing

Jinlu Zhang, Yixin Chen, Zan Wang et al.

CVPR 2025highlightarXiv:2505.24315
7
citations
#3600

Activation-Informed Merging of Large Language Models

Amin Heyrani Nobari, Kaveh Alimohammadi, Ali ArjomandBigdeli et al.

NEURIPS 2025posterarXiv:2502.02421
7
citations