Most Cited 2025 "key-value pair reuse" Papers

22,274 papers found • Page 22 of 112

Filters:Most Cited 2025 key-value pair reuse Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#4201

QCS:Feature Refining from Quadruplet Cross Similarity for Facial Expression Recognition

Chengpeng Wang, Li Chen, Lili Wang et al.

AAAI 2025paperarXiv:2411.01988

citations

#4202

Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better

Enshu Liu, Junyi Zhu, Zinan Lin et al.

ICLR 2025posterarXiv:2404.02241

citations

#4203

LotteryCodec: Searching the Implicit Representation in a Random Network for Low-Complexity Image Compression

Haotian Wu, Gongpu Chen, Pier Luigi Dragotti et al.

ICML 2025spotlightarXiv:2507.01204

citations

#4204

VoiceCraft-Dub: Automated Video Dubbing with Neural Codec Language Models

Kim Sung-Bin, Jeongsoo Choi, Puyuan Peng et al.

ICCV 2025posterarXiv:2504.02386

citations

#4205

EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing

Kaizhi Zheng, Xiaotong Chen, Xuehai He et al.

ICLR 2025posterarXiv:2410.12836

citations

#4206

Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection

Hongru Yan, Yu Zheng, Yueqi Duan

ICLR 2025posterarXiv:2410.01404

citations

#4207

BrainOmni: A Brain Foundation Model for Unified EEG and MEG Signals

Qinfan Xiao, Ziyun Cui, Chi Zhang et al.

NEURIPS 2025oralarXiv:2505.18185

citations

#4208

Learning-Augmented Search Data Structures

Chunkai Fu, Brandon G. Nguyen, Jung Seo et al.

ICLR 2025posterarXiv:2402.10457

citations

#4209

Reference-Based 3D-Aware Image Editing with Triplanes

Bahri Batuhan Bilecen, Yiğit Yalın, Ning Yu et al.

CVPR 2025highlightarXiv:2404.03632

citations

#4210

VE-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality Assessment

Shangkun Sun, Xiaoyu Liang, Songlin Fan et al.

AAAI 2025paperarXiv:2408.11481

citations

#4211

MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems

Xuanming Zhang, Yuxuan Chen, Samuel (Min-Hsuan) Yeh et al.

NEURIPS 2025oralarXiv:2505.18943

citations

#4212

Data-Juicer 2.0: Cloud-Scale Adaptive Data Processing for and with Foundation Models

Daoyuan Chen, Yilun Huang, Xuchen Pan et al.

NEURIPS 2025spotlightarXiv:2501.14755

citations

#4213

HumanoidGen: Data Generation for Bimanual Dexterous Manipulation via LLM Reasoning

Zhi Jing, Siyuan Yang, Jicong Ao et al.

NEURIPS 2025posterarXiv:2507.00833

citations

#4214

StochSync: Stochastic Diffusion Synchronization for Image Generation in Arbitrary Spaces

Kyeongmin Yeo, Jaihoon Kim, Minhyuk Sung

ICLR 2025posterarXiv:2501.15445

citations

#4215

Precedence-Constrained Winter Value for Effective Graph Data Valuation

Hongliang Chi, Wei Jin, Charu Aggarwal et al.

ICLR 2025posterarXiv:2402.01943

citations

#4216

Týr-the-Pruner: Structural Pruning LLMs via Global Sparsity Distribution Optimization

Guanchen Li, Yixing Xu, Zeping Li et al.

NEURIPS 2025posterarXiv:2503.09657

citations

#4217

Dual Conditioned Motion Diffusion for Pose-Based Video Anomaly Detection

Hongsong Wang, Andi Xu, Pinle Ding et al.

AAAI 2025paperarXiv:2412.17210

citations

#4218

LDP: Generalizing to Multilingual Visual Information Extraction by Language Decoupled Pretraining

Huawen Shen, Gengluo Li, Jinwen Zhong et al.

AAAI 2025paperarXiv:2412.14596

citations

#4219

ProcTag: Process Tagging for Assessing the Efficacy of Document Instruction Data

Yufan Shen, Chuwei Luo, Zhaoqing Zhu et al.

AAAI 2025paperarXiv:2407.12358

citations

#4220

Student-Informed Teacher Training

Nico Messikommer, Jiaxu Xing, Elie Aljalbout et al.

ICLR 2025posterarXiv:2412.09149

citations

#4221

Hearing Anywhere in Any Environment

Xiulong Liu, Anurag Kumar, Paul Calamia et al.

CVPR 2025posterarXiv:2504.10746

citations

#4222

TempSamp-R1: Effective Temporal Sampling with Reinforcement Fine-Tuning for Video LLMs

Yunheng Li, Jing Cheng, Shaoyong Jia et al.

NEURIPS 2025oralarXiv:2509.18056

citations

#4223

ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding

Guangda Ji, Silvan Weder, Francis Engelmann et al.

CVPR 2025posterarXiv:2410.13924

citations

#4224

DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation

Jiashuo Sun, Xianrui Zhong, Sizhe Zhou et al.

NEURIPS 2025posterarXiv:2505.07233

citations

#4225

StreamForest: Efficient Online Video Understanding with Persistent Event Memory

Xiangyu Zeng, Kefan Qiu, Qingyu Zhang et al.

NEURIPS 2025oralarXiv:2509.24871

citations

#4226

Dense SAE Latents Are Features, Not Bugs

Xiaoqing Sun, Alessandro Stolfo, Joshua Engels et al.

NEURIPS 2025posterarXiv:2506.15679

citations

#4227

Learning Distances from Data with Normalizing Flows and Score Matching

Peter Sorrenson, Daniel Behrend-Uriarte, Christoph Schnörr et al.

ICML 2025posterarXiv:2407.09297

citations

#4228

Jacobian Sparse Autoencoders: Sparsify Computations, Not Just Activations

Lucy Farnik, Tim Lawson, Conor Houghton et al.

ICML 2025spotlightarXiv:2502.18147

citations

#4229

Rethinking Bimanual Robotic Manipulation: Learning with Decoupled Interaction Framework

Jian-Jian Jiang, Xiao-Ming Wu, Yi-Xiang He et al.

ICCV 2025posterarXiv:2503.09186

citations

#4230

ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual Instructions

Tomas Soucek, Prajwal Gatti, Michael Wray et al.

CVPR 2025posterarXiv:2412.01987

citations

#4231

Language-Guided Audio-Visual Learning for Long-Term Sports Assessment

Huangbiao Xu, Xiao Ke, Huanqi Wu et al.

CVPR 2025poster

citations

#4232

ZAPBench: A Benchmark for Whole-Brain Activity Prediction in Zebrafish

Jan-Matthis Lueckmann, Alexander Immer, Alex Chen et al.

ICLR 2025posterarXiv:2503.02618

citations

#4233

Spreading Out-of-Distribution Detection on Graphs

Daeho Um, Jongin Lim, Sunoh Kim et al.

ICLR 2025poster

citations

#4234

EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation

Hongwei Niu, Jie Hu, Jianghang Lin et al.

AAAI 2025paperarXiv:2412.08628

citations

#4235

Planning from Imagination: Episodic Simulation and Episodic Memory for Vision-and-Language Navigation

Yiyuan Pan, Yunzhe Xu, Zhe Liu et al.

AAAI 2025paperarXiv:2412.01857

citations

#4236

Denoising Functional Maps: Diffusion Models for Shape Correspondence

Aleksei Zhuravlev, Zorah Lähner, Vladislav Golyanik

CVPR 2025posterarXiv:2503.01845

citations

#4237

Parameter Efficient Fine-tuning via Explained Variance Adaptation

Fabian Paischer, Lukas Hauzenberger, Thomas Schmied et al.

NEURIPS 2025posterarXiv:2410.07170

citations

#4238

Boosting ViT-based MRI Reconstruction from the Perspectives of Frequency Modulation, Spatial Purification, and Scale Diversification

Yucong Meng, Zhiwei Yang, Yonghong Shi et al.

AAAI 2025paperarXiv:2412.10776

citations

#4239

Binarized Mamba-Transformer for Lightweight Quad Bayer HybridEVS Demosaicing

Shiyang Zhou, Haijin Zeng, Yunfan Lu et al.

CVPR 2025posterarXiv:2503.16134

citations

#4240

NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images

Lingen Li, Zhaoyang Zhang, Yaowei Li et al.

CVPR 2025posterarXiv:2412.03517

citations

#4241

Ultra-High Resolution Segmentation via Boundary-Enhanced Patch-Merging Transformer

Haopeng Sun, Yingwei Zhang, Lumin Xu et al.

AAAI 2025paperarXiv:2412.10181

citations

#4242

Functionality Understanding and Segmentation in 3D Scenes

Jaime Corsetti, Francesco Giuliari, Alice Fasoli et al.

CVPR 2025highlightarXiv:2411.16310

citations

#4243

Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models

Yudong Jin, Sida Peng, Xuan Wang et al.

ICCV 2025posterarXiv:2507.13344

citations

#4244

Decouple and Track: Benchmarking and Improving Video Diffusion Transformers For Motion Transfer

Qingyu Shi, Jianzong Wu, Jinbin Bai et al.

ICCV 2025posterarXiv:2503.17350

citations

#4245

SpotActor: Training-Free Layout-Controlled Consistent Image Generation

Jiahao Wang, Caixia Yan, Weizhan Zhang et al.

AAAI 2025paperarXiv:2409.04801

citations

#4246

Language Models over Canonical Byte-Pair Encodings

Tim Vieira, Tianyu Liu, Clemente Pasti et al.

ICML 2025posterarXiv:2506.07956

citations

#4247

DexGarmentLab: Dexterous Garment Manipulation Environment with Generalizable Policy

Yuran Wang, Ruihai Wu, Yue Chen et al.

NEURIPS 2025spotlightarXiv:2505.11032

citations

#4248

SimMotionEdit: Text-Based Human Motion Editing with Motion Similarity Prediction

Zhengyuan Li, Kai Cheng, Anindita Ghosh et al.

CVPR 2025posterarXiv:2503.18211

citations

#4249

AnomalyNCD: Towards Novel Anomaly Class Discovery in Industrial Scenarios

Ziming Huang, Xurui Li, Haotian Liu et al.

CVPR 2025posterarXiv:2410.14379

citations

#4250

Causal Discovery from Conditionally Stationary Time Series

Carles Balsells-Rodas, Xavier Sumba, Tanmayee Narendra et al.

ICML 2025posterarXiv:2110.06257

citations

#4251

Intrinsic Dimension Correlation: uncovering nonlinear connections in multimodal representations

Lorenzo Basile, Santiago Acevedo, Luca Bortolussi et al.

ICLR 2025posterarXiv:2406.15812

citations

#4252

Do LLMs Really Forget? Evaluating Unlearning with Knowledge Correlation and Confidence Awareness

Rongzhe Wei, Peizhi Niu, Hans Hao-Hsun Hsu et al.

NEURIPS 2025posterarXiv:2506.05735

citations

#4253

Exploring Visual Vulnerabilities via Multi-Loss Adversarial Search for Jailbreaking Vision-Language Models

Shuyang Hao, Bryan Hooi, Jun Liu et al.

CVPR 2025posterarXiv:2411.18000

citations

#4254

FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution

Gene Chou, Wenqi Xian, Guandao Yang et al.

ICCV 2025highlightarXiv:2504.07093

citations

#4255

Seq2Time: Sequential Knowledge Transfer for Video LLM Temporal Grounding

Andong Deng, Zhongpai Gao, Anwesa Choudhuri et al.

CVPR 2025posterarXiv:2411.16932

citations

#4256

RCTrans: Radar-Camera Transformer via Radar Densifier and Sequential Decoder for 3D Object Detection

Yiheng Li, Yang Yang, Zhen Lei

AAAI 2025paperarXiv:2412.12799

citations

#4257

Exploring Simple Open-Vocabulary Semantic Segmentation

Zihang Lai

CVPR 2025posterarXiv:2401.12217

citations

#4258

Tracing the Representation Geometry of Language Models from Pretraining to Post-training

Melody Li, Kumar Krishna Agrawal, Arna Ghosh et al.

NEURIPS 2025posterarXiv:2509.23024

citations

#4259

The Persistence of Neural Collapse Despite Low-Rank Bias

Connall Garrod, Jonathan Keating

NEURIPS 2025posterarXiv:2410.23169

citations

#4260

Probing Equivariance and Symmetry Breaking in Convolutional Networks

Sharvaree Vadgama, Mohammad Islam, Domas Buracas et al.

NEURIPS 2025posterarXiv:2501.01999

citations

#4261

Chiron-o1: Igniting Multimodal Large Language Models towards Generalizable Medical Reasoning via Mentor-Intern Collaborative Search

Haoran Sun, Yankai Jiang, Wenjie Lou et al.

NEURIPS 2025posterarXiv:2506.16962

citations

#4262

No-Regret is not enough! Bandits with General Constraints through Adaptive Regret Minimization

Martino Bernasconi, Matteo Castiglioni, Andrea Celli

ICML 2025posterarXiv:2405.06575

citations

#4263

Pamba: Enhancing Global Interaction in Point Clouds via State Space Model

Zhuoyuan Li, Yubo Ai, Jiahao Lu et al.

AAAI 2025paperarXiv:2406.17442

citations

#4264

Asymmetric Visual Semantic Embedding Framework for Efficient Vision-Language Alignment

Yang Liu, Mengyuan Liu, Shudong Huang et al.

AAAI 2025paperarXiv:2503.06974

citations

#4265

NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors

Yanrui Bin, Wenbo Hu, Haoyuan Wang et al.

ICCV 2025posterarXiv:2504.11427

citations

#4266

TimePro: Efficient Multivariate Long-term Time Series Forecasting with Variable- and Time-Aware Hyper-state

Xiaowen Ma, Zhen-Liang Ni, Shuai Xiao et al.

ICML 2025oralarXiv:2505.20774

citations

#4267

Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning

Guozheng Ma, Lu Li, Zilin Wang et al.

ICML 2025oralarXiv:2506.17204

citations

#4268

Lift3D Policy: Lifting 2D Foundation Models for Robust 3D Robotic Manipulation

Yueru Jia, Jiaming Liu, Sixiang Chen et al.

CVPR 2025poster

citations

#4269

Prediction-Feedback DETR for Temporal Action Detection

Jihwan Kim, Miso Lee, Cheol-Ho Cho et al.

AAAI 2025paperarXiv:2408.16729

citations

#4270

Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents

Han Lin, Jaemin Cho, Amir Zadeh et al.

NEURIPS 2025posterarXiv:2508.05954

citations

#4271

AnyCalib: On-Manifold Learning for Model-Agnostic Single-View Camera Calibration

Javier Tirado-Garín, Javier Civera

ICCV 2025posterarXiv:2503.12701

citations

#4272

Event-Enhanced Blurry Video Super-Resolution

Dachun Kai, Yueyi Zhang, Jin Wang et al.

AAAI 2025paperarXiv:2504.13042

citations

#4273

Jailbreak-AudioBench: In-Depth Evaluation and Analysis of Jailbreak Threats for Large Audio Language Models

Hao Cheng, Erjia Xiao, Jing Shao et al.

NEURIPS 2025posterarXiv:2501.13772

citations

#4274

CL-Splats: Continual Learning of Gaussian Splatting with Local Optimization

Jan Ackermann, Jonas Kulhanek, Shengqu Cai et al.

ICCV 2025posterarXiv:2506.21117

citations

#4275

Active Fine-Tuning of Multi-Task Policies

Marco Bagatella, Jonas Hübotter, Georg Martius et al.

ICML 2025oralarXiv:2410.05026

citations

#4276

Position: The Artificial Intelligence and Machine Learning Community Should Adopt a More Transparent and Regulated Peer Review Process

Jing Yang

ICML 2025posterarXiv:2502.00874

citations

#4277

Order-Robust Class Incremental Learning: Graph-Driven Dynamic Similarity Grouping

Guannan Lai, Yujie Li, Xiangkun Wang et al.

CVPR 2025posterarXiv:2502.20032

citations

#4278

EARL-BO: Reinforcement Learning for Multi-Step Lookahead, High-Dimensional Bayesian Optimization

Mujin Cheon, Jay Lee, Dong-Yeun Koh et al.

ICML 2025posterarXiv:2411.00171

citations

#4279

AutoURDF: Unsupervised Robot Modeling from Point Cloud Frames Using Cluster Registration

Jiong Lin, Lechen Zhang, Kwansoo Lee et al.

CVPR 2025posterarXiv:2412.05507

citations

#4280

Linear combinations of latents in generative models: subspaces and beyond

Erik Bodin, Alexandru Stere, Dragos Margineantu et al.

ICLR 2025posterarXiv:2408.08558

citations

#4281

LayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph Generation

Mufei Li, Viraj Shitole, Eli Chien et al.

ICLR 2025posterarXiv:2411.02322

citations

#4282

Towards Doctor-Like Reasoning: Medical RAG Fusing Knowledge with Patient Analogy through Textual Gradients

Yuxing Lu, Gecheng Fu, Wei Wu et al.

NEURIPS 2025poster

citations

#4283

Volume Optimality in Conformal Prediction with Structured Prediction Sets

Chao Gao, Liren Shan, Vaidehi Srinivas et al.

ICML 2025posterarXiv:2502.16658

citations

#4284

Multiview Equivariance Improves 3D Correspondence Understanding with Minimal Feature Finetuning

Yang You, Yixin Li, Congyue Deng et al.

ICLR 2025posterarXiv:2411.19458

citations

#4285

CLIP Under the Microscope: A Fine-Grained Analysis of Multi-Object Representation

Reza Abbasi, Ali Nazari, Aminreza Sefid et al.

CVPR 2025posterarXiv:2502.19842

citations

#4286

SteerConf: Steering LLMs for Confidence Elicitation

Ziang Zhou, Tianyuan Jin, Jieming Shi et al.

NEURIPS 2025posterarXiv:2503.02863

citations

#4287

Same Task, Different Circuits: Disentangling Modality-Specific Mechanisms in VLMs

Yaniv Nikankin, Dana Arad, Yossi Gandelsman et al.

NEURIPS 2025posterarXiv:2506.09047

citations

#4288

From Sparse to Dense: Camera Relocalization with Scene-Specific Detector from Feature Gaussian Splatting

Zhiwei Huang, Hailin Yu, Yichun Shentu et al.

CVPR 2025posterarXiv:2503.19358

citations

#4289

Toward Efficient Kernel-Based Solvers for Nonlinear PDEs

Zhitong Xu, Da Long, Yiming Xu et al.

ICML 2025posterarXiv:2410.11165

citations

#4290

QT-DoG: Quantization-Aware Training for Domain Generalization

Saqib Javed, Hieu Le, Mathieu Salzmann

ICML 2025posterarXiv:2410.06020

citations

#4291

Grounding Language with Vision: A Conditional Mutual Information Calibrated Decoding Strategy for Reducing Hallucinations in LVLMs

Hao Fang, Changle Zhou, Jiawei Kong et al.

NEURIPS 2025posterarXiv:2505.19678

citations

#4292

LibriBrain: Over 50 Hours of Within-Subject MEG to Improve Speech Decoding Methods at Scale

Miran Özdogan, Gilad Landau, Gereon Elvers et al.

NEURIPS 2025posterarXiv:2506.02098

citations

#4293

On scalable and efficient training of diffusion samplers

Minkyu Kim, Kiyoung Seong, Dongyeop Woo et al.

NEURIPS 2025posterarXiv:2505.19552

citations

#4294

T2V-OptJail: Discrete Prompt Optimization for Text-to-Video Jailbreak Attacks

Jiayang Liu, Siyuan Liang, Shiqian Zhao et al.

NEURIPS 2025posterarXiv:2505.06679

citations

#4295

Understanding and Mitigating Memorization in Diffusion Models for Tabular Data

Zhengyu Fang, Zhimeng Jiang, Huiyuan Chen et al.

ICML 2025posterarXiv:2412.11044

citations

#4296

Critical Batch Size Revisited: A Simple Empirical Approach to Large-Batch Language Model Training

Will Merrill, Shane Arora, Dirk Groeneveld et al.

NEURIPS 2025spotlightarXiv:2505.23971

citations

#4297

MOS: Modeling Object-Scene Associations in Generalized Category Discovery

Zhengyuan Peng, Jinpeng Ma, Zhimin Sun et al.

CVPR 2025posterarXiv:2503.12035

citations

#4298

Flowing Datasets with Wasserstein over Wasserstein Gradient Flows

Clément Bonet, Christophe Vauthier, Anna Korba

ICML 2025oralarXiv:2506.07534

citations

#4299

BrainACTIV: Identifying visuo-semantic properties driving cortical selectivity using diffusion-based image manipulation

Diego García Cerdas, Christina Sartzetaki, Magnus Petersen et al.

ICLR 2025poster

citations

#4300

Auto-Regressive Diffusion for Generating 3D Human-Object Interactions

Zichen Geng, Zeeshan Hayder, Wei Liu et al.

AAAI 2025paperarXiv:2503.16801

citations

#4301

ELICIT: LLM Augmentation Via External In-context Capability

Futing Wang, Jianhao (Elliott) Yan, Yue Zhang et al.

ICLR 2025posterarXiv:2410.09343

citations

#4302

Task Generalization with Autoregressive Compositional Structure: Can Learning from $D$ Tasks Generalize to $D^T$ Tasks?

Amirhesam Abedsoltan, Huaqing Zhang, Kaiyue Wen et al.

ICML 2025posterarXiv:2502.08991

citations

#4303

LoRAShop: Training-Free Multi-Concept Image Generation and Editing with Rectified Flow Transformers

Yusuf Dalva, Hidir Yesiltepe, Pinar Yanardag

NEURIPS 2025spotlightarXiv:2505.23758

citations

#4304

Probabilistic Learning to Defer: Handling Missing Expert Annotations and Controlling Workload Distribution

Cuong Nguyen, Thanh-Toan Do, Gustavo Carneiro

ICLR 2025poster

citations

#4305

Towards Understanding the Robustness of Diffusion-Based Purification: A Stochastic Perspective

Yiming Liu, Kezhao Liu, Yao Xiao et al.

ICLR 2025posterarXiv:2404.14309

citations

#4306

Broken Tokens? Your Language Model can Secretly Handle Non-Canonical Tokenizations

Brian Zheng, Alisa Liu, Orevaoghene Ahia et al.

NEURIPS 2025spotlightarXiv:2506.19004

citations

#4307

Leveraging Submodule Linearity Enhances Task Arithmetic Performance in LLMs

Rui Dai, Sile Hu, Xu Shen et al.

ICLR 2025posterarXiv:2504.10902

citations

#4308

DiffRetouch: Using Diffusion to Retouch on the Shoulder of Experts

Zheng-Peng Duan, Jiawei Zhang, Zheng Lin et al.

AAAI 2025paperarXiv:2407.03757

citations

#4309

PROXSPARSE: REGULARIZED LEARNING OF SEMI-STRUCTURED SPARSITY MASKS FOR PRETRAINED LLMS

Hongyi Liu, Rajarshi Saha, Zhen Jia et al.

ICML 2025posterarXiv:2502.00258

citations

#4310

Split Gibbs Discrete Diffusion Posterior Sampling

Wenda Chu, Zihui Wu, Yifan Chen et al.

NEURIPS 2025posterarXiv:2503.01161

citations

#4311

Short-length Adversarial Training Helps LLMs Defend Long-length Jailbreak Attacks: Theoretical and Empirical Evidence

Shaopeng Fu, Liang Ding, Jingfeng ZHANG et al.

NEURIPS 2025posterarXiv:2502.04204

citations

#4312

Uncertain Multimodal Intention and Emotion Understanding in the Wild

Qu Yang, QingHongYa Shi, Tongxin Wang et al.

CVPR 2025poster

citations

#4313

H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving

Siran Chen, Yuxiao Luo, Yue Ma et al.

AAAI 2025paperarXiv:2501.04302

citations

#4314

Enhancing Cognition and Explainability of Multimodal Foundation Models with Self-Synthesized Data

Yucheng Shi, Quanzheng Li, Jin Sun et al.

ICLR 2025posterarXiv:2502.14044

citations

#4315

WildSeg3D: Segment Any 3D Objects in the Wild from 2D Images

Yansong Guo, Jie Hu, Yansong Qu et al.

ICCV 2025posterarXiv:2503.08407

citations

#4316

Dynamical Diffusion: Learning Temporal Dynamics with Diffusion Models

Xingzhuo Guo, Yu Zhang, Baixu Chen et al.

ICLR 2025oralarXiv:2503.00951

citations

#4317

VELOCITI: Benchmarking Video-Language Compositional Reasoning with Strict Entailment

Darshana Saravanan, Varun Gupta, Darshan Singh S et al.

CVPR 2025posterarXiv:2406.10889

citations

#4318

Active Task Disambiguation with LLMs

Katarzyna Kobalczyk, Nicolás Astorga, Tennison Liu et al.

ICLR 2025posterarXiv:2502.04485

citations

#4319

Mask in the Mirror: Implicit Sparsification

Tom Jacobs, Rebekka Burkholz

ICLR 2025posterarXiv:2408.09966

citations

#4320

Structured Linear CDEs: Maximally Expressive and Parallel-in-Time Sequence Models

Benjamin Walker, Lingyi Yang, Nicola Muca Cirone et al.

NEURIPS 2025spotlightarXiv:2505.17761

citations

#4321

Video Perception Models for 3D Scene Synthesis

Rui Huang, Guangyao Zhai, Zuria Bauer et al.

NEURIPS 2025posterarXiv:2506.20601

citations

#4322

Text2Relight: Creative Portrait Relighting with Text Guidance

Junuk Cha, Mengwei Ren, Krishna Kumar Singh et al.

AAAI 2025paperarXiv:2412.13734

citations

#4323

WaterDiffusion: Learning a Prior-involved Unrolling Diffusion for Joint Underwater Saliency Detection and Visual Restoration

Laibin Chang, Yunke Wang, Longxiang Deng et al.

AAAI 2025paper

citations

#4324

Understanding the Limits of Deep Tabular Methods with Temporal Shift

Haorun Cai, Han-Jia Ye

ICML 2025oralarXiv:2502.20260

citations

#4325

DEALing with Image Reconstruction: Deep Attentive Least Squares

Mehrsa Pourya, Erich Kobler, Michael Unser et al.

ICML 2025posterarXiv:2502.04079

citations

#4326

Estimating Model Performance Under Covariate Shift Without Labels

Jakub Białek, Juhani Kivimäki, Wojciech Kuberski et al.

NEURIPS 2025posterarXiv:2401.08348

citations

#4327

PhysAug: A Physical-guided and Frequency-based Data Augmentation for Single-Domain Generalized Object Detection

Xiaoran Xu, Jiangang Yang, Wenhui Shi et al.

AAAI 2025paperarXiv:2412.11807

citations

#4328

Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning

Ali Taghibakhshi, Sharath Turuvekere Sreenivas, Saurav Muralidharan et al.

NEURIPS 2025posterarXiv:2504.11409

citations

#4329

Scaffolding Dexterous Manipulation with Vision-Language Models

Vincent de Bakker, Joey Hejna, Tyler Lum et al.

NEURIPS 2025posterarXiv:2506.19212

citations

#4330

Improving Gaussian Splatting with Localized Points Management

Haosen Yang, Chenhao Zhang, Wenqing Wang et al.

CVPR 2025highlightarXiv:2406.04251

citations

#4331

CSformer: Combining Channel Independence and Mixing for Robust Multivariate Time Series Forecasting

Haoxin Wang, Yipeng Mo, Kunlan Xiang et al.

AAAI 2025paperarXiv:2312.06220

citations

#4332

TaskGalaxy: Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types

Jiankang Chen, Tianke Zhang, Changyi Liu et al.

ICLR 2025posterarXiv:2502.09925

citations

#4333

Dissecting Generalized Category Discovery: Multiplex Consensus under Self-Deconstruction

Luyao Tang, Kunze Huang, Yuxuan Yuan et al.

ICCV 2025highlightarXiv:2508.10731

citations

#4334

Exploit Your Latents: Coarse-Grained Protein Backmapping with Latent Diffusion Models

Rongchao Zhang, Yu Huang, Yiwei Lou et al.

AAAI 2025paper

citations

#4335

DualCP: Rehearsal-Free Domain-Incremental Learning via Dual-Level Concept Prototype

Qiang Wang, Yuhang He, Songlin Dong et al.

AAAI 2025paperarXiv:2503.18042

citations

#4336

Are Expressive Models Truly Necessary for Offline RL?

Guan Wang, Haoyi Niu, Jianxiong Li et al.

AAAI 2025paperarXiv:2412.11253

citations

#4337

Generating Multimodal Driving Scenes via Next-Scene Prediction

Yanhao Wu, Haoyang Zhang, Tianwei Lin et al.

CVPR 2025posterarXiv:2503.14945

citations

#4338

ProtoArgNet: Interpretable Image Classification with Super-Prototypes and Argumentation

Hamed Ayoobi, Nico Potyka, Francesca Toni

AAAI 2025paperarXiv:2311.15438

citations

#4339

Bayesian Experimental Design Via Contrastive Diffusions

Jacopo Iollo, Christophe Heinkelé, Pierre Alliez et al.

ICLR 2025posterarXiv:2410.11826

citations

#4340

Attention Mechanism, Max-Affine Partition, and Universal Approximation

Hude Liu, Jerry Yao-Chieh Hu, Zhao Song et al.

NEURIPS 2025posterarXiv:2504.19901

citations

#4341

Audio Super-Resolution with Latent Bridge Models

Chang Li, Zehua Chen, Liyuan Wang et al.

NEURIPS 2025posterarXiv:2509.17609

citations

#4342

IGL-Bench: Establishing the Comprehensive Benchmark for Imbalanced Graph Learning

Jiawen Qin, Haonan Yuan, Qingyun Sun et al.

ICLR 2025posterarXiv:2406.09870

citations

#4343

FreqDebias: Towards Generalizable Deepfake Detection via Consistency-Driven Frequency Debiasing

Hossein Kashiani, Niloufar Alipour Talemi, Fatemeh Afghah

CVPR 2025posterarXiv:2509.22412

citations

#4344

MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization

Siyuan Li, Luyuan Zhang, Zedong Wang et al.

CVPR 2025posterarXiv:2504.00999

citations

#4345

Runtime Analysis for Multi-Objective Evolutionary Algorithms in Unbounded Integer Spaces

Benjamin Doerr, Martin S. Krejca, Günter Rudolph

AAAI 2025paperarXiv:2412.11684

citations

#4346

Locally Convex Global Loss Network for Decision-Focused Learning

Haeun Jeon, Hyunglip Bae, Minsu Park et al.

AAAI 2025paperarXiv:2403.01875

citations

#4347

Autocorrelation Matters: Understanding the Role of Initialization Schemes for State Space Models

Fusheng Liu, Qianxiao Li

ICLR 2025oralarXiv:2411.19455

citations

#4348

MindSimulator: Exploring Brain Concept Localization via Synthetic fMRI

Qi Zhang, Qi Zhang, Zixuan Gong et al.

ICLR 2025posterarXiv:2503.02351

citations

#4349

IMDPrompter: Adapting SAM to Image Manipulation Detection by Cross-View Automated Prompt Learning

Quan Zhang, Yuxin Qi, Xi Tang et al.

ICLR 2025posterarXiv:2502.02454

citations

#4350

Revisiting a Design Choice in Gradient Temporal Difference Learning

Xiaochi Qian, Shangtong Zhang

ICLR 2025oralarXiv:2308.01170

citations

#4351

Decompile-Bench: Million-Scale Binary-Source Function Pairs for Real-World Binary Decompilation

hanzhuo tan, Xiaolong Tian, Hanrui Qi et al.

NEURIPS 2025posterarXiv:2505.12668

citations

#4352

SUMI-IFL: An Information-Theoretic Framework for Image Forgery Localization with Sufficiency and Minimality Constraints

Ziqi Sheng, Wei Lu, Xiangyang Luo et al.

AAAI 2025paperarXiv:2412.09981

citations

#4353

Why LVLMs Are More Prone to Hallucinations in Longer Responses: The Role of Context

Ge Zheng, Jiaye Qian, Jiajin Tang et al.

ICCV 2025posterarXiv:2510.20229

citations

#4354

Understanding Adam Requires Better Rotation Dependent Assumptions

Tianyue Zhang, Lucas Maes, Alan Milligan et al.

NEURIPS 2025posterarXiv:2410.19964

citations

#4355

Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning

Roger Creus Castanyer, Johan Obando Ceron, Lu Li et al.

NEURIPS 2025spotlightarXiv:2506.15544

citations

#4356

OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning

Xianhang Li, Yanqing Liu, Haoqin Tu et al.

ICCV 2025posterarXiv:2505.04601

citations

#4357

Learning a Neural Solver for Parametric PDEs to Enhance Physics-Informed Methods

Lise Le Boudec, Emmanuel de Bézenac, Louis Serrano et al.

ICLR 2025posterarXiv:2410.06820

citations

#4358

HM3: Hierarchical Multi-Objective Model Merging for Pretrained Models

Yu Zhou, Xingyu Wu, Jibin Wu et al.

NEURIPS 2025spotlightarXiv:2409.18893

citations

#4359

SoPo: Text-to-Motion Generation Using Semi-Online Preference Optimization

Xiaofeng Tan, Hongsong Wang, Xin Geng et al.

NEURIPS 2025posterarXiv:2412.05095

citations

#4360

Oasis: One Image is All You Need for Multimodal Instruction Data Synthesis

Letian Zhang, Quan Cui, Bingchen Zhao et al.

ICCV 2025posterarXiv:2503.08741

citations

#4361

Unisolver: PDE-Conditional Transformers Towards Universal Neural PDE Solvers

Hang Zhou, Yuezhou Ma, Haixu Wu et al.

ICML 2025posterarXiv:2405.17527

citations

#4362

Hypergraph Attacks via Injecting Homogeneous Nodes into Elite Hyperedges

Meixia He, Peican Zhu, Keke Tang et al.

AAAI 2025paperarXiv:2412.18365

citations

#4363

MOL-Mamba: Enhancing Molecular Representation with Structural & Electronic Insights

Jingjing Hu, Dan Guo, Zhan Si et al.

AAAI 2025paperarXiv:2412.16483

citations

#4364

Aligning Language Models Using Follow-up Likelihood as Reward Signal

Chen Zhang, Dading Chong, Feng Jiang et al.

AAAI 2025paperarXiv:2409.13948

citations

#4365

Prediction-Powered Causal Inferences

Riccardo Cadei, Ilker Demirel, Piersilvio De Bartolomeis et al.

NEURIPS 2025posterarXiv:2502.06343

citations

#4366

Motion Modes: What Could Happen Next?

Karran Pandey, Yannick Hold-Geoffroy, Matheus Gadelha et al.

CVPR 2025posterarXiv:2412.00148

citations

#4367

Logits DeConfusion with CLIP for Few-Shot Learning

Shuo Li, Fang Liu, Zehua Hao et al.

CVPR 2025posterarXiv:2504.12104

citations

#4368

PICD: Versatile Perceptual Image Compression with Diffusion Rendering

Tongda Xu, Jiahao Li, Bin Li et al.

CVPR 2025posterarXiv:2505.05853

citations

#4369

Provable Scaling Laws for the Test-Time Compute of Large Language Models

Yanxi Chen, Xuchen Pan, Yaliang Li et al.

NEURIPS 2025posterarXiv:2411.19477

citations

#4370

FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation

Zhuguanyu Wu, Shihe Wang, Jiayi Zhang et al.

CVPR 2025highlightarXiv:2506.11543

citations

#4371

ConStellaration: A dataset of QI-like stellarator plasma boundaries and optimization benchmarks

Santiago Cadena, Andrea Merlo, Emanuel Laude et al.

NEURIPS 2025posterarXiv:2506.19583

citations

#4372

Large Language Models Think Too Fast To Explore Effectively

Lan Pan, Hanbo Xie, Robert Wilson

NEURIPS 2025posterarXiv:2501.18009

citations

#4373

Golden Cudgel Network for Real-Time Semantic Segmentation

Guoyu Yang, Yuan Wang, Daming Shi et al.

CVPR 2025posterarXiv:2503.03325

citations

#4374

6D Object Pose Tracking in Internet Videos for Robotic Manipulation

Georgy Ponimatkin, Martin Cífka, Tomas Soucek et al.

ICLR 2025oralarXiv:2503.10307

citations

#4375

DWIM: Towards Tool-aware Visual Reasoning via Discrepancy-aware Workflow Generation & Instruct-Masking Tuning

Fucai Ke, Vijay Kumar b g, Xingjian Leng et al.

ICCV 2025posterarXiv:2503.19263

citations

#4376

SUMO: Subspace-Aware Moment-Orthogonalization for Accelerating Memory-Efficient LLM Training

Yehonathan Refael, Guy Smorodinsky, Tom Tirer et al.

NEURIPS 2025posterarXiv:2505.24749

citations

#4377

GNNs Getting ComFy: Community and Feature Similarity Guided Rewiring

Celia Rubio-Madrigal, Adarsh Jamadandi, Rebekka Burkholz

ICLR 2025posterarXiv:2502.04891

citations

#4378

Momentum Multi-Marginal Schrödinger Bridge Matching

Panagiotis Theodoropoulos, Augustinos Saravanos, Evangelos Theodorou et al.

NEURIPS 2025oralarXiv:2506.10168

citations

#4379

Understanding Representation Dynamics of Diffusion Models via Low-Dimensional Modeling

Xiao Li, Zekai Zhang, Xiang Li et al.

NEURIPS 2025posterarXiv:2502.05743

citations

#4380

Truth over Tricks: Measuring and Mitigating Shortcut Learning in Misinformation Detection

Herun Wan, Jiaying Wu, Minnan Luo et al.

NEURIPS 2025posterarXiv:2506.02350

citations

#4381

Multimodal Tabular Reasoning with Privileged Structured Information

Jun-Peng Jiang, Yu Xia, Hai-Long Sun et al.

NEURIPS 2025posterarXiv:2506.04088

citations

#4382

The emergence of sparse attention: impact of data distribution and benefits of repetition

Nicolas Zucchet, Francesco D'Angelo, Andrew Lampinen et al.

NEURIPS 2025oralarXiv:2505.17863

citations

#4383

EVOS: Efficient Implicit Neural Training via EVOlutionary Selector

Weixiang Zhang, Shuzhao Xie, Chengwei Ren et al.

CVPR 2025posterarXiv:2412.10153

citations

#4384

EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?

Yuqian Yuan, Ronghao Dang, long li et al.

NEURIPS 2025oralarXiv:2506.05287

citations

#4385

Diversity as a Reward: Fine-Tuning LLMs on a Mixture of Domain-Undetermined Data

Zhenqing Ling, Daoyuan Chen, Liuyi Yao et al.

NEURIPS 2025posterarXiv:2502.04380

citations

#4386

Where, What, Why: Towards Explainable Driver Attention Prediction

Yuchen Zhou, Jiayu Tang, Xiaoyan Xiao et al.

ICCV 2025highlightarXiv:2506.23088

citations

#4387

End-to-end Learning of Gaussian Mixture Priors for Diffusion Sampler

Denis Blessing, Xiaogang Jia, Gerhard Neumann

ICLR 2025posterarXiv:2503.00524

citations

#4388

HybridGS: Decoupling Transients and Statics with 2D and 3D Gaussian Splatting

Jingyu Lin, Jiaqi Gu, Lubin Fan et al.

CVPR 2025posterarXiv:2412.03844

citations

#4389

Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models

Jun Zhang, Jue Wang, Huan Li et al.

ICLR 2025posterarXiv:2502.13533

citations

#4390

Cached Multi-Lora Composition for Multi-Concept Image Generation

Xiandong Zou, Mingzhu Shen, Christos-Savvas Bouganis et al.

ICLR 2025posterarXiv:2502.04923

citations

#4391

CLIPDrag: Combining Text-based and Drag-based Instructions for Image Editing

Ziqi Jiang, Zhen Wang, Long Chen

ICLR 2025posterarXiv:2410.03097

citations

#4392

BrainOOD: Out-of-distribution Generalizable Brain Network Analysis

Jiaxing Xu, Yongqiang Chen, Xia Dong et al.

ICLR 2025posterarXiv:2502.01688

citations

#4393

Task-level Distributionally Robust Optimization for Large Language Model-based Dense Retrieval

Guangyuan Ma, Yongliang Ma, Xing Wu et al.

AAAI 2025paperarXiv:2408.10613

citations

#4394

Show and Tell: Visually Explainable Deep Neural Nets via Spatially-Aware Concept Bottleneck Models

Itay Benou, Tammy Riklin Raviv

CVPR 2025highlightarXiv:2502.20134

citations

#4395

VALLR: Visual ASR Language Model for Lip Reading

Marshall Thomas, Edward Fish, Richard Bowden

ICCV 2025posterarXiv:2503.21408

citations

#4396

Large Self-Supervised Models Bridge the Gap in Domain Adaptive Object Detection

Marc-Antoine Lavoie, Anas Mahmoud, Steven L. Waslander

CVPR 2025posterarXiv:2503.23220

citations

#4397

Dynamical Low-Rank Compression of Neural Networks with Robustness under Adversarial Attacks

Steffen Schotthöfer, Lexie Yang, Stefan Schnake

NEURIPS 2025oralarXiv:2505.08022

citations

#4398

MMTL-UniAD: A Unified Framework for Multimodal and Multi-Task Learning in Assistive Driving Perception

Wenzhuo Liu, Wenshuo Wang, Yicheng Qiao et al.

CVPR 2025posterarXiv:2504.02264

citations

#4399

Boosting Short Text Classification with Multi-Source Information Exploration and Dual-Level Contrastive Learning

Yonghao Liu, Mengyu Li, Wei Pang et al.

AAAI 2025paperarXiv:2501.09214

citations

#4400

Prompt-CAM: Making Vision Transformers Interpretable for Fine-Grained Analysis

Arpita Chowdhury, Dipanjyoti Paul, Zheda Mai et al.

CVPR 2025posterarXiv:2501.09333

citations

← Previous

1...20 21 22 23 24...112