Most Cited 2025 Poster Papers

22,274 papers found • Page 52 of 112

#10201

Efficient Federated Learning against Byzantine Attacks and Data Heterogeneity via Aggregating Normalized Gradients

Shiyuan Zuo, Xingrun Yan, Rongfei Fan et al.

NEURIPS 2025arXiv:2408.09539
3
citations
#10202

SOGS: Second-Order Anchor for Advanced 3D Gaussian Splatting

Jiahui Zhang, Fangneng Zhan, Ling Shao et al.

CVPR 2025arXiv:2503.07476
3
citations
#10203

PAC Bench: Do Foundation Models Understand Prerequisites for Executing Manipulation Policies?

Atharva Gundawar, Som Sagar, Ransalu Senanayake

NEURIPS 2025arXiv:2506.23725
3
citations
#10204

Towards Unbiased and Robust Spatio-Temporal Scene Graph Generation and Anticipation

Rohith Peddi, Saurabh ., Ayush Abhay Shrivastava et al.

CVPR 2025highlightarXiv:2411.13059
3
citations
#10205

One-shot 3D Object Canonicalization based on Geometric and Semantic Consistency

Li Jin, Yujie Wang, Wenzheng Chen et al.

CVPR 2025highlight
3
citations
#10206

Visual Instruction Bottleneck Tuning

Changdae Oh, Jiatong Li, Shawn Im et al.

NEURIPS 2025arXiv:2505.13946
3
citations
#10207

On Fairness of Unified Multimodal Large Language Model for Image Generation

Ming Liu, Hao Chen, Jindong Wang et al.

NEURIPS 2025arXiv:2502.03429
3
citations
#10208

Your Pre-trained LLM is Secretly an Unsupervised Confidence Calibrator

Beier Luo, Shuoyuan Wang, Sharon Li et al.

NEURIPS 2025arXiv:2505.16690
3
citations
#10209

Bringing SAM to new heights: leveraging elevation data for tree crown segmentation from drone imagery

Mélisande Teng, Arthur Ouaknine, Etienne Laliberté et al.

NEURIPS 2025arXiv:2506.04970
3
citations
#10210

Learning Sparse Approximate Inverse Preconditioners for Conjugate Gradient Solvers on GPUs

Zhehao Li, Zhehao Li, Kangbo Lyu et al.

NEURIPS 2025arXiv:2510.27517
3
citations
#10211

Latte: Collaborative Test-Time Adaptation of Vision-Language Models in Federated Learning

Wenxuan Bao, Ruxi Deng, Ruizhong Qiu et al.

ICCV 2025arXiv:2507.21494
3
citations
#10212

Bridging Domain Generalization to Multimodal Domain Generalization via Unified Representations

Hai Huang, Yan Xia, Sashuai Zhou et al.

ICCV 2025arXiv:2507.03304
3
citations
#10213

TriTex: Learning Texture from a Single Mesh via Triplane Semantic Features

Dana Cohen-Bar, Daniel Cohen-Or, Gal Chechik et al.

CVPR 2025arXiv:2503.16630
3
citations
#10214

Bootstrapping Grounded Chain-of-Thought in Multimodal LLMs for Data-Efficient Model Adaptation

Jiaer Xia, Bingkui Tong, Yuhang Zang et al.

ICCV 2025highlightarXiv:2507.02859
3
citations
#10215

Generate, Refine, and Encode: Leveraging Synthesized Novel Samples for On-the-Fly Fine-Grained Category Discovery

Xiao Liu, Nan Pu, Haiyang Zheng et al.

ICCV 2025arXiv:2507.04051
3
citations
#10216

SafePTR: Token-Level Jailbreak Defense in Multimodal LLMs via Prune-then-Restore Mechanism

Beitao Chen, Xinyu Lyu, shengming yuan et al.

NEURIPS 2025arXiv:2507.01513
3
citations
#10217

MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation

Fu Rong, Meng Lan, Qian Zhang et al.

ICCV 2025arXiv:2501.13667
3
citations
#10218

A Statistical Theory of Contrastive Learning via Approximate Sufficient Statistics

Licong Lin, Song Mei

NEURIPS 2025arXiv:2503.17538
3
citations
#10219

LLM-Driven Treatment Effect Estimation Under Inference Time Text Confounding

Yuchen Ma, Dennis Frauen, Jonas Schweisthal et al.

NEURIPS 2025arXiv:2507.02843
3
citations
#10220

CAPability: A Comprehensive Visual Caption Benchmark for Evaluating Both Correctness and Thoroughness

Zhihang Liu, Chen-Wei Xie, Bin Wen et al.

NEURIPS 2025arXiv:2502.14914
3
citations
#10221

MVGBench: a Comprehensive Benchmark for Multi-view Generation Models

Xianghui Xie, Jan Lenssen, Gerard Pons-Moll

ICCV 2025
3
citations
#10222

Optimistic Query Routing in Clustering-based Approximate Maximum Inner Product Search

Sebastian Bruch, Aditya Krishnan, Franco Maria Nardini

NEURIPS 2025arXiv:2405.12207
3
citations
#10223

REOBench: Benchmarking Robustness of Earth Observation Foundation Models

Xiang Li, Yong Tao, Siyuan Zhang et al.

NEURIPS 2025arXiv:2505.16793
3
citations
#10224

Efficient Video Super-Resolution for Real-time Rendering with Decoupled G-buffer Guidance

Mingjun Zheng, Long Sun, Jiangxin Dong et al.

CVPR 2025
3
citations
#10225

Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints

Guanjie Chen, Xinyu Zhao, Yucheng Zhou et al.

ICCV 2025arXiv:2411.17616
3
citations
#10226

A Hubness Perspective on Representation Learning for Graph-Based Multi-View Clustering

Zheming Xu, He Liu, Congyan Lang et al.

CVPR 2025
3
citations
#10227

EDCFlow: Exploring Temporally Dense Difference Maps for Event-based Optical Flow Estimation

Daikun Liu, Lei Cheng, Teng Wang et al.

CVPR 2025arXiv:2506.03512
3
citations
#10228

Convergent Functions, Divergent Forms

Hyeonseong Jeon, Ainaz Eftekhar, Aaron Walsman et al.

NEURIPS 2025arXiv:2505.21665
3
citations
#10229

Can Agent Fix Agent Issues?

Alfin Wijaya Rahardja, Junwei Liu, Weitong Chen et al.

NEURIPS 2025arXiv:2505.20749
3
citations
#10230

PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination

Ming Dai, Wenxuan Cheng, Jiedong Zhuang et al.

ICCV 2025arXiv:2509.04833
3
citations
#10231

Cross-Subject Mind Decoding from Inaccurate Representations

Yangyang Xu, Bangzhen Liu, Wenqi Shao et al.

ICCV 2025arXiv:2507.19071
3
citations
#10232

FedCALM: Conflict-aware Layer-wise Mitigation for Selective Aggregation in Deeper Personalized Federated Learning

Hao Zheng, Zhigang Hu, Boyu Wang et al.

CVPR 2025
3
citations
#10233

ODG: Occupancy Prediction Using Dual Gaussians

Yunxiao Shi, Yinhao Zhu, Herbert Cai et al.

NEURIPS 2025arXiv:2506.09417
3
citations
#10234

Escaping the SpuriVerse: Can Large Vision-Language Models Generalize Beyond Seen Spurious Correlations?

Yiwei Yang, Chung Peng Lee, Shangbin Feng et al.

NEURIPS 2025arXiv:2506.18322
3
citations
#10235

LLM Safety Alignment is Divergence Estimation in Disguise

Rajdeep Haldar, Ziyi Wang, Guang Lin et al.

NEURIPS 2025arXiv:2502.00657
3
citations
#10236

VAFlow: Video-to-Audio Generation with Cross-Modality Flow Matching

Xihua Wang, Xin Cheng, Yuyue Wang et al.

ICCV 2025
3
citations
#10237

Efficient Policy Optimization in Robust Constrained MDPs with Iteration Complexity Guarantees

Sourav Ganguly, Kishan Panaganti, Arnob Ghosh et al.

NEURIPS 2025arXiv:2505.19238
3
citations
#10238

From Imitation to Innovation: The Emergence of AI's Unique Artistic Styles and the Challenge of Copyright Protection

Zexi Jia, Chuanwei Huang, Hongyan Fei et al.

ICCV 2025arXiv:2507.04769
3
citations
#10239

LLM-Explorer: A Plug-in Reinforcement Learning Policy Exploration Enhancement Driven by Large Language Models

Qianyue Hao, Yiwen Song, Qingmin Liao et al.

NEURIPS 2025spotlightarXiv:2505.15293
3
citations
#10240

A Snapshot of Influence: A Local Data Attribution Framework for Online Reinforcement Learning

Yuzheng Hu, Fan Wu, Haotian Ye et al.

NEURIPS 2025oralarXiv:2505.19281
3
citations
#10241

General Compression Framework for Efficient Transformer Object Tracking

Lingyi Hong, Jinglun Li, Xinyu Zhou et al.

ICCV 2025arXiv:2409.17564
3
citations
#10242

Analyzing Fine-Grained Alignment and Enhancing Vision Understanding in Multimodal Language Models

Jiachen Jiang, Jinxin Zhou, Bo Peng et al.

NEURIPS 2025arXiv:2505.17316
3
citations
#10243

ODHSR: Online Dense 3D Reconstruction of Humans and Scenes from Monocular Videos

Zetong Zhang, Manuel Kaufmann, Lixin Xue et al.

CVPR 2025arXiv:2504.13167
3
citations
#10244

TemCoCo: Temporally Consistent Multi-modal Video Fusion with Visual-Semantic Collaboration

Gong Meiqi, Hao Zhang, Xunpeng Yi et al.

ICCV 2025arXiv:2508.17817
3
citations
#10245

SP3D: Boosting Sparsely-Supervised 3D Object Detection via Accurate Cross-Modal Semantic Prompts

Shijia Zhao, Qiming Xia, Xusheng Guo et al.

CVPR 2025highlightarXiv:2503.06467
3
citations
#10246

DCAD-2000: A Multilingual Dataset across 2000+ Languages with Data Cleaning as Anomaly Detection

Yingli Shen, Wen Lai, Shuo Wang et al.

NEURIPS 2025arXiv:2502.11546
3
citations
#10247

Simultaneous Modeling of Protein Conformation and Dynamics via Autoregression

Yuning Shen, Lihao Wang, Huizhuo Yuan et al.

NEURIPS 2025oralarXiv:2505.17478
3
citations
#10248

AdaDetectGPT: Adaptive Detection of LLM-Generated Text with Statistical Guarantees

Hongyi Zhou, Jin Zhu, Pingfan Su et al.

NEURIPS 2025arXiv:2510.01268
3
citations
#10249

Reading Recognition in the Wild

Charig Yang, Samiul Alam, Shakhrul Iman Siam et al.

NEURIPS 2025arXiv:2505.24848
3
citations
#10250

Charm: The Missing Piece in ViT Fine-Tuning for Image Aesthetic Assessment

Fatemeh Behrad, Tinne Tuytelaars, Johan Wagemans

CVPR 2025arXiv:2504.02522
3
citations
#10251

EA-KD: Entropy-based Adaptive Knowledge Distillation

Chi-Ping Su, Ching-Hsun Tseng, Bin Pu et al.

ICCV 2025arXiv:2311.13621
3
citations
#10252

Memory Decoder: A Pretrained, Plug-and-Play Memory for Large Language Models

Jiaqi Cao, Jiarui Wang, Rubin Wei et al.

NEURIPS 2025arXiv:2508.09874
3
citations
#10253

How Can Objects Help Video-Language Understanding?

Zitian Tang, Shijie Wang, Junho Cho et al.

ICCV 2025arXiv:2504.07454
3
citations
#10254

HoGS: Unified Near and Far Object Reconstruction via Homogeneous Gaussian Splatting

Xinpeng Liu, Zeyi Huang, Fumio Okura et al.

CVPR 2025arXiv:2503.19232
3
citations
#10255

Scaling Language-centric Omnimodal Representation Learning

Chenghao Xiao, Hou Pong (Ken) Chan, Hao Zhang et al.

NEURIPS 2025arXiv:2510.11693
3
citations
#10256

FiffDepth: Feed-forward Transformation of Diffusion-Based Generators for Detailed Depth Estimation

Yunpeng Bai, Qixing Huang

ICCV 2025arXiv:2412.00671
3
citations
#10257

CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays

Hyungyung Lee, Geon Choi, Jung-Oh Lee et al.

NEURIPS 2025spotlightarXiv:2505.18087
3
citations
#10258

NADER: Neural Architecture Design via Multi-Agent Collaboration

Zekang Yang, Wang ZENG, Sheng Jin et al.

CVPR 2025arXiv:2412.19206
3
citations
#10259

ETVA: Evaluation of Text-to-Video Alignment via Fine-grained Question Generation and Answering

Kaisi Guan, Zhengfeng Lai, Yuchong Sun et al.

ICCV 2025arXiv:2503.16867
3
citations
#10260

Position: Machine Learning Conferences Should Establish a "Refutations and Critiques" Track

Rylan Schaeffer, Joshua Kazdan, Yegor Denisov-Blanch et al.

NEURIPS 2025oralarXiv:2506.19882
3
citations
#10261

Weakly Supervised Visible-Infrared Person Re-Identification via Heterogeneous Expert Collaborative Consistency Learning

Yafei Zhang, Lingqi Kong, Huafeng Li et al.

ICCV 2025arXiv:2507.12942
3
citations
#10262

Latent Mixture of Symmetries for Sample-Efficient Dynamic Learning

Haoran Li, CHENHAN XIAO, Muhao Guo et al.

NEURIPS 2025oralarXiv:2510.03578
3
citations
#10263

Detecting Adversarial Data Using Perturbation Forgery

Qian Wang, Chen Li, Yuchen Luo et al.

CVPR 2025arXiv:2405.16226
3
citations
#10264

Vision-and-Language Training Helps Deploy Taxonomic Knowledge but Does Not Fundamentally Alter It

Yulu Qin, Dheeraj Varghese, Adam Dahlgren Lindström et al.

NEURIPS 2025oralarXiv:2507.13328
3
citations
#10265

Offline Goal-conditioned Reinforcement Learning with Quasimetric Representations

Vivek Myers, Bill Zheng, Benjamin Eysenbach et al.

NEURIPS 2025oralarXiv:2509.20478
3
citations
#10266

Anytime-valid, Bayes-assisted, Prediction-Powered Inference

Valentin Kilian, Stefano Cortinovis, Francois Caron

NEURIPS 2025arXiv:2505.18000
3
citations
#10267

Semantic Equitable Clustering: A Simple and Effective Strategy for Clustering Vision Tokens

Qihang Fan, Huaibo Huang, Mingrui Chen et al.

ICCV 2025arXiv:2405.13337
3
citations
#10268

SLVR: Super-Light Visual Reconstruction via Blueprint Controllable Convolutions and Exploring Feature Diversity Representation

Ning Ni, Libao Zhang

CVPR 2025
3
citations
#10269

Adaptive Dropout: Unleashing Dropout across Layers for Generalizable Image Super-Resolution

Hang Xu, Jie Huang, Wei Yu et al.

CVPR 2025arXiv:2506.12738
3
citations
#10270

A Flag Decomposition for Hierarchical Datasets

Nathan Mankovich, Ignacio Santamaria, Gustau Camps-Valls et al.

CVPR 2025arXiv:2502.07782
3
citations
#10271

CATransformers: Carbon Aware Transformers Through Joint Model-Hardware Optimization

Irene Wang, Mostafa Elhoushi, H Ekin Sumbul et al.

NEURIPS 2025arXiv:2505.01386
3
citations
#10272

Reparameterized LLM Training via Orthogonal Equivalence Transformation

Zeju Qiu, Simon Buchholz, Tim Xiao et al.

NEURIPS 2025arXiv:2506.08001
3
citations
#10273

Annotation Ambiguity Aware Semi-Supervised Medical Image Segmentation

Suruchi Kumari, Pravendra Singh

CVPR 2025highlight
3
citations
#10274

From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Calibration

Mingyang Song, Xiaoye Qu, Jiawei Zhou et al.

CVPR 2025arXiv:2503.12821
3
citations
#10275

ORIDa: Object-centric Real-world Image Composition Dataset

Jinwoo Kim, Sangmin Han, Jinho Jeong et al.

CVPR 2025arXiv:2506.08964
3
citations
#10276

Self-Reinforcing Prototype Evolution with Dual-Knowledge Cooperation for Semi-Supervised Lifelong Person Re-Identification

Kunlun Xu, Fan Zhuo, Jiangmeng Li et al.

ICCV 2025arXiv:2507.01884
3
citations
#10277

Synthesize Privacy-Preserving High-Resolution Images via Private Textual Intermediaries

Haoxiang Wang, Zinan Lin, Da Yu et al.

NEURIPS 2025arXiv:2506.07555
3
citations
#10278

Inference-Time Reward Hacking in Large Language Models

Hadi Khalaf, Claudio Mayrink Verdun, Alex Oesterling et al.

NEURIPS 2025spotlightarXiv:2506.19248
3
citations
#10279

Optimal Control for Transformer Architectures: Enhancing Generalization, Robustness and Efficiency

Kelvin Kan, Xingjian Li, Benjamin Zhang et al.

NEURIPS 2025arXiv:2505.13499
3
citations
#10280

BWFormer: Building Wireframe Reconstruction from Airborne LiDAR Point Cloud with Transformer

Yuzhou Liu, Lingjie Zhu, Hanqiao Ye et al.

CVPR 2025highlight
3
citations
#10281

Pixel-aligned RGB-NIR Stereo Imaging and Dataset for Robot Vision

Jinneyong Kim, Seung-Hwan Baek

CVPR 2025arXiv:2411.18025
3
citations
#10282

Dynamic View Synthesis as an Inverse Problem

Hidir Yesiltepe, Pinar Yanardag

NEURIPS 2025arXiv:2506.08004
3
citations
#10283

ViT-Linearizer: Distilling Quadratic Knowledge into Linear-Time Vision Models

Guoyizhe Wei, Rama Chellappa

ICCV 2025arXiv:2504.00037
3
citations
#10284

EigenGS Representation: From Eigenspace to Gaussian Image Space

LO-WEI TAI, Ching-En Ching En, Li et al.

CVPR 2025arXiv:2503.07446
3
citations
#10285

MOVE: Motion-Guided Few-Shot Video Object Segmentation

Kaining Ying, Hengrui Hu, Henghui Ding

ICCV 2025arXiv:2507.22061
3
citations
#10286

Are Spatial-Temporal Graph Convolution Networks for Human Action Recognition Over-Parameterized?

Jianyang Xie, Yitian Zhao, Yanda Meng et al.

CVPR 2025arXiv:2505.10679
3
citations
#10287

Learning long range dependencies through time reversal symmetry breaking

Guillaume Pourcel, Maxence Ernoult

NEURIPS 2025oralarXiv:2506.05259
3
citations
#10288

CARE: Decoding-Time Safety Alignment via Rollback and Introspection Intervention

Xiaomeng Hu, Fei Huang, Chenhan Yuan et al.

NEURIPS 2025arXiv:2509.06982
3
citations
#10289

Diffusion-Based Hierarchical Graph Neural Networks for Simulating Nonlinear Solid Mechanics

Tobias Würth, Niklas Freymuth, Gerhard Neumann et al.

NEURIPS 2025oralarXiv:2506.06045
3
citations
#10290

Masking meets Supervision: A Strong Learning Alliance

Byeongho Heo, Taekyung Kim, Sangdoo Yun et al.

CVPR 2025arXiv:2306.11339
3
citations
#10291

MF-LLM: Simulating Population Decision Dynamics via a Mean-Field Large Language Model Framework

Qirui Mi, Mengyue Yang, Xiangning Yu et al.

NEURIPS 2025arXiv:2504.21582
3
citations
#10292

Optimal Neural Compressors for the Rate-Distortion-Perception Tradeoff

Eric Lei, Hamed Hassani, Shirin Saeedi Bidokhti

NEURIPS 2025spotlightarXiv:2503.17558
3
citations
#10293

SAUCE: Selective Concept Unlearning in Vision-Language Models with Sparse Autoencoders

Jiahui Geng, Qing Li

ICCV 2025arXiv:2503.14530
3
citations
#10294

IM-Zero: Instance-level Motion Controllable Video Generation in a Zero-shot Manner

Yuyang Huang, Yabo Chen, Li Ding et al.

CVPR 2025
3
citations
#10295

ODE: Open-Set Evaluation of Hallucinations in Multimodal Large Language Models

Yahan Tu, Rui Hu, Jitao Sang

CVPR 2025arXiv:2409.09318
3
citations
#10296

SocialMOIF: Multi-Order Intention Fusion for Pedestrian Trajectory Prediction

Kai Chen, Xiaodong Zhao, Yujie Huang et al.

CVPR 2025arXiv:2504.15616
3
citations
#10297

On the Coexistence and Ensembling of Watermarks

Aleksandar Petrov, Shruti Agarwal, Philip Torr et al.

NEURIPS 2025arXiv:2501.17356
3
citations
#10298

O-MaMa: Learning Object Mask Matching between Egocentric and Exocentric Views

Lorenzo Mur-Labadia, Maria Santos-Villafranca, Jesus Bermudez-cameo et al.

ICCV 2025arXiv:2506.06026
3
citations
#10299

Details Matter for Indoor Open-vocabulary 3D Instance Segmentation

Sanghun Jung, Jingjing Zheng, Ke Zhang et al.

ICCV 2025arXiv:2507.23134
3
citations
#10300

Hallucination at a Glance: Controlled Visual Edits and Fine-Grained Multimodal Learning

Tianyi Bai, Yuxuan Fan, Qiu Jiantao et al.

NEURIPS 2025arXiv:2506.07227
3
citations
#10301

Shallow Diffuse: Robust and Invisible Watermarking through Low-Dim Subspaces in Diffusion Models

Wenda Li, Huijie Zhang, Qing Qu

NEURIPS 2025spotlight
3
citations
#10302

Wide-Horizon Thinking and Simulation-Based Evaluation for Real-World LLM Planning with Multifaceted Constraints

Dongjie Yang, Chengqiang Lu, Qimeng Wang et al.

NEURIPS 2025spotlightarXiv:2506.12421
3
citations
#10303

EA-Vit: Efficient Adaptation for Elastic Vision Transformer

Chen Zhu, Wangbo Zhao, Huiwen Zhang et al.

ICCV 2025arXiv:2507.19360
3
citations
#10304

TaxaDiffusion: Progressively Trained Diffusion Model for Fine-Grained Species Generation

Amin Karimi Monsefi, Mridul Khurana, Rajiv Ramnath et al.

ICCV 2025arXiv:2506.01923
3
citations
#10305

Tightening Robustness Verification of MaxPool-based Neural Networks via Minimizing the Over-Approximation Zone

Yuan Xiao, Yuchen Chen, Shiqing Ma et al.

CVPR 2025arXiv:2211.09810
3
citations
#10306

Differential-informed Sample Selection Accelerates Multimodal Contrastive Learning

Zihua Zhao, Feng Hong, Mengxi Chen et al.

ICCV 2025arXiv:2507.12998
3
citations
#10307

Improved Regret Bounds for Gaussian Process Upper Confidence Bound in Bayesian Optimization

Shogo Iwazaki

NEURIPS 2025oralarXiv:2506.01393
3
citations
#10308

When Lighting Deceives: Exposing Vision-Language Models' Illumination Vulnerability Through Illumination Transformation Attack

Hanqing Liu, Shouwei Ruan, Yao Huang et al.

ICCV 2025arXiv:2503.06903
3
citations
#10309

An Analytical Theory of Spectral Bias in the Learning Dynamics of Diffusion Models

Binxu Wang, Cengiz Pehlevan

NEURIPS 2025spotlightarXiv:2503.03206
3
citations
#10310

Olympus: A Universal Task Router for Computer Vision Tasks

Yuanze Lin, Yunsheng Li, Dongdong Chen et al.

CVPR 2025highlightarXiv:2412.09612
3
citations
#10311

FeedEdit: Text-Based Image Editing with Dynamic Feedback Regulation

Fengyi Fu, Lei Zhang, Mengqi Huang et al.

CVPR 2025
3
citations
#10312

Rethinking Multimodal Learning from the Perspective of Mitigating Classification Ability Disproportion

Qing-Yuan Jiang, Longfei Huang, Yang Yang

NEURIPS 2025oralarXiv:2502.20120
3
citations
#10313

Self-Refining Language Model Anonymizers via Adversarial Distillation

Kyuyoung Kim, Hyunjun Jeon, Jinwoo Shin

NEURIPS 2025arXiv:2506.01420
3
citations
#10314

Boosting Adversarial Transferability via Residual Perturbation Attack

Jinjia Peng, Zeze Tao, Huibing Wang et al.

ICCV 2025arXiv:2508.05689
3
citations
#10315

Watermarking One for All: A Robust Watermarking Scheme Against Partial Image Theft

Gaozhi Liu, Silu Cao, Zhenxing Qian et al.

CVPR 2025
3
citations
#10316

PVChat: Personalized Video Chat with One-Shot Learning

YUFEI SHI, Weilong Yan, Gang Xu et al.

ICCV 2025arXiv:2503.17069
3
citations
#10317

Zero-shot RGB-D Point Cloud Registration with Pre-trained Large Vision Model

Haobo Jiang, Jin Xie, Jian Yang et al.

CVPR 2025
3
citations
#10318

Deep learning for continuous-time stochastic control with jumps

Patrick Cheridito, Jean-Loup Dupret, Donatien Hainaut

NEURIPS 2025arXiv:2505.15602
3
citations
#10319

RoomPainter: View-Integrated Diffusion for Consistent Indoor Scene Texturing

Zhipeng Huang, Wangbo Yu, Xinhua Cheng et al.

CVPR 2025arXiv:2412.16778
3
citations
#10320

Conditional Balance: Improving Multi-Conditioning Trade-Offs in Image Generation

Nadav Z. Cohen, Oron Nir, Ariel Shamir

CVPR 2025arXiv:2412.19853
3
citations
#10321

Predict-Optimize-Distill: A Self-Improving Cycle for 4D Object Understanding

Mingxuan Wu, Huang Huang, Justin Kerr et al.

ICCV 2025arXiv:2504.17441
3
citations
#10322

Progressive Test Time Energy Adaptation for Medical Image Segmentation

Xiaoran Zhang, Byung-Woo Hong, Hyoungseob Park et al.

ICCV 2025highlightarXiv:2503.16616
3
citations
#10323

DeCafNet: Delegate and Conquer for Efficient Temporal Grounding in Long Videos

Zijia Lu, ASM Iftekhar, Gaurav Mittal et al.

CVPR 2025arXiv:2505.16376
3
citations
#10324

PolyGuard: Massive Multi-Domain Safety Policy-Grounded Guardrail Dataset

Mintong Kang, Zhaorun Chen, Chejian Xu et al.

NEURIPS 2025
3
citations
#10325

Who Reasons in the Large Language Models?

Jie Shao, Jianxin Wu

NEURIPS 2025arXiv:2505.20993
3
citations
#10326

FlowSeek: Optical Flow Made Easier with Depth Foundation Models and Motion Bases

Matteo Poggi, Fabio Tosi

ICCV 2025arXiv:2509.05297
3
citations
#10327

OW-OVD: Unified Open World and Open Vocabulary Object Detection

Xing Xi, Yangyang Huang, Ronghua Luo et al.

CVPR 2025
3
citations
#10328

HyperMARL: Adaptive Hypernetworks for Multi-Agent RL

Kale-ab Tessera, Muhammad Arrasy Rahman, Amos Storkey et al.

NEURIPS 2025arXiv:2412.04233
3
citations
#10329

ATCTrack: Aligning Target-Context Cues with Dynamic Target States for Robust Vision-Language Tracking

Xiaokun Feng, Shiyu Hu, Xuchen Li et al.

ICCV 2025highlightarXiv:2507.19875
3
citations
#10330

AdaLRS: Loss-Guided Adaptive Learning Rate Search for Efficient Foundation Model Pretraining

Hongyuan Dong, Dingkang Yang, Xiao Liang et al.

NEURIPS 2025arXiv:2506.13274
3
citations
#10331

Generalization Error Analysis for Selective State-Space Models Through the Lens of Attention

Arya Honarpisheh, Mustafa Bozdag, Octavia Camps et al.

NEURIPS 2025arXiv:2502.01473
3
citations
#10332

Enhancing Few-Shot Vision-Language Classification with Large Multimodal Model Features

Chancharik Mitra, Brandon Huang, Tianning Chai et al.

ICCV 2025arXiv:2412.00142
3
citations
#10333

Conformal Prediction Beyond the Seen: A Missing Mass Perspective for Uncertainty Quantification in Generative Models

Sima Noorani, Shayan Kiyani, George J. Pappas et al.

NEURIPS 2025arXiv:2506.05497
3
citations
#10334

PriOr-Flow: Enhancing Primitive Panoramic Optical Flow with Orthogonal View

Longliang Liu, Miaojie Feng, Junda Cheng et al.

ICCV 2025highlightarXiv:2506.23897
3
citations
#10335

InfiGFusion: Graph-on-Logits Distillation via Efficient Gromov-Wasserstein for Model Fusion

Yuanyi Wang, Zhaoyi Yan, Yiming Zhang et al.

NEURIPS 2025arXiv:2505.13893
3
citations
#10336

BRACE: A Benchmark for Robust Audio Caption Quality Evaluation

Tianyu Guo, Hongyu Chen, Hao Liang et al.

NEURIPS 2025arXiv:2512.10403
3
citations
#10337

Large-scale Pre-training for Grounded Video Caption Generation

Evangelos Kazakos, Cordelia Schmid, Josef Sivic

ICCV 2025arXiv:2503.10781
3
citations
#10338

Shading Meets Motion: Self-supervised Indoor 3D Reconstruction Via Simultaneous Shape-from-Shading and Structure-from-Motion

Guoyu Lu

CVPR 2025
3
citations
#10339

Dynamic Reconstruction of Hand-Object Interaction with Distributed Force-aware Contact Representation

Zhenjun Yu, Wenqiang Xu, Pengfei Xie et al.

ICCV 2025arXiv:2411.09572
3
citations
#10340

ProxyTransformation: Preshaping Point Cloud Manifold With Proxy Attention For 3D Visual Grounding

Qihang Peng, Henry Zheng, Gao Huang

CVPR 2025arXiv:2502.19247
3
citations
#10341

AVCD: Mitigating Hallucinations in Audio-Visual Large Language Models through Contrastive Decoding

Chaeyoung Jung, Youngjoon Jang, Joon Son Chung

NEURIPS 2025arXiv:2505.20862
3
citations
#10342

Silencer: From Discovery to Mitigation of Self-Bias in LLM-as-Benchmark-Generator

Peiwen Yuan, Yiwei Li, Shaoxiong Feng et al.

NEURIPS 2025arXiv:2505.20738
3
citations
#10343

Benford’s Curse: Tracing Digit Bias to Numerical Hallucination in LLMs

Jiandong Shao, Yao Lu, Jianfei Yang

NEURIPS 2025arXiv:2506.01734
3
citations
#10344

Towards Understanding How Knowledge Evolves in Large Vision-Language Models

Sudong Wang, Yunjian Zhang, Yao Zhu et al.

CVPR 2025arXiv:2504.02862
3
citations
#10345

ProJudge: A Multi-Modal Multi-Discipline Benchmark and Instruction-Tuning Dataset for MLLM-based Process Judges

Jiaxin Ai, Pengfei Zhou, xu Pan et al.

ICCV 2025arXiv:2503.06553
3
citations
#10346

Open-Insect: Benchmarking Open-Set Recognition of Novel Species in Biodiversity Monitoring

Yuyan Chen, Nico Lang, B. Schmidt et al.

NEURIPS 2025spotlightarXiv:2503.01691
3
citations
#10347

SemCoT: Accelerating Chain-of-Thought Reasoning through Semantically-Aligned Implicit Tokens

Yinhan He, Wendy Zheng, Yaochen Zhu et al.

NEURIPS 2025arXiv:2510.24940
3
citations
#10348

RePO: Understanding Preference Learning Through ReLU-Based Optimization

Junkang Wu, Kexin Huang, xue wang et al.

NEURIPS 2025arXiv:2503.07426
3
citations
#10349

Resona: Improving Context Copying in Linear Recurrence Models with Retrieval

Xinyu Wang, Linrui Ma, Jerry Huang et al.

COLM 2025paperarXiv:2503.22913
3
citations
#10350

POROver: Improving Safety and Reducing Overrefusal in Large Language Models with Overgeneration and Preference Optimization

Batuhan K. Karaman, ishmam zabir, Alon Benhaim et al.

ICML 2025arXiv:2410.12999
3
citations
#10351

An End-to-End Model for Logits-Based Large Language Models Watermarking

KA HIM WONG, Jicheng Zhou, Jiantao Zhou et al.

ICML 2025arXiv:2505.02344
3
citations
#10352

Stuffed Mamba: Oversized States Lead to the Inability to Forget

Yingfa Chen, Xinrong Zhang, Shengding Hu et al.

COLM 2025paper
3
citations
#10353

Targeted Unlearning with Single Layer Unlearning Gradient

Zikui Cai, Yaoteng Tan, M. Salman Asif

ICML 2025arXiv:2407.11867
3
citations
#10354

ReFeed: Multi-dimensional Summarization Refinement with Reflective Reasoning on Feedback

Taewon Yun, Jihwan Oh, Hyangsuk Min et al.

COLM 2025paperarXiv:2503.21332
3
citations
#10355

G-Adaptivity: optimised graph-based mesh relocation for finite element methods

James Rowbottom, Georg Maierhofer, Teo Deveney et al.

ICML 2025spotlightarXiv:2407.04516
3
citations
#10356

AutoScale: Scale-Aware Data Mixing for Pre-Training LLMs

Feiyang Kang, Yifan Sun, Bingbing Wen et al.

COLM 2025paperarXiv:2407.20177
3
citations
#10357

Improving Fisher Information Estimation and Efficiency for LoRA-based LLM Unlearning

Yejin Kim, Eunwon Kim, Buru Chang et al.

COLM 2025paperarXiv:2508.21300
3
citations
#10358

Complex Wavelet Mutual Information Loss: A Multi-Scale Loss Function for Semantic Segmentation

Renhao Lu

ICML 2025arXiv:2502.00563
3
citations
#10359

Distributed Event-Based Learning via ADMM

Guner Dilsad ER, Sebastian Trimpe, Michael Muehlebach

ICML 2025arXiv:2405.10618
3
citations
#10360

On the Effectiveness and Generalization of Race Representations for Debiasing High-Stakes Decisions

Dang Nguyen, Chenhao Tan

COLM 2025paperarXiv:2504.06303
3
citations
#10361

Rethink the Role of Deep Learning towards Large-scale Quantum Systems

Yusheng Zhao, Chi Zhang, Yuxuan Du

ICML 2025arXiv:2505.13852
3
citations
#10362

A Variational Information Theoretic Approach to Out-of-Distribution Detection

Sudeepta Mondal, Zhuolin Jiang, Ganesh Sundaramoorthi

ICML 2025arXiv:2506.14194
3
citations
#10363

Random Feature Representation Boosting

Nikita Zozoulenko, Thomas Cass, Lukas Gonon

ICML 2025arXiv:2501.18283
3
citations
#10364

Decision Making under the Exponential Family: Distributionally Robust Optimisation with Bayesian Ambiguity Sets

Charita Dellaporta, Patrick O'Hara, Theodoros Damoulas

ICML 2025spotlightarXiv:2411.16829
3
citations
#10365

ParaPO: Aligning Language Models to Reduce Verbatim Reproduction of Pre-training Data

Tong Chen, Faeze Brahman, Jiacheng Liu et al.

COLM 2025paperarXiv:2504.14452
3
citations
#10366

Energy-Based Reward Models for Robust Language Model Alignment

Anamika Lochab, Ruqi Zhang

COLM 2025paperarXiv:2504.13134
3
citations
#10367

Compositional Flows for 3D Molecule and Synthesis Pathway Co-design

Tony Shen, Seonghwan Seo, Ross Irwin et al.

ICML 2025arXiv:2504.08051
3
citations
#10368

Accelerating Quantum Reinforcement Learning with a Quantum Natural Policy Gradient Based Approach

Yang Xu, Vaneet Aggarwal

ICML 2025arXiv:2501.16243
3
citations
#10369

Beyond the Reported Cutoff: Where Large Language Models Fall Short on Financial Knowledge

Agam Shah, Liqin Ye, Sebastian Jaskowski et al.

COLM 2025paperarXiv:2504.00042
3
citations
#10370

Lightweight Online Adaption for Time Series Foundation Model Forecasts

Thomas Lee, William Toner, Rajkarn Singh et al.

ICML 2025arXiv:2502.12920
3
citations
#10371

De-AntiFake: Rethinking the Protective Perturbations Against Voice Cloning Attacks

Wei Fan, Kejiang Chen, Chang Liu et al.

ICML 2025arXiv:2507.02606
3
citations
#10372

Enhancing Graph Contrastive Learning for Protein Graphs from Perspective of Invariance

YUSONG WANG, Shiyin Tan, Jialun Shen et al.

ICML 2025
3
citations
#10373

Kandinsky Conformal Prediction: Beyond Class- and Covariate-Conditional Coverage

Konstantina Bairaktari, Jiayun Wu, Steven Wu

ICML 2025arXiv:2502.17264
3
citations
#10374

Learning Cascade Ranking as One Network

Yunli Wang, ZhenZhang, Zhiqiang Wang et al.

ICML 2025arXiv:2503.09492
3
citations
#10375

Maximum Total Correlation Reinforcement Learning

Bang You, Puze Liu, Huaping Liu et al.

ICML 2025arXiv:2505.16734
3
citations
#10376

It's My Data Too: Private ML for Datasets with Multi-User Training Examples

Arun Ganesh, Ryan McKenna, Hugh B McMahan et al.

ICML 2025arXiv:2503.03622
3
citations
#10377

MuSeD: A Multimodal Spanish Dataset for Sexism Detection in Social Media Videos

Laura De Grazia, Pol Pastells, Mauro Vázquez Chas et al.

COLM 2025paperarXiv:2504.11169
3
citations
#10378

Post-training for Efficient Communication via Convention Formation

Yilun Hua, Evan Wang, Yoav Artzi

COLM 2025paperarXiv:2508.06482
3
citations
#10379

Gandalf the Red: Adaptive Security for LLMs

Niklas Pfister, Václav Volhejn, Manuel Knott et al.

ICML 2025arXiv:2501.07927
3
citations
#10380

Safety Alignment Can Be Not Superficial With Explicit Safety Signals

Jianwei Li, Jung-Eun Kim

ICML 2025arXiv:2505.17072
3
citations
#10381

Learning In-context $n$-grams with Transformers: Sub-$n$-grams Are Near-Stationary Points

Aditya Vardhan Varre, Gizem Yüce, Nicolas Flammarion

ICML 2025
3
citations
#10382

Non-Asymptotic Length Generalization

Thomas Chen, Tengyu Ma, Zhiyuan Li

ICML 2025arXiv:2506.03085
3
citations
#10383

Navigating Conflicting Views: Harnessing Trust for Learning

Jueqing Lu, Wray Buntine, Yuanyuan Qi et al.

ICML 2025arXiv:2406.00958
3
citations
#10384

AdaptMI: Adaptive Skill-based In-context Math Instructions for Small Language Models

Yinghui He, Abhishek Panigrahi, Yong Lin et al.

COLM 2025paperarXiv:2505.00147
3
citations
#10385

CUPID: Evaluating Personalized and Contextualized Alignment of LLMs from Interactions

Tae Soo Kim, Yoonjoo Lee, Yoonah Park et al.

COLM 2025paperarXiv:2508.01674
3
citations
#10386

Verifying the Verifiers: Unveiling Pitfalls and Potentials in Fact Verifiers

Wooseok Seo, Seungju Han, Jaehun Jung et al.

COLM 2025paperarXiv:2506.13342
3
citations
#10387

From Local Details to Global Context: Advancing Vision-Language Models with Attention-Based Selection

Lincan Cai, Jingxuan Kang, Shuang Li et al.

ICML 2025arXiv:2505.13233
3
citations
#10388

Intersectional Fairness in Reinforcement Learning with Large State and Constraint Spaces

ERIC EATON, Marcel Hussing, Michael Kearns et al.

ICML 2025arXiv:2502.11828
3
citations
#10389

O-MAPL: Offline Multi-agent Preference Learning

The Viet Bui, Tien Mai, Thanh Nguyen

ICML 2025arXiv:2501.18944
3
citations
#10390

Vision-Language Models Are Not Pragmatically Competent in Referring Expression Generation

Ziqiao Ma, Jing Ding, Xuejun Zhang et al.

COLM 2025paperarXiv:2504.16060
3
citations
#10391

Text Speaks Louder than Vision: ASCII Art Reveals Textual Biases in Vision-Language Models

Zhaochen Wang, Bryan Hooi, Yiwei Wang et al.

COLM 2025paperarXiv:2504.01589
3
citations
#10392

BCE vs. CE in Deep Feature Learning

Qiufu Li, Huibin Xiao, Linlin Shen

ICML 2025arXiv:2505.05813
3
citations
#10393

Gradient-based Explanations for Deep Learning Survival Models

Sophie Hanna Langbein, Niklas Koenen, Marvin N. Wright

ICML 2025oralarXiv:2502.04970
3
citations
#10394

Probing then Editing Response Personality of Large Language Models

Tianjie Ju, Zhenyu Shao, Bowen Wang et al.

COLM 2025paperarXiv:2504.10227
3
citations
#10395

Scalable Zeroth-Order Fine-Tuning for Extremely Large Language Models with Limited GPU Memory

Liangyu Wang, Jie Ren, Hang Xu et al.

COLM 2025paperarXiv:2503.12668
3
citations
#10396

IterKey: Iterative Keyword Generation with LLMs for Enhanced Retrieval Augmented Generation

Kazuki Hayashi, Hidetaka Kamigaito, Shinya Kouda et al.

COLM 2025paperarXiv:2505.08450
3
citations
#10397

EvoMesh: Adaptive Physical Simulation with Hierarchical Graph Evolutions

Huayu Deng, Xiangming Zhu, Yunbo Wang et al.

ICML 2025arXiv:2410.03779
3
citations
#10398

Pareto-Optimality, Smoothness, and Stochasticity in Learning-Augmented One-Max-Search

Ziyad Benomar, Lorenzo Croissant, Vianney Perchet et al.

ICML 2025arXiv:2502.05720
3
citations
#10399

DocVXQA: Context-Aware Visual Explanations for Document Question Answering

Mohamed Ali Souibgui, Changkyu Choi, Andrey Barsky et al.

ICML 2025arXiv:2505.07496
3
citations
#10400

MixMin: Finding Data Mixtures via Convex Minimization

Anvith Thudi, Evianne Rovers, Yangjun Ruan et al.

ICML 2025arXiv:2502.10510
3
citations