Most Cited 2025 "top-p sampling" Papers

22,274 papers found • Page 72 of 112

#14201

OcRFDet: Object-Centric Radiance Fields for Multi-View 3D Object Detection in Autonomous Driving

Mingqian Ji, Jian Yang, Shanshan Zhang

ICCV 2025arXiv:2506.23565
1
citations
#14202

DitHub: A Modular Framework for Incremental Open-Vocabulary Object Detection

Chiara Cappellino, Gianluca Mancusi, Matteo Mosconi et al.

NEURIPS 2025arXiv:2503.09271
1
citations
#14203

Seeing 3D Through 2D Lenses: 3D Few-Shot Class-Incremental Learning via Cross-Modal Geometric Rectification

Tuo Xiang, Xuemiao Xu, Bangzhen Liu et al.

ICCV 2025arXiv:2509.14958
1
citations
#14204

Enhancing Zero-shot Object Counting via Text-guided Local Ranking and Number-evoked Global Attention

Shiwei Zhang, Qi Zhou, Wei Ke

ICCV 2025
1
citations
#14205

Systems with Switching Causal Relations: A Meta-Causal Perspective

Moritz Willig, Tim Tobiasch, Florian Busch et al.

ICLR 2025arXiv:2410.13054
1
citations
#14206

ConsNoTrainLoRA: Data-driven Weight Initialization of Low-rank Adapters using Constraints

Debasmit Das, Hyoungwoo Park, Munawar Hayat et al.

ICCV 2025arXiv:2507.08044
1
citations
#14207

SINGER: Stochastic Network Graph Evolving Operator for High Dimensional PDEs

Mingquan Feng, Yixin Huang, Weixin Liao et al.

ICLR 2025
1
citations
#14208

Hyperphantasia: A Benchmark for Evaluating the Mental Visualization Capabilities of Multimodal LLMs

Mohammad Shahab Sepehri, Berk Tinaz, Zalan Fabian et al.

NEURIPS 2025arXiv:2507.11932
1
citations
#14209

RGE-GS: Reward-Guided Expansive Driving Scene Reconstruction via Diffusion Priors

Sicong Du, Jiarun Liu, Qifeng Chen et al.

ICCV 2025arXiv:2506.22800
1
citations
#14210

Learning Dense Feature Matching via Lifting Single 2D Image to 3D Space

Yingping Liang, Yutao Hu, Wenqi Shao et al.

ICCV 2025arXiv:2507.00392
1
citations
#14211

Pose-Guided Temporal Enhancement for Robust Low-Resolution Hand Reconstruction

Kaixin Fan, Pengfei Ren, Jingyu Wang et al.

CVPR 2025
1
citations
#14212

Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics

Indrashis Das, Mahmoud Safari, Steven Adriaensen et al.

NEURIPS 2025arXiv:2502.03654
1
citations
#14213

Scene Coordinate Reconstruction Priors

Wenjing Bian, Axel Barroso-Laguna, Tommaso Cavallari et al.

ICCV 2025arXiv:2510.12387
1
citations
#14214

Revisiting Large-Scale Non-convex Distributionally Robust Optimization

Qi Zhang, Yi Zhou, Simon Khan et al.

ICLR 2025
1
citations
#14215

Controllable and Expressive One-Shot Video Head Swapping

Chaonan Ji, Jinwei Qi, Peng Zhang et al.

ICCV 2025arXiv:2506.16852
1
citations
#14216

Exponential Dynamic Energy Network for High Capacity Sequence Memory

Arjun Karuvally, Pichsinee Lertsaroj, Terrence Sejnowski et al.

NEURIPS 2025oralarXiv:2510.24965
1
citations
#14217

Attribute-Missing Multi-view Graph Clustering

Bowen Zhao, Qianqian Wang, Zhengming Ding et al.

CVPR 2025
1
citations
#14218

Empowering Vector Graphics with Consistently Arbitrary Viewing and View-dependent Visibility

Yidi Li, Jun Xiao, Zhengda Lu et al.

CVPR 2025highlightarXiv:2505.21377
1
citations
#14219

VASA-3D: Lifelike Audio-Driven Gaussian Head Avatars from a Single Image

Sicheng Xu, Guojun Chen, Jiaolong Yang et al.

NEURIPS 2025arXiv:2512.14677
1
citations
#14220

MANGO: Multimodal Attention-based Normalizing Flow Approach to Fusion Learning

Thanh-Dat Truong, Christophe Bobda, Nitin Agarwal et al.

NEURIPS 2025arXiv:2508.10133
1
citations
#14221

Three Mechanisms of Feature Learning in a Linear Network

Yizhou Xu, Liu Ziyin

ICLR 2025arXiv:2401.07085
1
citations
#14222

ReCon: Enhancing True Correspondence Discrimination through Relation Consistency for Robust Noisy Correspondence Learning

Quanxing Zha, Xin Liu, Shu-Juan Peng et al.

CVPR 2025arXiv:2502.19962
1
citations
#14223

Latent Expression Generation for Referring Image Segmentation and Grounding

Seonghoon Yu, Junbeom Hong, Joonseok Lee et al.

ICCV 2025arXiv:2508.05123
1
citations
#14224

Style Evolving along Chain-of-Thought for Unknown-Domain Object Detection

Zihao Zhang, Aming Wu, Yahong Han

CVPR 2025highlightarXiv:2503.09968
1
citations
#14225

SPA: Efficient User-Preference Alignment against Uncertainty in Medical Image Segmentation

Jiayuan Zhu, Junde Wu, Cheng Ouyang et al.

ICCV 2025arXiv:2411.15513
1
citations
#14226

Uncertainty Estimation on Graphs with Structure Informed Stochastic Partial Differential Equations

Fred Xu, Thomas Markovich

NEURIPS 2025oralarXiv:2506.06907
1
citations
#14227

SVFR: A Unified Framework for Generalized Video Face Restoration

Zhiyao Wang, Xu Chen, Chengming Xu et al.

CVPR 2025arXiv:2501.01235
1
citations
#14228

Cross-fluctuation phase transitions reveal sampling dynamics in diffusion models

Sai Niranjan Ramachandran, Manish Krishan Lal, Suvrit Sra

NEURIPS 2025arXiv:2511.00124
1
citations
#14229

DexH2R: A Benchmark for Dynamic Dexterous Grasping in Human-to-Robot Handover

Youzhuo Wang, jiayi ye, Chuyang Xiao et al.

ICCV 2025arXiv:2506.23152
1
citations
#14230

Moment- and Power-Spectrum-Based Gaussianity Regularization for Text-to-Image Models

Jisung Hwang, Jaihoon Kim, Minhyuk Sung

NEURIPS 2025arXiv:2509.07027
1
citations
#14231

FRET: Feature Redundancy Elimination for Test Time Adaptation

Linjing You, Jiabao Lu, Xiayuan Huang et al.

ICCV 2025arXiv:2505.10641
1
citations
#14232

Spend Wisely: Maximizing Post-Training Gains in Iterative Synthetic Data Bootstrapping

Pu Yang, Yunzhen Feng, Ziyuan Chen et al.

NEURIPS 2025spotlightarXiv:2501.18962
1
citations
#14233

Continuous Subspace Optimization for Continual Learning

Quan Cheng, Yuanyu Wan, Lingyu Wu et al.

NEURIPS 2025arXiv:2505.11816
1
citations
#14234

TADFormer: Task-Adaptive Dynamic TransFormer for Efficient Multi-Task Learning

Seungmin Baek, Soyul Lee, Hayeon Jo et al.

CVPR 2025arXiv:2501.04293
1
citations
#14235

When Can Model-Free Reinforcement Learning be Enough for Thinking?

Josiah Hanna, Nicholas Corrado

NEURIPS 2025arXiv:2506.17124
1
citations
#14236

TurboVSR: Fantastic Video Upscalers and Where to Find Them

Zhongdao Wang, Guodongfang Zhao, Jingjing Ren et al.

ICCV 2025highlightarXiv:2506.23618
1
citations
#14237

DDB: Diffusion Driven Balancing to Address Spurious Correlations

Aryan Yazdan Parast, Basim Azam, Naveed Akhtar

ICCV 2025arXiv:2503.17226
1
citations
#14238

SpecGuard: Spectral Projection-based Advanced Invisible Watermarking

Inzamamul Alam, Md Islam, Simon Woo et al.

ICCV 2025arXiv:2510.07302
1
citations
#14239

Towards Scalable Topological Regularizers

Wong Hiu-Tung, Darrick Lee, Hong Yan

ICLR 2025arXiv:2501.14641
1
citations
#14240

Recognizing Actions from Robotic View for Natural Human-Robot Interaction

Ziyi Wang, Peiming Li, Hong Liu et al.

ICCV 2025arXiv:2507.22522
1
citations
#14241

Inductive Domain Transfer In Misspecified Simulation-Based Inference

Ortal Senouf, Antoine Wehenkel, Cédric Vincent-Cuaz et al.

NEURIPS 2025arXiv:2508.15593
1
citations
#14242

Attention (as Discrete-Time Markov) Chains

Yotam Erel, Olaf Dünkel, Rishabh Dabral et al.

NEURIPS 2025arXiv:2507.17657
1
citations
#14243

IM-Portrait: Learning 3D-aware Video Diffusion for Photorealistic Talking Heads from Monocular VideosC

Yuan Li, Ziqian Bai, Feitong Tan et al.

CVPR 2025
1
citations
#14244

TeethGenerator: A two-stage framework for paired pre- and post-orthodontic 3D dental data generation

Changsong Lei, Yaqian Liang, Shaofeng Wang et al.

ICCV 2025arXiv:2507.04685
1
citations
#14245

Revisiting Generative Infrared and Visible Image Fusion Based on Human Cognitive Laws

Lin Guo, Xiaoqing Luo, Wei Xie et al.

NEURIPS 2025spotlightarXiv:2510.26268
1
citations
#14246

Multi-modal Contrastive Learning with Negative Sampling Calibration for Phenotypic Drug Discovery

Jiahua Rao, Hanjing Lin, Leyu Chen et al.

CVPR 2025
1
citations
#14247

Proximal Mapping Loss: Understanding Loss Functions in Crowd Counting & Localization

Wei LIN, Jia Wan, Antoni Chan

ICLR 2025
1
citations
#14248

Mind the Cost of Scaffold! Benign Clients May Even Become Accomplices of Backdoor Attack

Xingshuo Han, Xuanye Zhang, Xiang Lan et al.

ICCV 2025arXiv:2411.16167
1
citations
#14249

Leader360V: A Large-scale, Real-world 360 Video Dataset for Multi-task Learning in Diverse Environment

WEIMING ZHANG, Dingwen Xiao, Aobotao DAI et al.

NEURIPS 2025arXiv:2506.14271
1
citations
#14250

Compositional Discrete Latent Code for High Fidelity, Productive Diffusion Models

Samuel Lavoie, Michael Noukhovitch, Aaron Courville

NEURIPS 2025arXiv:2507.12318
1
citations
#14251

MAC-Ego3D: Multi-Agent Gaussian Consensus for Real-Time Collaborative Ego-Motion and Photorealistic 3D Reconstruction

Xiaohao Xu, Feng Xue, Shibo Zhao et al.

CVPR 2025arXiv:2412.09723
1
citations
#14252

Towards Accurate and Efficient 3D Object Detection for Autonomous Driving: A Mixture of Experts Computing System on Edge

Linshen Liu, Boyan Su, Junyue Jiang et al.

ICCV 2025arXiv:2507.04123
1
citations
#14253

Learning a Fast Mixing Exogenous Block MDP using a Single Trajectory

Alexander Levine, Peter Stone, Amy Zhang

ICLR 2025arXiv:2410.03016
1
citations
#14254

REP: Resource-Efficient Prompting for Rehearsal-Free Continual Learning

Sungho Jeon, Xinyue Ma, Kwang In Kim et al.

NEURIPS 2025arXiv:2406.04772
1
citations
#14255

MIR-Bench: Can Your LLM Recognize Complicated Patterns via Many-Shot In-Context Reasoning?

Kai Yan, Zhan Ling, Kang Liu et al.

NEURIPS 2025arXiv:2502.09933
1
citations
#14256

Reinforcement Learning-Guided Data Selection via Redundancy Assessment

Suorong Yang, Peijia Li, Furao Shen et al.

ICCV 2025arXiv:2506.21037
1
citations
#14257

Time-Masked Transformers with Lightweight Test-Time Adaptation for Neural Speech Decoding

Ebrahim Feghhi, Shreyas Kaasyap, Nima Hadidi et al.

NEURIPS 2025arXiv:2507.02800
1
citations
#14258

Feedback Schrödinger Bridge Matching

Panagiotis Theodoropoulos, Nikolaos Komianos, Vincent Pacelli et al.

ICLR 2025arXiv:2410.14055
1
citations
#14259

Additive Models Explained: A Computational Complexity Approach

Shahaf Bassan, Michal Moshkovitz, Guy Katz

NEURIPS 2025arXiv:2510.21292
1
citations
#14260

Activation Subspaces for Out-of-Distribution Detection

Barış Zöngür, Robin Hesse, Stefan Roth

ICCV 2025arXiv:2508.21695
1
citations
#14261

Rep-MTL: Unleashing the Power of Representation-level Task Saliency for Multi-Task Learning

Zedong Wang, Siyuan Li, Dan Xu

ICCV 2025highlightarXiv:2507.21049
1
citations
#14262

Generator-Mediated Bandits: Thompson Sampling for GenAI-Powered Adaptive Interventions

Marc Brooks, Gabriel Durham, Kihyuk Hong et al.

NEURIPS 2025arXiv:2505.16311
1
citations
#14263

Sound Logical Explanations for Mean Aggregation Graph Neural Networks

Matthew Morris, Ian Horrocks

NEURIPS 2025arXiv:2511.11593
1
citations
#14264

AdaptCMVC: Robust Adaption to Incremental Views in Continual Multi-view Clustering

Jing Wang, Songhe Feng, Kristoffer Knutsen Wickstrøm et al.

CVPR 2025
1
citations
#14265

Confounding Robust Deep Reinforcement Learning: A Causal Approach

Mingxuan Li, Junzhe Zhang, Elias Bareinboim

NEURIPS 2025oralarXiv:2510.21110
1
citations
#14266

DynPose: Largely Improving the Efficiency of Human Pose Estimation by a Simple Dynamic Framework

Yalong Xu, Lin Zhao, Chen Gong et al.

CVPR 2025
1
citations
#14267

Augmenting Moment Retrieval: Zero-Dependency Two-Stage Learning

Zhengxuan Wei, Jiajin Tang, Sibei Yang

ICCV 2025arXiv:2510.19622
1
citations
#14268

PersonaHOI: Effortlessly Improving Face Personalization in Human-Object Interaction Generation

Xinting Hu, Haoran Wang, Jan Lenssen et al.

CVPR 2025
1
citations
#14269

Decomposing stimulus-specific sensory neural information via diffusion models

Steeve Laquitaine, Simone Azeglio, Carlo Paris et al.

NEURIPS 2025spotlightarXiv:2505.11309
1
citations
#14270

TRENDy: Temporal Regression of Effective Nonlinear Dynamics

Matthew Ricci, Guy Pelc, Zoe Piran et al.

ICLR 2025oralarXiv:2412.03496
1
citations
#14271

ARIA: Training Language Agents with Intention-driven Reward Aggregation

Ruihan Yang, yikai zhang, Aili Chen et al.

NEURIPS 2025spotlightarXiv:2506.00539
1
citations
#14272

MODEM: A Morton-Order Degradation Estimation Mechanism for Adverse Weather Image Recovery

Hainuo Wang, Qiming Hu, Xiaojie Guo

NEURIPS 2025arXiv:2505.17581
1
citations
#14273

Efficient Causal Decision Making with One-sided Feedback

Jianing Chu, Shu Yang, Wenbin Lu et al.

ICLR 2025
1
citations
#14274

Nonlinearly Preconditioned Gradient Methods: Momentum and Stochastic Analysis

Konstantinos Oikonomidis, Jan Quan, Panagiotis Patrinos

NEURIPS 2025arXiv:2510.11312
1
citations
#14275

PixelStitch: Structure-Preserving Pixel-Wise Bidirectional Warps for Unsupervised Image Stitching

Hengzhe Jin, Lang Nie, Chunyu Lin et al.

ICCV 2025
1
citations
#14276

SuperEvent: Cross-Modal Learning of Event-based Keypoint Detection for SLAM

Yannick Burkhardt, Simon Schaefer, Stefan Leutenegger

ICCV 2025highlightarXiv:2504.00139
1
citations
#14277

LTD-Bench: Evaluating Large Language Models by Letting Them Draw

Liuhao Lin, Ke Li, Zihan Xu et al.

NEURIPS 2025arXiv:2511.02347
1
citations
#14278

Energy-based generator matching: A neural sampler for general state space

Dongyeop Woo, Minsu Kim, Minkyu Kim et al.

NEURIPS 2025arXiv:2505.19646
1
citations
#14279

Mamba-Adaptor: State Space Model Adaptor for Visual Recognition

Fei Xie, Jiahao Nie, Yujin Tang et al.

CVPR 2025arXiv:2505.12685
1
citations
#14280

$O(\sqrt{T})$ Static Regret and Instance Dependent Constraint Violation for Constrained Online Convex Optimization

Rahul Vaze, Abhishek Sinha

NEURIPS 2025arXiv:2502.05019
1
citations
#14281

The Logical Expressiveness of Temporal GNNs via Two-Dimensional Product Logics

Marco Sälzer, Przemyslaw Walega, Martin Lange

NEURIPS 2025oralarXiv:2505.11930
1
citations
#14282

Serialization based Point Cloud Oversegmentation

chenghui Lu, Dilong Li, Jianlong Kwan et al.

ICCV 2025
1
citations
#14283

PS3: A Multimodal Transformer Integrating Pathology Reports with Histology Images and Biological Pathways for Cancer Survival Prediction

Manahil Raza, Ayesha Azam, Talha Qaiser et al.

ICCV 2025arXiv:2509.20022
1
citations
#14284

ClaraVid: A Holistic Scene Reconstruction Benchmark From Aerial Perspective With Delentropy-Based Complexity Profiling

Radu Beche, Sergiu Nedevschi

ICCV 2025arXiv:2503.17856
1
citations
#14285

Web-Scale Collection of Video Data for 4D Animal Reconstruction

Brian Nlong Zhao, Jiajun Wu, Shangzhe Wu

NEURIPS 2025arXiv:2511.01169
1
citations
#14286

RNNs perform task computations by dynamically warping neural representations

Arthur Pellegrino, Angus Chadwick

NEURIPS 2025arXiv:2512.04310
1
citations
#14287

Dual Energy-Based Model with Open-World Uncertainty Estimation for Out-of-distribution Detection

Qi Chen, Hu Ding

CVPR 2025
1
citations
#14288

Linear Mode Connectivity in Differentiable Tree Ensembles

Ryuichi Kanoh, Mahito Sugiyama

ICLR 2025arXiv:2405.14596
1
citations
#14289

Discrete Latent Plans via Semantic Skill Abstractions

Haobin Jiang, Wang, Zongqing Lu

ICLR 2025
1
citations
#14290

Few-shot Implicit Function Generation via Equivariance

Suizhi Huang, Xingyi Yang, Hongtao Lu et al.

CVPR 2025highlightarXiv:2501.01601
1
citations
#14291

Adaptive Energy Alignment for Accelerating Test-Time Adaptation

Wonjeong Choi, Do-Yeon Kim, Jungwuk Park et al.

ICLR 2025
1
citations
#14292

Seemingly Redundant Modules Enhance Robust Odor Learning in Fruit Flies

HaiYang Li, Liao Yu, Qiang Yu et al.

NEURIPS 2025arXiv:2510.21315
1
citations
#14293

Discontinuity-aware Normal Integration for Generic Central Camera Models

Francesco Milano, Manuel Lopez-Antequera, Naina Dhingra et al.

ICCV 2025highlightarXiv:2507.06075
1
citations
#14294

Preserve Anything: Controllable Image Synthesis with Object Preservation

Prasen Kumar Sharma, Neeraj Matiyali, Siddharth Srivastava et al.

ICCV 2025arXiv:2506.22531
1
citations
#14295

ReservoirTTA: Prolonged Test-time Adaptation for Evolving and Recurring Domains

Guillaume Vray, Devavrat Tomar, Xufeng Gao et al.

NEURIPS 2025arXiv:2505.14511
1
citations
#14296

Black Hole-Driven Identity Absorbing in Diffusion Models

Muhammad Shaheryar, Jong Taek Lee, Soon Ki Jung

CVPR 2025
1
citations
#14297

MCOP: Multi-UAV Collaborative Occupancy Prediction

Zefu Lin, Wenbo Chen, Xiaojuan Jin et al.

ICCV 2025arXiv:2510.12679
1
citations
#14298

IGD: Instructional Graphic Design with Multimodal Layer Generation

Yadong Qu, Shancheng Fang, Yuxin Wang et al.

ICCV 2025arXiv:2507.09910
1
citations
#14299

SL2A-INR: Single-Layer Learnable Activation for Implicit Neural Representation

Reza Rezaeian, Moein Heidari, Reza Azad et al.

ICCV 2025
1
citations
#14300

Is This Tracker On? A Benchmark Protocol for Dynamic Tracking

Ilona Demler, Saumya Chauhan, Georgia Gkioxari

NEURIPS 2025arXiv:2510.19819
1
citations
#14301

Practical do-Shapley Explanations with Estimand-Agnostic Causal Inference

Álvaro Parafita, Tomas Garriga, Axel Brando et al.

NEURIPS 2025spotlightarXiv:2509.20211
1
citations
#14302

3DIS: Depth-Driven Decoupled Image Synthesis for Universal Multi-Instance Generation

Dewei Zhou, Ji Xie, Zongxin Yang et al.

ICLR 2025
1
citations
#14303

Link to the Past: Temporal Propagation for Fast 3D Human Reconstruction from Monocular Video

Marchellus Matthew, Nadhira Noor, In Kyu Park

CVPR 2025arXiv:2505.07333
1
citations
#14304

HOI-Dyn: Learning Interaction Dynamics for Human-Object Motion Diffusion

Lin Wu, Zhixiang Chen, Jianglin Lan

NEURIPS 2025arXiv:2507.01737
1
citations
#14305

No-Regret Online Autobidding Algorithms in First-price Auctions

Yilin LI, Yuan Deng, Wei Tang et al.

NEURIPS 2025arXiv:2510.16869
1
citations
#14306

Sampling Innovation-Based Adaptive Compressive Sensing

Zhifu Tian, Tao Hu, Chaoyang Niu et al.

CVPR 2025arXiv:2503.13241
1
citations
#14307

Depth-Guided Bundle Sampling for Efficient Generalizable Neural Radiance Field Reconstruction

Li Fang, Hao Zhu, Longlong Chen et al.

CVPR 2025arXiv:2505.19793
1
citations
#14308

R2Det: Exploring Relaxed Rotation Equivariance in 2D Object Detection

Zhiqiang Wu, Yingjie Liu, Hanlin Dong et al.

ICLR 2025arXiv:2408.11760
1
citations
#14309

Balancing Conservatism and Aggressiveness: Prototype-Affinity Hybrid Network for Few-Shot Segmentation

Tianyu Zou, Shengwu Xiong, Ruilin Yao et al.

ICCV 2025arXiv:2507.19140
1
citations
#14310

miniF2F-Lean Revisited: Reviewing Limitations and Charting a Path Forward

Azim Ospanov, Farzan Farnia, Roozbeh Yousefzadeh

NEURIPS 2025arXiv:2511.03108
1
citations
#14311

Text-IRSTD: Leveraging Semantic Text to Promote Infrared Small Target Detection in Complex Scenes

Feng Huang, Shuyuan Zheng, Zhaobing Qiu et al.

ICCV 2025arXiv:2503.07249
1
citations
#14312

Balanced Ranking with Relative Centrality: A multi-core periphery perspective

Chandra Sekhar Mukherjee, Jiapeng Zhang

ICLR 2025
1
citations
#14313

DoppDrive: Doppler-Driven Temporal Aggregation for Improved Radar Object Detection

Yuval Haitman, Oded Bialer

ICCV 2025arXiv:2508.12330
1
citations
#14314

Practical Solutions to the Relative Pose of Three Calibrated Cameras

Charalambos Tzamos, Viktor Kocur, Yaqing Ding et al.

CVPR 2025arXiv:2303.16078
1
citations
#14315

Foveated Instance Segmentation

Hongyi Zeng, Wenxuan Liu, Tianhua Xia et al.

CVPR 2025arXiv:2503.21854
1
citations
#14316

Reinforcement Learning for Control of Non-Markovian Cellular Population Dynamics

Josiah Kratz, Jacob Adamczyk

ICLR 2025oralarXiv:2410.08439
1
citations
#14317

BabyVLM: Data-Efficient Pretraining of VLMs Inspired by Infant Learning

Shengao Wang, Arjun Chandra, Aoming Liu et al.

ICCV 2025arXiv:2504.09426
1
citations
#14318

Modeling Dynamic Neural Activity by combining Naturalistic Video Stimuli and Stimulus-independent Latent Factors

Finn Schmidt, Polina Turishcheva, Suhas Shrinivasan et al.

NEURIPS 2025oralarXiv:2410.16136
1
citations
#14319

Beyond Benign Overfitting in Nadaraya-Watson Interpolators

Daniel Barzilai, Guy Kornowski, Ohad Shamir

NEURIPS 2025arXiv:2502.07480
1
citations
#14320

One Last Attention for Your Vision-Language Model

Liang Chen, Ghazi Shazan Ahmad, Tianjun Yao et al.

ICCV 2025arXiv:2507.15480
1
citations
#14321

Why 1 + 1 < 1 in Visual Token Pruning: Beyond Naive Integration via Multi-Objective Balanced Covering

Yangfu Li, Hongjian Zhan, Tianyi Chen et al.

NEURIPS 2025arXiv:2505.10118
1
citations
#14322

Token Bottleneck: One Token to Remember Dynamics

Taekyung Kim, Dongyoon Han, Byeongho Heo et al.

NEURIPS 2025oralarXiv:2507.06543
1
citations
#14323

Explaining Domain Shifts in Language: Concept Erasing for Interpretable Image Classification

Zequn Zeng, Yudi Su, Jianqiao Sun et al.

CVPR 2025arXiv:2503.18483
1
citations
#14324

Difference Inversion: Interpolate and Isolate the Difference with Token Consistency for Image Analogy Generation

Hyunsoo Kim, Donghyun Kim, Suhyun Kim

CVPR 2025arXiv:2506.07750
1
citations
#14325

Hessian-guided Perturbed Wasserstein Gradient Flows for Escaping Saddle Points

Naoya Yamamoto, Juno Kim, Taiji Suzuki

NEURIPS 2025arXiv:2509.16974
1
citations
#14326

AIM: Amending Inherent Interpretability via Self-Supervised Masking

Eyad Alshami, Shashank Agnihotri, Bernt Schiele et al.

ICCV 2025highlightarXiv:2508.11502
1
citations
#14327

S⁴M: Boosting Semi-Supervised Instance Segmentation with SAM

Heeji Yoon, Heeseong Shin, Eunbeen Hong et al.

ICCV 2025
1
citations
#14328

CompleteMe: Reference-based Human Image Completion

Yu-Ju Tsai, Brian Price, Qing Liu et al.

ICCV 2025arXiv:2504.20042
1
citations
#14329

Bubbleformer: Forecasting Boiling with Transformers

Sheikh Md Shakeel Hassan, Xianwei Zou, Akash Dhruv et al.

NEURIPS 2025oralarXiv:2507.21244
1
citations
#14330

Revisiting Adversarial Patch Defenses on Object Detectors: Unified Evaluation, Large-Scale Dataset, and New Insights

Junhao Zheng, Jiahao Sun, Chenhao Lin et al.

ICCV 2025arXiv:2508.00649
1
citations
#14331

Beyond Blur: A Fluid Perspective on Generative Diffusion Models

Grzegorz Gruszczynski, Jakub Meixner, Michał Włodarczyk et al.

ICCV 2025arXiv:2506.16827
1
citations
#14332

Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning

Claire Chen, Shuze Liu, Shangtong Zhang

ICLR 2025arXiv:2410.05655
1
citations
#14333

DiffBreak: Is Diffusion-Based Purification Robust?

Andre Kassis, Urs Hengartner, Yaoliang Yu

NEURIPS 2025arXiv:2411.16598
1
citations
#14334

GVDepth: Zero-Shot Monocular Depth Estimation for Ground Vehicles based on Probabilistic Cue Fusion

Karlo Koledic, Luka Petrovic, Ivan Marković et al.

ICCV 2025arXiv:2412.06080
1
citations
#14335

Risk-Sensitive Variational Actor-Critic: A Model-Based Approach

Alonso Granados, Mohammadreza Ebrahimi, Jason Pacheco

ICLR 2025
1
citations
#14336

CoSER: Towards Consistent Dense Multiview Text-to-Image Generator for 3D Creation

Bonan Li, Zicheng Zhang, Xingyi Yang et al.

CVPR 2025highlight
1
citations
#14337

Aligning Moments in Time using Video Queries

Yogesh Kumar, Uday Agarwal, Manish Gupta et al.

ICCV 2025arXiv:2508.15439
1
citations
#14338

DIMO: Diverse 3D Motion Generation for Arbitrary Objects

Linzhan Mou, Jiahui Lei, Chen Wang et al.

ICCV 2025highlightarXiv:2511.07409
1
citations
#14339

Sparse Gaussian Processes: Structured Approximations and Power-EP Revisited

Thang Bui, Michalis Titsias

NEURIPS 2025arXiv:2507.02377
1
citations
#14340

Global-Aware Monocular Semantic Scene Completion with State Space Models

Shijie Li, Zhongyao Cheng, Rong Li et al.

ICCV 2025arXiv:2503.06569
1
citations
#14341

Beyond Token Probes: Hallucination Detection via Activation Tensors with ACT-ViT

Guy Bar-Shalom, Fabrizio Frasca, Yaniv Galron et al.

NEURIPS 2025arXiv:2510.00296
1
citations
#14342

A Markov Decision Process for Variable Selection in Branch & Bound

Paul STRANG, Zacharie ALES, Côme Bissuel et al.

NEURIPS 2025arXiv:2510.19348
1
citations
#14343

DISTA-Net: Dynamic Closely-Spaced Infrared Small Target Unmixing

Shengdong Han, Shangdong Yang, Yuxuan Li et al.

ICCV 2025arXiv:2505.19148
1
citations
#14344

Improving Personalized Search with Regularized Low-Rank Parameter Updates

Fiona Ryan, Josef Sivic, Fabian Caba Heilbron et al.

CVPR 2025highlightarXiv:2506.10182
1
citations
#14345

COUNTS: Benchmarking Object Detectors and Multimodal Large Language Models under Distribution Shifts

Jiansheng Li, Xingxuan Zhang, Hao Zou et al.

CVPR 2025highlightarXiv:2504.10158
1
citations
#14346

LC-Opt: Benchmarking Reinforcement Learning and Agentic AI for End-to-End Liquid Cooling Optimization in Data Centers

Avisek Naug, Antonio Guillen-Perez, Vineet Kumar et al.

NEURIPS 2025arXiv:2511.00116
1
citations
#14347

Point-Cache: Test-time Dynamic and Hierarchical Cache for Robust and Generalizable Point Cloud Analysis

Hongyu Sun, Qiuhong Ke, Ming Cheng et al.

CVPR 2025arXiv:2503.12150
1
citations
#14348

TIDE: Training Locally Interpretable Domain Generalization Models Enables Test-time Correction

Aishwarya Agarwal, Srikrishna Karanam, Vineet Gandhi

CVPR 2025highlightarXiv:2411.16788
1
citations
#14349

Look-Ahead Reasoning on Learning Platforms

Haiqing Zhu, Tijana Zrnic, Celestine Mendler-Dünner

NEURIPS 2025oralarXiv:2511.14745
1
citations
#14350

SegmentDreamer: Towards High-fidelity Text-to-3D Synthesis with Segmented Consistency Trajectory Distillation

Jiahao Zhu, Zixuan Chen, Guangcong Wang et al.

ICCV 2025arXiv:2507.05256
1
citations
#14351

Improving Progressive Generation with Decomposable Flow Matching

Moayed Haji-Ali, Willi Menapace, Ivan Skorokhodov et al.

NEURIPS 2025arXiv:2506.19839
1
citations
#14352

Poly-Autoregressive Prediction for Modeling Interactions

Neerja Thakkar, Tara Sadjadpour, Jathushan Rajasegaran et al.

CVPR 2025arXiv:2502.08646
1
citations
#14353

Comprehensive Assessment and Analysis for NSFW Content Erasure in Text-to-Image Diffusion models

Die Chen, Zhiwen Li, Cen Chen et al.

NEURIPS 2025arXiv:2502.12527
1
citations
#14354

4D-Bench: Benchmarking Multi-modal Large Language Models for 4D Object Understanding

Wenxuan Zhu, Bing Li, Cheng Zheng et al.

ICCV 2025arXiv:2503.17827
1
citations
#14355

Pseudo Visible Feature Fine-Grained Fusion for Thermal Object Detection

Ting Li, Mao Ye, Tianwen Wu et al.

CVPR 2025
1
citations
#14356

Fast Rate Bounds for Multi-Task and Meta-Learning with Different Sample Sizes

Hossein Zakerinia, Christoph Lampert

NEURIPS 2025arXiv:2505.15496
1
citations
#14357

Separating the 'what' and 'how' of compositional computation to enable reuse and continual learning

Haozhe Shan, Sun Minni, Lea Duncker

NEURIPS 2025arXiv:2510.20709
1
citations
#14358

Structured Spectral Reasoning for Frequency-Adaptive Multimodal Recommendation

Wei Yang, Rui Zhong, Yiqun Chen et al.

NEURIPS 2025arXiv:2512.01372
1
citations
#14359

Principles of Visual Tokens for Efficient Video Understanding

Xinyue Hao, Li, Shreyank Gowda et al.

ICCV 2025arXiv:2411.13626
1
citations
#14360

Graph Neural Network Combining Event Stream and Periodic Aggregation for Low-Latency Event-based Vision

Manon Dampfhoffer, Thomas Mesquida, Damien Joubert et al.

CVPR 2025highlight
1
citations
#14361

Unlocking the Potential of Unlabeled Data in Semi-Supervised Domain Generalization

Dongkwan Lee, Kyomin Hwang, Nojun Kwak

CVPR 2025arXiv:2503.13915
1
citations
#14362

Towards Provable Emergence of In-Context Reinforcement Learning

Jiuqi Wang, Rohan Chandra, Shangtong Zhang

NEURIPS 2025oralarXiv:2509.18389
1
citations
#14363

MetricGrids: Arbitrary Nonlinear Approximation with Elementary Metric Grids based Implicit Neural Representation

Shu Wang, Yanbo Gao, Shuai Li et al.

CVPR 2025highlightarXiv:2503.10000
1
citations
#14364

Revisiting Bi-Linear State Transitions in Recurrent Neural Networks

Reza Ebrahimi, Roland Memisevic

NEURIPS 2025arXiv:2505.21749
1
citations
#14365

FORLA: Federated Object-centric Representation Learning with Slot Attention

Guiqiu Liao, Matjaz Jogan, Eric Eaton et al.

NEURIPS 2025arXiv:2506.02964
1
citations
#14366

Unlearning the Noisy Correspondence Makes CLIP More Robust

Haochen Han, Alex Jinpeng Wang, Peijun Ye et al.

ICCV 2025arXiv:2507.03434
1
citations
#14367

Subgraph Federated Learning via Spectral Methods

Javad Aliakbari, Johan Oestman, Ashkan Panahi et al.

NEURIPS 2025arXiv:2510.25657
1
citations
#14368

UWAV: Uncertainty-weighted Weakly-supervised Audio-Visual Video Parsing

Yung-Hsuan Lai, Janek Ebbers, Yu-Chiang Frank Wang et al.

CVPR 2025arXiv:2505.09615
1
citations
#14369

ArchPower: Dataset for Architecture-Level Power Modeling of Modern CPU Design

Qijun Zhang, Yao Lu, Mengming Li et al.

NEURIPS 2025arXiv:2512.06854
1
citations
#14370

UniGS: Modeling Unitary 3D Gaussians for Novel View Synthesis from Sparse-view Images

Jiamin WU, Kenkun Liu, Xiaoke Jiang et al.

ICCV 2025arXiv:2410.13195
1
citations
#14371

Attention IoU: Examining Biases in CelebA using Attention Maps

Aaron Serianni, Tyler Zhu, Olga Russakovsky et al.

CVPR 2025arXiv:2503.19846
1
citations
#14372

Exploring Landscapes for Better Minima along Valleys

Tong Zhao, Jiacheng Li, Yuanchang Zhou et al.

NEURIPS 2025arXiv:2510.27153
1
citations
#14373

Bridging Critical Gaps in Convergent Learning: How Representational Alignment Evolves Across Layers, Training, and Distribution Shifts

Chaitanya Kapoor, Sudhanshu Srivastava, Meenakshi Khosla

NEURIPS 2025arXiv:2502.18710
1
citations
#14374

Online Segment Any 3D Thing as Instance Tracking

Hanshi Wang, Cai Zijian, Jin Gao et al.

NEURIPS 2025oralarXiv:2512.07599
1
citations
#14375

Deterministic Certification of Graph Neural Networks against Graph Poisoning Attacks with Arbitrary Perturbations

Jiate Li, Meng Pang, Yun Dong et al.

CVPR 2025arXiv:2503.18503
1
citations
#14376

Recursive Inference Scaling: A Winning Path to Scalable Inference in Language and Multimodal Systems

Ibrahim Alabdulmohsin, Xiaohua Zhai

NEURIPS 2025arXiv:2502.07503
1
citations
#14377

Trans-Adapter: A Plug-and-Play Framework for Transparent Image Inpainting

Yuekun Dai, Haitian Li, Shangchen Zhou et al.

ICCV 2025arXiv:2508.01098
1
citations
#14378

Variance-Based Pruning for Accelerating and Compressing Trained Networks

Uranik Berisha, Jens Mehnert, Alexandru Condurache

ICCV 2025arXiv:2507.12988
1
citations
#14379

Eluder dimension: localise it!

Alireza Bakhtiari, Alex Ayoub, Samuel Robertson et al.

NEURIPS 2025spotlightarXiv:2601.09825
1
citations
#14380

Risk Bounds For Distributional Regression

Carlos Misael Madrid Padilla, OSCAR HERNAN MADRID PADILLA, Sabyasachi Chatterjee

NEURIPS 2025arXiv:2505.09075
1
citations
#14381

METEOR: Multi-Encoder Collaborative Token Pruning for Efficient Vision Language Models

Yuchen Liu, Yaoming Wang, Bowen Shi et al.

ICCV 2025arXiv:2507.20842
1
citations
#14382

ShortFT: Diffusion Model Alignment via Shortcut-based Fine-Tuning

Xiefan Guo, Miaomiao Cui, Liefeng Bo et al.

ICCV 2025arXiv:2507.22604
1
citations
#14383

UnMix-NeRF: Spectral Unmixing Meets Neural Radiance Fields

Fabian Perez, Sara Rojas Martinez, Carlos Hinojosa et al.

ICCV 2025arXiv:2506.21884
1
citations
#14384

Keep the Balance: A Parameter-Efficient Symmetrical Framework for RGB+X Semantic Segmentation

Jiaxin Cai, Jingze Su, Qi Li et al.

CVPR 2025
1
citations
#14385

MERIT: Multilingual Semantic Retrieval with Interleaved Multi-Condition Query

Wei Chow, Yuan Gao, Linfeng Li et al.

NEURIPS 2025oralarXiv:2506.03144
1
citations
#14386

On Rollouts in Model-Based Reinforcement Learning

Bernd Frauenknecht, Devdutt Subhasish, Friedrich Solowjow et al.

ICLR 2025arXiv:2501.16918
1
citations
#14387

Universal Domain Adaptation for Semantic Segmentation

Seun-An Choe, Keon Hee Park, Jinwoo Choi et al.

CVPR 2025arXiv:2505.22458
1
citations
#14388

When GNNs meet symmetry in ILPs: an orbit-based feature augmentation approach

Qian Chen, Lei Li, Qian Li et al.

ICLR 2025arXiv:2501.14211
1
citations
#14389

Non-Markovian Discrete Diffusion with Causal Language Models

Yangtian Zhang, Sizhuang He, Daniel Levine et al.

NEURIPS 2025oralarXiv:2502.09767
1
citations
#14390

Stochastically Dominant Peer Prediction

Yichi Zhang, Shengwei Xu, Grant Schoenebeck et al.

NEURIPS 2025arXiv:2506.02259
1
citations
#14391

Stochastic variance-reduced Gaussian variational inference on the Bures-Wasserstein manifold

Hoang Phuc Hau Luu, Hanlin Yu, Bernardo Williams et al.

ICLR 2025arXiv:2410.02490
1
citations
#14392

Training Large Language Models for Retrieval-Augmented Question Answering through Backtracking Correction

Huawen Feng, ZekunYao, Junhao Zheng et al.

ICLR 2025
1
citations
#14393

Learning to Condition: A Neural Heuristic for Scalable MPE Inference

Brij Malhotra, Shivvrat Arya, Tahrima Rahman et al.

NEURIPS 2025arXiv:2509.25217
1
citations
#14394

Training the Untrainable: Introducing Inductive Bias via Representational Alignment

Vighnesh Subramaniam, David Mayo, Colin Conwell et al.

NEURIPS 2025arXiv:2410.20035
1
citations
#14395

NTClick: Achieving Precise Interactive Segmentation With Noise-tolerant Clicks

Chenyi Zhang, Ting Liu, Xiaochao Qu et al.

CVPR 2025highlight
1
citations
#14396

IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance

Jiayi Guo, Chuanhao Yan, Xingqian Xu et al.

ICCV 2025arXiv:2509.26231
1
citations
#14397

Private Hyperparameter Tuning with Ex-Post Guarantee

Badih Ghazi, Pritish Kamath, Alexander Knop et al.

NEURIPS 2025spotlightarXiv:2508.15183
1
citations
#14398

One-Shot Knowledge Transfer for Scalable Person Re-Identification

Longhua Li, Lei Qi, Xin Geng

ICCV 2025arXiv:2511.06016
1
citations
#14399

AugGen: Synthetic Augmentation using Diffusion Models Can Improve Recognition

Parsa Rahimi, Damien Teney, Sébastien Marcel

NEURIPS 2025arXiv:2503.11544
1
citations
#14400

HarmonySeg: Tubular Structure Segmentation with Deep-Shallow Feature Fusion and Growth-Suppression Balanced Loss

Ke Zhang, Yi Huang, Wei Liu et al.

ICCV 2025arXiv:2504.07827
1
citations