Most Cited 2025 "noisy speech processing" Papers

22,274 papers found • Page 71 of 112

#14001

Differentially Private Boxplots

Kelly Ramsay, Jairo Diaz-Rodriguez

ICML 2025arXiv:2405.20415
1
citations
#14002

ELBOing Stein: Variational Bayes with Stein Mixture Inference

Ola Rønning, Eric Nalisnick, Christophe Ley et al.

ICLR 2025arXiv:2410.22948
1
citations
#14003

COGNATE: Acceleration of Sparse Tensor Programs on Emerging Hardware using Transfer Learning

Chamika Sudusinghe, Gerasimos Gerogiannis, Damitha Lenadora et al.

ICML 2025arXiv:2506.00424
1
citations
#14004

Understanding Sharpness Dynamics in NN Training with a Minimalist Example: The Effects of Dataset Difficulty, Depth, Stochasticity, and More

Geonhui Yoo, Minhak Song, Chulhee Yun

ICML 2025arXiv:2506.06940
1
citations
#14005

Incremental Gradient Descent with Small Epoch Counts is Surprisingly Slow on Ill-Conditioned Problems

Yujun Kim, Jaeyoung Cha, Chulhee Yun

ICML 2025arXiv:2506.04126
1
citations
#14006

SEAM: Semantically Equivalent Across Modalities Benchmark for Vision-Language Models

Zhenwei Tang, Difan Jiao, Blair Yang et al.

COLM 2025paperarXiv:2508.18179
1
citations
#14007

High Dynamic Range Novel View Synthesis with Single Exposure

Kaixuan Zhang, HuWang, Minxian Li et al.

ICML 2025arXiv:2505.01212
1
citations
#14008

Dynamic Syntactic Feature Filtering and Injecting Networks for Cross-lingual Dependency Parsing

Jianjian Liu, Zhengtao Yu, Ying Li et al.

AAAI 2025paper
1
citations
#14009

Building, Reusing, and Generalizing Abstract Representations from Concrete Sequences

Shuchen Wu, Mirko Thalmann, Peter Dayan et al.

ICLR 2025arXiv:2410.21332
1
citations
#14010

LAMA-UT: Language Agnostic Multilingual ASR Through Orthography Unification and Language-Specific Transliteration

Sangmin Lee, Woojin Chung, Hong-Goo Kang

AAAI 2025paperarXiv:2412.15299
1
citations
#14011

InstaTrain: Adaptive Training via Ultra-Fast Natural Annealing within Dynamical Systems

Chuan Liu, Ruibing Song, Chunshu Wu et al.

ICLR 2025
1
citations
#14012

Uncertainty-Aware Self-Training for CTC-Based Automatic Speech Recognition

Eungbeom Kim, Kyogu Lee

AAAI 2025paper
1
citations
#14013

Catoni Contextual Bandits are Robust to Heavy-tailed Rewards

Chenlu Ye, Yujia Jin, Alekh Agarwal et al.

ICML 2025spotlightarXiv:2502.02486
1
citations
#14014

Efficient Multivariate Robust Mean Estimation Under Mean-Shift Contamination

Ilias Diakonikolas, Giannis Iakovidis, Daniel Kane et al.

ICML 2025arXiv:2502.14772
1
citations
#14015

An Optimistic Algorithm for online CMDPS with Anytime Adversarial Constraints

Jiahui Zhu, Kihyun Yu, Dabeen Lee et al.

ICML 2025arXiv:2505.21841
1
citations
#14016

KinDEL: DNA-Encoded Library Dataset for Kinase Inhibitors

Benson Chen, Tomasz Danel, Gabriel Dreiman et al.

ICML 2025arXiv:2410.08938
1
citations
#14017

Deep Submodular Optimization and LLM for Multimodal Content Extraction and Automatic Poster Generation from Long Document

Vijay Jaisankar, Sambaran Bandyopadhyay, Kalp Vyas et al.

AAAI 2025paper
1
citations
#14018

Empowering Self-Learning of LLMs: Inner Knowledge Explicitation as a Catalyst

Shijue Huang, Wanjun Zhong, Deng Cai et al.

AAAI 2025paper
1
citations
#14019

Logic Induced High-Order Reasoning Network for Event-Event Relation Extraction

Peixin Huang, Xiang Zhao, Minghao Hu et al.

AAAI 2025paperarXiv:2412.14688
1
citations
#14020

Learning from Noisy Labels via Self-Taught On-the-Fly Meta Loss Rescaling

Michael Heck, Christian Geishauser, Nurul Lubis et al.

AAAI 2025paperarXiv:2412.12955
1
citations
#14021

ProsodyFM: Unsupervised Phrasing and Intonation Control for Intelligible Speech Synthesis

Xiangheng He, Junjie Chen, Zixing Zhang et al.

AAAI 2025paperarXiv:2412.11795
1
citations
#14022

QoS-Efficient Serving of Multiple Mixture-of-Expert LLMs Using Partial Runtime Reconfiguration

HamidReza Imani, Jiaxin Peng, Peiman Mohseni et al.

ICML 2025arXiv:2505.06481
1
citations
#14023

CSSinger: End-to-End Chunkwise Streaming Singing Voice Synthesis System Based on Conditional Variational Autoencoder

Jianwei Cui, Yu Gu, Shihao Chen et al.

AAAI 2025paperarXiv:2412.08918
1
citations
#14024

Small Language Model Makes an Effective Long Text Extractor

Yelin Chen, Fanjin Zhang, Jie Tang

AAAI 2025paperarXiv:2502.07286
1
citations
#14025

CSL-L2M: Controllable Song-Level Lyric-to-Melody Generation Based on Conditional Transformer with Fine-Grained Lyric and Musical Controls

Li Chai, Donglin Wang

AAAI 2025paperarXiv:2412.09887
1
citations
#14026

Implicit In-Context Learning: Evidence from Artificial Language Experiments

Xiaomeng Ma, Qihui Xu

COLM 2025paperarXiv:2503.24190
1
citations
#14027

On the Learnability of Distribution Classes with Adaptive Adversaries

Tosca Lechner, Alex Bie, Gautam Kamath

ICML 2025arXiv:2509.05137
1
citations
#14028

SrSv: Integrating Sequential Rollouts with Sequential Value Estimation for Multi-agent Reinforcement Learning

Xu Wan, Chao Yang, Cheng Yang et al.

AAAI 2025paperarXiv:2503.01458
1
citations
#14029

A Multiagent Path Search Algorithm for Large-Scale Coalition Structure Generation

Redha Taguelmimt, Samir Aknine, Djamila Boukredera et al.

AAAI 2025paperarXiv:2502.10226
1
citations
#14030

Towards Attributions of Input Variables in a Coalition

Xinhao Zheng, Huiqi Deng, Quanshi Zhang

ICML 2025arXiv:2309.13411
1
citations
#14031

Optimally Solving Simultaneous-Move Dec-POMDPs: The Sequential Central Planning Approach

Johan Peralez, Aurélien Delage, Jacopo Castellini et al.

AAAI 2025paperarXiv:2408.13139
1
citations
#14032

NTK-DFL: Enhancing Decentralized Federated Learning in Heterogeneous Settings via Neural Tangent Kernel

Gabriel Thompson, Kai Yue, Chau-Wai Wong et al.

ICML 2025arXiv:2410.01922
1
citations
#14033

Unsupervised Translation of Emergent Communication

Ido Levy, Orr Paradise, Boaz Carmeli et al.

AAAI 2025paperarXiv:2502.07552
1
citations
#14034

Discovering Spoofing Attempts on Language Model Watermarks

Thibaud Gloaguen, Nikola Jovanović, Robin Staab et al.

ICML 2025arXiv:2410.02693
1
citations
#14035

Craftium: Bridging Flexibility and Efficiency for Rich 3D Single- and Multi-Agent Environments

Mikel Malagón, Josu Ceberio, Jose A Lozano

ICML 2025arXiv:2407.03969
1
citations
#14036

PRO-VPT: Distribution-Adaptive Visual Prompt Tuning via Prompt Relocation

Chikai Shang, Mengke Li, Yiqun Zhang et al.

ICCV 2025arXiv:2503.06901
1
citations
#14037

FastDINOv2: Frequency Based Curriculum Learning Improves Robustness and Training Speed

Jiaqi Zhang, Juntuo Wang, Zhixin Sun et al.

NEURIPS 2025arXiv:2507.03779
1
citations
#14038

Targeted Forgetting of Image Subgroups in CLIP Models

Zeliang Zhang, Gaowen Liu, Charles Fleming et al.

CVPR 2025arXiv:2506.03117
1
citations
#14039

Resolution of Simpson's paradox via the common cause principle

Arshak Hovhannisyan, Armen Allahverdyan

NEURIPS 2025arXiv:2403.00957
1
citations
#14040

Self-Supervised Large Scale Point Cloud Completion for Archaeological Site Restoration

Aocheng Li, James R. Zimmer-Dauphinee, Rajesh Kalyanam et al.

CVPR 2025arXiv:2503.04030
1
citations
#14041

BASIC: Boosting Visual Alignment with Intrinsic Refined Embeddings in Multimodal Large Language Models

Jianting Tang, Yubo Wang, Haoyu Cao et al.

ICCV 2025arXiv:2508.06895
1
citations
#14042

Where and How to Perturb: On the Design of Perturbation Guidance in Diffusion and Flow Models

Donghoon Ahn, Jiwon Kang, Sanghyun Lee et al.

NEURIPS 2025arXiv:2506.10978
1
citations
#14043

SpEx: A Spectral Approach to Explainable Clustering

Tal Argov, Tal Wagner

NEURIPS 2025arXiv:2511.00885
1
citations
#14044

Non-Stationary Lipschitz Bandits

Nicolas Nguyen, Solenne Gaucher, Claire Vernade

NEURIPS 2025arXiv:2505.18871
1
citations
#14045

ESC: Erasing Space Concept for Knowledge Deletion

Tae-Young Lee, Sundong Park, Minwoo Jeon et al.

CVPR 2025highlightarXiv:2504.02199
1
citations
#14046

MMPB: It’s Time for Multi-Modal Personalization

Jaeik Kim, Woojin Kim, Woohyeon Park et al.

NEURIPS 2025arXiv:2509.22820
1
citations
#14047

Taxonomy of reduction matrices for Graph Coarsening

Antonin Joly, Nicolas Keriven, Aline Roumy

NEURIPS 2025arXiv:2506.11743
1
citations
#14048

DONUT: A Decoder-Only Model for Trajectory Prediction

Markus Knoche, Daan de Geus, Bastian Leibe

ICCV 2025arXiv:2506.06854
1
citations
#14049

Flattening Hierarchies with Policy Bootstrapping

John Zhou, Jonathan Kao

NEURIPS 2025spotlightarXiv:2505.14975
1
citations
#14050

FreeCus: Free Lunch Subject-driven Customization in Diffusion Transformers

Yanbing Zhang, Zhe Wang, Qin Zhou et al.

ICCV 2025arXiv:2507.15249
1
citations
#14051

Understanding Generalization in Physics Informed Models through Affine Variety Dimensions

Takeshi Koshizuka, Issei Sato

NEURIPS 2025arXiv:2501.18879
1
citations
#14052

Simulation-Based Inference for Adaptive Experiments

Brian Cho, Aurelien Bibaut, Nathan Kallus

NEURIPS 2025arXiv:2506.02881
1
citations
#14053

Military AI Needs Technically-Informed Regulation to Safeguard AI Research and its Applications

Riley Simmons-Edler, Jean Dong, Paul Lushenko et al.

NEURIPS 2025arXiv:2505.18371
1
citations
#14054

CF3: Compact and Fast 3D Feature Fields

Hyunjoon Lee, Joonkyu Min, Jaesik Park

ICCV 2025arXiv:2508.05254
1
citations
#14055

From Noise to Narrative: Tracing the Origins of Hallucinations in Transformers

Praneet Suresh, Jack Stanley, Sonia Joseph et al.

NEURIPS 2025arXiv:2509.06938
1
citations
#14056

MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization

Chenglong Wang, Yang Gan, Hang Zhou et al.

NEURIPS 2025arXiv:2510.21473
1
citations
#14057

Benchmarking Egocentric Visual-Inertial SLAM at City Scale

Anusha Krishnan, Shaohui Liu, Paul-Edouard Sarlin et al.

ICCV 2025highlightarXiv:2509.26639
1
citations
#14058

Fitted Neural Lossless Image Compression

Zhe Zhang, Zhenzhong Chen, Shan Liu

CVPR 2025
1
citations
#14059

Describe, Adapt and Combine: Empowering CLIP Encoders for Open-set 3D Object Retrieval

Zhichuan Wang, Yang Zhou, Zhe Liu et al.

ICCV 2025arXiv:2507.21489
1
citations
#14060

Towards Unsupervised Training of Matching-based Graph Edit Distance Solver via Preference-aware GAN

Wei Huang, Hanchen Wang, Dong Wen et al.

NEURIPS 2025arXiv:2506.01977
1
citations
#14061

Gradient Descent as Loss Landscape Navigation: a Normative Framework for Deriving Learning Rules

John Vastola, Samuel J Gershman, Kanaka Rajan

NEURIPS 2025arXiv:2510.26997
1
citations
#14062

Neural Tangent Knowledge Distillation for Optical Convolutional Networks

Jinlin Xiang, Minho Choi, Yubo Zhang et al.

NEURIPS 2025arXiv:2508.08421
1
citations
#14063

Separation of Powers: On Segregating Knowledge from Observation in LLM-enabled Knowledge-based Visual Question Answering

Zhen Yang, Zhuo Tao, Qi Chen et al.

CVPR 2025
1
citations
#14064

Boosting Domain Generalized and Adaptive Detection with Diffusion Models: Fitness, Generalization, and Transferability

Boyong He, Yuxiang Ji, Zhuoyue Tan et al.

ICCV 2025highlightarXiv:2506.21042
1
citations
#14065

Planning and Learning in Average Risk-aware MDPs

Weikai Wang, Erick Delage

NEURIPS 2025arXiv:2503.17629
1
citations
#14066

Adam Reduces a Unique Form of Sharpness: Theoretical Insights Near the Minimizer Manifold

Xinghan Li, Haodong Wen, Kaifeng Lyu

NEURIPS 2025arXiv:2511.02773
1
citations
#14067

TrafficLoc: Localizing Traffic Surveillance Cameras in 3D Scenes

Yan Xia, Yunxiang Lu, Rui Song et al.

ICCV 2025arXiv:2412.10308
1
citations
#14068

Active Test-time Vision-Language Navigation

Heeju Ko, Sung June Kim, Gyeongrok Oh et al.

NEURIPS 2025arXiv:2506.06630
1
citations
#14069

One Token Embedding Is Enough to Deadlock Your Large Reasoning Model

Mohan Zhang, Yihua Zhang, Jinghan Jia et al.

NEURIPS 2025arXiv:2510.15965
1
citations
#14070

LawDIS: Language-Window-based Controllable Dichotomous Image Segmentation

Xinyu Yan, Meijun Sun, Ge-Peng Ji et al.

ICCV 2025arXiv:2508.01152
1
citations
#14071

Grids Often Outperform Implicit Neural Representation at Compressing Dense Signals

Namhoon Kim, Sara Fridovich-Keil

NEURIPS 2025arXiv:2506.11139
1
citations
#14072

EasySpec: Layer-Parallel Speculative Decoding for Efficient Multi-GPU Utilization

Yize Wu, KE GAO, Ling Li et al.

NEURIPS 2025arXiv:2502.02493
1
citations
#14073

ProSpero: Active Learning for Robust Protein Design Beyond Wild-Type Neighborhoods

Michal Kmicikiewicz, Vincent Fortuin, Ewa Szczurek

NEURIPS 2025arXiv:2505.22494
1
citations
#14074

UniZyme: A Unified Protein Cleavage Site Predictor Enhanced with Enzyme Active-Site Knowledge

Chenao Li, Shuo Yan, Enyan Dai

NEURIPS 2025arXiv:2502.06914
1
citations
#14075

FaCT: Faithful Concept Traces for Explaining Neural Network Decisions

Amin Parchami-Araghi, Sukrut Rao, Jonas Fischer et al.

NEURIPS 2025arXiv:2510.25512
1
citations
#14076

Improving Large Vision and Language Models by Learning from a Panel of Peers

Jefferson Hernandez, Jing Shi, Simon Jenni et al.

ICCV 2025arXiv:2509.01610
1
citations
#14077

CAGE: Continuity-Aware edGE Network Unlocks Robust Floorplan Reconstruction

Yiyi Liu, Chunyang Liu, Bohan Wang et al.

NEURIPS 2025arXiv:2509.15459
1
citations
#14078

Learning Generalizable Shape Completion with SIM(3) Equivariance

Yuqing Wang, Zhaiyu Chen, Xiaoxiang Zhu

NEURIPS 2025arXiv:2509.26631
1
citations
#14079

Sinusoidal Initialization, Time for a New Start

Alberto Fernandez-Hernandez, Jose Mestre, Manuel F. Dolz et al.

NEURIPS 2025arXiv:2505.12909
1
citations
#14080

Learning Cocoercive Conservative Denoisers via Helmholtz Decomposition for Poisson Imaging Inverse Problems

Deliang Wei, Peng Chen, Haobo Xu et al.

NEURIPS 2025
1
citations
#14081

UtilGen: Utility-Centric Generative Data Augmentation with Dual-Level Task Adaptation

Jiyu Guo, Shuo Yang, Yiming Huang et al.

NEURIPS 2025arXiv:2510.24262
1
citations
#14082

Borrowing Eyes for the Blind Spot: Overcoming Data Scarcity in Malicious Video Detection via Cross-Domain Retrieval Augmentation

Rongpei Hong, Jian Lang, Ting Zhong et al.

ICCV 2025
1
citations
#14083

Pinpointing Attention-Causal Communication in Language Models

Gabriel Franco, Mark Crovella

NEURIPS 2025
1
citations
#14084

Conformal Prediction and MLLM aided Uncertainty Quantification in Scene Graph Generation

Sayak Nag, Udita Ghosh, Calvin-Khang Ta et al.

CVPR 2025arXiv:2503.13947
1
citations
#14085

Pretraining a Shared Q-Network for Data-Efficient Offline Reinforcement Learning

Jongchan Park, Mingyu Park, Donghwan Lee

NEURIPS 2025arXiv:2505.05701
1
citations
#14086

Not All Data are Good Labels: On the Self-supervised Labeling for Time Series Forecasting

Yuxuan Yang, Dalin Zhang, Yuxuan Liang et al.

NEURIPS 2025spotlightarXiv:2502.14704
1
citations
#14087

Exploring Polyglot Harmony: On Multilingual Data Allocation for Large Language Models Pretraining

Ping Guo, Yubing Ren, BINBINLIU et al.

NEURIPS 2025arXiv:2509.15556
1
citations
#14088

SSTAG: Structure-Aware Self-Supervised Learning Method for Text-Attributed Graphs

Ruyue Liu, Rong Yin, Xiangzhen Bo et al.

NEURIPS 2025arXiv:2510.01248
1
citations
#14089

MigGPT: Harnessing Large Language Models for Automated Migration of Out-of-Tree Linux Kernel Patches Across Versions

Pucheng Dang, Di Huang, Dong Li et al.

NEURIPS 2025spotlightarXiv:2504.09474
1
citations
#14090

A Real-world Display Inverse Rendering Dataset

Seokjun Choi, Hoon-Gyu Chung, Yujin Jeon et al.

ICCV 2025arXiv:2508.14411
1
citations
#14091

Amortized Variational Transdimensional Inference

Laurence Davies, Daniel MacKinlay, Rafael Oliveira et al.

NEURIPS 2025spotlightarXiv:2506.04749
1
citations
#14092

What’s in Common? Multimodal Models Hallucinate When Reasoning Across Scenes

Candace Ross, Florian Bordes, Adina Williams et al.

NEURIPS 2025arXiv:2511.03768
1
citations
#14093

Position: Biology is the Challenge Physics-Informed ML Needs to Evolve

Julien Martinelli

NEURIPS 2025arXiv:2510.25368
1
citations
#14094

Feature-Based Instance Neighbor Discovery: Advanced Stable Test-Time Adaptation in Dynamic World

Qinting Jiang, Chuyang Ye, Dongyan Wei et al.

NEURIPS 2025arXiv:2506.06782
1
citations
#14095

Pruning All-Rounder: Rethinking and Improving Inference Efficiency for Large Vision Language Models

Wei Suo, Ji Ma, Mengyang Sun et al.

ICCV 2025arXiv:2412.06458
1
citations
#14096

Evidential Knowledge Distillation

Liangyu Xiang, Junyu Gao, Changsheng Xu

ICCV 2025arXiv:2507.18366
1
citations
#14097

Dynamical modeling of nonlinear latent factors in multiscale neural activity with real-time inference

Eray Erturk, Maryam Shanechi

NEURIPS 2025oralarXiv:2512.12462
1
citations
#14098

SpecMER: Fast Protein Generation with K-mer Guided Speculative Decoding

Thomas Walton, Darin Tsui, Aryan Musharaf et al.

NEURIPS 2025spotlightarXiv:2509.21689
1
citations
#14099

Aggregation Hides Out-of-Distribution Generalization Failures from Spurious Correlations

Olawale Salaudeen, Haoran Zhang, Kumail Alhamoud et al.

NEURIPS 2025spotlightarXiv:2510.24884
1
citations
#14100

TEMPO: Temporal Multi-scale Autoregressive Generation of Protein Conformational Ensembles

Yaoyao Xu, Di Wang, Zihan Zhou et al.

NEURIPS 2025oralarXiv:2511.05510
1
citations
#14101

Training Data Provenance Verification: Did Your Model Use Synthetic Data from My Generative Model for Training?

Yuechen Xie, Jie Song, Huiqiong Wang et al.

CVPR 2025arXiv:2503.09122
1
citations
#14102

Towards Fine-grained Interactive Segmentation in Images and Videos

Yuan Yao, Qiushi Yang, Miaomiao Cui et al.

ICCV 2025arXiv:2502.09660
1
citations
#14103

Restoring Pruned Large Language Models via Lost Component Compensation

Zijian Feng, Hanzhang Zhou, Zixiao Zhu et al.

NEURIPS 2025spotlightarXiv:2510.21834
1
citations
#14104

Enhancing LLM Watermark Resilience Against Both Scrubbing and Spoofing Attacks

Huanming Shen, Baizhou Huang, Xiaojun Wan

NEURIPS 2025spotlightarXiv:2507.06274
1
citations
#14105

Embracing Trustworthy Brain-Agent Collaboration as Paradigm Extension for Intelligent Assistive Technologies

Yankai Chen, Xinni Zhang, Yifei Zhang et al.

NEURIPS 2025arXiv:2510.22095
1
citations
#14106

DyFlow: Dynamic Workflow Framework for Agentic Reasoning

Yanbo Wang, Zixiang Xu, Yue Huang et al.

NEURIPS 2025arXiv:2509.26062
1
citations
#14107

Position: Benchmarking is Broken - Don't Let AI be Its Own Judge

Zerui Cheng, Stella Wohnig, Ruchika Gupta et al.

NEURIPS 2025arXiv:2510.07575
1
citations
#14108

PANDA: Towards Generalist Video Anomaly Detection via Agentic AI Engineer

Zhiwei Yang, Chen Gao, Mike Zheng Shou

NEURIPS 2025arXiv:2509.26386
1
citations
#14109

Generating, Fast and Slow: Scalable Parallel Video Generation with Video Interface Networks

Bhishma Dedhia, David Bourgin, Krishna Kumar Singh et al.

ICCV 2025arXiv:2503.17539
1
citations
#14110

RCP-Bench: Benchmarking Robustness for Collaborative Perception Under Diverse Corruptions

Shihang Du, Sanqing Qu, Tianhang Wang et al.

CVPR 2025
1
citations
#14111

DrivAerStar: An Industrial-Grade CFD Dataset for Vehicle Aerodynamic Optimization

Jiyan Qiu, Lyulin Kuang, Guan Wang et al.

NEURIPS 2025arXiv:2510.16857
1
citations
#14112

VisHall3D: Monocular Semantic Scene Completion from Reconstructing the Visible Regions to Hallucinating the Invisible Regions

Haoang Lu, Yuanqi Su, Xiaoning Zhang et al.

ICCV 2025arXiv:2507.19188
1
citations
#14113

DIO: Decomposable Implicit 4D Occupancy-Flow World Model

Christopher Diehl, Quinlan Sykora, Ben Agro et al.

CVPR 2025
1
citations
#14114

MineAnyBuild: Benchmarking Spatial Planning for Open-world AI Agents

Ziming Wei, Bingqian Lin, Zijian Jiao et al.

NEURIPS 2025arXiv:2505.20148
1
citations
#14115

GlobalTomo: A global dataset for physics-ML seismic wavefield modeling and FWI

Shiqian Li, Zhi Li, Zhancun Mu et al.

NEURIPS 2025arXiv:2406.18202
1
citations
#14116

Gaussian Splatting Feature Fields for (Privacy-Preserving) Visual Localization

Maxime Pietrantoni, Gabriela Csurka, Torsten Sattler

CVPR 2025arXiv:2507.23569
1
citations
#14117

Accelerating data-driven algorithm selection for combinatorial partitioning problems

Vaggos Chatziafratis, Ishani Karmarkar, Yingxi Li et al.

NEURIPS 2025spotlightarXiv:2402.14332
1
citations
#14118

DAMM-Diffusion: Learning Divergence-Aware Multi-Modal Diffusion Model for Nanoparticles Distribution Prediction

Junjie Zhou, Shouju Wang, Yuxia Tang et al.

CVPR 2025highlightarXiv:2503.09491
1
citations
#14119

The Narrow Gate: Localized Image-Text Communication in Native Multimodal Models

Alessandro Serra, Francesco Ortu, Emanuele Panizon et al.

NEURIPS 2025arXiv:2412.06646
1
citations
#14120

Geminio: Language-Guided Gradient Inversion Attacks in Federated Learning

Junjie Shan, Ziqi Zhao, Jialin Lu et al.

ICCV 2025arXiv:2411.14937
1
citations
#14121

Hephaestus: Mixture Generative Modeling with Energy Guidance for Large-scale QoS Degradation

Nguyen Do, Bach Ngo, Youval Kashuv et al.

NEURIPS 2025arXiv:2510.17036
1
citations
#14122

TRIM: Scalable 3D Gaussian Diffusion Inference with Temporal and Spatial Trimming

Zeyuan Yin, Xiaoming Liu

NEURIPS 2025oralarXiv:2511.16642
1
citations
#14123

Information-Bottleneck Driven Binary Neural Network for Change Detection

Kaijie Yin, Zhiyuan Zhang, Shu Kong et al.

ICCV 2025arXiv:2507.03504
1
citations
#14124

GlassWizard: Harvesting Diffusion Priors for Glass Surface Detection

Wenxue Li, Tian Ye, Xinyu Xiong et al.

ICCV 2025
1
citations
#14125

Saliency-Aware Quantized Imitation Learning for Efficient Robotic Control

Seongmin Park, Hyungmin Kim, Sangwoo kim et al.

ICCV 2025arXiv:2505.15304
1
citations
#14126

Value Improved Actor Critic Algorithms

Yaniv Oren, Moritz Zanger, Pascal van der Vaart et al.

NEURIPS 2025arXiv:2406.01423
1
citations
#14127

SonoGym: High Performance Simulation for Challenging Surgical Tasks with Robotic Ultrasound

Yunke Ao, Masoud Moghani, Mayank Mittal et al.

NEURIPS 2025arXiv:2507.01152
1
citations
#14128

Open Ad-hoc Categorization with Contextualized Feature Learning

Zilin Wang, Sangwoo Mo, Stella X. Yu et al.

CVPR 2025arXiv:2512.16202
1
citations
#14129

Stepsize anything: A unified learning rate schedule for budgeted-iteration training

Anda Tang, Yiming Dong, Yutao Zeng et al.

NEURIPS 2025arXiv:2505.24452
1
citations
#14130

Visual Diversity and Region-aware Prompt Learning for Zero-shot HOI Detection

Chanhyeong Yang, Taehoon song, Jihwan Park et al.

NEURIPS 2025arXiv:2510.25094
1
citations
#14131

Benefit From Seen: Enhancing Open-Vocabulary Object Detection by Bridging Visual and Textual Co-Occurrence Knowledge

Yanqi Li, Jianwei Niu, Tao Ren

ICCV 2025
1
citations
#14132

Understanding and Enhancing Mask-Based Pretraining towards Universal Representations

Mingze Dong, Leda Wang, Yuval Kluger

NEURIPS 2025arXiv:2509.21650
1
citations
#14133

From Condensation to Rank Collapse: A Two-Stage Analysis of Transformer Training Dynamics

Zheng-An Chen, Tao Luo

NEURIPS 2025oralarXiv:2510.06954
1
citations
#14134

InstanceAssemble: Layout-Aware Image Generation via Instance Assembling Attention

Qiang Xiang, Shuang Sun, Binglei Li et al.

NEURIPS 2025arXiv:2509.16691
1
citations
#14135

Wasserstein Style Distribution Analysis and Transform for Stylized Image Generation

Xi Yu, Xiang Gu, Zhihao Shi et al.

ICCV 2025highlight
1
citations
#14136

VideoMiner: Iteratively Grounding Key Frames of Hour-Long Videos via Tree-based Group Relative Policy Optimization

Xinye Cao, Hongcan Guo, Jiawen Qian et al.

ICCV 2025arXiv:2510.06040
1
citations
#14137

Linearly Constrained Diffusion Implicit Models

Vivek Jayaram, Ira Kemelmacher-Shlizerman, Steve Seitz et al.

NEURIPS 2025arXiv:2411.00359
1
citations
#14138

IRGPT: Understanding Real-world Infrared Image with Bi-cross-modal Curriculum on Large-scale Benchmark

Zhe Cao, Jin Zhang, Ruiheng Zhang

ICCV 2025arXiv:2507.14449
1
citations
#14139

To Distill or Decide? Understanding the Algorithmic Trade-off in Partially Observable RL

Yuda Song, Dhruv Rohatgi, Aarti Singh et al.

NEURIPS 2025spotlight
1
citations
#14140

Structure-Aware Fusion with Progressive Injection for Multimodal Molecular Representation Learning

Zihao Jing, Yan Sun, Yan Yi Li et al.

NEURIPS 2025arXiv:2510.23640
1
citations
#14141

Valid Inference with Imperfect Synthetic Data

Yewon Byun, Shantanu Gupta, Zachary Lipton et al.

NEURIPS 2025arXiv:2508.06635
1
citations
#14142

SyncSDE: A Probabilistic Framework for Diffusion Synchronization

Hyunjun Lee, Hyunsoo Lee, Sookwan Han

CVPR 2025arXiv:2503.21555
1
citations
#14143

Words That Unite The World: A Unified Framework for Deciphering Central Bank Communications

Agam Shah, Siddhant Sukhani, Huzaifa Pardawala et al.

NEURIPS 2025oralarXiv:2505.17048
1
citations
#14144

Learning Robust Representations with Long-Term Information for Generalization in Visual Reinforcement Learning

Rui Yang, Jie Wang, Qijie Peng et al.

ICLR 2025
1
citations
#14145

ModuLM: Enabling Modular and Multimodal Molecular Relational Learning with Large Language Models

Zhuo Chen, YIZHEN ZHENG, Huan Yee Koh et al.

NEURIPS 2025arXiv:2506.00880
1
citations
#14146

CULTURE3D: A Large-Scale and Diverse Dataset of Cultural Landmarks and Terrains for Gaussian-Based Scene Rendering

xinyi zheng, Steve Zhang, Weizhe Lin et al.

ICCV 2025arXiv:2501.06927
1
citations
#14147

Memory-Efficient Generative Models via Product Quantization

Jie Shao, Hanxiao Zhang, Hao Yu et al.

ICCV 2025
1
citations
#14148

SSVQ: Unleashing the Potential of Vector Quantization with Sign-Splitting

Shuaiting Li, Juncan Deng, Chengxuan Wang et al.

ICCV 2025arXiv:2503.08668
1
citations
#14149

FlowFeat: Pixel-Dense Embedding of Motion Profiles

Nikita Araslanov, Anna Sonnweber, Daniel Cremers

NEURIPS 2025oralarXiv:2511.07696
1
citations
#14150

Ask and Remember: A Questions-Only Replay Strategy for Continual Visual Question Answering

Imad Eddine MAROUF, Enzo Tartaglione, Stéphane Lathuilière et al.

ICCV 2025arXiv:2502.04469
1
citations
#14151

VADB: A Large-Scale Video Aesthetic Database with Professional and Multi-Dimensional Annotations

Qianqian Qiao, DanDan Zheng, Yihang Bo et al.

NEURIPS 2025oralarXiv:2510.25238
1
citations
#14152

Adapting to Observation Length of Trajectory Prediction via Contrastive Learning

Ruiqi Qiu, JUN GONG, Xinyu Zhang et al.

CVPR 2025
1
citations
#14153

How Many Tokens Do 3D Point Cloud Transformer Architectures Really Need?

Tuan Tran Anh, Duy M. H. Nguyen, Hoai-Chau Tran et al.

NEURIPS 2025arXiv:2511.05449
1
citations
#14154

Achieving More with Less: Additive Prompt Tuning for Rehearsal-Free Class-Incremental Learning

Haoran Chen, Ping Wang, Zihan Zhou et al.

ICCV 2025arXiv:2503.07979
1
citations
#14155

Prediction-Powered Semi-Supervised Learning with Online Power Tuning

Noa Shoham, Ron Dorfman, Shalev Shaer et al.

NEURIPS 2025arXiv:2510.22586
1
citations
#14156

Curious Causality-Seeking Agents Learn Meta Causal World

Zhiyu Zhao, Haoxuan Li, Haifeng Zhang et al.

NEURIPS 2025arXiv:2506.23068
1
citations
#14157

Free-MoRef: Instantly Multiplexing Context Perception Capabilities of Video-MLLMs within Single Inference

KUO WANG, Quanlong Zheng, Junlin Xie et al.

ICCV 2025arXiv:2508.02134
1
citations
#14158

RoFt-Mol: Benchmarking Robust Fine-tuning with Molecular Graph Foundation Models

Shikun Liu, Deyu Zou, Nima Shoghi et al.

NEURIPS 2025spotlightarXiv:2509.00614
1
citations
#14159

Correspondence-Free Fast and Robust Spherical Point Pattern Registration

Anik Sarker, Alan Asbeck

ICCV 2025arXiv:2508.02339
1
citations
#14160

Intrinsic Goals for Autonomous Agents: Model-Based Exploration in Virtual Zebrafish Predicts Ethological Behavior and Whole-Brain Dynamics

Reece Keller, Alyn Kirsch, Felix Pei et al.

NEURIPS 2025oralarXiv:2506.00138
1
citations
#14161

Tighter CMI-Based Generalization Bounds via Stochastic Projection and Quantization

Milad Sefidgaran, Kimia Nadjahi, Abdellatif Zaidi

NEURIPS 2025oralarXiv:2510.23485
1
citations
#14162

Learning Pixel-adaptive Multi-layer Perceptrons for Real-time Image Enhancement

Junyu Lou, Xiaorui Zhao, Kexuan Shi et al.

ICCV 2025arXiv:2507.12135
1
citations
#14163

Instance-wise Supervision-level Optimization in Active Learning

Shinnosuke Matsuo, Riku Togashi, Ryoma Bise et al.

CVPR 2025arXiv:2503.06517
1
citations
#14164

Reducing Class-wise Confusion for Incremental Learning with Disentangled Manifolds

Huitong Chen, Yu Wang, Yan Fan et al.

CVPR 2025arXiv:2503.17677
1
citations
#14165

Text Embedding Knows How to Quantize Text-Guided Diffusion Models

Hongjae Lee, Myungjun Son, Dongjea Kang et al.

ICCV 2025arXiv:2507.10340
1
citations
#14166

Re-ttention: Ultra Sparse Visual Generation via Attention Statistical Reshape

Ruichen Chen, Keith Mills, Liyao Jiang et al.

NEURIPS 2025oralarXiv:2505.22918
1
citations
#14167

Tight Bounds on the Distortion of Randomized and Deterministic Distributed Voting

Mohammad Abam, Davoud Kareshki, Marzieh Nilipour et al.

NEURIPS 2025arXiv:2509.17134
1
citations
#14168

LiSu: A Dataset and Method for LiDAR Surface Normal Estimation

Dušan Malić, Christian Fruhwirth-Reisinger, Samuel Schulter et al.

CVPR 2025arXiv:2503.08601
1
citations
#14169

RGNMR: A Gauss-Newton method for robust matrix completion with theoretical guarantees

Eilon Vaknin Laufer, Boaz Nadler

NEURIPS 2025arXiv:2505.12919
1
citations
#14170

DACoN: DINO for Anime Paint Bucket Colorization with Any Number of Reference Images

Kazuma Nagata, Naoshi Kaneko

ICCV 2025arXiv:2509.14685
1
citations
#14171

ONDA-Pose: Occlusion-Aware Neural Domain Adaptation for Self-Supervised 6D Object Pose Estimation

Tao Tan, Qiulei Dong

CVPR 2025
1
citations
#14172

RAM-W600: A Multi-Task Wrist Dataset and Benchmark for Rheumatoid Arthritis

YANG SONGXIAO, Haolin Wang, Yao Fu et al.

NEURIPS 2025arXiv:2507.05193
1
citations
#14173

Measuring the Impact of Rotation Equivariance on Aerial Object Detection

Xiuyu Wu, Xinhao Wang, Xiubin Zhu et al.

ICCV 2025arXiv:2507.09896
1
citations
#14174

Injecting Frame-Event Complementary Fusion into Diffusion for Optical Flow in Challenging Scenes

Haonan Wang, Hanyu Zhou, Haoyue Liu et al.

NEURIPS 2025spotlightarXiv:2510.10577
1
citations
#14175

MirrorVerse: Pushing Diffusion Models to Realistically Reflect the World

Ankit Dhiman, Manan Shah, R. Venkatesh Babu

CVPR 2025arXiv:2504.15397
1
citations
#14176

VLMs have Tunnel Vision: Evaluating Nonlocal Visual Reasoning in Leading VLMs

Shmuel Berman, Jia Deng

NEURIPS 2025spotlightarXiv:2507.13361
1
citations
#14177

DynImg: Key Frames with Visual Prompts are Good Representation for Multi-Modal Video Understanding

Xiaoyi Bao, Chen-Wei Xie, Hao Tang et al.

ICCV 2025arXiv:2507.15569
1
citations
#14178

Seeing the Unseen: A Semantic Alignment and Context-Aware Prompt Framework for Open-Vocabulary Camouflaged Object Segmentation

Peng Ren, Tian Bai, Jing Sun et al.

ICCV 2025
1
citations
#14179

Decoupled Subgraph Federated Learning

Javad Aliakbari, Johan Östman, Alexandre Graell i Amat

ICLR 2025arXiv:2402.19163
1
citations
#14180

Improved Diffusion-based Generative Model with Better Adversarial Robustness

Zekun Wang, Mingyang Yi, Shuchen Xue et al.

ICLR 2025arXiv:2502.17099
1
citations
#14181

DirectTriGS: Triplane-based Gaussian Splatting Field Representation for 3D Generation

Xiaoliang Ju, Hongsheng Li

CVPR 2025arXiv:2503.06900
1
citations
#14182

GeoDepth: From Point-to-Depth to Plane-to-Depth Modeling for Self-Supervised Monocular Depth Estimation

Haifeng Wu, Shuhang Gu, Lixin Duan et al.

CVPR 2025
1
citations
#14183

On the Complexity of Finding Stationary Points in Nonconvex Simple Bilevel Optimization

Jincheng Cao, Ruichen Jiang, Erfan Yazdandoost Hamedani et al.

NEURIPS 2025arXiv:2507.23155
1
citations
#14184

SinGS: Animatable Single-Image Human Gaussian Splats with Kinematic Priors

Yufan Wu, Xuanhong Chen, Wen Li et al.

CVPR 2025
1
citations
#14185

REINFORCEMENT LEARNING FOR INDIVIDUAL OPTIMAL POLICY FROM HETEROGENEOUS DATA

Rui Miao, Babak Shahbaba, Annie Qu

NEURIPS 2025arXiv:2505.09496
1
citations
#14186

Active Event-based Stereo Vision

Jianing Li, Yunjian Zhang, Haiqian Han et al.

CVPR 2025
1
citations
#14187

LiFT: Learning to Fine-Tune via Bayesian Parameter Efficient Meta Fine-Tuning

Minyoung Kim, Timothy Hospedales

ICLR 2025
1
citations
#14188

Visual Structures Help Visual Reasoning: Addressing the Binding Problem in LVLMs

Amirmohammad Izadi, Mohammadali Banayeeanzade, Fatemeh Askari et al.

NEURIPS 2025
1
citations
#14189

Synthesizing Performance Constraints for Evaluating and Improving Code Efficiency

Jun Yang, Cheng-Chi Wang, Bogdan Stoica et al.

NEURIPS 2025arXiv:2505.23471
1
citations
#14190

Continuity and Isolation Lead to Doubts or Dilemmas in Large Language Models

Hector Pasten, Felipe Urrutia, Hector Orellana et al.

NEURIPS 2025arXiv:2505.10606
1
citations
#14191

MaDCoW: Marginal Distortion Correction for Wide-Angle Photography with Arbitrary Objects

Kevin Zhang, Jia-Bin Huang, Jose Echevarria et al.

CVPR 2025
1
citations
#14192

Fast exact recovery of noisy matrix from few entries: the infinity norm approach

BaoLinh Tran, Van Vu

NEURIPS 2025arXiv:2501.19224
1
citations
#14193

AutoScape: Geometry-Consistent Long-Horizon Scene Generation

Jiacheng Chen, Ziyu Jiang, Mingfu Liang et al.

ICCV 2025arXiv:2510.20726
1
citations
#14194

ReID5o: Achieving Omni Multi-modal Person Re-identification in a Single Model

Jialong Zuo, Yongtai Deng, Mengdan Tan et al.

NEURIPS 2025arXiv:2506.09385
1
citations
#14195

DICE: Staleness-Centric Optimizations for Parallel Diffusion MoE Inference

Jiajun Luo, Lizhuo Luo, Jianru Xu et al.

ICCV 2025
1
citations
#14196

RayGaussX: Accelerating Gaussian-Based Ray Marching for Real-Time and High-Quality Novel View Synthesis

Hugo Blanc, Jean-Emmanuel Deschaud, Alexis Paljic

ICCV 2025arXiv:2509.07782
1
citations
#14197

BlinkTrack: Feature Tracking over 80 FPS via Events and Images

Yichen Shen, Yijin Li, Shuo Chen et al.

ICCV 2025arXiv:2409.17981
1
citations
#14198

ZeCO: Zero-Communication Overhead Sequence Parallelism for Linear Attention

Yuhong CHOU, Zehao Liu, Rui-Jie Zhu et al.

NEURIPS 2025arXiv:2507.01004
1
citations
#14199

Towards Learning High-Precision Least Squares Algorithms with Sequence Models

Jerry Liu, Jessica Grogan, Owen Dugan et al.

ICLR 2025arXiv:2503.12295
1
citations
#14200

C-LoRA: Contextual Low-Rank Adaptation for Uncertainty Estimation in Large Language Models

Amir Hossein Rahmati, Sanket Jantre, Weifeng Zhang et al.

NEURIPS 2025arXiv:2505.17773
1
citations