Most Cited 2025 "data corruptions" Papers

22,274 papers found • Page 25 of 112

#4801

Autonomous Goal Detection and Cessation in Reinforcement Learning: A Case Study on Source Term Estimation

Yiwei Shi, Muning Wen, Qi Zhang et al.

AAAI 2025paperarXiv:2409.09541
5
citations
#4802

StressPrompt: Does Stress Impact Large Language Models and Human Performance Similarly?

Guobin Shen, Dongcheng Zhao, Aorigele Bao et al.

AAAI 2025paperarXiv:2409.17167
5
citations
#4803

Spotting the Unexpected (STU): A 3D LiDAR Dataset for Anomaly Segmentation in Autonomous Driving

Alexey Nekrasov, Malcolm Burdorf, Stewart Worrall et al.

CVPR 2025arXiv:2505.02148
5
citations
#4804

Phoneme-Level Feature Discrepancies: A Key to Detecting Sophisticated Speech Deepfakes

Kuiyuan Zhang, Zhongyun Hua, Rushi Lan et al.

AAAI 2025paperarXiv:2412.12619
5
citations
#4805

SALOVA: Segment-Augmented Long Video Assistant for Targeted Retrieval and Routing in Long-Form Video Analysis

Junho Kim, Hyunjun Kim, Hosu Lee et al.

CVPR 2025arXiv:2411.16173
5
citations
#4806

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Wenqi Zhang, Hang Zhang, Xin Li et al.

ICCV 2025highlightarXiv:2501.00958
5
citations
#4807

ReNeg: Learning Negative Embedding with Reward Guidance

Xiaomin Li, yixuan liu, Takashi Isobe et al.

CVPR 2025highlightarXiv:2412.19637
5
citations
#4808

ZETA: Leveraging $Z$-order Curves for Efficient Top-$k$ Attention

Qiuhao Zeng, Jierui Huang, Peng Lu et al.

ICLR 2025arXiv:2501.14577
5
citations
#4809

Through the Dual-Prism: A Spectral Perspective on Graph Data Augmentation for Graph Classifications

Yutong Xia, Runpeng Yu, Yuxuan Liang et al.

AAAI 2025paperarXiv:2401.09953
5
citations
#4810

Cluster Based Heterogeneous Federated Foundation Model Adaptation and Fine-Tuning

Xianda Wang, Yaqi Qiao, Duo Wu et al.

AAAI 2025paper
5
citations
#4811

Graph-Based Cross-Domain Knowledge Distillation for Cross-Dataset Text-to-Image Person Retrieval

Bingjun Luo, Jinpeng Wang, Zewen Wang et al.

AAAI 2025paperarXiv:2501.15052
5
citations
#4812

Bridging Molecular Graphs and Large Language Models

Runze Wang, Mingqi Yang, Yanming Shen

AAAI 2025paperarXiv:2503.03135
5
citations
#4813

Data-adaptive Differentially Private Prompt Synthesis for In-Context Learning

Fengyu Gao, Ruida Zhou, Tianhao Wang et al.

ICLR 2025arXiv:2410.12085
5
citations
#4814

Towards Learnable Anchor for Deep Multi-View Clustering

Bocheng Wang, Chusheng Zeng, Mulin Chen et al.

AAAI 2025paperarXiv:2503.12427
5
citations
#4815

CLOC: Contrastive Learning for Ordinal Classification with Multi-Margin N-pair Loss

Dileepa Pitawela, Gustavo Carneiro, Hsiang-Ting Chen

CVPR 2025arXiv:2504.17813
5
citations
#4816

A Thorough Comparison Between Independent Cascade and Susceptible-Infected-Recovered Models

Panfeng Liu, Guoliang Qiu, Biaoshuai Tao et al.

AAAI 2025paperarXiv:2408.11470
5
citations
#4817

PSMGD: Periodic Stochastic Multi-Gradient Descent for Fast Multi-Objective Optimization

Mingjing Xu, Peizhong Ju, Jia Liu et al.

AAAI 2025paperarXiv:2412.10961
5
citations
#4818

Learning Interpretable Queries for Explainable Image Classification with Information Pursuit

Stefan Kolek, Aditya Chattopadhyay, Kwan Ho Ryan Chan et al.

ICCV 2025arXiv:2312.11548
5
citations
#4819

SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation

Pengfei Chen, Lingxi Xie, xinyue huo et al.

ICLR 2025arXiv:2407.16682
5
citations
#4820

HandOS: 3D Hand Reconstruction in One Stage

Xingyu Chen, Zhuheng Song, Xiaoke Jiang et al.

CVPR 2025arXiv:2412.01537
5
citations
#4821

BUFFER-X: Towards Zero-Shot Point Cloud Registration in Diverse Scenes

Minkyun Seo, Hyungtae Lim, Kanghee Lee et al.

ICCV 2025highlightarXiv:2503.07940
5
citations
#4822

Multi-View Collaborative Learning Network for Speech Deepfake Detection

Kuiyuan Zhang, Zhongyun Hua, Rushi Lan et al.

AAAI 2025paper
5
citations
#4823

DF-MIA: A Distribution-Free Membership Inference Attack on Fine-Tuned Large Language Models

Zhiheng Huang, Yannan Liu, Daojing He et al.

AAAI 2025paper
5
citations
#4824

Accelerating Training with Neuron Interaction and Nowcasting Networks

Boris Knyazev, Abhinav Moudgil, Guillaume Lajoie et al.

ICLR 2025arXiv:2409.04434
5
citations
#4825

InteractionMap: Improving Online Vectorized HDMap Construction with Interaction

Kuang Wu, Chuan Yang, Zhanbin Li

CVPR 2025arXiv:2503.21659
5
citations
#4826

FactorGCL: A Hypergraph-Based Factor Model with Temporal Residual Contrastive Learning for Stock Returns Prediction

Yitong Duan, Weiran Wang, Jian Li

AAAI 2025paperarXiv:2502.05218
5
citations
#4827

RoboSense: Large-scale Dataset and Benchmark for Egocentric Robot Perception and Navigation in Crowded and Unstructured Environments

Haisheng Su, Feixiang Song, CONG MA et al.

CVPR 2025arXiv:2408.15503
5
citations
#4828

mmFAS: Multimodal Face Anti-Spoofing Using Multi-Level Alignment and Switch-Attention Fusion

Geng Chen, Wuyuan Xie, Di Lin et al.

AAAI 2025paper
5
citations
#4829

LASeR: Learning to Adaptively Select Reward Models with Multi-Arm Bandits

Duy Nguyen, Archiki Prasad, Elias Stengel-Eskin et al.

NEURIPS 2025arXiv:2410.01735
5
citations
#4830

GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding

Rui Hu, Yuxuan Zhang, Lianghui Zhu et al.

ICCV 2025arXiv:2503.10596
5
citations
#4831

EgoBridge: Domain Adaptation for Generalizable Imitation from Egocentric Human Data

Ryan Punamiya, Dhruv Patel, Patcharapong Aphiwetsa et al.

NEURIPS 2025arXiv:2509.19626
5
citations
#4832

Neural Interactive Proofs

Lewis Hammond, Sam Adam-Day

ICLR 2025arXiv:2412.08897
5
citations
#4833

Lightweight Predictive 3D Gaussian Splats

Junli Cao, Vidit Goel, Chaoyang Wang et al.

ICLR 2025arXiv:2406.19434
5
citations
#4834

On Extending Direct Preference Optimization to Accommodate Ties

Jinghong Chen, Guangyu Yang, Weizhe Lin et al.

NEURIPS 2025arXiv:2409.17431
5
citations
#4835

Strategic Classification With Externalities

Safwan Hossain, Evi Micha, Yiling Chen et al.

ICLR 2025arXiv:2410.08032
5
citations
#4836

uniINF: Best-of-Both-Worlds Algorithm for Parameter-Free Heavy-Tailed MABs

Yu Chen, Jiatai Huang, Yan Dai et al.

ICLR 2025arXiv:2410.03284
5
citations
#4837

Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs

Kejia Zhang, Keda TAO, Jiasheng Tang et al.

NEURIPS 2025arXiv:2501.19164
5
citations
#4838

V-Stylist: Video Stylization via Collaboration and Reflection of MLLM Agents

Zhengrong Yue, Shaobin Zhuang, Kunchang Li et al.

CVPR 2025arXiv:2503.12077
5
citations
#4839

NUDGE: Lightweight Non-Parametric Fine-Tuning of Embeddings for Retrieval

Sepanta Zeighami, Zac Wellmer, Aditya Parameswaran

ICLR 2025arXiv:2409.02343
5
citations
#4840

DreamLayer: Simultaneous Multi-Layer Generation via Diffusion Model

Junjia Huang, Pengxiang Yan, Jinhang Cai et al.

ICCV 2025highlight
5
citations
#4841

WorldWeaver: Generating Long-Horizon Video Worlds via Rich Perception

Zhiheng Liu, Xueqing Deng, Shoufa Chen et al.

NEURIPS 2025oralarXiv:2508.15720
5
citations
#4842

Binarized Neural Network for Multi-spectral Image Fusion

Junming Hou, Xiaoyu Chen, Ran Ran et al.

CVPR 2025
5
citations
#4843

Multi-head Transformers Provably Learn Symbolic Multi-step Reasoning via Gradient Descent

Tong Yang, Yu Huang, Yingbin Liang et al.

NEURIPS 2025arXiv:2508.08222
5
citations
#4844

How to Verify Any (Reasonable) Distribution Property: Computationally Sound Argument Systems for Distributions

Tal Herman, Guy Rothblum

ICLR 2025arXiv:2409.06594
5
citations
#4845

Two Sparse Matrices are Better than One: Sparsifying Neural Networks with Double Sparse Factorization

Vladimir Boza, Vladimir Macko

ICLR 2025
5
citations
#4846

Leveraging Attention to Effectively Compress Prompts for Long-Context LLMs

Yunlong Zhao, Haoran Wu, Bo Xu

AAAI 2025paper
5
citations
#4847

FlexEvent: Towards Flexible Event-Frame Object Detection at Varying Operational Frequencies

Dongyue Lu, Lingdong Kong, Gim Hee Lee et al.

NEURIPS 2025oralarXiv:2412.06708
5
citations
#4848

RaSA: Rank-Sharing Low-Rank Adaptation

Zhiwei He, Zhaopeng Tu, Xing Wang et al.

ICLR 2025arXiv:2503.12576
5
citations
#4849

Severing Spurious Correlations with Data Pruning

Varun Mulchandani, Jung-Eun Kim

ICLR 2025arXiv:2503.18258
5
citations
#4850

Fine-grained Spatiotemporal Grounding on Egocentric Videos

Shuo LIANG, Yiwu Zhong, Zi-Yuan Hu et al.

ICCV 2025arXiv:2508.00518
5
citations
#4851

Matryoshka Pilot: Learning to Drive Black-Box LLMs with LLMs

ChangHao Li, Yuchen Zhuang, Rushi Qiang et al.

NEURIPS 2025arXiv:2410.20749
5
citations
#4852

VeriThoughts: Enabling Automated Verilog Code Generation using Reasoning and Formal Verification

Patrick Yubeaton, Andre Nakkab, Weihua Xiao et al.

NEURIPS 2025arXiv:2505.20302
5
citations
#4853

Latent Radiance Fields with 3D-aware 2D Representations

Chaoyi Zhou, Xi Liu, Feng Luo et al.

ICLR 2025arXiv:2502.09613
5
citations
#4854

A Unified Solution to Video Fusion: From Multi-Frame Learning to Benchmarking

Zixiang Zhao, Haowen Bai, Bingxin Ke et al.

NEURIPS 2025oralarXiv:2505.19858
5
citations
#4855

CoPRA: Bridging Cross-domain Pretrained Sequence Models with Complex Structures for Protein-RNA Binding Affinity Prediction

Rong Han, Xiaohong Liu, Tong Pan et al.

AAAI 2025paperarXiv:2409.03773
5
citations
#4856

LuxDiT: Lighting Estimation with Video Diffusion Transformer

Ruofan Liang, Kai He, Zan Gojcic et al.

NEURIPS 2025arXiv:2509.03680
5
citations
#4857

CTRL-D: Controllable Dynamic 3D Scene Editing with Personalized 2D Diffusion

Kai He, Chin-Hsuan Wu, Igor Gilitschenski

CVPR 2025arXiv:2412.01792
5
citations
#4858

SQLens: An End-to-End Framework for Error Detection and Correction in Text-to-SQL

Yue Gong, Chuan Lei, Xiao Qin et al.

NEURIPS 2025arXiv:2506.04494
5
citations
#4859

Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object Detection

Chuhan ZHANG, Chaoyang Zhu, Pingcheng Dong et al.

ICLR 2025arXiv:2503.11005
5
citations
#4860

On Speeding Up Language Model Evaluation

Jin Zhou, Christian Belardi, Ruihan Wu et al.

ICLR 2025arXiv:2407.06172
5
citations
#4861

ECHOPulse: ECG Controlled Echocardio-gram Video Generation

Yiwei Li, Sekeun Kim, Zihao Wu et al.

ICLR 2025arXiv:2410.03143
5
citations
#4862

RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMs

Jiaxing Wu, Lin Ning, Luyang Liu et al.

AAAI 2025paperarXiv:2409.04421
5
citations
#4863

Omnidirectional Multi-Object Tracking

Kai Luo, Hao Shi, Sheng Wu et al.

CVPR 2025arXiv:2503.04565
5
citations
#4864

Bayesian WeakS-to-Strong from Text Classification to Generation

Ziyun Cui, Ziyang Zhang, Guangzhi Sun et al.

ICLR 2025arXiv:2406.03199
5
citations
#4865

Purifying Shampoo: Investigating Shampoo's Heuristics by Decomposing its Preconditioner

Runa Eschenhagen, Aaron Defazio, Tsung-Hsien Lee et al.

NEURIPS 2025spotlightarXiv:2506.03595
5
citations
#4866

LiteSearch: Efficient Tree Search with Dynamic Exploration Budget for Math Reasoning

Ante Wang, Linfeng Song, Ye Tian et al.

AAAI 2025paper
5
citations
#4867

MrSteve: Instruction-Following Agents in Minecraft with What-Where-When Memory

Junyeong Park, Junmo Cho, Sungjin Ahn

ICLR 2025arXiv:2411.06736
5
citations
#4868

Multi-identity Human Image Animation with Structural Video Diffusion

Zhenzhi Wang, Yixuan Li, yanhong zeng et al.

ICCV 2025arXiv:2504.04126
5
citations
#4869

Self-Evolutionary Large Language Models Through Uncertainty-Enhanced Preference Optimization

Jianing Wang, Yang Zhou, Xiaocheng Zhang et al.

AAAI 2025paperarXiv:2409.11212
5
citations
#4870

GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation

Ning Gao, Yilun Chen, Shuai Yang et al.

CVPR 2025arXiv:2506.10966
5
citations
#4871

Calibrating Expressions of Certainty

Peiqi Wang, Barbara Lam, Yingcheng Liu et al.

ICLR 2025arXiv:2410.04315
5
citations
#4872

On Generalization Across Environments In Multi-Objective Reinforcement Learning

Jayden Teoh, Pradeep Varakantham, Peter Vamplew

ICLR 2025arXiv:2503.00799
5
citations
#4873

Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers

Efstathios Karypidis, Ioannis Kakogeorgiou, Spyros Gidaris et al.

CVPR 2025arXiv:2501.08303
5
citations
#4874

Enhancing Testing-Time Robustness for Trusted Multi-View Classification in the Wild

Wei Liu, Yufei Chen, Xiaodong Yue

CVPR 2025
5
citations
#4875

Few-Shot, No Problem: Descriptive Continual Relation Extraction

Nguyen Xuan Thanh, Anh Duc Le, Quyen Tran et al.

AAAI 2025paperarXiv:2502.20596
5
citations
#4876

Reinforcement learning with combinatorial actions for coupled restless bandits

Lily Xu, Bryan Wilder, Elias Khalil et al.

ICLR 2025arXiv:2503.01919
5
citations
#4877

Causal LLM Routing: End-to-End Regret Minimization from Observational Data

Asterios Tsiourvas, Wei Sun, Georgia Perakis

NEURIPS 2025arXiv:2505.16037
5
citations
#4878

GARLIC: GPT-Augmented Reinforcement Learning with Intelligent Control for Vehicle Dispatching

Xiao Han, Zijian Zhang, Xiangyu Zhao et al.

AAAI 2025paperarXiv:2408.10286
5
citations
#4879

Pos3R: 6D Pose Estimation for Unseen Objects Made Easy

Weijian Deng, Dylan Campbell, Chunyi Sun et al.

CVPR 2025
5
citations
#4880

Learning to Communicate Through Implicit Communication Channels

Han Wang, Binbin Chen, zhang et al.

ICLR 2025arXiv:2411.01553
5
citations
#4881

Language Models Can Predict Their Own Behavior

Dhananjay Ashok, Jonathan May

NEURIPS 2025arXiv:2502.13329
5
citations
#4882

Generalized Diffusion Detector: Mining Robust Features from Diffusion Models for Domain-Generalized Detection

Boyong He, Yuxiang Ji, Qianwen Ye et al.

CVPR 2025arXiv:2503.02101
5
citations
#4883

OCRT: Boosting Foundation Models in the Open World with Object-Concept-Relation Triad

Luyao Tang, Chaoqi Chen, Yuxuan Yuan et al.

CVPR 2025arXiv:2503.18695
5
citations
#4884

Multi-Modal View Enhanced Large Vision Models for Long-Term Time Series Forecasting

ChengAo Shen, Wenchao Yu, Ziming Zhao et al.

NEURIPS 2025arXiv:2505.24003
5
citations
#4885

Infer Human’s Intentions Before Following Natural Language Instructions

Yanming Wan, Yue Wu, Yiping Wang et al.

AAAI 2025paperarXiv:2409.18073
5
citations
#4886

Learning Visual Generative Priors without Text

Shuailei Ma, Kecheng Zheng, Ying Wei et al.

CVPR 2025arXiv:2412.07767
5
citations
#4887

STAR: Stability-Inducing Weight Perturbation for Continual Learning

Masih Eskandar, Tooba Imtiaz, Davin Hill et al.

ICLR 2025arXiv:2503.01595
5
citations
#4888

Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning

Xiaochuan Li, Zichun Yu, Chenyan Xiong

ICLR 2025arXiv:2410.14208
5
citations
#4889

MAtCha Gaussians: Atlas of Charts for High-Quality Geometry and Photorealism From Sparse Views

Antoine Guédon, Tomoki Ichikawa, Kohei Yamashita et al.

CVPR 2025highlightarXiv:2412.06767
5
citations
#4890

ProbeSDF: Light Field Probes For Neural Surface Reconstruction

Briac Toussaint, Diego Thomas, Jean-Sébastien Franco

CVPR 2025arXiv:2412.10084
5
citations
#4891

HUSH: Holistic Panoramic 3D Scene Understanding using Spherical Harmonics

Jongsung Lee, HARIN PARK, Byeong-Uk Lee et al.

CVPR 2025
5
citations
#4892

Can LLMs Obfuscate Code? A Systematic Analysis of Large Language Models into Assembly Code Obfuscation

Seyedreza Mohseni, Seyedali Mohammadi, Deepa Tilwani et al.

AAAI 2025paperarXiv:2412.16135
5
citations
#4893

BigDocs: An Open Dataset for Training Multimodal Models on Document and Code Tasks

Juan A. Rodriguez, Xiangru Jian, Siba Smarak Panigrahi et al.

ICLR 2025arXiv:2412.04626
5
citations
#4894

Revisiting End-to-End Learning with Slide-level Supervision in Computational Pathology

Wenhao Tang, Rong Qin, Heng Fang et al.

NEURIPS 2025arXiv:2506.02408
5
citations
#4895

Zeroth-Order Fine-Tuning of LLMs with Transferable Static Sparsity

Wentao Guo, Jikai Long, Yimeng Zeng et al.

ICLR 2025
5
citations
#4896

Dynamic Stereotype Theory Induced Micro-expression Recognition with Oriented Deformation

Bohao Zhang, Xuejiao Wang, Changbo Wang et al.

CVPR 2025
5
citations
#4897

Memorize and Rank: Elevating Large Language Models for Clinical Diagnosis Prediction

Mingyu Derek Ma, Xiaoxuan Wang, Yijia Xiao et al.

AAAI 2025paperarXiv:2501.17326
5
citations
#4898

LUCAS: Layered Universal Codec Avatars

Di Liu, Teng Deng, Giljoo Nam et al.

CVPR 2025arXiv:2502.19739
5
citations
#4899

FDS: Frequency-Aware Denoising Score for Text-Guided Latent Diffusion Image Editing

Yufan Ren, Zicong Jiang, Tong Zhang et al.

CVPR 2025arXiv:2503.19191
5
citations
#4900

Rethinking Fair Representation Learning for Performance-Sensitive Tasks

Charles Jones, Fabio De Sousa Ribeiro, Mélanie Roschewitz et al.

ICLR 2025arXiv:2410.04120
5
citations
#4901

Improving Adversarial Transferability on Vision Transformers via Forward Propagation Refinement

Yuchen Ren, Zhengyu Zhao, Chenhao Lin et al.

CVPR 2025arXiv:2503.15404
5
citations
#4902

DrVD-Bench: Do Vision-Language Models Reason Like Human Doctors in Medical Image Diagnosis?

Tianhong Zhou, xu yin, Yingtao Zhu et al.

NEURIPS 2025arXiv:2505.24173
5
citations
#4903

Accelerating 3D Molecule Generation via Jointly Geometric Optimal Transport

Haokai Hong, Wanyu LIN, KC Tan

ICLR 2025arXiv:2405.15252
5
citations
#4904

Learning Orthogonal Multi-Index Models: A Fine-Grained Information Exponent Analysis

Yunwei Ren, Jason Lee

NEURIPS 2025arXiv:2410.09678
5
citations
#4905

Understanding the Generalization of In-Context Learning in Transformers: An Empirical Study

Xingxuan Zhang, Haoran Wang, Jiansheng Li et al.

ICLR 2025arXiv:2503.15579
5
citations
#4906

CMT: A Memory Compression Method for Continual Knowledge Learning of Large Language Models

Dongfang Li, Zetian Sun, Xinshuo Hu et al.

AAAI 2025paperarXiv:2412.07393
5
citations
#4907

HeGTa: Leveraging Heterogeneous Graph-enhanced Large Language Models for Few-shot Complex Table Understanding

Rihui Jin, Yu Li, Guilin Qi et al.

AAAI 2025paperarXiv:2403.19723
5
citations
#4908

Detection-Friendly Nonuniformity Correction: A Union Framework for Infrared UAV Target Detection

Houzhang Fang, Xiaolin Wang, Zengyang Li et al.

CVPR 2025highlight
5
citations
#4909

CAT: Content-Adaptive Image Tokenization

Junhong Shen, Kushal Tirumala, Michihiro Yasunaga et al.

NEURIPS 2025arXiv:2501.03120
5
citations
#4910

MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds

Bingquan Dai, Luo Li, Qihong Tang et al.

NEURIPS 2025arXiv:2508.14879
5
citations
#4911

Compliant Residual DAgger: Improving Real-World Contact-Rich Manipulation with Human Corrections

Xiaomeng Xu, Yifan Hou, Zeyi Liu et al.

NEURIPS 2025arXiv:2506.16685
5
citations
#4912

From Probability to Counterfactuals: the Increasing Complexity of Satisfiability in Pearl's Causal Hierarchy

Julian Dörfler, Benito van der Zander, Markus Bläser et al.

ICLR 2025arXiv:2405.07373
5
citations
#4913

Pushing the Limits of All-Atom Geometric Graph Neural Networks: Pre-Training, Scaling, and Zero-Shot Transfer

Zihan Pengmei, Zhengyuan Shen, Zichen Wang et al.

ICLR 2025arXiv:2410.21683
5
citations
#4914

Context-Enhanced Memory-Refined Transformer for Online Action Detection

Zhanzhong Pang, Fadime Sener, Angela Yao

CVPR 2025arXiv:2503.18359
5
citations
#4915

Modeling Thousands of Human Annotators for Generalizable Text-to-Image Person Re-identification

Jiayu Jiang, Changxing Ding, Wentao Tan et al.

CVPR 2025highlightarXiv:2503.09962
5
citations
#4916

Prosody-Enhanced Acoustic Pre-training and Acoustic-Disentangled Prosody Adapting for Movie Dubbing

Zhedong Zhang, Liang Li, Chenggang Yan et al.

CVPR 2025arXiv:2503.12042
5
citations
#4917

CITI: Enhancing Tool Utilizing Ability in Large Language Models Without Sacrificing General Performance

Yupu Hao, Pengfei Cao, Zhuoran Jin et al.

AAAI 2025paperarXiv:2409.13202
5
citations
#4918

On the Zero-shot Adversarial Robustness of Vision-Language Models: A Truly Zero-shot and Training-free Approach

Baoshun Tong, Hanjiang Lai, Yan Pan et al.

CVPR 2025
5
citations
#4919

3DEnhancer: Consistent Multi-View Diffusion for 3D Enhancement

Yihang Luo, Shangchen Zhou, Yushi Lan et al.

CVPR 2025arXiv:2412.18565
5
citations
#4920

Preference-Oriented Supervised Fine-Tuning: Favoring Target Model over Aligned Large Language Models

Yuchen Fan, Yuzhong Hong, Qiushi Wang et al.

AAAI 2025paperarXiv:2412.12865
5
citations
#4921

VLMaterial: Procedural Material Generation with Large Vision-Language Models

Beichen Li, Rundi Wu, Armando Solar-Lezama et al.

ICLR 2025arXiv:2501.18623
5
citations
#4922

Concept Replacer: Replacing Sensitive Concepts in Diffusion Models via Precision Localization

lingyun zhang, Yu Xie, Yanwei Fu et al.

CVPR 2025arXiv:2412.01244
5
citations
#4923

SMI-Editor: Edit-based SMILES Language Model with Fragment-level Supervision

Kangjie Zheng, Siyue Liang, Junwei Yang et al.

ICLR 2025arXiv:2412.05569
5
citations
#4924

Predicting Empirical AI Research Outcomes with Language Models

Jiaxin Wen, Chenglei Si, Yueh-Han Chen et al.

NEURIPS 2025arXiv:2506.00794
5
citations
#4925

Understanding Fine-tuning CLIP for Open-vocabulary Semantic Segmentation in Hyperbolic Space

Zelin Peng, Zhengqin Xu, Zhilin Zeng et al.

CVPR 2025
5
citations
#4926

OnlineAnySeg: Online Zero-Shot 3D Segmentation by Visual Foundation Model Guided 2D Mask Merging

Yijie Tang, Jiazhao Zhang, Yuqing Lan et al.

CVPR 2025arXiv:2503.01309
5
citations
#4927

BitStack: Any-Size Compression of Large Language Models in Variable Memory Environments

Xinghao Wang, Pengyu Wang, Bo Wang et al.

ICLR 2025arXiv:2410.23918
5
citations
#4928

GLoRa: A Benchmark to Evaluate the Ability to Learn Long-Range Dependencies in Graphs

Dongzhuoran Zhou, Evgeny Kharlamov, Egor Kostylev

ICLR 2025
5
citations
#4929

CoMBO: Conflict Mitigation via Branched Optimization for Class Incremental Segmentation

Kai Fang, Anqi Zhang, Guangyu Gao et al.

CVPR 2025arXiv:2504.04156
5
citations
#4930

Question-Aware Gaussian Experts for Audio-Visual Question Answering

Hongyeob Kim, Inyoung Jung, Dayoon Suh et al.

CVPR 2025highlightarXiv:2503.04459
5
citations
#4931

Revisiting Source-Free Domain Adaptation: a New Perspective via Uncertainty Control

Gezheng Xu, Hui GUO, Li Yi et al.

ICLR 2025
5
citations
#4932

Finding Local Diffusion Schrödinger Bridge using Kolmogorov-Arnold Network

Xingyu Qiu, Mengying Yang, Xinghua Ma et al.

CVPR 2025arXiv:2502.19754
5
citations
#4933

Object-aware Sound Source Localization via Audio-Visual Scene Understanding

Sung Jin Um, Dongjin Kim, Sangmin Lee et al.

CVPR 2025arXiv:2506.18557
5
citations
#4934

LNS2+RL: Combining Multi-agent Reinforcement Learning with Large Neighborhood Search in Multi-agent Path Finding

Yutong Wang, Tanishq Duhan, Jiaoyang Li et al.

AAAI 2025paperarXiv:2405.17794
5
citations
#4935

Many-Objective Multi-Solution Transport

Ziyue Li, Tian Li, Virginia Smith et al.

ICLR 2025arXiv:2403.04099
5
citations
#4936

PanDA: Towards Panoramic Depth Anything with Unlabeled Panoramas and Mobius Spatial Augmentation

Zidong Cao, Jinjing Zhu, Weiming Zhang et al.

CVPR 2025arXiv:2406.13378
5
citations
#4937

Towards Understanding Text Hallucination of Diffusion Models via Local Generation Bias

Rui Lu, Runzhe Wang, Kaifeng Lyu et al.

ICLR 2025arXiv:2503.03595
5
citations
#4938

Cropper: Vision-Language Model for Image Cropping through In-Context Learning

Seung Hyun Lee, Jijun jiang, Yiran Xu et al.

CVPR 2025arXiv:2408.07790
5
citations
#4939

FedSA: A Unified Representation Learning via Semantic Anchors for Prototype-based Federated Learning

Yanbing Zhou, Xiangmou Qu, Chenlong You et al.

AAAI 2025paperarXiv:2501.05496
5
citations
#4940

Model-Free Offline Reinforcement Learning with Enhanced Robustness

Chi Zhang, Zain Ulabedeen Farhat, George Atia et al.

ICLR 2025
5
citations
#4941

Spatial Transport Optimization by Repositioning Attention Map for Training-Free Text-to-Image Synthesis

Woojung Han, Yeonkyung Lee, Chanyoung Kim et al.

CVPR 2025arXiv:2503.22168
5
citations
#4942

Ego4o: Egocentric Human Motion Capture and Understanding from Multi-Modal Input

Jian Wang, Rishabh Dabral, Diogo Luvizon et al.

CVPR 2025arXiv:2504.08449
5
citations
#4943

ProtCLIP: Function-Informed Protein Multi-Modal Learning

Hanjing Zhou, Mingze Yin, Wei Wu et al.

AAAI 2025paperarXiv:2412.20014
5
citations
#4944

Cocoon: Robust Multi-Modal Perception with Uncertainty-Aware Sensor Fusion

Minkyoung Cho, Yulong Cao, Jiachen Sun et al.

ICLR 2025arXiv:2410.12592
5
citations
#4945

A Simple Approach to Unifying Diffusion-based Conditional Generation

Xirui Li, Charles Herrmann, Kelvin Chan et al.

ICLR 2025arXiv:2410.11439
5
citations
#4946

Beyond FVD: An Enhanced Evaluation Metrics for Video Generation Distribution Quality

Ge Ya Luo, Gian M Favero, Zhi Hao Luo et al.

ICLR 2025oral
5
citations
#4947

Specifying What You Know or Not for Multi-Label Class-Incremental Learning

Aoting Zhang, Dongbao Yang, Chang Liu et al.

AAAI 2025paperarXiv:2503.17017
5
citations
#4948

Lawma: The Power of Specialization for Legal Annotation

Ricardo Dominguez-Olmedo, Vedant Nanda, Rediet Abebe et al.

ICLR 2025arXiv:2407.16615
5
citations
#4949

Can Students Beyond the Teacher? Distilling Knowledge from Teacher’s Bias

Jianhua Zhang, Yi Gao, Ruyu Liu et al.

AAAI 2025paperarXiv:2412.09874
5
citations
#4950

Convergence of Clipped SGD on Convex $(L_0,L_1)$-Smooth Functions

Ofir Gaash, Kfir Y. Levy, Yair Carmon

NEURIPS 2025arXiv:2502.16492
5
citations
#4951

IterIS: Iterative Inference-Solving Alignment for LoRA Merging

Hongxu chen, Zhen Wang, Runshi Li et al.

CVPR 2025arXiv:2411.15231
5
citations
#4952

Conformal Generative Modeling with Improved Sample Efficiency through Sequential Greedy Filtering

Klaus-Rudolf Kladny, Bernhard Schölkopf, Michael Muehlebach

ICLR 2025arXiv:2410.01660
5
citations
#4953

Learning-Guided Rolling Horizon Optimization for Long-Horizon Flexible Job-Shop Scheduling

Sirui Li, Wenbin Ouyang, Yining Ma et al.

ICLR 2025arXiv:2502.15791
5
citations
#4954

When Are Concepts Erased From Diffusion Models?

Kevin Lu, Nicky Kriplani, Rohit Gandikota et al.

NEURIPS 2025arXiv:2505.17013
5
citations
#4955

Feature Clipping for Uncertainty Calibration

Linwei Tao, Minjing Dong, Chang Xu

AAAI 2025paperarXiv:2410.19796
5
citations
#4956

EchoONE: Segmenting Multiple Echocardiography Planes in One Model

Jiongtong Hu, Wei Zhuo, Jun Cheng et al.

CVPR 2025arXiv:2412.02993
5
citations
#4957

LeanVAE: An Ultra-Efficient Reconstruction VAE for Video Diffusion Models

Yu Cheng, Fajie Yuan

ICCV 2025arXiv:2503.14325
5
citations
#4958

Seeing What Matters: Empowering CLIP with Patch Generation-to-Selection

Gensheng Pei, Tao Chen, Yujia Wang et al.

CVPR 2025arXiv:2503.17080
5
citations
#4959

Poplar: Efficient Scaling of Distributed DNN Training on Heterogeneous GPU Clusters

WenZheng Zhang, Yang Hu, Jing Shi et al.

AAAI 2025paperarXiv:2408.12596
5
citations
#4960

Skip-Vision: Efficient and Scalable Acceleration of Vision-Language Models via Adaptive Token Skipping

Weili Zeng, Ziyuan Huang, Kaixiang Ji et al.

ICCV 2025arXiv:2503.21817
5
citations
#4961

MOSCATO: Predicting Multiple Object State Change Through Actions

Parnian Zameni, Yuhan Shen, Ehsan Elhamifar

ICCV 2025
5
citations
#4962

The Fluorescent Veil: A Stealthy and Effective Physical Adversarial Patch Against Traffic Sign Recognition

Shuai Yuan, Xingshuo Han, Hongwei Li et al.

NEURIPS 2025arXiv:2409.12394
5
citations
#4963

Spiral: Semantic-Aware Progressive LiDAR Scene Generation and Understanding

Dekai Zhu, Yixuan Hu, Youquan Liu et al.

NEURIPS 2025arXiv:2505.22643
5
citations
#4964

Improved Balanced Classification with Theoretically Grounded Loss Functions

Corinna Cortes, Mehryar Mohri, Yutao Zhong

NEURIPS 2025arXiv:2512.23947
5
citations
#4965

Difficulty-aware Balancing Margin Loss for Long-tailed Recognition

Minseok Son, Inyong Koo, Jinyoung Park et al.

AAAI 2025paperarXiv:2412.15477
5
citations
#4966

Privacy-Preserving Personalized Federated Prompt Learning for Multimodal Large Language Models

Linh Tran, Wei Sun, Stacy Patterson et al.

ICLR 2025arXiv:2501.13904
5
citations
#4967

Uncertainty Quantification with the Empirical Neural Tangent Kernel

Joseph Wilson, Chris van der Heide, Liam Hodgkinson et al.

NEURIPS 2025arXiv:2502.02870
5
citations
#4968

SEPARATE: A Simple Low-rank Projection for Gradient Compression in Modern Large-scale Model Training Process

Hanzhen Zhao, Xingyu Xie, Cong Fang et al.

ICLR 2025
5
citations
#4969

Time-o1: Time-Series Forecasting Needs Transformed Label Alignment

Hao Wang, Licheng Pan, Zhichao Chen et al.

NEURIPS 2025oralarXiv:2505.17847
5
citations
#4970

Harmonizing Visual and Textual Embeddings for Zero-Shot Text-to-Image Customization

Yeji Song, Jimyeong Kim, Wonhark Park et al.

AAAI 2025paperarXiv:2403.14155
5
citations
#4971

Interpretable Generative Models through Post-hoc Concept Bottlenecks

Akshay R. Kulkarni, Ge Yan, Chung-En Sun et al.

CVPR 2025arXiv:2503.19377
5
citations
#4972

Optimal Non-Asymptotic Rates of Value Iteration for Average-Reward Markov Decision Processes

Jongmin Lee, Ernest Ryu

ICLR 2025arXiv:2504.09913
5
citations
#4973

Uncertainty and Influence aware Reward Model Refinement for Reinforcement Learning from Human Feedback

Zexu Sun, Yiju Guo, Yankai Lin et al.

ICLR 2025
5
citations
#4974

Novel View Synthesis with Pixel-Space Diffusion Models

Noam Elata, Bahjat Kawar, Yaron Ostrovsky-Berman et al.

CVPR 2025arXiv:2411.07765
5
citations
#4975

Distilled Prompt Learning for Incomplete Multimodal Survival Prediction

Yingxue Xu, Fengtao ZHOU, Chenyu Zhao et al.

CVPR 2025arXiv:2503.01653
5
citations
#4976

Exact Expressive Power of Transformers with Padding

Will Merrill, Ashish Sabharwal

NEURIPS 2025arXiv:2505.18948
5
citations
#4977

BiggerGait: Unlocking Gait Recognition with Layer-wise Representations from Large Vision Models

Dingqiang Ye, Chao Fan, Zhanbo Huang et al.

NEURIPS 2025arXiv:2505.18132
5
citations
#4978

Reverse Diffusion Sequential Monte Carlo Samplers

Luhuan Wu, Yi Han, Christian Andersson Naesseth et al.

NEURIPS 2025arXiv:2508.05926
5
citations
#4979

Progressive Compression with Universally Quantized Diffusion Models

Yibo Yang, Justus Will, Stephan Mandt

ICLR 2025arXiv:2412.10935
5
citations
#4980

Flavors of Margin: Implicit Bias of Steepest Descent in Homogeneous Neural Networks

Nikolaos Tsilivis, Gal Vardi, Julia Kempe

ICLR 2025arXiv:2410.22069
5
citations
#4981

Multilevel neural simulation-based inference

Yuga Hikida, Ayush Bharti, Niall Jeffrey et al.

NEURIPS 2025arXiv:2506.06087
5
citations
#4982

Disentangled World Models: Learning to Transfer Semantic Knowledge from Distracting Videos for Reinforcement Learning

Qi Wang, Zhipeng Zhang, Baao Xie et al.

ICCV 2025arXiv:2503.08751
5
citations
#4983

Think Thrice Before You Act: Progressive Thought Refinement in Large Language Models

Chengyu Du, Jinyi Han, Yizhou Ying et al.

ICLR 2025arXiv:2410.13413
5
citations
#4984

Shape it Up! Restoring LLM Safety during Finetuning

ShengYun Peng, Pin-Yu Chen, Jianfeng Chi et al.

NEURIPS 2025arXiv:2505.17196
5
citations
#4985

FedSPU: Personalized Federated Learning for Resource-Constrained Devices with Stochastic Parameter Update

Ziru Niu, Hai Dong, A. K. Qin

AAAI 2025paperarXiv:2403.11464
5
citations
#4986

Multimodal Variational Autoencoder: A Barycentric View

Peijie Qiu, Wenhui Zhu, Sayantan Kumar et al.

AAAI 2025paperarXiv:2412.20487
5
citations
#4987

Noise-Resilient Symbolic Regression with Dynamic Gating Reinforcement Learning

Chenglu Sun, Shuo Shen, Wenzhi Tao et al.

AAAI 2025paperarXiv:2501.01085
5
citations
#4988

InfinityStar: Unified Spacetime AutoRegressive Modeling for Visual Generation

Jinlai Liu, Jian Han, Bin Yan et al.

NEURIPS 2025oral
5
citations
#4989

CSA: Data-efficient Mapping of Unimodal Features to Multimodal Features

Po-han Li, Sandeep Chinchali, ufuk topcu

ICLR 2025arXiv:2410.07610
5
citations
#4990

A Generic Framework for Conformal Fairness

Aditya Vadlamani, Anutam Srinivasan, Pranav Maneriker et al.

ICLR 2025arXiv:2505.16115
5
citations
#4991

Learning Normal Flow Directly From Events

Dehao Yuan, Levi Burner, Jiayi Wu et al.

ICCV 2025arXiv:2412.11284
5
citations
#4992

SwitchLingua: The First Large-Scale Multilingual and Multi-Ethnic Code-Switching Dataset

Peng Xie, Xingyuan Liu, Yequan Bie et al.

NEURIPS 2025arXiv:2506.00087
5
citations
#4993

A Simple Graph Contrastive Learning Framework for Short Text Classification

Yonghao Liu, Fausto Giunchiglia, Lan Huang et al.

AAAI 2025paperarXiv:2501.09219
5
citations
#4994

SweetTok: Semantic-Aware Spatial-Temporal Tokenizer for Compact Video Discretization

Zhentao Tan, Ben Xue, Jian Jia et al.

ICCV 2025arXiv:2412.10443
5
citations
#4995

DeblurDiff: Real-Word Image Deblurring with Generative Diffusion Models

Lingshun Kong, Jiawei Zhang, Dongqing Zou et al.

NEURIPS 2025
5
citations
#4996

Rectifying Magnitude Neglect in Linear Attention

Qihang Fan, Huaibo Huang, Yuang Ai et al.

ICCV 2025highlightarXiv:2507.00698
5
citations
#4997

Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation

Kaining Ying, Henghui Ding, Guangquan Jie et al.

ICCV 2025arXiv:2507.22886
5
citations
#4998

Subgraph Aggregation for Out-of-Distribution Generalization on Graphs

Bowen Liu, Haoyang Li, Shuning Wang et al.

AAAI 2025paperarXiv:2410.22228
5
citations
#4999

MOFFlow: Flow Matching for Structure Prediction of Metal-Organic Frameworks

Nayoung Kim, Seongsu Kim, Minsu Kim et al.

ICLR 2025arXiv:2410.17270
5
citations
#5000

Boltzmann-Aligned Inverse Folding Model as a Predictor of Mutational Effects on Protein-Protein Interactions

Xiaoran Jiao, Weian Mao, Wengong Jin et al.

ICLR 2025arXiv:2410.09543
5
citations