Most Cited 2025 "universal inverted bottleneck" Papers

22,274 papers found • Page 26 of 112

#5001

FactorGCL: A Hypergraph-Based Factor Model with Temporal Residual Contrastive Learning for Stock Returns Prediction

Yitong Duan, Weiran Wang, Jian Li

AAAI 2025paperarXiv:2502.05218
5
citations
#5002

RadarQA: Multi-modal Quality Analysis of Weather Radar Forecasts

Xuming He, Zhiyuan You, Junchao Gong et al.

NEURIPS 2025arXiv:2508.12291
5
citations
#5003

Calibrating Expressions of Certainty

Peiqi Wang, Barbara Lam, Yingcheng Liu et al.

ICLR 2025arXiv:2410.04315
5
citations
#5004

Convergence of Clipped SGD on Convex $(L_0,L_1)$-Smooth Functions

Ofir Gaash, Kfir Y. Levy, Yair Carmon

NEURIPS 2025arXiv:2502.16492
5
citations
#5005

PSMGD: Periodic Stochastic Multi-Gradient Descent for Fast Multi-Objective Optimization

Mingjing Xu, Peizhong Ju, Jia Liu et al.

AAAI 2025paperarXiv:2412.10961
5
citations
#5006

How to Verify Any (Reasonable) Distribution Property: Computationally Sound Argument Systems for Distributions

Tal Herman, Guy Rothblum

ICLR 2025arXiv:2409.06594
5
citations
#5007

Through the Dual-Prism: A Spectral Perspective on Graph Data Augmentation for Graph Classifications

Yutong Xia, Runpeng Yu, Yuxuan Liang et al.

AAAI 2025paperarXiv:2401.09953
5
citations
#5008

Strategic Classification With Externalities

Safwan Hossain, Evi Micha, Yiling Chen et al.

ICLR 2025arXiv:2410.08032
5
citations
#5009

Interpretable Generative Models through Post-hoc Concept Bottlenecks

Akshay R. Kulkarni, Ge Yan, Chung-En Sun et al.

CVPR 2025arXiv:2503.19377
5
citations
#5010

Cluster Based Heterogeneous Federated Foundation Model Adaptation and Fine-Tuning

Xianda Wang, Yaqi Qiao, Duo Wu et al.

AAAI 2025paper
5
citations
#5011

Reinforcement learning with combinatorial actions for coupled restless bandits

Lily Xu, Bryan Wilder, Elias Khalil et al.

ICLR 2025arXiv:2503.01919
5
citations
#5012

CoPRA: Bridging Cross-domain Pretrained Sequence Models with Complex Structures for Protein-RNA Binding Affinity Prediction

Rong Han, Xiaohong Liu, Tong Pan et al.

AAAI 2025paperarXiv:2409.03773
5
citations
#5013

Two Sparse Matrices are Better than One: Sparsifying Neural Networks with Double Sparse Factorization

Vladimir Boza, Vladimir Macko

ICLR 2025
5
citations
#5014

HeterGP: Bridging Heterogeneity in Graph Neural Networks with Multi-View Prompting

Fengyu Yan, Xiaobao Wang, Dongxiao He et al.

AAAI 2025paper
5
citations
#5015

Latent Radiance Fields with 3D-aware 2D Representations

Chaoyi Zhou, Xi Liu, Feng Luo et al.

ICLR 2025arXiv:2502.09613
5
citations
#5016

Novel View Synthesis with Pixel-Space Diffusion Models

Noam Elata, Bahjat Kawar, Yaron Ostrovsky-Berman et al.

CVPR 2025arXiv:2411.07765
5
citations
#5017

The Fluorescent Veil: A Stealthy and Effective Physical Adversarial Patch Against Traffic Sign Recognition

Shuai Yuan, Xingshuo Han, Hongwei Li et al.

NEURIPS 2025arXiv:2409.12394
5
citations
#5018

Improved Balanced Classification with Theoretically Grounded Loss Functions

Corinna Cortes, Mehryar Mohri, Yutao Zhong

NEURIPS 2025arXiv:2512.23947
5
citations
#5019

Spiral: Semantic-Aware Progressive LiDAR Scene Generation and Understanding

Dekai Zhu, Yixuan Hu, Youquan Liu et al.

NEURIPS 2025arXiv:2505.22643
5
citations
#5020

Towards Learnable Anchor for Deep Multi-View Clustering

Bocheng Wang, Chusheng Zeng, Mulin Chen et al.

AAAI 2025paperarXiv:2503.12427
5
citations
#5021

Distilled Prompt Learning for Incomplete Multimodal Survival Prediction

Yingxue Xu, Fengtao ZHOU, Chenyu Zhao et al.

CVPR 2025arXiv:2503.01653
5
citations
#5022

When Are Concepts Erased From Diffusion Models?

Kevin Lu, Nicky Kriplani, Rohit Gandikota et al.

NEURIPS 2025arXiv:2505.17013
5
citations
#5023

Severing Spurious Correlations with Data Pruning

Varun Mulchandani, Jung-Eun Kim

ICLR 2025arXiv:2503.18258
5
citations
#5024

Bridging Molecular Graphs and Large Language Models

Runze Wang, Mingqi Yang, Yanming Shen

AAAI 2025paperarXiv:2503.03135
5
citations
#5025

Time-o1: Time-Series Forecasting Needs Transformed Label Alignment

Hao Wang, Licheng Pan, Zhichao Chen et al.

NEURIPS 2025oralarXiv:2505.17847
5
citations
#5026

Uncertainty Quantification with the Empirical Neural Tangent Kernel

Joseph Wilson, Chris van der Heide, Liam Hodgkinson et al.

NEURIPS 2025arXiv:2502.02870
5
citations
#5027

Accelerating 3D Molecule Generation via Jointly Geometric Optimal Transport

Haokai Hong, Wanyu LIN, KC Tan

ICLR 2025arXiv:2405.15252
5
citations
#5028

Zeroth-Order Fine-Tuning of LLMs with Transferable Static Sparsity

Wentao Guo, Jikai Long, Yimeng Zeng et al.

ICLR 2025
5
citations
#5029

Disentangled World Models: Learning to Transfer Semantic Knowledge from Distracting Videos for Reinforcement Learning

Qi Wang, Zhipeng Zhang, Baao Xie et al.

ICCV 2025arXiv:2503.08751
5
citations
#5030

Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object Detection

Chuhan ZHANG, Chaoyang Zhu, Pingcheng Dong et al.

ICLR 2025arXiv:2503.11005
5
citations
#5031

GARLIC: GPT-Augmented Reinforcement Learning with Intelligent Control for Vehicle Dispatching

Xiao Han, Zijian Zhang, Xiangyu Zhao et al.

AAAI 2025paperarXiv:2408.10286
5
citations
#5032

MERGE$^3$: Efficient Evolutionary Merging on Consumer-grade GPUs

Tommaso Mencattini, Adrian Robert Minut, Donato Crisostomi et al.

ICML 2025arXiv:2502.10436
5
citations
#5033

Shape it Up! Restoring LLM Safety during Finetuning

ShengYun Peng, Pin-Yu Chen, Jianfeng Chi et al.

NEURIPS 2025arXiv:2505.17196
5
citations
#5034

On Generalization Across Environments In Multi-Objective Reinforcement Learning

Jayden Teoh, Pradeep Varakantham, Peter Vamplew

ICLR 2025arXiv:2503.00799
5
citations
#5035

Bayesian WeakS-to-Strong from Text Classification to Generation

Ziyun Cui, Ziyang Zhang, Guangzhi Sun et al.

ICLR 2025arXiv:2406.03199
5
citations
#5036

Learning Normal Flow Directly From Events

Dehao Yuan, Levi Burner, Jiayi Wu et al.

ICCV 2025arXiv:2412.11284
5
citations
#5037

SweetTok: Semantic-Aware Spatial-Temporal Tokenizer for Compact Video Discretization

Zhentao Tan, Ben Xue, Jian Jia et al.

ICCV 2025arXiv:2412.10443
5
citations
#5038

Exact Expressive Power of Transformers with Padding

Will Merrill, Ashish Sabharwal

NEURIPS 2025arXiv:2505.18948
5
citations
#5039

Learning to Communicate Through Implicit Communication Channels

Han Wang, Binbin Chen, zhang et al.

ICLR 2025arXiv:2411.01553
5
citations
#5040

Rectifying Magnitude Neglect in Linear Attention

Qihang Fan, Huaibo Huang, Yuang Ai et al.

ICCV 2025highlightarXiv:2507.00698
5
citations
#5041

Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation

Kaining Ying, Henghui Ding, Guangquan Jie et al.

ICCV 2025arXiv:2507.22886
5
citations
#5042

Pushing the Limits of All-Atom Geometric Graph Neural Networks: Pre-Training, Scaling, and Zero-Shot Transfer

Zihan Pengmei, Zhengyuan Shen, Zichen Wang et al.

ICLR 2025arXiv:2410.21683
5
citations
#5043

BiggerGait: Unlocking Gait Recognition with Layer-wise Representations from Large Vision Models

Dingqiang Ye, Chao Fan, Zhanbo Huang et al.

NEURIPS 2025arXiv:2505.18132
5
citations
#5044

From Probability to Counterfactuals: the Increasing Complexity of Satisfiability in Pearl's Causal Hierarchy

Julian Dörfler, Benito van der Zander, Markus Bläser et al.

ICLR 2025arXiv:2405.07373
5
citations
#5045

Reverse Diffusion Sequential Monte Carlo Samplers

Luhuan Wu, Yi Han, Christian Andersson Naesseth et al.

NEURIPS 2025arXiv:2508.05926
5
citations
#5046

Multilevel neural simulation-based inference

Yuga Hikida, Ayush Bharti, Niall Jeffrey et al.

NEURIPS 2025arXiv:2506.06087
5
citations
#5047

VLMaterial: Procedural Material Generation with Large Vision-Language Models

Beichen Li, Rundi Wu, Armando Solar-Lezama et al.

ICLR 2025arXiv:2501.18623
5
citations
#5048

InfinityStar: Unified Spacetime AutoRegressive Modeling for Visual Generation

Jinlai Liu, Jian Han, Bin Yan et al.

NEURIPS 2025oral
5
citations
#5049

Lay2Story: Extending Diffusion Transformers for Layout-Togglable Story Generation

Ao Ma, Jiasong Feng, Ke Cao et al.

ICCV 2025arXiv:2508.08949
5
citations
#5050

Leveraging Attention to Effectively Compress Prompts for Long-Context LLMs

Yunlong Zhao, Haoran Wu, Bo Xu

AAAI 2025paper
5
citations
#5051

AToM: Aligning Text-to-Motion Model at Event-Level with GPT-4Vision Reward

Haonan Han, Xiangzuo Wu, Huan Liao et al.

CVPR 2025arXiv:2411.18654
5
citations
#5052

STAR: Stability-Inducing Weight Perturbation for Continual Learning

Masih Eskandar, Tooba Imtiaz, Davin Hill et al.

ICLR 2025arXiv:2503.01595
5
citations
#5053

Revisiting Source-Free Domain Adaptation: a New Perspective via Uncertainty Control

Gezheng Xu, Hui GUO, Li Yi et al.

ICLR 2025
5
citations
#5054

Rethinking Fair Representation Learning for Performance-Sensitive Tasks

Charles Jones, Fabio De Sousa Ribeiro, Mélanie Roschewitz et al.

ICLR 2025arXiv:2410.04120
5
citations
#5055

SMI-Editor: Edit-based SMILES Language Model with Fragment-level Supervision

Kangjie Zheng, Siyue Liang, Junwei Yang et al.

ICLR 2025arXiv:2412.05569
5
citations
#5056

Free360: Layered Gaussian Splatting for Unbounded 360-Degree View Synthesis from Extremely Sparse and Unposed Views

Chong Bao, Xiyu Zhang, Zehao Yu et al.

CVPR 2025arXiv:2503.24382
5
citations
#5057

Understanding the Generalization of In-Context Learning in Transformers: An Empirical Study

Xingxuan Zhang, Haoran Wang, Jiansheng Li et al.

ICLR 2025arXiv:2503.15579
5
citations
#5058

BigDocs: An Open Dataset for Training Multimodal Models on Document and Code Tasks

Juan A. Rodriguez, Xiangru Jian, Siba Smarak Panigrahi et al.

ICLR 2025arXiv:2412.04626
5
citations
#5059

4Deform: Neural Surface Deformation for Robust Shape Interpolation

Lu Sang, Zehranaz Canfes, Dongliang Cao et al.

CVPR 2025arXiv:2502.20208
5
citations
#5060

CaO2: Rectifying Inconsistencies in Diffusion-Based Dataset Distillation

Haoxuan Wang, Zhenghao Zhao, Junyi Wu et al.

ICCV 2025
5
citations
#5061

Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning

Xiaochuan Li, Zichun Yu, Chenyan Xiong

ICLR 2025arXiv:2410.14208
5
citations
#5062

Cocoon: Robust Multi-Modal Perception with Uncertainty-Aware Sensor Fusion

Minkyoung Cho, Yulong Cao, Jiachen Sun et al.

ICLR 2025arXiv:2410.12592
5
citations
#5063

Improving Multimodal Learning via Imbalanced Learning

Shicai Wei, Chunbo Luo, Yang Luo

ICCV 2025arXiv:2507.10203
5
citations
#5064

GLoRa: A Benchmark to Evaluate the Ability to Learn Long-Range Dependencies in Graphs

Dongzhuoran Zhou, Evgeny Kharlamov, Egor Kostylev

ICLR 2025
5
citations
#5065

RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMs

Jiaxing Wu, Lin Ning, Luyang Liu et al.

AAAI 2025paperarXiv:2409.04421
5
citations
#5066

Learning Diffusion Models with Flexible Representation Guidance

Chenyu Wang, Cai Zhou, Sharut Gupta et al.

NEURIPS 2025arXiv:2507.08980
5
citations
#5067

HERO: Human Reaction Generation from Videos

Chengjun Yu, Wei Zhai, Yuhang Yang et al.

ICCV 2025arXiv:2503.08270
5
citations
#5068

PerLDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Model

Jinhua Zhang, Hualian Sheng, Sijia Cai et al.

ICCV 2025arXiv:2407.06109
5
citations
#5069

DecoupledGaussian: Object-Scene Decoupling for Physics-Based Interaction

Miaowei Wang, Yibo Zhang, Rui Ma et al.

CVPR 2025arXiv:2503.05484
5
citations
#5070

Conformal Generative Modeling with Improved Sample Efficiency through Sequential Greedy Filtering

Klaus-Rudolf Kladny, Bernhard Schölkopf, Michael Muehlebach

ICLR 2025arXiv:2410.01660
5
citations
#5071

Few-Shot, No Problem: Descriptive Continual Relation Extraction

Nguyen Xuan Thanh, Anh Duc Le, Quyen Tran et al.

AAAI 2025paperarXiv:2502.20596
5
citations
#5072

ChartSketcher: Reasoning with Multimodal Feedback and Reflection for Chart Understanding

Muye Huang, Lingling Zhang, Jie Ma et al.

NEURIPS 2025arXiv:2505.19076
5
citations
#5073

Infer Human’s Intentions Before Following Natural Language Instructions

Yanming Wan, Yue Wu, Yiping Wang et al.

AAAI 2025paperarXiv:2409.18073
5
citations
#5074

Multi-party Collaborative Attention Control for Image Customization

Han Yang, Chuanguang Yang, Qiuli Wang et al.

CVPR 2025arXiv:2505.01428
5
citations
#5075

HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding

Chenxin Tao, Shiqian Su, Xizhou Zhu et al.

CVPR 2025arXiv:2412.16158
5
citations
#5076

LiteSearch: Efficient Tree Search with Dynamic Exploration Budget for Math Reasoning

Ante Wang, Linfeng Song, Ye Tian et al.

AAAI 2025paper
5
citations
#5077

SEPARATE: A Simple Low-rank Projection for Gradient Compression in Modern Large-scale Model Training Process

Hanzhen Zhao, Xingyu Xie, Cong Fang et al.

ICLR 2025
5
citations
#5078

From Alexnet to Transformers: Measuring the Non-linearity of Deep Neural Networks with Affine Optimal Transport

Quentin Bouniot, Ievgen Redko, Anton Mallasto et al.

CVPR 2025arXiv:2310.11439
5
citations
#5079

MAGNET: A Multi-agent Framework for Finding Audio-Visual Needles by Reasoning over Multi-Video Haystacks

Sanjoy Chowdhury, Mohamed Elmoghany, Yohan Abeysinghe et al.

NEURIPS 2025oralarXiv:2506.07016
5
citations
#5080

Model-Free Offline Reinforcement Learning with Enhanced Robustness

Chi Zhang, Zain Ulabedeen Farhat, George Atia et al.

ICLR 2025
5
citations
#5081

Hardware-Rasterized Ray-Based Gaussian Splatting

Samuel Rota Bulò, Lorenzo Porzi, Nemanja Bartolovic et al.

CVPR 2025highlightarXiv:2503.18682
5
citations
#5082

Self-Evolutionary Large Language Models Through Uncertainty-Enhanced Preference Optimization

Jianing Wang, Yang Zhou, Xiaocheng Zhang et al.

AAAI 2025paperarXiv:2409.11212
5
citations
#5083

DOLPHIN: A Programmable Framework for Scalable Neurosymbolic Learning

Aaditya Naik, Jason Liu, Claire Wang et al.

ICML 2025arXiv:2410.03348
5
citations
#5084

Conformal Language Model Reasoning with Coherent Factuality

Maxon Rubin-Toles, Maya Gambhir, Keshav Ramji et al.

ICLR 2025arXiv:2505.17126
5
citations
#5085

FlexiDiT: Your Diffusion Transformer Can Easily Generate High-Quality Samples with Less Compute

Sotiris Anagnostidis, Gregor Bachmann, Yeongmin Kim et al.

CVPR 2025highlightarXiv:2502.20126
5
citations
#5086

MINGLE: Mixture of Null-Space Gated Low-Rank Experts for Test-Time Continual Model Merging

Zihuan Qiu, Yi Xu, Chiyuan He et al.

NEURIPS 2025arXiv:2505.11883
5
citations
#5087

Beyond FVD: An Enhanced Evaluation Metrics for Video Generation Distribution Quality

Ge Ya Luo, Gian M Favero, Zhi Hao Luo et al.

ICLR 2025oral
5
citations
#5088

Removing Reflections from RAW Photos

Eric Kee, Adam Pikielny, Kevin Blackburn-Matzen et al.

CVPR 2025arXiv:2404.14414
5
citations
#5089

Curly Flow Matching for Learning Non-gradient Field Dynamics

Katarina Petrović, Lazar Atanackovic, Viggo Moro et al.

NEURIPS 2025arXiv:2510.26645
5
citations
#5090

PanoWan: Lifting Diffusion Video Generation Models to 360$^\circ$ with Latitude/Longitude-aware Mechanisms

Yifei Xia, Shuchen Weng, Siqi Yang et al.

NEURIPS 2025
5
citations
#5091

Lawma: The Power of Specialization for Legal Annotation

Ricardo Dominguez-Olmedo, Vedant Nanda, Rediet Abebe et al.

ICLR 2025arXiv:2407.16615
5
citations
#5092

Logic.py: Bridging the Gap between LLMs and Constraint Solvers

Pascal Kesseli, Peter O'Hearn, Ricardo Cabral

NEURIPS 2025arXiv:2502.15776
5
citations
#5093

CMT: A Memory Compression Method for Continual Knowledge Learning of Large Language Models

Dongfang Li, Zetian Sun, Xinshuo Hu et al.

AAAI 2025paperarXiv:2412.07393
5
citations
#5094

BitStack: Any-Size Compression of Large Language Models in Variable Memory Environments

Xinghao Wang, Pengyu Wang, Bo Wang et al.

ICLR 2025arXiv:2410.23918
5
citations
#5095

CoHD: A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentation

Zhuoyan Luo, Yinghao Wu, Tianheng Cheng et al.

ICCV 2025arXiv:2405.15658
5
citations
#5096

In-Context Learning Strategies Emerge Rationally

Daniel Wurgaft, Ekdeep S Lubana, Core Francisco Park et al.

NEURIPS 2025arXiv:2506.17859
5
citations
#5097

HeGTa: Leveraging Heterogeneous Graph-enhanced Large Language Models for Few-shot Complex Table Understanding

Rihui Jin, Yu Li, Guilin Qi et al.

AAAI 2025paperarXiv:2403.19723
5
citations
#5098

Low-Light Image Enhancement using Event-Based Illumination Estimation

Lei Sun, Yuhan Bao, Jiajun Zhai et al.

ICCV 2025arXiv:2504.09379
5
citations
#5099

Memorize and Rank: Elevating Large Language Models for Clinical Diagnosis Prediction

Mingyu Derek Ma, Xiaoxuan Wang, Yijia Xiao et al.

AAAI 2025paperarXiv:2501.17326
5
citations
#5100

Many-Objective Multi-Solution Transport

Ziyue Li, Tian Li, Virginia Smith et al.

ICLR 2025arXiv:2403.04099
5
citations
#5101

Make Your Training Flexible: Towards Deployment-Efficient Video Models

Chenting Wang, Kunchang Li, Tianxiang Jiang et al.

ICCV 2025arXiv:2503.14237
5
citations
#5102

Lifelong Knowledge Editing for Vision Language Models with Low-Rank Mixture-of-Experts

Qizhou Chen, Chengyu Wang, Dakan Wang et al.

CVPR 2025arXiv:2411.15432
5
citations
#5103

CITI: Enhancing Tool Utilizing Ability in Large Language Models Without Sacrificing General Performance

Yupu Hao, Pengfei Cao, Zhuoran Jin et al.

AAAI 2025paperarXiv:2409.13202
5
citations
#5104

Privacy-Preserving Personalized Federated Prompt Learning for Multimodal Large Language Models

Linh Tran, Wei Sun, Stacy Patterson et al.

ICLR 2025arXiv:2501.13904
5
citations
#5105

Learning Heterogeneous Tissues with Mixture of Experts for Gigapixel Whole Slide Images

Junxian Wu, Minheng Chen, Xinyi Ke et al.

CVPR 2025
5
citations
#5106

Importance-Based Token Merging for Efficient Image and Video Generation

Haoyu Wu, Jingyi Xu, Hieu Le et al.

ICCV 2025arXiv:2411.16720
5
citations
#5107

Understanding LLM Behaviors via Compression: Data Generation, Knowledge Acquisition and Scaling Laws

Zhixuan Pan, Shaowen Wang, Liao Pengfei et al.

NEURIPS 2025spotlightarXiv:2504.09597
5
citations
#5108

Optimal Non-Asymptotic Rates of Value Iteration for Average-Reward Markov Decision Processes

Jongmin Lee, Ernest Ryu

ICLR 2025arXiv:2504.09913
5
citations
#5109

Preference-Oriented Supervised Fine-Tuning: Favoring Target Model over Aligned Large Language Models

Yuchen Fan, Yuzhong Hong, Qiushi Wang et al.

AAAI 2025paperarXiv:2412.12865
5
citations
#5110

Think Thrice Before You Act: Progressive Thought Refinement in Large Language Models

Chengyu Du, Jinyi Han, Yizhou Ying et al.

ICLR 2025arXiv:2410.13413
5
citations
#5111

Time-IMM: A Dataset and Benchmark for Irregular Multimodal Multivariate Time Series

Ching Chang, Jeehyun Hwang, Yidan Shi et al.

NEURIPS 2025arXiv:2506.10412
5
citations
#5112

SwitchLingua: The First Large-Scale Multilingual and Multi-Ethnic Code-Switching Dataset

Peng Xie, Xingyuan Liu, Yequan Bie et al.

NEURIPS 2025arXiv:2506.00087
5
citations
#5113

Semantic and Expressive Variations in Image Captions Across Languages

Andre Ye, Sebastin Santy, Jena D. Hwang et al.

CVPR 2025arXiv:2310.14356
5
citations
#5114

NightAdapter: Learning a Frequency Adapter for Generalizable Night-time Scene Segmentation

Qi Bi, Jingjun Yi, Huimin Huang et al.

CVPR 2025
5
citations
#5115

Learning-Guided Rolling Horizon Optimization for Long-Horizon Flexible Job-Shop Scheduling

Sirui Li, Wenbin Ouyang, Yining Ma et al.

ICLR 2025arXiv:2502.15791
5
citations
#5116

ABC-Former: Auxiliary Bimodal Cross-domain Transformer with Interactive Channel Attention for White Balance

Yu-Cheng Chiu, GUAN-RONG CHEN, Zihao Chen et al.

CVPR 2025
5
citations
#5117

BEDLAM2.0: Synthetic humans and cameras in motion

Joachim Tesch, Giorgio Becherini, Prerana Achar et al.

NEURIPS 2025oralarXiv:2511.14394
5
citations
#5118

DeblurDiff: Real-Word Image Deblurring with Generative Diffusion Models

Lingshun Kong, Jiawei Zhang, Dongqing Zou et al.

NEURIPS 2025
5
citations
#5119

Flavors of Margin: Implicit Bias of Steepest Descent in Homogeneous Neural Networks

Nikolaos Tsilivis, Gal Vardi, Julia Kempe

ICLR 2025arXiv:2410.22069
5
citations
#5120

UIBDiffusion: Universal Imperceptible Backdoor Attack for Diffusion Models

Yuning Han, Bingyin Zhao, Rui Chu et al.

CVPR 2025highlightarXiv:2412.11441
5
citations
#5121

LNS2+RL: Combining Multi-agent Reinforcement Learning with Large Neighborhood Search in Multi-agent Path Finding

Yutong Wang, Tanishq Duhan, Jiaoyang Li et al.

AAAI 2025paperarXiv:2405.17794
5
citations
#5122

SocialGesture: Delving into Multi-person Gesture Understanding

Xu Cao, Pranav Virupaksha, Wenqi Jia et al.

CVPR 2025arXiv:2504.02244
5
citations
#5123

TFCustom: Customized Image Generation with Time-Aware Frequency Feature Guidance

Mushui Liu, Dong She, Qihan Huang et al.

CVPR 2025highlight
5
citations
#5124

Long Video Diffusion Generation with Segmented Cross-Attention and Content-Rich Video Data Curation

Xin Yan, Yuxuan Cai, Qiuyue Wang et al.

CVPR 2025arXiv:2412.01316
5
citations
#5125

A Generic Framework for Conformal Fairness

Aditya Vadlamani, Anutam Srinivasan, Pranav Maneriker et al.

ICLR 2025arXiv:2505.16115
5
citations
#5126

AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization

Yiyang Du, Xiaochen Wang, Chi Chen et al.

CVPR 2025arXiv:2503.23733
5
citations
#5127

Progressive Compression with Universally Quantized Diffusion Models

Yibo Yang, Justus Will, Stephan Mandt

ICLR 2025arXiv:2412.10935
5
citations
#5128

A Simple Approach to Unifying Diffusion-based Conditional Generation

Xirui Li, Charles Herrmann, Kelvin Chan et al.

ICLR 2025arXiv:2410.11439
5
citations
#5129

EgoExOR: An Ego-Exo-Centric Operating Room Dataset for Surgical Activity Understanding

Ege Özsoy, Arda Mamur, Felix Tristram et al.

NEURIPS 2025arXiv:2505.24287
5
citations
#5130

CSA: Data-efficient Mapping of Unimodal Features to Multimodal Features

Po-han Li, Sandeep Chinchali, ufuk topcu

ICLR 2025arXiv:2410.07610
5
citations
#5131

ProtCLIP: Function-Informed Protein Multi-Modal Learning

Hanjing Zhou, Mingze Yin, Wei Wu et al.

AAAI 2025paperarXiv:2412.20014
5
citations
#5132

MindGYM: What Matters in Question Synthesis for Thinking-Centric Fine-Tuning?

Zhe Xu, Daoyuan Chen, Zhenqing Ling et al.

NEURIPS 2025arXiv:2503.09499
5
citations
#5133

Uncertainty and Influence aware Reward Model Refinement for Reinforcement Learning from Human Feedback

Zexu Sun, Yiju Guo, Yankai Lin et al.

ICLR 2025
5
citations
#5134

Incomplete and Unpaired Multi-View Graph Clustering with Cross-View Feature Fusion

Liang Zhao, Ziyue Wang, Xiao Wang et al.

AAAI 2025paper
5
citations
#5135

FedSA: A Unified Representation Learning via Semantic Anchors for Prototype-based Federated Learning

Yanbing Zhou, Xiangmou Qu, Chenlong You et al.

AAAI 2025paperarXiv:2501.05496
5
citations
#5136

Specifying What You Know or Not for Multi-Label Class-Incremental Learning

Aoting Zhang, Dongbao Yang, Chang Liu et al.

AAAI 2025paperarXiv:2503.17017
5
citations
#5137

PatientSim: A Persona-Driven Simulator for Realistic Doctor-Patient Interactions

Daeun Kyung, Hyunseung Chung, Seongsu Bae et al.

NEURIPS 2025spotlightarXiv:2505.17818
5
citations
#5138

Can Students Beyond the Teacher? Distilling Knowledge from Teacher’s Bias

Jianhua Zhang, Yi Gao, Ruyu Liu et al.

AAAI 2025paperarXiv:2412.09874
5
citations
#5139

Uni-Renderer: Unifying Rendering and Inverse Rendering Via Dual Stream Diffusion

ZhiFei Chen, Tianshuo Xu, Wenhang Ge et al.

CVPR 2025arXiv:2412.15050
5
citations
#5140

Improving Transferable Targeted Attacks with Feature Tuning Mixup

Kaisheng Liang, Xuelong Dai, Yanjie Li et al.

CVPR 2025arXiv:2411.15553
5
citations
#5141

From Elements to Design: A Layered Approach for Automatic Graphic Design Composition

Jiawei Lin, Shizhao Sun, Danqing Huang et al.

CVPR 2025arXiv:2412.19712
5
citations
#5142

Poplar: Efficient Scaling of Distributed DNN Training on Heterogeneous GPU Clusters

WenZheng Zhang, Yang Hu, Jing Shi et al.

AAAI 2025paperarXiv:2408.12596
5
citations
#5143

DiffVsgg: Diffusion-Driven Online Video Scene Graph Generation

Mu Chen, Liulei Li, Wenguan Wang et al.

CVPR 2025arXiv:2503.13957
5
citations
#5144

Open-World Objectness Modeling Unifies Novel Object Detection

Shan Zhang, Yao Ni, Jinhao Du et al.

CVPR 2025
5
citations
#5145

Noise-Resilient Symbolic Regression with Dynamic Gating Reinforcement Learning

Chenglu Sun, Shuo Shen, Wenzhi Tao et al.

AAAI 2025paperarXiv:2501.01085
5
citations
#5146

Feature Clipping for Uncertainty Calibration

Linwei Tao, Minjing Dong, Chang Xu

AAAI 2025paperarXiv:2410.19796
5
citations
#5147

MoRE-Brain: Routed Mixture of Experts for Interpretable and Generalizable Cross-Subject fMRI Visual Decoding

YUXIANG WEI, Yanteng Zhang, Xi Xiao et al.

NEURIPS 2025arXiv:2505.15946
5
citations
#5148

One2Any: One-Reference 6D Pose Estimation for Any Object

Mengya Liu, Siyuan Li, Ajad Chhatkuli et al.

CVPR 2025arXiv:2505.04109
5
citations
#5149

Difficulty-aware Balancing Margin Loss for Long-tailed Recognition

Minseok Son, Inyong Koo, Jinyoung Park et al.

AAAI 2025paperarXiv:2412.15477
5
citations
#5150

Harmonizing Visual and Textual Embeddings for Zero-Shot Text-to-Image Customization

Yeji Song, Jimyeong Kim, Wonhark Park et al.

AAAI 2025paperarXiv:2403.14155
5
citations
#5151

Online Estimation and Inference for Robust Policy Evaluation in Reinforcement Learning

Weidong Liu, Jiyuan Tu, Xi Chen et al.

NEURIPS 2025arXiv:2310.02581
5
citations
#5152

Boltzmann-Aligned Inverse Folding Model as a Predictor of Mutational Effects on Protein-Protein Interactions

Xiaoran Jiao, Weian Mao, Wengong Jin et al.

ICLR 2025arXiv:2410.09543
5
citations
#5153

MOFFlow: Flow Matching for Structure Prediction of Metal-Organic Frameworks

Nayoung Kim, Seongsu Kim, Minsu Kim et al.

ICLR 2025arXiv:2410.17270
5
citations
#5154

Understanding Prompt Tuning and In-Context Learning via Meta-Learning

Tim Genewein, Kevin Li, Jordi Grau-Moya et al.

NEURIPS 2025spotlightarXiv:2505.17010
5
citations
#5155

Every Rollout Counts: Optimal Resource Allocation for Efficient Test-Time Scaling

Xinglin Wang, Yiwei Li, Shaoxiong Feng et al.

NEURIPS 2025arXiv:2506.15707
5
citations
#5156

Integral Imprecise Probability Metrics

Siu Lun (Alan) Chau, Michele Caprio, Krikamol Muandet

NEURIPS 2025arXiv:2505.16156
5
citations
#5157

Multimodal Variational Autoencoder: A Barycentric View

Peijie Qiu, Wenhui Zhu, Sayantan Kumar et al.

AAAI 2025paperarXiv:2412.20487
5
citations
#5158

PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation

Ao Wang, Hui Chen, Jianchao Tan et al.

NEURIPS 2025arXiv:2412.03409
5
citations
#5159

JTD-UAV: MLLM-Enhanced Joint Tracking and Description Framework for Anti-UAV Systems

Yifan Wang, Jian Zhao, Zhaoxin Fan et al.

CVPR 2025
5
citations
#5160

FedSPU: Personalized Federated Learning for Resource-Constrained Devices with Stochastic Parameter Update

Ziru Niu, Hai Dong, A. K. Qin

AAAI 2025paperarXiv:2403.11464
5
citations
#5161

Gaussian Splatting with Discretized SDF for Relightable Assets

Zuo-Liang Zhu, jian Yang, Beibei Wang

ICCV 2025arXiv:2507.15629
5
citations
#5162

Atomic Thinking of LLMs: Decoupling and Exploring Mathematical Reasoning Abilities

Jiayi Kuang, Haojing Huang, Yinghui Li et al.

NEURIPS 2025arXiv:2509.25725
5
citations
#5163

TikZero: Zero-Shot Text-Guided Graphics Program Synthesis

Jonas Belouadi, Eddy Ilg, Margret Keuper et al.

ICCV 2025highlightarXiv:2503.11509
5
citations
#5164

PhysRig: Differentiable Physics-Based Skinning and Rigging Framework for Realistic Articulated Object Modeling

Hao Zhang, Haolan Xu, Chun Feng et al.

ICCV 2025arXiv:2506.20936
5
citations
#5165

Latent Diffusion Models with Masked AutoEncoders

Junho Lee, Jeongwoo Shin, Hyungwook Choi et al.

ICCV 2025arXiv:2507.09984
5
citations
#5166

OpenRSD: Towards Open-prompts for Object Detection in Remote Sensing Images

Ziyue Huang, Yongchao Feng, Ziqi Liu et al.

ICCV 2025arXiv:2503.06146
5
citations
#5167

Learning Interleaved Image-Text Comprehension in Vision-Language Large Models

Chenyu Zhou, Mengdan Zhang, Peixian Chen et al.

ICLR 2025arXiv:2406.10228
5
citations
#5168

Birth and Death of a Rose

Chen Geng, Yunzhi Zhang, Shangzhe Wu et al.

CVPR 2025arXiv:2412.05278
5
citations
#5169

QA-Calibration of Language Model Confidence Scores

Putra Manggala, Atalanti A Mastakouri, Elke Kirschbaum et al.

ICLR 2025arXiv:2410.06615
5
citations
#5170

SimMLM: A Simple Framework for Multi-modal Learning with Missing Modality

Sijie Li, Chen Chen, Jungong Han

ICCV 2025arXiv:2507.19264
5
citations
#5171

Subgraph Aggregation for Out-of-Distribution Generalization on Graphs

Bowen Liu, Haoyang Li, Shuning Wang et al.

AAAI 2025paperarXiv:2410.22228
5
citations
#5172

Adversarial Domain Prompt Tuning and Generation for Single Domain Generalization

Zhipeng Xu, De Cheng, XINYANG JIANG et al.

CVPR 2025
5
citations
#5173

GaussianUpdate: Continual 3D Gaussian Splatting Update for Changing Environments

Lin Zeng, Boming Zhao, Jiarui Hu et al.

ICCV 2025arXiv:2508.08867
5
citations
#5174

InfantAgent-Next: A Multimodal Generalist Agent for Automated Computer Interaction

Bin Lei, Weitai Kang, Zijian Zhang et al.

NEURIPS 2025arXiv:2505.10887
5
citations
#5175

SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering

Byeongjun Park, Hyojun Go, Hyelin Nam et al.

ICCV 2025arXiv:2503.12024
5
citations
#5176

Constructing Confidence Intervals for Average Treatment Effects from Multiple Datasets

Yuxin Wang, Maresa Schröder, Dennis Frauen et al.

ICLR 2025arXiv:2412.11511
5
citations
#5177

Multi-Resolution Pathology-Language Pre-training Model with Text-Guided Visual Representation

Shahad Albastaki, Anabia Sohail, IYYAKUTTI IYAPPAN GANAPATHI et al.

CVPR 2025arXiv:2504.18856
5
citations
#5178

Episodic Novelty Through Temporal Distance

Yuhua Jiang, Qihan Liu, Yiqin Yang et al.

ICLR 2025oralarXiv:2501.15418
5
citations
#5179

SEGS-SLAM: Structure-enhanced 3D Gaussian Splatting SLAM with Appearance Embedding

Tianci Wen, Zhiang Liu, Yongchun Fang

ICCV 2025arXiv:2501.05242
5
citations
#5180

Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration

Junyuan Deng, Wei Yin, Xiaoyang Guo et al.

ICCV 2025arXiv:2411.17240
5
citations
#5181

Natural Language Inference Improves Compositionality in Vision-Language Models

Paola Cascante-Bonilla, Yu (Hope) Hou, Yang Cao et al.

ICLR 2025arXiv:2410.22315
5
citations
#5182

New Perspectives on the Polyak Stepsize: Surrogate Functions and Negative Results

Francesco Orabona, Ryan D'Orazio

NEURIPS 2025arXiv:2505.20219
5
citations
#5183

Self-Updatable Large Language Models by Integrating Context into Model Parameters

Yu Wang, Xinshuang Liu, Xiusi Chen et al.

ICLR 2025arXiv:2410.00487
5
citations
#5184

Re-evaluating Open-ended Evaluation of Large Language Models

Si-Qi Liu, Ian Gemp, Luke Marris et al.

ICLR 2025arXiv:2502.20170
5
citations
#5185

SymMaP: Improving Computational Efficiency in Linear Solvers through Symbolic Preconditioning

Hong Wang, Jie Wang, Minghao Ma et al.

NEURIPS 2025arXiv:2510.24170
5
citations
#5186

Context Clues: Evaluating Long Context Models for Clinical Prediction Tasks on EHR Data

Michael Wornow, Suhana Bedi, Miguel Angel Fuentes Hernandez et al.

ICLR 2025
5
citations
#5187

Adaptive Non-Uniform Timestep Sampling for Accelerating Diffusion Model Training

Myunsoo Kim, Donghyeon Ki, Seong-Woong Shim et al.

CVPR 2025arXiv:2411.09998
5
citations
#5188

Frequency-Dynamic Attention Modulation For Dense Prediction

Linwei Chen, Lin Gu, Ying Fu

ICCV 2025arXiv:2507.12006
5
citations
#5189

HALL-E: Hierarchical Neural Codec Language Model for Minute-Long Zero-Shot Text-to-Speech Synthesis

Yuto Nishimura, Takumi Hirose, Masanari Ohi et al.

ICLR 2025arXiv:2410.04380
5
citations
#5190

Orientation Matters: Making 3D Generative Models Orientation-Aligned

Yichong Lu, Yuzhuo Tian, Zijin Jiang et al.

NEURIPS 2025arXiv:2506.08640
5
citations
#5191

Certification of Speaker Recognition Models to Additive Perturbations

Dmitrii Korzh, Elvir Karimov, Mikhail Pautov et al.

AAAI 2025paperarXiv:2404.18791
5
citations
#5192

p-Mean Regret for Stochastic Bandits

Anand Krishna, Philips George John, Adarsh Barik et al.

AAAI 2025paperarXiv:2412.10751
5
citations
#5193

RILQ: Rank-Insensitive LoRA-Based Quantization Error Compensation for Boosting 2-Bit Large Language Model Accuracy

Geonho Lee, Janghwan Lee, Sukjin Hong et al.

AAAI 2025paperarXiv:2412.01129
5
citations
#5194

SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts

Gengze Zhou, Yicong Hong, Zun Wang et al.

ICCV 2025arXiv:2412.05552
5
citations
#5195

DC-ControlNet: Decoupling Inter- and Intra-Element Conditions in Image Generation with Diffusion Models

hongji yang, Wencheng Han, Yucheng Zhou et al.

ICCV 2025arXiv:2502.14779
5
citations
#5196

SAP: Corrective Machine Unlearning with Scaled Activation Projection for Label Noise Robustness

Sangamesh Kodge, Deepak Ravikumar, Gobinda Saha et al.

AAAI 2025paperarXiv:2403.08618
5
citations
#5197

WaveMamba: Wavelet-Driven Mamba Fusion for RGB-Infrared Object Detection

Haodong Zhu, Wenhao Dong, Linlin Yang et al.

ICCV 2025arXiv:2507.18173
5
citations
#5198

LongDiff: Training-Free Long Video Generation in One Go

Zhuoling Li, Hossein Rahmani, Qiuhong Ke et al.

CVPR 2025arXiv:2503.18150
5
citations
#5199

X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation

jian ma, Qirong Peng, Xu Guo et al.

ICCV 2025arXiv:2503.06134
5
citations
#5200

Is Large-scale Pretraining the Secret to Good Domain Generalization?

Piotr Teterwak, Kuniaki Saito, Theodoros Tsiligkaridis et al.

ICLR 2025arXiv:2412.02856
5
citations