Most Cited 2025 Poster Papers

22,274 papers found • Page 26 of 112

#5001

Generative Medical Segmentation

Jiayu Huo, Xi Ouyang, Sébastien Ourselin et al.

AAAI 2025paperarXiv:2403.18198
5
citations
#5002

Context-Enhanced Memory-Refined Transformer for Online Action Detection

Zhanzhong Pang, Fadime Sener, Angela Yao

CVPR 2025posterarXiv:2503.18359
5
citations
#5003

Modeling Thousands of Human Annotators for Generalizable Text-to-Image Person Re-identification

Jiayu Jiang, Changxing Ding, Wentao Tan et al.

CVPR 2025highlightarXiv:2503.09962
5
citations
#5004

A Unified Solution to Video Fusion: From Multi-Frame Learning to Benchmarking

Zixiang Zhao, Haowen Bai, Bingxin Ke et al.

NEURIPS 2025oralarXiv:2505.19858
5
citations
#5005

Prosody-Enhanced Acoustic Pre-training and Acoustic-Disentangled Prosody Adapting for Movie Dubbing

Zhedong Zhang, Liang Li, Chenggang Yan et al.

CVPR 2025posterarXiv:2503.12042
5
citations
#5006

LuxDiT: Lighting Estimation with Video Diffusion Transformer

Ruofan Liang, Kai He, Zan Gojcic et al.

NEURIPS 2025posterarXiv:2509.03680
5
citations
#5007

On the Zero-shot Adversarial Robustness of Vision-Language Models: A Truly Zero-shot and Training-free Approach

Baoshun Tong, Hanjiang Lai, Yan Pan et al.

CVPR 2025poster
5
citations
#5008

3DEnhancer: Consistent Multi-View Diffusion for 3D Enhancement

Yihang Luo, Shangchen Zhou, Yushi Lan et al.

CVPR 2025posterarXiv:2412.18565
5
citations
#5009

MonoBox: Tightness-Free Box-Supervised Polyp Segmentation Using Monotonicity Constraint

Qiang Hu, Zhenyu Yi, Ying Zhou et al.

AAAI 2025paperarXiv:2404.01188
5
citations
#5010

Concept Replacer: Replacing Sensitive Concepts in Diffusion Models via Precision Localization

lingyun zhang, Yu Xie, Yanwei Fu et al.

CVPR 2025posterarXiv:2412.01244
5
citations
#5011

MetaNeRV: Meta Neural Representations for Videos with Spatial-Temporal Guidance

Jialong Guo, Ke Liu, Jiangchao Yao et al.

AAAI 2025paperarXiv:2501.02427
5
citations
#5012

Variational Search Distributions

Dan Steinberg, Rafael Oliveira, Cheng Soon Ong et al.

ICLR 2025posterarXiv:2409.06142
5
citations
#5013

Understanding Fine-tuning CLIP for Open-vocabulary Semantic Segmentation in Hyperbolic Space

Zelin Peng, Zhengqin Xu, Zhilin Zeng et al.

CVPR 2025poster
5
citations
#5014

Efficient Connectivity-Preserving Instance Segmentation with Supervoxel-Based Loss Function

Anna Grim, Jayaram Chandrashekar, Uygar Sümbül

AAAI 2025paperarXiv:2501.01022
5
citations
#5015

OnlineAnySeg: Online Zero-Shot 3D Segmentation by Visual Foundation Model Guided 2D Mask Merging

Yijie Tang, Jiazhao Zhang, Yuqing Lan et al.

CVPR 2025posterarXiv:2503.01309
5
citations
#5016

LICORICE: Label-Efficient Concept-Based Interpretable Reinforcement Learning

Zhuorui Ye, Stephanie Milani, Geoff Gordon et al.

ICLR 2025posterarXiv:2407.15786
5
citations
#5017

PoseLLaVA: Pose Centric Multimodal LLM for Fine-Grained 3D Pose Manipulation

Dong Feng, Ping Guo, Encheng Peng et al.

AAAI 2025paper
5
citations
#5018

ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy Prediction

Yi Feng, Yu Han, Xijing Zhang et al.

AAAI 2025paperarXiv:2412.11210
5
citations
#5019

E(3)-equivariant models cannot learn chirality: Field-based molecular generation

Alexandru Dumitrescu, Dani Korpela, Markus Heinonen et al.

ICLR 2025posterarXiv:2402.15864
5
citations
#5020

DOLPHIN: A Programmable Framework for Scalable Neurosymbolic Learning

Aaditya Naik, Jason Liu, Claire Wang et al.

ICML 2025posterarXiv:2410.03348
5
citations
#5021

Purifying Shampoo: Investigating Shampoo's Heuristics by Decomposing its Preconditioner

Runa Eschenhagen, Aaron Defazio, Tsung-Hsien Lee et al.

NEURIPS 2025spotlightarXiv:2506.03595
5
citations
#5022

MERGE$^3$: Efficient Evolutionary Merging on Consumer-grade GPUs

Tommaso Mencattini, Adrian Robert Minut, Donato Crisostomi et al.

ICML 2025posterarXiv:2502.10436
5
citations
#5023

Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation

Yi-Chen Li, Fuxiang Zhang, Wenjie Qiu et al.

ICLR 2025posterarXiv:2407.03856
5
citations
#5024

CoMBO: Conflict Mitigation via Branched Optimization for Class Incremental Segmentation

Kai Fang, Anqi Zhang, Guangyu Gao et al.

CVPR 2025posterarXiv:2504.04156
5
citations
#5025

Question-Aware Gaussian Experts for Audio-Visual Question Answering

Hongyeob Kim, Inyoung Jung, Dayoon Suh et al.

CVPR 2025highlightarXiv:2503.04459
5
citations
#5026

Enforcing Latent Euclidean Geometry in Single-Cell VAEs for Manifold Interpolation

Alessandro Palma, Sergei Rybakov, Leon Hetzel et al.

ICML 2025spotlightarXiv:2507.11789
5
citations
#5027

Finding Local Diffusion Schrödinger Bridge using Kolmogorov-Arnold Network

Xingyu Qiu, Mengying Yang, Xinghua Ma et al.

CVPR 2025posterarXiv:2502.19754
5
citations
#5028

Object-aware Sound Source Localization via Audio-Visual Scene Understanding

Sung Jin Um, Dongjin Kim, Sangmin Lee et al.

CVPR 2025posterarXiv:2506.18557
5
citations
#5029

Inverse Problem Sampling in Latent Space Using Sequential Monte Carlo

Idan Achituve, Hai Victor Habi, Amir Rosenfeld et al.

ICML 2025posterarXiv:2502.05908
5
citations
#5030

EvHDR-NeRF: Building High Dynamic Range Radiance Fields with Single Exposure Images and Events

Zehao Chen, Zhanfeng Liao, De Ma et al.

AAAI 2025paper
5
citations
#5031

PanDA: Towards Panoramic Depth Anything with Unlabeled Panoramas and Mobius Spatial Augmentation

Zidong Cao, Jinjing Zhu, Weiming Zhang et al.

CVPR 2025posterarXiv:2406.13378
5
citations
#5032

What should a neuron aim for? Designing local objective functions based on information theory

Andreas C. Schneider, Valentin Neuhaus, David Ehrlich et al.

ICLR 2025posterarXiv:2412.02482
5
citations
#5033

Φ-GAN:Physics-Inspired GAN for Generating SAR Images Under Limited Data

Xidan Zhang, Yihan Zhuang, Qian Guo et al.

ICCV 2025poster
5
citations
#5034

NeurOp-Diff: Continuous Remote Sensing Image Super-Resolution via Neural Operator Diffusion

Zihao Xu, Yuzhi Tang, Bowen Xu et al.

ICCV 2025
5
citations
#5035

AoP-SAM: Automation of Prompts for Efficient Segmentation

Yi Chen, Muyoung Son, Chuanbo Hua et al.

AAAI 2025paperarXiv:2505.11980
5
citations
#5036

Language Models Can Predict Their Own Behavior

Dhananjay Ashok, Jonathan May

NEURIPS 2025posterarXiv:2502.13329
5
citations
#5037

Causal LLM Routing: End-to-End Regret Minimization from Observational Data

Asterios Tsiourvas, Wei Sun, Georgia Perakis

NEURIPS 2025posterarXiv:2505.16037
5
citations
#5038

A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement

Hui Yuan, Yifan Zeng, Yue Wu et al.

ICLR 2025posterarXiv:2410.13828
5
citations
#5039

Attribution Analysis Meets Model Editing: Advancing Knowledge Correction in Vision Language Models with VisEdit

Qizhou Chen, Taolin Zhang, Chengyu Wang et al.

AAAI 2025paperarXiv:2408.09916
5
citations
#5040

SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation

Pengfei Chen, Lingxi Xie, xinyue huo et al.

ICLR 2025posterarXiv:2407.16682
5
citations
#5041

Cropper: Vision-Language Model for Image Cropping through In-Context Learning

Seung Hyun Lee, Jijun jiang, Yiran Xu et al.

CVPR 2025posterarXiv:2408.07790
5
citations
#5042

EMPLACE: Self-Supervised Urban Scene Change Detection

Tim Alpherts, Sennay Ghebreab, Nanne van Noord

AAAI 2025paperarXiv:2503.17716
5
citations
#5043

Spatial Transport Optimization by Repositioning Attention Map for Training-Free Text-to-Image Synthesis

Woojung Han, Yeonkyung Lee, Chanyoung Kim et al.

CVPR 2025posterarXiv:2503.22168
5
citations
#5044

Accelerating Training with Neuron Interaction and Nowcasting Networks

Boris Knyazev, Abhinav Moudgil, Guillaume Lajoie et al.

ICLR 2025posterarXiv:2409.04434
5
citations
#5045

Ego4o: Egocentric Human Motion Capture and Understanding from Multi-Modal Input

Jian Wang, Rishabh Dabral, Diogo Luvizon et al.

CVPR 2025posterarXiv:2504.08449
5
citations
#5046

Multi-Modal View Enhanced Large Vision Models for Long-Term Time Series Forecasting

ChengAo Shen, Wenchao Yu, Ziming Zhao et al.

NEURIPS 2025posterarXiv:2505.24003
5
citations
#5047

Correlated Errors in Large Language Models

Elliot Myunghoon Kim, Avi Garg, Kenny Peng et al.

ICML 2025posterarXiv:2506.07962
5
citations
#5048

What makes an Ensemble (Un) Interpretable?

Shahaf Bassan, Guy Amir, Meirav Zehavi et al.

ICML 2025posterarXiv:2506.08216
5
citations
#5049

Multi-View Collaborative Learning Network for Speech Deepfake Detection

Kuiyuan Zhang, Zhongyun Hua, Rushi Lan et al.

AAAI 2025paper
5
citations
#5050

Revisiting End-to-End Learning with Slide-level Supervision in Computational Pathology

Wenhao Tang, Rong Qin, Heng Fang et al.

NEURIPS 2025posterarXiv:2506.02408
5
citations
#5051

ZETA: Leveraging $Z$-order Curves for Efficient Top-$k$ Attention

Qiuhao Zeng, Jierui Huang, Peng Lu et al.

ICLR 2025posterarXiv:2501.14577
5
citations
#5052

mRNA2vec: mRNA Embedding with Language Model in the 5'UTR-CDS for mRNA Design

Honggen Zhang, Xiangrui Gao, June Zhang et al.

AAAI 2025paperarXiv:2408.09048
5
citations
#5053

Lightweight Predictive 3D Gaussian Splats

Junli Cao, Vidit Goel, Chaoyang Wang et al.

ICLR 2025posterarXiv:2406.19434
5
citations
#5054

Phoneme-Level Feature Discrepancies: A Key to Detecting Sophisticated Speech Deepfakes

Kuiyuan Zhang, Zhongyun Hua, Rushi Lan et al.

AAAI 2025paperarXiv:2412.12619
5
citations
#5055

Learning Orthogonal Multi-Index Models: A Fine-Grained Information Exponent Analysis

Yunwei Ren, Jason Lee

NEURIPS 2025posterarXiv:2410.09678
5
citations
#5056

DrVD-Bench: Do Vision-Language Models Reason Like Human Doctors in Medical Image Diagnosis?

Tianhong Zhou, xu yin, Yingtao Zhu et al.

NEURIPS 2025posterarXiv:2505.24173
5
citations
#5057

Data-adaptive Differentially Private Prompt Synthesis for In-Context Learning

Fengyu Gao, Ruida Zhou, Tianhao Wang et al.

ICLR 2025posterarXiv:2410.12085
5
citations
#5058

Self-Supervised Diffusion MRI Denoising via Iterative and Stable Refinement

Chenxu Wu, Qingpeng Kong, Zihang Jiang et al.

ICLR 2025oralarXiv:2501.13514
5
citations
#5059

Graph Structure Learning for Spatial-Temporal Imputation: Adapting to Node and Feature Scales

Xinyu Yang, Yu Sun, Xinyang Chen et al.

AAAI 2025paperarXiv:2412.18535
5
citations
#5060

RaSA: Rank-Sharing Low-Rank Adaptation

Zhiwei He, Zhaopeng Tu, Xing Wang et al.

ICLR 2025posterarXiv:2503.12576
5
citations
#5061

CAT: Content-Adaptive Image Tokenization

Junhong Shen, Kushal Tirumala, Michihiro Yasunaga et al.

NEURIPS 2025posterarXiv:2501.03120
5
citations
#5062

Compliant Residual DAgger: Improving Real-World Contact-Rich Manipulation with Human Corrections

Xiaomeng Xu, Yifan Hou, Zeyi Liu et al.

NEURIPS 2025posterarXiv:2506.16685
5
citations
#5063

MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds

Bingquan Dai, Luo Li, Qihong Tang et al.

NEURIPS 2025posterarXiv:2508.14879
5
citations
#5064

GTG: Generalizable Trajectory Generation Model for Urban Mobility

Jingyuan Wang, Yujing Lin, Yudong Li

AAAI 2025paperarXiv:2502.01107
5
citations
#5065

Autonomous Goal Detection and Cessation in Reinforcement Learning: A Case Study on Source Term Estimation

Yiwei Shi, Muning Wen, Qi Zhang et al.

AAAI 2025paperarXiv:2409.09541
5
citations
#5066

Tree-Sliced Wasserstein Distance with Nonlinear Projection

Thanh Tran, Viet Hoang Tran, Thanh Chu et al.

ICML 2025posterarXiv:2505.00968
5
citations
#5067

Through the Dual-Prism: A Spectral Perspective on Graph Data Augmentation for Graph Classifications

Yutong Xia, Runpeng Yu, Yuxuan Liang et al.

AAAI 2025paperarXiv:2401.09953
5
citations
#5068

IterIS: Iterative Inference-Solving Alignment for LoRA Merging

Hongxu chen, Zhen Wang, Runshi Li et al.

CVPR 2025posterarXiv:2411.15231
5
citations
#5069

PSMGD: Periodic Stochastic Multi-Gradient Descent for Fast Multi-Objective Optimization

Mingjing Xu, Peizhong Ju, Jia Liu et al.

AAAI 2025paperarXiv:2412.10961
5
citations
#5070

ECHOPulse: ECG Controlled Echocardio-gram Video Generation

Yiwei Li, Sekeun Kim, Zihao Wu et al.

ICLR 2025posterarXiv:2410.03143
5
citations
#5071

Graph-Based Cross-Domain Knowledge Distillation for Cross-Dataset Text-to-Image Person Retrieval

Bingjun Luo, Jinpeng Wang, Zewen Wang et al.

AAAI 2025paperarXiv:2501.15052
5
citations
#5072

Bridging Molecular Graphs and Large Language Models

Runze Wang, Mingqi Yang, Yanming Shen

AAAI 2025paperarXiv:2503.03135
5
citations
#5073

Predicting Empirical AI Research Outcomes with Language Models

Jiaxin Wen, Chenglei Si, Yueh-Han Chen et al.

NEURIPS 2025posterarXiv:2506.00794
5
citations
#5074

A Thorough Comparison Between Independent Cascade and Susceptible-Infected-Recovered Models

Panfeng Liu, Guoliang Qiu, Biaoshuai Tao et al.

AAAI 2025paperarXiv:2408.11470
5
citations
#5075

Cluster Based Heterogeneous Federated Foundation Model Adaptation and Fine-Tuning

Xianda Wang, Yaqi Qiao, Duo Wu et al.

AAAI 2025paper
5
citations
#5076

uniINF: Best-of-Both-Worlds Algorithm for Parameter-Free Heavy-Tailed MABs

Yu Chen, Jiatai Huang, Yan Dai et al.

ICLR 2025posterarXiv:2410.03284
5
citations
#5077

DF-MIA: A Distribution-Free Membership Inference Attack on Fine-Tuned Large Language Models

Zhiheng Huang, Yannan Liu, Daojing He et al.

AAAI 2025paper
5
citations
#5078

Towards Learnable Anchor for Deep Multi-View Clustering

Bocheng Wang, Chusheng Zeng, Mulin Chen et al.

AAAI 2025paperarXiv:2503.12427
5
citations
#5079

On Speeding Up Language Model Evaluation

Jin Zhou, Christian Belardi, Ruihan Wu et al.

ICLR 2025posterarXiv:2407.06172
5
citations
#5080

EchoONE: Segmenting Multiple Echocardiography Planes in One Model

Jiongtong Hu, Wei Zhuo, Jun Cheng et al.

CVPR 2025posterarXiv:2412.02993
5
citations
#5081

Neural Interactive Proofs

Lewis Hammond, Sam Adam-Day

ICLR 2025posterarXiv:2412.08897
5
citations
#5082

AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction

Junhao Cheng, Yuying Ge, Yixiao Ge et al.

ICCV 2025posterarXiv:2504.01014
5
citations
#5083

GARLIC: GPT-Augmented Reinforcement Learning with Intelligent Control for Vehicle Dispatching

Xiao Han, Zijian Zhang, Xiangyu Zhao et al.

AAAI 2025paperarXiv:2408.10286
5
citations
#5084

StressPrompt: Does Stress Impact Large Language Models and Human Performance Similarly?

Guobin Shen, Dongcheng Zhao, Aorigele Bao et al.

AAAI 2025paperarXiv:2409.17167
5
citations
#5085

LeanVAE: An Ultra-Efficient Reconstruction VAE for Video Diffusion Models

Yu Cheng, Fajie Yuan

ICCV 2025posterarXiv:2503.14325
5
citations
#5086

Seeing What Matters: Empowering CLIP with Patch Generation-to-Selection

Gensheng Pei, Tao Chen, Yujia Wang et al.

CVPR 2025posterarXiv:2503.17080
5
citations
#5087

HeterGP: Bridging Heterogeneity in Graph Neural Networks with Multi-View Prompting

Fengyu Yan, Xiaobao Wang, Dongxiao He et al.

AAAI 2025paper
5
citations
#5088

Skip-Vision: Efficient and Scalable Acceleration of Vision-Language Models via Adaptive Token Skipping

Weili Zeng, Ziyuan Huang, Kaixiang Ji et al.

ICCV 2025posterarXiv:2503.21817
5
citations
#5089

MOSCATO: Predicting Multiple Object State Change Through Actions

Parnian Zameni, Yuhan Shen, Ehsan Elhamifar

ICCV 2025poster
5
citations
#5090

NUDGE: Lightweight Non-Parametric Fine-Tuning of Embeddings for Retrieval

Sepanta Zeighami, Zac Wellmer, Aditya Parameswaran

ICLR 2025posterarXiv:2409.02343
5
citations
#5091

MrSteve: Instruction-Following Agents in Minecraft with What-Where-When Memory

Junyeong Park, Junmo Cho, Sungjin Ahn

ICLR 2025posterarXiv:2411.06736
5
citations
#5092

mmFAS: Multimodal Face Anti-Spoofing Using Multi-Level Alignment and Switch-Attention Fusion

Geng Chen, Wuyuan Xie, Di Lin et al.

AAAI 2025paper
5
citations
#5093

FactorGCL: A Hypergraph-Based Factor Model with Temporal Residual Contrastive Learning for Stock Returns Prediction

Yitong Duan, Weiran Wang, Jian Li

AAAI 2025paperarXiv:2502.05218
5
citations
#5094

RadarQA: Multi-modal Quality Analysis of Weather Radar Forecasts

Xuming He, Zhiyuan You, Junchao Gong et al.

NEURIPS 2025posterarXiv:2508.12291
5
citations
#5095

Calibrating Expressions of Certainty

Peiqi Wang, Barbara Lam, Yingcheng Liu et al.

ICLR 2025posterarXiv:2410.04315
5
citations
#5096

Convergence of Clipped SGD on Convex $(L_0,L_1)$-Smooth Functions

Ofir Gaash, Kfir Y. Levy, Yair Carmon

NEURIPS 2025posterarXiv:2502.16492
5
citations
#5097

Strategic Classification With Externalities

Safwan Hossain, Evi Micha, Yiling Chen et al.

ICLR 2025posterarXiv:2410.08032
5
citations
#5098

Interpretable Generative Models through Post-hoc Concept Bottlenecks

Akshay R. Kulkarni, Ge Yan, Chung-En Sun et al.

CVPR 2025posterarXiv:2503.19377
5
citations
#5099

How to Verify Any (Reasonable) Distribution Property: Computationally Sound Argument Systems for Distributions

Tal Herman, Guy Rothblum

ICLR 2025posterarXiv:2409.06594
5
citations
#5100

Two Sparse Matrices are Better than One: Sparsifying Neural Networks with Double Sparse Factorization

Vladimir Boza, Vladimir Macko

ICLR 2025poster
5
citations
#5101

Reinforcement learning with combinatorial actions for coupled restless bandits

Lily Xu, Bryan Wilder, Elias Khalil et al.

ICLR 2025posterarXiv:2503.01919
5
citations
#5102

Severing Spurious Correlations with Data Pruning

Varun Mulchandani, Jung-Eun Kim

ICLR 2025posterarXiv:2503.18258
5
citations
#5103

Latent Radiance Fields with 3D-aware 2D Representations

Chaoyi Zhou, Xi Liu, Feng Luo et al.

ICLR 2025posterarXiv:2502.09613
5
citations
#5104

Novel View Synthesis with Pixel-Space Diffusion Models

Noam Elata, Bahjat Kawar, Yaron Ostrovsky-Berman et al.

CVPR 2025posterarXiv:2411.07765
5
citations
#5105

The Fluorescent Veil: A Stealthy and Effective Physical Adversarial Patch Against Traffic Sign Recognition

Shuai Yuan, Xingshuo Han, Hongwei Li et al.

NEURIPS 2025posterarXiv:2409.12394
5
citations
#5106

Improved Balanced Classification with Theoretically Grounded Loss Functions

Corinna Cortes, Mehryar Mohri, Yutao Zhong

NEURIPS 2025posterarXiv:2512.23947
5
citations
#5107

Spiral: Semantic-Aware Progressive LiDAR Scene Generation and Understanding

Dekai Zhu, Yixuan Hu, Youquan Liu et al.

NEURIPS 2025posterarXiv:2505.22643
5
citations
#5108

Distilled Prompt Learning for Incomplete Multimodal Survival Prediction

Yingxue Xu, Fengtao ZHOU, Chenyu Zhao et al.

CVPR 2025posterarXiv:2503.01653
5
citations
#5109

When Are Concepts Erased From Diffusion Models?

Kevin Lu, Nicky Kriplani, Rohit Gandikota et al.

NEURIPS 2025posterarXiv:2505.17013
5
citations
#5110

Leveraging Attention to Effectively Compress Prompts for Long-Context LLMs

Yunlong Zhao, Haoran Wu, Bo Xu

AAAI 2025paper
5
citations
#5111

Time-o1: Time-Series Forecasting Needs Transformed Label Alignment

Hao Wang, Licheng Pan, Zhichao Chen et al.

NEURIPS 2025oralarXiv:2505.17847
5
citations
#5112

Uncertainty Quantification with the Empirical Neural Tangent Kernel

Joseph Wilson, Chris van der Heide, Liam Hodgkinson et al.

NEURIPS 2025posterarXiv:2502.02870
5
citations
#5113

Accelerating 3D Molecule Generation via Jointly Geometric Optimal Transport

Haokai Hong, Wanyu LIN, KC Tan

ICLR 2025posterarXiv:2405.15252
5
citations
#5114

Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object Detection

Chuhan ZHANG, Chaoyang Zhu, Pingcheng Dong et al.

ICLR 2025posterarXiv:2503.11005
5
citations
#5115

Zeroth-Order Fine-Tuning of LLMs with Transferable Static Sparsity

Wentao Guo, Jikai Long, Yimeng Zeng et al.

ICLR 2025poster
5
citations
#5116

Disentangled World Models: Learning to Transfer Semantic Knowledge from Distracting Videos for Reinforcement Learning

Qi Wang, Zhipeng Zhang, Baao Xie et al.

ICCV 2025posterarXiv:2503.08751
5
citations
#5117

CoPRA: Bridging Cross-domain Pretrained Sequence Models with Complex Structures for Protein-RNA Binding Affinity Prediction

Rong Han, Xiaohong Liu, Tong Pan et al.

AAAI 2025paperarXiv:2409.03773
5
citations
#5118

Shape it Up! Restoring LLM Safety during Finetuning

ShengYun Peng, Pin-Yu Chen, Jianfeng Chi et al.

NEURIPS 2025posterarXiv:2505.17196
5
citations
#5119

Bayesian WeakS-to-Strong from Text Classification to Generation

Ziyun Cui, Ziyang Zhang, Guangzhi Sun et al.

ICLR 2025posterarXiv:2406.03199
5
citations
#5120

Learning Normal Flow Directly From Events

Dehao Yuan, Levi Burner, Jiayi Wu et al.

ICCV 2025posterarXiv:2412.11284
5
citations
#5121

On Generalization Across Environments In Multi-Objective Reinforcement Learning

Jayden Teoh, Pradeep Varakantham, Peter Vamplew

ICLR 2025posterarXiv:2503.00799
5
citations
#5122

SweetTok: Semantic-Aware Spatial-Temporal Tokenizer for Compact Video Discretization

Zhentao Tan, Ben Xue, Jian Jia et al.

ICCV 2025posterarXiv:2412.10443
5
citations
#5123

Exact Expressive Power of Transformers with Padding

Will Merrill, Ashish Sabharwal

NEURIPS 2025posterarXiv:2505.18948
5
citations
#5124

Rectifying Magnitude Neglect in Linear Attention

Qihang Fan, Huaibo Huang, Yuang Ai et al.

ICCV 2025highlightarXiv:2507.00698
5
citations
#5125

Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation

Kaining Ying, Henghui Ding, Guangquan Jie et al.

ICCV 2025posterarXiv:2507.22886
5
citations
#5126

Pushing the Limits of All-Atom Geometric Graph Neural Networks: Pre-Training, Scaling, and Zero-Shot Transfer

Zihan Pengmei, Zhengyuan Shen, Zichen Wang et al.

ICLR 2025posterarXiv:2410.21683
5
citations
#5127

BiggerGait: Unlocking Gait Recognition with Layer-wise Representations from Large Vision Models

Dingqiang Ye, Chao Fan, Zhanbo Huang et al.

NEURIPS 2025posterarXiv:2505.18132
5
citations
#5128

From Probability to Counterfactuals: the Increasing Complexity of Satisfiability in Pearl's Causal Hierarchy

Julian Dörfler, Benito van der Zander, Markus Bläser et al.

ICLR 2025posterarXiv:2405.07373
5
citations
#5129

Rethinking Fair Representation Learning for Performance-Sensitive Tasks

Charles Jones, Fabio De Sousa Ribeiro, Mélanie Roschewitz et al.

ICLR 2025posterarXiv:2410.04120
5
citations
#5130

Learning to Communicate Through Implicit Communication Channels

Han Wang, Binbin Chen, zhang et al.

ICLR 2025posterarXiv:2411.01553
5
citations
#5131

Reverse Diffusion Sequential Monte Carlo Samplers

Luhuan Wu, Yi Han, Christian Andersson Naesseth et al.

NEURIPS 2025posterarXiv:2508.05926
5
citations
#5132

Multilevel neural simulation-based inference

Yuga Hikida, Ayush Bharti, Niall Jeffrey et al.

NEURIPS 2025posterarXiv:2506.06087
5
citations
#5133

VLMaterial: Procedural Material Generation with Large Vision-Language Models

Beichen Li, Rundi Wu, Armando Solar-Lezama et al.

ICLR 2025posterarXiv:2501.18623
5
citations
#5134

RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMs

Jiaxing Wu, Lin Ning, Luyang Liu et al.

AAAI 2025paperarXiv:2409.04421
5
citations
#5135

LiteSearch: Efficient Tree Search with Dynamic Exploration Budget for Math Reasoning

Ante Wang, Linfeng Song, Ye Tian et al.

AAAI 2025paper
5
citations
#5136

Self-Evolutionary Large Language Models Through Uncertainty-Enhanced Preference Optimization

Jianing Wang, Yang Zhou, Xiaocheng Zhang et al.

AAAI 2025paperarXiv:2409.11212
5
citations
#5137

InfinityStar: Unified Spacetime AutoRegressive Modeling for Visual Generation

Jinlai Liu, Jian Han, Bin Yan et al.

NEURIPS 2025oral
5
citations
#5138

Lay2Story: Extending Diffusion Transformers for Layout-Togglable Story Generation

Ao Ma, Jiasong Feng, Ke Cao et al.

ICCV 2025posterarXiv:2508.08949
5
citations
#5139

AToM: Aligning Text-to-Motion Model at Event-Level with GPT-4Vision Reward

Haonan Han, Xiangzuo Wu, Huan Liao et al.

CVPR 2025posterarXiv:2411.18654
5
citations
#5140

Revisiting Source-Free Domain Adaptation: a New Perspective via Uncertainty Control

Gezheng Xu, Hui GUO, Li Yi et al.

ICLR 2025poster
5
citations
#5141

Few-Shot, No Problem: Descriptive Continual Relation Extraction

Nguyen Xuan Thanh, Anh Duc Le, Quyen Tran et al.

AAAI 2025paperarXiv:2502.20596
5
citations
#5142

STAR: Stability-Inducing Weight Perturbation for Continual Learning

Masih Eskandar, Tooba Imtiaz, Davin Hill et al.

ICLR 2025posterarXiv:2503.01595
5
citations
#5143

SMI-Editor: Edit-based SMILES Language Model with Fragment-level Supervision

Kangjie Zheng, Siyue Liang, Junwei Yang et al.

ICLR 2025posterarXiv:2412.05569
5
citations
#5144

Infer Human’s Intentions Before Following Natural Language Instructions

Yanming Wan, Yue Wu, Yiping Wang et al.

AAAI 2025paperarXiv:2409.18073
5
citations
#5145

Understanding the Generalization of In-Context Learning in Transformers: An Empirical Study

Xingxuan Zhang, Haoran Wang, Jiansheng Li et al.

ICLR 2025posterarXiv:2503.15579
5
citations
#5146

Free360: Layered Gaussian Splatting for Unbounded 360-Degree View Synthesis from Extremely Sparse and Unposed Views

Chong Bao, Xiyu Zhang, Zehao Yu et al.

CVPR 2025posterarXiv:2503.24382
5
citations
#5147

BigDocs: An Open Dataset for Training Multimodal Models on Document and Code Tasks

Juan A. Rodriguez, Xiangru Jian, Siba Smarak Panigrahi et al.

ICLR 2025posterarXiv:2412.04626
5
citations
#5148

Can LLMs Obfuscate Code? A Systematic Analysis of Large Language Models into Assembly Code Obfuscation

Seyedreza Mohseni, Seyedali Mohammadi, Deepa Tilwani et al.

AAAI 2025paperarXiv:2412.16135
5
citations
#5149

4Deform: Neural Surface Deformation for Robust Shape Interpolation

Lu Sang, Zehranaz Canfes, Dongliang Cao et al.

CVPR 2025posterarXiv:2502.20208
5
citations
#5150

CaO2: Rectifying Inconsistencies in Diffusion-Based Dataset Distillation

Haoxuan Wang, Zhenghao Zhao, Junyi Wu et al.

ICCV 2025poster
5
citations
#5151

Memorize and Rank: Elevating Large Language Models for Clinical Diagnosis Prediction

Mingyu Derek Ma, Xiaoxuan Wang, Yijia Xiao et al.

AAAI 2025paperarXiv:2501.17326
5
citations
#5152

Cocoon: Robust Multi-Modal Perception with Uncertainty-Aware Sensor Fusion

Minkyoung Cho, Yulong Cao, Jiachen Sun et al.

ICLR 2025posterarXiv:2410.12592
5
citations
#5153

GLoRa: A Benchmark to Evaluate the Ability to Learn Long-Range Dependencies in Graphs

Dongzhuoran Zhou, Evgeny Kharlamov, Egor Kostylev

ICLR 2025poster
5
citations
#5154

Improving Multimodal Learning via Imbalanced Learning

Shicai Wei, Chunbo Luo, Yang Luo

ICCV 2025posterarXiv:2507.10203
5
citations
#5155

Learning Diffusion Models with Flexible Representation Guidance

Chenyu Wang, Cai Zhou, Sharut Gupta et al.

NEURIPS 2025posterarXiv:2507.08980
5
citations
#5156

HERO: Human Reaction Generation from Videos

Chengjun Yu, Wei Zhai, Yuhang Yang et al.

ICCV 2025posterarXiv:2503.08270
5
citations
#5157

PerLDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Model

Jinhua Zhang, Hualian Sheng, Sijia Cai et al.

ICCV 2025posterarXiv:2407.06109
5
citations
#5158

DecoupledGaussian: Object-Scene Decoupling for Physics-Based Interaction

Miaowei Wang, Yibo Zhang, Rui Ma et al.

CVPR 2025posterarXiv:2503.05484
5
citations
#5159

Conformal Generative Modeling with Improved Sample Efficiency through Sequential Greedy Filtering

Klaus-Rudolf Kladny, Bernhard Schölkopf, Michael Muehlebach

ICLR 2025posterarXiv:2410.01660
5
citations
#5160

CMT: A Memory Compression Method for Continual Knowledge Learning of Large Language Models

Dongfang Li, Zetian Sun, Xinshuo Hu et al.

AAAI 2025paperarXiv:2412.07393
5
citations
#5161

ChartSketcher: Reasoning with Multimodal Feedback and Reflection for Chart Understanding

Muye Huang, Lingling Zhang, Jie Ma et al.

NEURIPS 2025posterarXiv:2505.19076
5
citations
#5162

Multi-party Collaborative Attention Control for Image Customization

Han Yang, Chuanguang Yang, Qiuli Wang et al.

CVPR 2025posterarXiv:2505.01428
5
citations
#5163

HeGTa: Leveraging Heterogeneous Graph-enhanced Large Language Models for Few-shot Complex Table Understanding

Rihui Jin, Yu Li, Guilin Qi et al.

AAAI 2025paperarXiv:2403.19723
5
citations
#5164

HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding

Chenxin Tao, Shiqian Su, Xizhou Zhu et al.

CVPR 2025posterarXiv:2412.16158
5
citations
#5165

CITI: Enhancing Tool Utilizing Ability in Large Language Models Without Sacrificing General Performance

Yupu Hao, Pengfei Cao, Zhuoran Jin et al.

AAAI 2025paperarXiv:2409.13202
5
citations
#5166

SEPARATE: A Simple Low-rank Projection for Gradient Compression in Modern Large-scale Model Training Process

Hanzhen Zhao, Xingyu Xie, Cong Fang et al.

ICLR 2025poster
5
citations
#5167

From Alexnet to Transformers: Measuring the Non-linearity of Deep Neural Networks with Affine Optimal Transport

Quentin Bouniot, Ievgen Redko, Anton Mallasto et al.

CVPR 2025posterarXiv:2310.11439
5
citations
#5168

Model-Free Offline Reinforcement Learning with Enhanced Robustness

Chi Zhang, Zain Ulabedeen Farhat, George Atia et al.

ICLR 2025poster
5
citations
#5169

Towards Understanding Text Hallucination of Diffusion Models via Local Generation Bias

Rui Lu, Runzhe Wang, Kaifeng Lyu et al.

ICLR 2025posterarXiv:2503.03595
5
citations
#5170

MAGNET: A Multi-agent Framework for Finding Audio-Visual Needles by Reasoning over Multi-Video Haystacks

Sanjoy Chowdhury, Mohamed Elmoghany, Yohan Abeysinghe et al.

NEURIPS 2025oralarXiv:2506.07016
5
citations
#5171

Hardware-Rasterized Ray-Based Gaussian Splatting

Samuel Rota Bulò, Lorenzo Porzi, Nemanja Bartolovic et al.

CVPR 2025highlightarXiv:2503.18682
5
citations
#5172

Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning

Xiaochuan Li, Zichun Yu, Chenyan Xiong

ICLR 2025posterarXiv:2410.14208
5
citations
#5173

FlexiDiT: Your Diffusion Transformer Can Easily Generate High-Quality Samples with Less Compute

Sotiris Anagnostidis, Gregor Bachmann, Yeongmin Kim et al.

CVPR 2025highlightarXiv:2502.20126
5
citations
#5174

MINGLE: Mixture of Null-Space Gated Low-Rank Experts for Test-Time Continual Model Merging

Zihuan Qiu, Yi Xu, Chiyuan He et al.

NEURIPS 2025posterarXiv:2505.11883
5
citations
#5175

Beyond FVD: An Enhanced Evaluation Metrics for Video Generation Distribution Quality

Ge Ya Luo, Gian M Favero, Zhi Hao Luo et al.

ICLR 2025oral
5
citations
#5176

Removing Reflections from RAW Photos

Eric Kee, Adam Pikielny, Kevin Blackburn-Matzen et al.

CVPR 2025posterarXiv:2404.14414
5
citations
#5177

Lawma: The Power of Specialization for Legal Annotation

Ricardo Dominguez-Olmedo, Vedant Nanda, Rediet Abebe et al.

ICLR 2025posterarXiv:2407.16615
5
citations
#5178

Curly Flow Matching for Learning Non-gradient Field Dynamics

Katarina Petrović, Lazar Atanackovic, Viggo Moro et al.

NEURIPS 2025posterarXiv:2510.26645
5
citations
#5179

Preference-Oriented Supervised Fine-Tuning: Favoring Target Model over Aligned Large Language Models

Yuchen Fan, Yuzhong Hong, Qiushi Wang et al.

AAAI 2025paperarXiv:2412.12865
5
citations
#5180

PanoWan: Lifting Diffusion Video Generation Models to 360$^\circ$ with Latitude/Longitude-aware Mechanisms

Yifei Xia, Shuchen Weng, Siqi Yang et al.

NEURIPS 2025poster
5
citations
#5181

Conformal Language Model Reasoning with Coherent Factuality

Maxon Rubin-Toles, Maya Gambhir, Keshav Ramji et al.

ICLR 2025posterarXiv:2505.17126
5
citations
#5182

Logic.py: Bridging the Gap between LLMs and Constraint Solvers

Pascal Kesseli, Peter O'Hearn, Ricardo Cabral

NEURIPS 2025posterarXiv:2502.15776
5
citations
#5183

BitStack: Any-Size Compression of Large Language Models in Variable Memory Environments

Xinghao Wang, Pengyu Wang, Bo Wang et al.

ICLR 2025posterarXiv:2410.23918
5
citations
#5184

CoHD: A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentation

Zhuoyan Luo, Yinghao Wu, Tianheng Cheng et al.

ICCV 2025posterarXiv:2405.15658
5
citations
#5185

In-Context Learning Strategies Emerge Rationally

Daniel Wurgaft, Ekdeep S Lubana, Core Francisco Park et al.

NEURIPS 2025posterarXiv:2506.17859
5
citations
#5186

Low-Light Image Enhancement using Event-Based Illumination Estimation

Lei Sun, Yuhan Bao, Jiajun Zhai et al.

ICCV 2025posterarXiv:2504.09379
5
citations
#5187

Privacy-Preserving Personalized Federated Prompt Learning for Multimodal Large Language Models

Linh Tran, Wei Sun, Stacy Patterson et al.

ICLR 2025posterarXiv:2501.13904
5
citations
#5188

Many-Objective Multi-Solution Transport

Ziyue Li, Tian Li, Virginia Smith et al.

ICLR 2025posterarXiv:2403.04099
5
citations
#5189

Make Your Training Flexible: Towards Deployment-Efficient Video Models

Chenting Wang, Kunchang Li, Tianxiang Jiang et al.

ICCV 2025posterarXiv:2503.14237
5
citations
#5190

Lifelong Knowledge Editing for Vision Language Models with Low-Rank Mixture-of-Experts

Qizhou Chen, Chengyu Wang, Dakan Wang et al.

CVPR 2025posterarXiv:2411.15432
5
citations
#5191

LNS2+RL: Combining Multi-agent Reinforcement Learning with Large Neighborhood Search in Multi-agent Path Finding

Yutong Wang, Tanishq Duhan, Jiaoyang Li et al.

AAAI 2025paperarXiv:2405.17794
5
citations
#5192

Learning Heterogeneous Tissues with Mixture of Experts for Gigapixel Whole Slide Images

Junxian Wu, Minheng Chen, Xinyi Ke et al.

CVPR 2025poster
5
citations
#5193

Incomplete and Unpaired Multi-View Graph Clustering with Cross-View Feature Fusion

Liang Zhao, Ziyue Wang, Xiao Wang et al.

AAAI 2025paper
5
citations
#5194

Think Thrice Before You Act: Progressive Thought Refinement in Large Language Models

Chengyu Du, Jinyi Han, Yizhou Ying et al.

ICLR 2025posterarXiv:2410.13413
5
citations
#5195

Importance-Based Token Merging for Efficient Image and Video Generation

Haoyu Wu, Jingyi Xu, Hieu Le et al.

ICCV 2025posterarXiv:2411.16720
5
citations
#5196

Understanding LLM Behaviors via Compression: Data Generation, Knowledge Acquisition and Scaling Laws

Zhixuan Pan, Shaowen Wang, Liao Pengfei et al.

NEURIPS 2025spotlightarXiv:2504.09597
5
citations
#5197

ProtCLIP: Function-Informed Protein Multi-Modal Learning

Hanjing Zhou, Mingze Yin, Wei Wu et al.

AAAI 2025paperarXiv:2412.20014
5
citations
#5198

Time-IMM: A Dataset and Benchmark for Irregular Multimodal Multivariate Time Series

Ching Chang, Jeehyun Hwang, Yidan Shi et al.

NEURIPS 2025posterarXiv:2506.10412
5
citations
#5199

SwitchLingua: The First Large-Scale Multilingual and Multi-Ethnic Code-Switching Dataset

Peng Xie, Xingyuan Liu, Yequan Bie et al.

NEURIPS 2025posterarXiv:2506.00087
5
citations
#5200

Semantic and Expressive Variations in Image Captions Across Languages

Andre Ye, Sebastin Santy, Jena D. Hwang et al.

CVPR 2025posterarXiv:2310.14356
5
citations