Most Cited 2025 "diffusion" Papers

22,274 papers found • Page 39 of 112

Filters:Most Cited 2025 diffusion Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#7601

Action-Agnostic Point-Level Supervision for Temporal Action Detection

Shuhei M. Yoshida, Takashi Shibata, Makoto Terao et al.

AAAI 2025paperarXiv:2412.21205

citations

#7602

Advancing Loss Functions in Recommender Systems: A Comparative Study with a Rényi Divergence-Based Solution

Shengjia Zhang, Jiawei Chen, Changdong Li et al.

AAAI 2025paperarXiv:2506.15120

citations

#7603

From Probability to Counterfactuals: the Increasing Complexity of Satisfiability in Pearl's Causal Hierarchy

Julian Dörfler, Benito van der Zander, Markus Bläser et al.

ICLR 2025arXiv:2405.07373

citations

#7604

On Teacher Hacking in Language Model Distillation

Daniil Tiapkin, Daniele Calandriello, Johan Ferret et al.

ICML 2025arXiv:2502.02671

citations

#7605

GTG: Generalizable Trajectory Generation Model for Urban Mobility

Jingyuan Wang, Yujing Lin, Yudong Li

AAAI 2025paperarXiv:2502.01107

citations

#7606

When Selection Meets Intervention: Additional Complexities in Causal Discovery

Haoyue Dai, Ignavier Ng, Jianle Sun et al.

ICLR 2025arXiv:2503.07302

citations

#7607

Replacing Paths with Connection-Biased Attention for Knowledge Graph Completion

Sharmishtha Dutta, Alex Gittens, Mohammed J. Zaki et al.

AAAI 2025paperarXiv:2410.00876

citations

#7608

Graph Structure Refinement with Energy-based Contrastive Learning

Xianlin Zeng, Yufeng Wang, Yuqi Sun et al.

AAAI 2025paperarXiv:2412.17856

citations

#7609

Blink of an eye: a simple theory for feature localization in generative models

Marvin Li, Aayush Karan, Sitan Chen

ICML 2025oralarXiv:2502.00921

citations

#7610

Fundamental Limits of Visual Autoregressive Transformers: Universal Approximation Abilities

Yifang Chen, Xiaoyu Li, Yingyu Liang et al.

ICML 2025

citations

#7611

Privacy-Preserving Personalized Federated Prompt Learning for Multimodal Large Language Models

Linh Tran, Wei Sun, Stacy Patterson et al.

ICLR 2025arXiv:2501.13904

citations

#7612

Disentangling Representations through Multi-task Learning

Pantelis Vafidis, Aman Bhargava, Antonio Rangel

ICLR 2025arXiv:2407.11249

citations

#7613

Deep Generative Model for Mechanical System Configuration Design

Yasaman Etesam, Hyunmin Cheong, Mohammadmehdi Ataei et al.

AAAI 2025paperarXiv:2409.06016

citations

#7614

Beyond FVD: An Enhanced Evaluation Metrics for Video Generation Distribution Quality

Ge Ya Luo, Gian M Favero, Zhi Hao Luo et al.

ICLR 2025oral

citations

#7615

CrossWordBench: Evaluating the Reasoning Capabilities of LLMs and LVLMs with Controllable Puzzle Generation

Jixuan Leng, Chengsong Huang, Langlin Huang et al.

COLM 2025paperarXiv:2504.00043

citations

#7616

On Volume Minimization in Conformal Regression

Batiste Le Bars, Pierre Humbert

ICML 2025arXiv:2502.09985

citations

#7617

How Many Lines to Paint the City: Exact Edge-Cover in Temporal Graphs

Argyrios Deligkas, Michelle Döring, Eduard Eiben et al.

AAAI 2025paperarXiv:2408.17107

citations

#7618

Towards Effective, Efficient and Unsupervised Social Event Detection in the Hyperbolic Space

Xiaoyan Yu, Yifan Wei, Shuaishuai Zhou et al.

AAAI 2025paperarXiv:2412.10712

citations

#7619

Guaranteed Generation from Large Language Models

Minbeom Kim, Thibaut Thonet, Jos Rozen et al.

ICLR 2025arXiv:2410.06716

citations

#7620

Grokking at the Edge of Linear Separability

Alon Beck, Noam Levi, Yohai Bar-Sinai

ICML 2025arXiv:2410.04489

citations

#7621

CultureCLIP: Empowering CLIP with Cultural Awareness through Synthetic Images and Contextualized Captions

Yuchen Huang, Zhiyuan Fan, Zhitao He et al.

COLM 2025paperarXiv:2507.06210

citations

#7622

Multi-Token Attention

Olga Golovneva, Tianlu Wang, Jason E Weston et al.

COLM 2025paperarXiv:2504.00927

citations

#7623

OAC: Output-adaptive Calibration for Accurate Post-training Quantization

Ali Edalati, Alireza Ghaffari, Mahsa Ghazvini Nejad et al.

AAAI 2025paperarXiv:2405.15025

citations

#7624

GAPrompt: Geometry-Aware Point Cloud Prompt for 3D Vision Model

Zixiang Ai, Zichen Liu, Yuanhang Lei et al.

ICML 2025arXiv:2505.04119

citations

#7625

Neural Genetic Search in Discrete Spaces

Hyeonah Kim, Sanghyeok Choi, Jiwoo Son et al.

ICML 2025arXiv:2502.10433

citations

#7626

NUDGE: Lightweight Non-Parametric Fine-Tuning of Embeddings for Retrieval

Sepanta Zeighami, Zac Wellmer, Aditya Parameswaran

ICLR 2025arXiv:2409.02343

citations

#7627

What makes an Ensemble (Un) Interpretable?

Shahaf Bassan, Guy Amir, Meirav Zehavi et al.

ICML 2025arXiv:2506.08216

citations

#7628

Decentralized Federated Learning with Model Caching on Mobile Agents

Xiaoyu Wang, Guojun Xiong, Houwei Cao et al.

AAAI 2025paperarXiv:2408.14001

citations

#7629

Simple, Good, Fast: Self-Supervised World Models Free of Baggage

Jan Robine, Marc Höftmann, Stefan Harmeling

ICLR 2025arXiv:2506.02612

citations

#7630

Frequency-Aware Density Control via Reparameterization for High-Quality Rendering of 3D Gaussian Splatting

Zhaojie Zeng, Yuesong Wang, Lili Ju et al.

AAAI 2025paperarXiv:2503.07000

citations

#7631

Feature Clipping for Uncertainty Calibration

Linwei Tao, Minjing Dong, Chang Xu

AAAI 2025paperarXiv:2410.19796

citations

#7632

Learning to Communicate Through Implicit Communication Channels

Han Wang, Binbin Chen, zhang et al.

ICLR 2025arXiv:2411.01553

citations

#7633

Sharper Error Bounds in Late Fusion Multi-view Clustering with Eigenvalue Proportion Optimization

Liang Du, Henghui Jiang, Xiaodong Li et al.

AAAI 2025paper

citations

#7634

Rebalancing Multi-Label Class-Incremental Learning

Kaile Du, Yifan Zhou, Fan Lyu et al.

AAAI 2025paperarXiv:2408.12161

citations

#7635

STAIR: Manipulating Collaborative and Multimodal Information for E-Commerce Recommendation

Cong Xu, Yunhang He, Jun Wang et al.

AAAI 2025paperarXiv:2412.11729

citations

#7636

Through the Dual-Prism: A Spectral Perspective on Graph Data Augmentation for Graph Classifications

Yutong Xia, Runpeng Yu, Yuxuan Liang et al.

AAAI 2025paperarXiv:2401.09953

citations

#7637

Unlocking the Power of SAM 2 for Few-Shot Segmentation

Qianxiong Xu, Lanyun Zhu, Xuanyi Liu et al.

ICML 2025arXiv:2505.14100

citations

#7638

CABS: Conflict-Aware and Balanced Sparsification for Enhancing Model Merging

Zongzhen Yang, Binhang Qi, Hailong Sun et al.

ICML 2025arXiv:2503.01874

citations

#7639

MERGE$^3$: Efficient Evolutionary Merging on Consumer-grade GPUs

Tommaso Mencattini, Adrian Robert Minut, Donato Crisostomi et al.

ICML 2025arXiv:2502.10436

citations

#7640

MGDA Converges under Generalized Smoothness, Provably

Qi Zhang, Peiyao Xiao, Shaofeng Zou et al.

ICLR 2025arXiv:2405.19440

citations

#7641

Synonymous Variational Inference for Perceptual Image Compression

Zijian Liang, Kai Niu, Changshuo Wang et al.

ICML 2025arXiv:2505.22438

citations

#7642

Multilingual Contextualization of Large Language Models for Document-Level Machine Translation

Miguel Moura Ramos, Patrick Fernandes, Sweta Agrawal et al.

COLM 2025paperarXiv:2504.12140

citations

#7643

Tree-Sliced Wasserstein Distance: A Geometric Perspective

Viet Hoang Tran, Trang Pham, Tho Tran Huu et al.

ICML 2025arXiv:2406.13725

citations

#7644

Nonlinearly Preconditioned Gradient Methods under Generalized Smoothness

Konstantinos Oikonomidis, Jan Quan, Emanuel Laude et al.

ICML 2025oralarXiv:2502.08532

citations

#7645

Optimizing Robustness and Accuracy in Mixture of Experts: A Dual-Model Approach

Xu Zhang, Kaidi Xu, Ziqing Hu et al.

ICML 2025arXiv:2502.06832

citations

#7646

Textured Mesh Saliency: Bridging Geometry and Texture for Human Perception in 3D Graphics

Kaiwei Zhang, Dandan Zhu, Xiongkuo Min et al.

AAAI 2025paperarXiv:2412.08188

citations

#7647

Robust Multi-bit Text Watermark with LLM-based Paraphrasers

Xiaojun Xu, jinghan jia, Yuanshun Yao et al.

ICML 2025arXiv:2412.03123

citations

#7648

Positional Biases Shift as Inputs Approach Context Window Limits

Blerta Veseli, Julian Chibane, Mariya Toneva et al.

COLM 2025paperarXiv:2508.07479

citations

#7649

VideoSAVi: Self-Aligned Video Language Models without Human Supervision

Yogesh Kulkarni, Pooyan Fazli

COLM 2025paperarXiv:2412.00624

citations

#7650

CHAMP: Conformalized 3D Human Multi-Hypothesis Pose Estimators

Harry Zhang, Luca Carlone

ICLR 2025arXiv:2407.06141

citations

#7651

LIFT the Veil for the Truth: Principal Weights Emerge after Rank Reduction for Reasoning-Focused Supervised Fine-Tuning

Zihang Liu, Tianyu Pang, Oleg Balabanov et al.

ICML 2025arXiv:2506.00772

citations

#7652

Asymptotic Unbiased Sample Sampling to Speed Up Sharpness-Aware Minimization

Jiaxin Deng, Junbiao Pang, Baochang Zhang et al.

AAAI 2025paperarXiv:2406.08001

citations

#7653

Latent Radiance Fields with 3D-aware 2D Representations

Chaoyi Zhou, Xi Liu, Feng Luo et al.

ICLR 2025arXiv:2502.09613

citations

#7654

One Image is Worth a Thousand Words: A Usability Preservable Text-Image Collaborative Erasing Framework

Feiran Li, Qianqian Xu, Shilong Bao et al.

ICML 2025arXiv:2505.11131

citations

#7655

mmFAS: Multimodal Face Anti-Spoofing Using Multi-Level Alignment and Switch-Attention Fusion

Geng Chen, Wuyuan Xie, Di Lin et al.

AAAI 2025paper

citations

#7656

ZETA: Leveraging $Z$-order Curves for Efficient Top-$k$ Attention

Qiuhao Zeng, Jierui Huang, Peng Lu et al.

ICLR 2025arXiv:2501.14577

citations

#7657

KV Shifting Attention Enhances Language Modeling

Mingyu Xu, Bingning Wang, Weipeng Chen

ICML 2025oralarXiv:2411.19574

citations

#7658

Solving Linear-Gaussian Bayesian Inverse Problems with Decoupled Diffusion Sequential Monte Carlo

Filip Ekström Kelvinius, Zheng Zhao, Fredrik Lindsten

ICML 2025arXiv:2502.06379

citations

#7659

CLIP-PCQA: Exploring Subjective-Aligned Vision-Language Modeling for Point Cloud Quality Assessment

Yating Liu, Yujie Zhang, Ziyu Shan et al.

AAAI 2025paperarXiv:2501.10071

citations

#7660

Clone-Robust AI Alignment

Ariel Procaccia, Benjamin Schiffer, Shirley Zhang

ICML 2025arXiv:2501.09254

citations

#7661

Learning-Augmented Hierarchical Clustering

Vladimir Braverman, Jon C. Ergun, Chen Wang et al.

ICML 2025arXiv:2506.05495

citations

#7662

From Thousands to Billions: 3D Visual Language Grounding via Render-Supervised Distillation from 2D VLMs

Ang Cao, Sergio Arnaud, Oleksandr Maksymets et al.

ICML 2025arXiv:2502.20389

citations

#7663

LLaVA-ReID: Selective Multi-image Questioner for Interactive Person Re-Identification

Yiding Lu, Mouxing Yang, Dezhong Peng et al.

ICML 2025arXiv:2504.10174

citations

#7664

SCOPE: Sign Language Contextual Processing with Embedding from LLMs

Yuqi Liu, Wenqian Zhang, Sihan Ren et al.

AAAI 2025paperarXiv:2409.01073

citations

#7665

Beyond CVaR: Leveraging Static Spectral Risk Measures for Enhanced Decision-Making in Distributional Reinforcement Learning

Mehrdad Moghimi, Hyejin Ku

ICML 2025arXiv:2501.02087

citations

#7666

WGFormer: An SE(3)-Transformer Driven by Wasserstein Gradient Flows for Molecular Ground-State Conformation Prediction

Fanmeng Wang, Minjie Cheng, Hongteng Xu

ICML 2025arXiv:2410.09795

citations

#7667

Subgraph Aggregation for Out-of-Distribution Generalization on Graphs

Bowen Liu, Haoyang Li, Shuning Wang et al.

AAAI 2025paperarXiv:2410.22228

citations

#7668

Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning

Xiaochuan Li, Zichun Yu, Chenyan Xiong

ICLR 2025arXiv:2410.14208

citations

#7669

Few-Shot, No Problem: Descriptive Continual Relation Extraction

Nguyen Xuan Thanh, Anh Duc Le, Quyen Tran et al.

AAAI 2025paperarXiv:2502.20596

citations

#7670

DPLUT: Unsupervised Low-light Image Enhancement with Lookup Tables and Diffusion Priors

Yunlong Lin, Zhenqi Fu, Kairun Wen et al.

AAAI 2025paper

citations

#7671

Differential Privacy Under Class Imbalance: Methods and Empirical Insights

Lucas Rosenblatt, Yuliia Lut, Ethan Turok et al.

ICML 2025arXiv:2411.05733

citations

#7672

How to Verify Any (Reasonable) Distribution Property: Computationally Sound Argument Systems for Distributions

Tal Herman, Guy Rothblum

ICLR 2025arXiv:2409.06594

citations

#7673

AFiRe: Anatomy-Driven Self-Supervised Learning for Fine-Grained Representation in Radiographic Images

Yihang Liu, Lianghua He, Ying Wen et al.

AAAI 2025paperarXiv:2504.10972

citations

#7674

Shallow diffusion networks provably learn hidden low-dimensional structure

Nicholas Boffi, Arthur Jacot, Stephen Tu et al.

ICLR 2025arXiv:2410.11275

citations

#7675

Conditional Diffusion Models Based Conditional Independence Testing

Yanfeng Yang, Shuai Li, Yingjie Zhang et al.

AAAI 2025paperarXiv:2412.11744

citations

#7676

Exploring a Principled Framework for Deep Subspace Clustering

Xianghan Meng, Zhiyuan Huang, Wei He et al.

ICLR 2025arXiv:2503.17288

citations

#7677

Efficient Robust Conformal Prediction via Lipschitz-Bounded Networks

Thomas Massena, Léo Andéol, Thibaut Boissin et al.

ICML 2025arXiv:2506.05434

citations

#7678

ELITE: Enhanced Language-Image Toxicity Evaluation for Safety

Wonjun Lee, Doehyeon Lee, Eugene Choi et al.

ICML 2025arXiv:2502.04757

citations

#7679

M3-JEPA: Multimodal Alignment via Multi-gate MoE based on the Joint-Embedding Predictive Architecture

Hongyang Lei, Xiaolong Cheng, Qi Qin et al.

ICML 2025arXiv:2409.05929

citations

#7680

Learning High-Degree Parities: The Crucial Role of the Initialization

Emmanuel Abbe, Elisabetta Cornacchia, Jan Hązła et al.

ICLR 2025arXiv:2412.04910

citations

#7681

Logarithmic Regret for Online KL-Regularized Reinforcement Learning

Heyang Zhao, Chenlu Ye, Wei Xiong et al.

ICML 2025arXiv:2502.07460

citations

#7682

Exploring the Design Space of Visual Context Representation in Video MLLMs

Yifan Du, Yuqi Huo, Kun Zhou et al.

ICLR 2025arXiv:2410.13694

citations

#7683

Variational Search Distributions

Dan Steinberg, Rafael Oliveira, Cheng Soon Ong et al.

ICLR 2025arXiv:2409.06142

citations

#7684

A Simple Graph Contrastive Learning Framework for Short Text Classification

Yonghao Liu, Fausto Giunchiglia, Lan Huang et al.

AAAI 2025paperarXiv:2501.09219

citations

#7685

Decoupling Appearance Variations with 3D Consistent Features in Gaussian Splatting

Jiaqi Lin, Zhihao Li, Binxiao Huang et al.

AAAI 2025paperarXiv:2501.10788

citations

#7686

On Generalization Across Environments In Multi-Objective Reinforcement Learning

Jayden Teoh, Pradeep Varakantham, Peter Vamplew

ICLR 2025arXiv:2503.00799

citations

#7687

Think Thrice Before You Act: Progressive Thought Refinement in Large Language Models

Chengyu Du, Jinyi Han, Yizhou Ying et al.

ICLR 2025arXiv:2410.13413

citations

#7688

LiteSearch: Efficient Tree Search with Dynamic Exploration Budget for Math Reasoning

Ante Wang, Linfeng Song, Ye Tian et al.

AAAI 2025paper

citations

#7689

Preserving AUC Fairness in Learning with Noisy Protected Groups

Mingyang Wu, Li Lin, Wenbin Zhang et al.

ICML 2025arXiv:2505.18532

citations

#7690

Epsilon: Exploring Comprehensive Visual-Semantic Projection for Multi-Label Zero-Shot Learning

Ziming Liu, Jingcai Guo, Song Guo et al.

AAAI 2025paperarXiv:2408.12253

citations

#7691

APIRL: Deep Reinforcement Learning for REST API Fuzzing

Myles Foley, Sergio Maffeis

AAAI 2025paperarXiv:2412.15991

citations

#7692

Self-Evolutionary Large Language Models Through Uncertainty-Enhanced Preference Optimization

Jianing Wang, Yang Zhou, Xiaocheng Zhang et al.

AAAI 2025paperarXiv:2409.11212

citations

#7693

Sharpness-Aware Black-Box Optimization

Feiyang YE, YUEMING LYU, Xuehao Wang et al.

ICLR 2025arXiv:2410.12457

citations

#7694

Incorporating Visual Correspondence into Diffusion Model for Virtual Try-On

Siqi Wan, Jingwen Chen, Yingwei Pan et al.

ICLR 2025arXiv:2505.16977

citations

#7695

Matching While Perceiving: Enhance Image Feature Matching with Applicable Semantic Amalgamation

Shihua Zhang, Zhenjie Zhu, Zizhuo Li et al.

AAAI 2025paper

citations

#7696

Storynizor: Consistent Story Generation via Inter-Frame Synchronized and Shuffled ID Injection

Yuhang Ma, Wenting Xu, Chaoyi Zhao et al.

AAAI 2025paperarXiv:2409.19624

citations

#7697

Text and Image Are Mutually Beneficial: Enhancing Training-Free Few-Shot Classification with CLIP

Yayuan Li, Jintao Guo, Lei Qi et al.

AAAI 2025paperarXiv:2412.11375

citations

#7698

Diversity-Rewarded CFG Distillation

Geoffrey Cideron, Andrea Agostinelli, Johan Ferret et al.

ICLR 2025arXiv:2410.06084

citations

#7699

Improving Multimodal Social Media Popularity Prediction via Selective Retrieval Knowledge Augmentation

Xovee Xu, Yifan Zhang, Fan Zhou et al.

AAAI 2025paper

citations

#7700

Does Data Scaling Lead to Visual Compositional Generalization?

Arnas Uselis, Andrea Dittadi, Seong Joon Oh

ICML 2025arXiv:2507.07102

citations

#7701

Edge Contrastive Learning: An Augmentation-Free Graph Contrastive Learning Model

Yujun Li, Hongyuan Zhang, Yuan Yuan

AAAI 2025paperarXiv:2412.11075

citations

#7702

In-Context Learning and Occam's Razor

Eric Elmoznino, Tom Marty, Tejas Kasetty et al.

ICML 2025arXiv:2410.14086

citations

#7703

Deep Rank-One Tensor Functional Factorization for Multi-Dimensional Data Recovery

Yanyi Li, Xi Zhang, Yisi Luo et al.

AAAI 2025paper

citations

#7704

Towards Realistic Semi-supervised Medical Image Classification

Wenxue Li, Lie Ju, Feilong Tang et al.

AAAI 2025paper

citations

#7705

Hyperbolic-Constraint Point Cloud Reconstruction from Single RGB-D Images

Wenrui Li, Zhe Yang, Wei Han et al.

AAAI 2025paperarXiv:2412.09055

citations

#7706

Dimension-Independent Rates for Structured Neural Density Estimation

Vandermeulen, Wai Ming Tai, Bryon Aragam

ICML 2025arXiv:2411.15095

citations

#7707

EvdCLIP: Improving Vision-Language Retrieval with Entity Visual Descriptions from Large Language Models

GuangHao Meng, Sunan He, Jinpeng Wang et al.

AAAI 2025paperarXiv:2505.18594

citations

#7708

DR-VAE: Debiased and Representation-enhanced Variational Autoencoder for Collaborative Recommendation

Fan Wang, Chaochao Chen, Weiming Liu et al.

AAAI 2025paper

citations

#7709

Revisiting CAD Model Generation by Learning Raster Sketch

Pu Li, Wenhao Zhang, Jianwei Guo et al.

AAAI 2025paperarXiv:2503.00928

citations

#7710

Black Sheep in the Herd: Playing with Spuriously Correlated Attributes for Vision-Language Recognition

Xinyu Tian, Shu Zou, Zhaoyuan Yang et al.

ICLR 2025arXiv:2502.15809

citations

#7711

Drop the Beat! Freestyler for Accompaniment Conditioned Rapping Voice Generation

Ziqian Ning, Shuai Wang, Yuepeng Jiang et al.

AAAI 2025paperarXiv:2408.15474

citations

#7712

Hierarchically Encapsulated Representation for Protocol Design in Self-Driving Labs

Yu-Zhe Shi, Mingchen Liu, Fanxu Meng et al.

ICLR 2025arXiv:2504.03810

citations

#7713

VIP: Vision Instructed Pre-training for Robotic Manipulation

Zhuoling Li, LiangLiang Ren, Jinrong Yang et al.

ICML 2025arXiv:2410.07169

citations

#7714

CoPEFT: Fast Adaptation Framework for Multi-Agent Collaborative Perception with Parameter-Efficient Fine-Tuning

Quanmin Wei, Penglin Dai, Wei Li et al.

AAAI 2025paperarXiv:2502.10705

citations

#7715

UTILITY: Utilizing Explainable Reinforcement Learning to Improve Reinforcement Learning

Shicheng Liu, Minghui Zhu

ICLR 2025

citations

#7716

Exploring Activation Patterns of Parameters in Language Models

Yudong Wang, Damai Dai, Zhe Yang et al.

AAAI 2025paperarXiv:2405.17799

citations

#7717

Attention Mechanisms Perspective: Exploring LLM Processing of Graph-Structured Data

Guan Zhong, Likang Wu, Hongke Zhao et al.

ICML 2025arXiv:2505.02130

citations

#7718

Continuous Autoregressive Modeling with Stochastic Monotonic Alignment for Speech Synthesis

Weiwei Lin, Chenhang HE

ICLR 2025arXiv:2502.01084

citations

#7719

Denoising with a Joint-Embedding Predictive Architecture

Chen Dengsheng, Jie Hu, Xiaoming Wei et al.

ICLR 2025arXiv:2410.03755

citations

#7720

Don’t lie to your friends: Learning what you know from collaborative self-play

Jacob Eisenstein, Reza Aghajani, Adam Fisch et al.

COLM 2025paper

citations

#7721

Core Context Aware Transformers for Long Context Language Modeling

Yaofo Chen, Zeng You, Shuhai Zhang et al.

ICML 2025arXiv:2412.12465

citations

#7722

Unlocking Point Processes through Point Set Diffusion

David Lüdke, Enric Rabasseda Raventós, Marcel Kollovieh et al.

ICLR 2025oralarXiv:2410.22493

citations

#7723

Importance Corrected Neural JKO Sampling

Johannes Hertrich, Robert Gruhlke

ICML 2025arXiv:2407.20444

citations

#7724

3DMolFormer: A Dual-channel Framework for Structure-based Drug Discovery

Xiuyuan Hu, Guoqing Liu, Can Chen et al.

ICLR 2025arXiv:2502.05107

citations

#7725

Learning Spatial-Semantic Features for Robust Video Object Segmentation

Xin Li, Deshui Miao, Zhenyu He et al.

ICLR 2025arXiv:2407.07760

citations

#7726

Compressed and distributed least-squares regression: convergence rates with applications to federated learning

Constantin Philippenko, Aymeric Dieuleveut

ICML 2025arXiv:2308.01358

citations

#7727

An Evolved Universal Transformer Memory

Edoardo Cetin, Qi Sun, Tianyu Zhao et al.

ICLR 2025arXiv:2410.13166

citations

#7728

CommVQ: Commutative Vector Quantization for KV Cache Compression

Junyan Li, Yang Zhang, Muhammad Yusuf Hassan et al.

ICML 2025arXiv:2506.18879

citations

#7729

Analytic DAG Constraints for Differentiable DAG Learning

Zhen Zhang, Ignavier Ng, Dong Gong et al.

ICLR 2025oralarXiv:2503.19218

citations

#7730

CSA: Data-efficient Mapping of Unimodal Features to Multimodal Features

Po-han Li, Sandeep Chinchali, ufuk topcu

ICLR 2025arXiv:2410.07610

citations

#7731

Graph Structure Learning for Spatial-Temporal Imputation: Adapting to Node and Feature Scales

Xinyu Yang, Yu Sun, Xinyang Chen et al.

AAAI 2025paperarXiv:2412.18535

citations

#7732

David and Goliath: Small One-step Model Beats Large Diffusion with Score Post-training

Weijian Luo, colin zhang, Debing Zhang et al.

ICML 2025arXiv:2410.20898

citations

#7733

Modular-Cam: Modular Dynamic Camera-view Video Generation with LLM

Zirui Pan, Xin Wang, Yipeng Zhang et al.

AAAI 2025paperarXiv:2504.12048

citations

#7734

SPEX: Scaling Feature Interaction Explanations for LLMs

Justin S. Kang, Landon Butler, Abhineet Agarwal et al.

ICML 2025arXiv:2502.13870

citations

#7735

Out-of-Distribution Detection using Synthetic Data Generation

Momin Abbas, Muneeza Azmat, Raya Horesh et al.

COLM 2025paperarXiv:2502.03323

citations

#7736

Learning Equivariant Non-Local Electron Density Functionals

Nicholas Gao, Eike Eberhard, Stephan Günnemann

ICLR 2025arXiv:2410.07972

citations

#7737

UniGS: Unified Language-Image-3D Pretraining with Gaussian Splatting

Haoyuan Li, Yanpeng Zhou, Tao Tang et al.

ICLR 2025arXiv:2502.17860

citations

#7738

Multi-Scale Fusion for Object Representation

Rongzhen Zhao, Vivienne Huiling Wang, Juho Kannala et al.

ICLR 2025arXiv:2410.01539

citations

#7739

HR-Extreme: A High-Resolution Dataset for Extreme Weather Forecasting

Nian Ran, Peng Xiao, Yue Wang et al.

ICLR 2025arXiv:2409.18885

citations

#7740

Foreground-Covering Prototype Generation and Matching for SAM-Aided Few-Shot Segmentation

Suho Park, SuBeen Lee, Hyun Seok Seong et al.

AAAI 2025paperarXiv:2501.00752

citations

#7741

CryoFM: A Flow-based Foundation Model for Cryo-EM Densities

Yi Zhou, Yilai Li, Jing Yuan et al.

ICLR 2025arXiv:2410.08631

citations

#7742

PALM: Pushing Adaptive Learning Rate Mechanisms for Continual Test-Time Adaptation

Sarthak Kumar Maharana, Baoming Zhang, Yunhui Guo

AAAI 2025paperarXiv:2403.10650

citations

#7743

SEAS: Self-Evolving Adversarial Safety Optimization for Large Language Models

Muxi Diao, Rumei Li, Shiyang Liu et al.

AAAI 2025paperarXiv:2408.02632

citations

#7744

Harnessing Event Sensory Data for Error Pattern Prediction in Vehicles: A Language Model Approach

Hugo Math, Rainer Lienhart, Robin Schön

AAAI 2025paperarXiv:2412.13041

citations

#7745

Homophily Enhanced Graph Domain Adaptation

Ruiyi Fang, Bingheng Li, Jingyu Zhao et al.

ICML 2025arXiv:2505.20089

citations

#7746

Color Transfer with Modulated Flows

Maria Larchenko, Alexander Lobashev, Dmitry Guskov et al.

AAAI 2025paperarXiv:2503.19062

citations

#7747

SToFM: a Multi-scale Foundation Model for Spatial Transcriptomics

Suyuan Zhao, YIZHEN LUO, Ganbo Yang et al.

ICML 2025arXiv:2507.11588

citations

#7748

Is There No Such Thing as a Bad Question? H4R: HalluciBot for Ratiocination, Rewriting, Ranking, and Routing

William Watson, Nicole Cho, Nishan Srishankar

AAAI 2025paperarXiv:2404.12535

citations

#7749

Let SSMs be ConvNets: State-space Modeling with Optimal Tensor Contractions

Yan Ru Pei

ICLR 2025arXiv:2501.13230

citations

#7750

DMOSpeech: Direct Metric Optimization via Distilled Diffusion Model in Zero-Shot Speech Synthesis

Yinghao Li, Rithesh Kumar, Zeyu Jin

ICML 2025oralarXiv:2410.11097

citations

#7751

$K^2$VAE: A Koopman-Kalman Enhanced Variational AutoEncoder for Probabilistic Time Series Forecasting

Xingjian Wu, Xiangfei Qiu, Hongfan Gao et al.

ICML 2025spotlightarXiv:2505.23017

citations

#7752

Microcanonical Langevin Ensembles: Advancing the Sampling of Bayesian Neural Networks

Emanuel Sommer, Jakob Robnik, Giorgi Nozadze et al.

ICLR 2025arXiv:2502.06335

citations

#7753

DragLoRA: Online Optimization of LoRA Adapters for Drag-based Image Editing in Diffusion Model

Siwei Xia, Li Sun, Tiantian Sun et al.

ICML 2025arXiv:2505.12427

citations

#7754

LDMol: A Text-to-Molecule Diffusion Model with Structurally Informative Latent Space Surpasses AR Models

Jinho Chang, Jong Chul YE

ICML 2025arXiv:2405.17829

citations

#7755

TGBFormer: Transformer-GraphFormer Blender Network for Video Object Detection

Qiang Qi, Xiao Wang

AAAI 2025paperarXiv:2503.13903

citations

#7756

MaRS: A Fast Sampler for Mean Reverting Diffusion based on ODE and SDE Solvers

Ao Li, Wei Fang, Hongbo Zhao et al.

ICLR 2025arXiv:2502.07856

citations

#7757

FlowDrag: 3D-aware Drag-based Image Editing with Mesh-guided Deformation Vector Flow Fields

Gwanhyeong Koo, Sunjae Yoon, Younghwan Lee et al.

ICML 2025spotlightarXiv:2507.08285

citations

#7758

Uncertainty and Influence aware Reward Model Refinement for Reinforcement Learning from Human Feedback

Zexu Sun, Yiju Guo, Yankai Lin et al.

ICLR 2025

citations

#7759

Multi-Stage Manipulation with Demonstration-Augmented Reward, Policy, and World Model Learning

Adrià López Escoriza, Nicklas Hansen, Stone Tao et al.

ICML 2025arXiv:2503.01837

citations

#7760

Preference-Oriented Supervised Fine-Tuning: Favoring Target Model over Aligned Large Language Models

Yuchen Fan, Yuzhong Hong, Qiushi Wang et al.

AAAI 2025paperarXiv:2412.12865

citations

#7761

Optimizing Backward Policies in GFlowNets via Trajectory Likelihood Maximization

Timofei Gritsaev, Nikita Morozov, Sergey Samsonov et al.

ICLR 2025arXiv:2410.15474

citations

#7762

Focus On This, Not That! Steering LLMs with Adaptive Feature Specification

Tom A. Lamb, Adam Davies, Alasdair J Paren et al.

ICML 2025arXiv:2410.22944

citations

#7763

Universal Neural Optimal Transport

Jonathan Geuter, Gregor Kornhardt, Ingimar Tomasson et al.

ICML 2025arXiv:2212.00133

citations

#7764

Improved Regret Analysis in Gaussian Process Bandits: Optimality for Noiseless Reward, RKHS norm, and Non-Stationary Variance

Shogo Iwazaki, Shion Takeno

ICML 2025oralarXiv:2502.06363

citations

#7765

Graph Assisted Offline-Online Deep Reinforcement Learning for Dynamic Workflow Scheduling

Yifan Yang, Gang Chen, Hui Ma et al.

ICLR 2025

citations

#7766

RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing

Yiqing Xie, Alex Xie, Divyanshu Sheth et al.

COLM 2025paperarXiv:2503.07358

citations

#7767

Progressive Compression with Universally Quantized Diffusion Models

Yibo Yang, Justus Will, Stephan Mandt

ICLR 2025arXiv:2412.10935

citations

#7768

CAMSIC: Content-aware Masked Image Modeling Transformer for Stereo Image Compression

Xinjie Zhang, Shenyuan Gao, Zhening Liu et al.

AAAI 2025paperarXiv:2403.08505

citations

#7769

LOB-Bench: Benchmarking Generative AI for Finance - an Application to Limit Order Book Data

Peer Nagy, Sascha Frey, Kang Li et al.

ICML 2025arXiv:2502.09172

citations

#7770

Self-Updatable Large Language Models by Integrating Context into Model Parameters

Yu Wang, Xinshuang Liu, Xiusi Chen et al.

ICLR 2025arXiv:2410.00487

citations

#7771

How Post-Training Reshapes LLMs: A Mechanistic View on Knowledge, Truthfulness, Refusal, and Confidence

Hongzhe Du, Weikai Li, Min Cai et al.

COLM 2025paperarXiv:2504.02904

citations

#7772

C2PD: Continuity-Constrained Pixelwise Deformation for Guided Depth Super-Resolution

Jiahui Kang, Qing Cai, Runqing Tan et al.

AAAI 2025paperarXiv:2501.07688

citations

#7773

HASARD: A Benchmark for Vision-Based Safe Reinforcement Learning in Embodied Agents

Tristan Tomilin, Meng Fang, Mykola Pechenizkiy

ICLR 2025arXiv:2503.08241

citations

#7774

Field Matching: an Electrostatic Paradigm to Generate and Transfer Data

Alexander Kolesov, S. Manukhov, Vladimir Palyulin et al.

ICML 2025arXiv:2502.02367

citations

#7775

QMP: Q-switch Mixture of Policies for Multi-Task Behavior Sharing

Grace Zhang, Ayush Jain, Injune Hwang et al.

ICLR 2025oralarXiv:2302.00671

citations

#7776

A Certified Unlearning Approach without Access to Source Data

Umit Basaran, Sk Miraj Ahmed, Amit Roy-Chowdhury et al.

ICML 2025arXiv:2506.06486

citations

#7777

Kernel-based Unsupervised Embedding Alignment for Enhanced Visual Representation in Vision-language Models

Shizhan Gong, Yankai Jiang, DOU QI et al.

ICML 2025arXiv:2506.02557

citations

#7778

Cape: Context-Aware Prompt Perturbation Mechanism with Differential Privacy

Haoqi Wu, Wei Dai, Wang Li et al.

ICML 2025arXiv:2505.05922

citations

#7779

Flavors of Margin: Implicit Bias of Steepest Descent in Homogeneous Neural Networks

Nikolaos Tsilivis, Gal Vardi, Julia Kempe

ICLR 2025arXiv:2410.22069

citations

#7780

Efficient Multi-agent Offline Coordination via Diffusion-based Trajectory Stitching

Lei Yuan, Yuqi Bian, Lihe Li et al.

ICLR 2025oral

citations

#7781

DF-MIA: A Distribution-Free Membership Inference Attack on Fine-Tuned Large Language Models

Zhiheng Huang, Yannan Liu, Daojing He et al.

AAAI 2025paper

citations

#7782

BalancEdit: Dynamically Balancing the Generality-Locality Trade-off in Multi-modal Model Editing

Dongliang Guo, Mengxuan Hu, Zihan Guan et al.

ICML 2025arXiv:2505.01343

citations

#7783

RILQ: Rank-Insensitive LoRA-Based Quantization Error Compensation for Boosting 2-Bit Large Language Model Accuracy

Geonho Lee, Janghwan Lee, Sukjin Hong et al.

AAAI 2025paperarXiv:2412.01129

citations

#7784

Topology-Aware 3D Gaussian Splatting: Leveraging Persistent Homology for Optimized Structural Integrity

Tianqi Shen, Shaohua Liu, Jiaqi Feng et al.

AAAI 2025paperarXiv:2412.16619

citations

#7785

HOLa: Zero-Shot HOI Detection with Low-Rank Decomposed VLM Feature Adaptation

Qinqian Lei, Bo Wang, Robby Tan

ICCV 2025arXiv:2507.15542

citations

#7786

Birth and Death of a Rose

Chen Geng, Yunzhi Zhang, Shangzhe Wu et al.

CVPR 2025arXiv:2412.05278

citations

#7787

4Deform: Neural Surface Deformation for Robust Shape Interpolation

Lu Sang, Zehranaz Canfes, Dongliang Cao et al.

CVPR 2025arXiv:2502.20208

citations

#7788

HUSH: Holistic Panoramic 3D Scene Understanding using Spherical Harmonics

Jongsung Lee, HARIN PARK, Byeong-Uk Lee et al.

CVPR 2025

citations

#7789

Enhancing Facial Privacy Protection via Weakening Diffusion Purification

Ali Salar, Qing Liu, Yingli Tian et al.

CVPR 2025arXiv:2503.10350

citations

#7790

OSLoPrompt: Bridging Low-Supervision Challenges and Open-Set Domain Generalization in CLIP

Mohamad Hassan N C, Divyam Gupta, Mainak Singha et al.

CVPR 2025arXiv:2503.16106

citations

#7791

SimVS: Simulating World Inconsistencies for Robust View Synthesis

Alex Trevithick, Roni Paiss, Philipp Henzler et al.

CVPR 2025arXiv:2412.07696

citations

#7792

Long Video Diffusion Generation with Segmented Cross-Attention and Content-Rich Video Data Curation

Xin Yan, Yuxuan Cai, Qiuyue Wang et al.

CVPR 2025arXiv:2412.01316

citations

#7793

Learning Heterogeneous Tissues with Mixture of Experts for Gigapixel Whole Slide Images

Junxian Wu, Minheng Chen, Xinyi Ke et al.

CVPR 2025

citations

#7794

Hardware-Rasterized Ray-Based Gaussian Splatting

Samuel Rota Bulò, Lorenzo Porzi, Nemanja Bartolovic et al.

CVPR 2025highlightarXiv:2503.18682

citations

#7795

Locality-Aware Zero-Shot Human-Object Interaction Detection

Sanghyun Kim, Deunsol Jung, Minsu Cho

CVPR 2025arXiv:2505.19503

citations

#7796

Interpretable Generative Models through Post-hoc Concept Bottlenecks

Akshay R. Kulkarni, Ge Yan, Chung-En Sun et al.

CVPR 2025arXiv:2503.19377

citations

#7797

MITracker: Multi-View Integration for Visual Object Tracking

Mengjie Xu, Yitao Zhu, Haotian Jiang et al.

CVPR 2025highlightarXiv:2502.20111

citations

#7798

HyperLoRA: Parameter-Efficient Adaptive Generation for Portrait Synthesis

Mengtian Li, Jinshu Chen, Wanquan Feng et al.

CVPR 2025highlightarXiv:2503.16944

citations

#7799

Robust Message Embedding via Attention Flow-Based Steganography

Huayuan Ye, Shenzhuo Zhang, Shiqi Jiang et al.

CVPR 2025arXiv:2405.16414

citations

#7800

ZeroGrasp: Zero-Shot Shape Reconstruction Enabled Robotic Grasping

Shun Iwase, Muhammad Zubair Irshad, Katherine Liu et al.

CVPR 2025arXiv:2504.10857

citations

← Previous

1...37 38 39 40 41...112