Most Cited 2025 "semantic proximity" Papers

22,274 papers found • Page 27 of 112

Filters:Most Cited 2025 semantic proximity Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#5201

Diff3DS: Generating View-Consistent 3D Sketch via Differentiable Curve Rendering

Yibo Zhang, Lihong Wang, Changqing Zou et al.

ICLR 2025arXiv:2405.15305

citations

#5202

REGENT: A Retrieval-Augmented Generalist Agent That Can Act In-Context in New Environments

Kaustubh Sridhar, Souradeep Dutta, Dinesh Jayaraman et al.

ICLR 2025arXiv:2412.04759

citations

#5203

GReaTer: Gradients Over Reasoning Makes Smaller Language Models Strong Prompt Optimizers

Sarkar Snigdha Sarathi Das, Ryo Kamoi, Bo Pang et al.

ICLR 2025arXiv:2412.09722

citations

#5204

Simulation-Free Hierarchical Latent Policy Planning for Proactive Dialogues

Tao He, Lizi Liao, Yixin Cao et al.

AAAI 2025paperarXiv:2412.14584

citations

#5205

The Elicitation Game: Evaluating Capability Elicitation Techniques

Felix Hofstätter, Teun van der Weij, Jayden Teoh et al.

ICML 2025arXiv:2502.02180

citations

#5206

Fast Think-on-Graph: Wider, Deeper and Faster Reasoning of Large Language Model on Knowledge Graph

Xujian Liang, Zhaoquan Gu

AAAI 2025paperarXiv:2501.14300

citations

#5207

Near, far: Patch-ordering enhances vision foundation models' scene understanding

Valentinos Pariza, Mohammadreza Salehi, Gertjan J Burghouts et al.

ICLR 2025arXiv:2408.11054

citations

#5208

CP-Guard: Malicious Agent Detection and Defense in Collaborative Bird’s Eye View Perception

Senkang Hu, Yihang Tao, Guowen Xu et al.

AAAI 2025paperarXiv:2412.12000

citations

#5209

Beyond Sequence: Impact of Geometric Context for RNA Property Prediction

Junjie Xu, Artem Moskalev, Tommaso Mansi et al.

ICLR 2025arXiv:2410.11933

citations

#5210

Understanding and Improving Length Generalization in Recurrent Models

Ricardo Buitrago Ruiz, Albert Gu

ICML 2025arXiv:2507.02782

citations

#5211

(Almost Full) EFX for Three (and More) Types of Agents

Pratik Ghosal, Vishwa Prakash HV, Prajakta Nimbhorkar et al.

AAAI 2025paperarXiv:2301.10632

citations

#5212

Constrained Fair and Efficient Allocations

Benjamin Cookson, Soroush Ebadian, Nisarg Shah

AAAI 2025paperarXiv:2411.00133

citations

#5213

Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models

Zhengfeng Lai, Vasileios Saveris, Chen Chen et al.

ICLR 2025arXiv:2410.02740

citations

#5214

FloNa: Floor Plan Guided Embodied Visual Navigation

Jiaxin Li, Weiqi Huang, Zan Wang et al.

AAAI 2025paperarXiv:2412.18335

citations

#5215

Counterfactual Generative Modeling with Variational Causal Inference

Yulun Wu, Louis McConnell, Claudia Iriondo

ICLR 2025arXiv:2410.12730

citations

#5216

KPL: Training-Free Medical Knowledge Mining of Vision-Language Models

Jiaxiang Liu, Tianxiang Hu, Jiawei Du et al.

AAAI 2025paperarXiv:2501.11231

citations

#5217

TimeCHEAT: A Channel Harmony Strategy for Irregularly Sampled Multivariate Time Series Analysis

Jiexi Liu, Meng Cao, Songcan Chen

AAAI 2025paperarXiv:2412.12886

citations

#5218

Can a MISL Fly? Analysis and Ingredients for Mutual Information Skill Learning

Chongyi Zheng, Jens Tuyls, Joanne Peng et al.

ICLR 2025arXiv:2412.08021

citations

#5219

Learning Generalizable Skills from Offline Multi-Task Data for Multi-Agent Cooperation

Sicong Liu, Yang Shu, Chenjuan Guo et al.

ICLR 2025oralarXiv:2503.21200

citations

#5220

Explain Yourself, Briefly! Self-Explaining Neural Networks with Concise Sufficient Reasons

Shahaf Bassan, Ron Eliav, Shlomit Gur

ICLR 2025arXiv:2502.03391

citations

#5221

Semi-Supervised Multi-View Multi-Label Learning with View-Specific Transformer and Enhanced Pseudo-Label

Quanjiang Li, Tingjin Luo, Mingdie Jiang et al.

AAAI 2025paper

citations

#5222

Improved Finite-Particle Convergence Rates for Stein Variational Gradient Descent

Sayan Banerjee, Krishna Balasubramanian, PROMIT GHOSAL

ICLR 2025arXiv:2409.08469

citations

#5223

HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation

Ling Yang, Xinchen Zhang, Ye Tian et al.

NEURIPS 2025arXiv:2502.12148

citations

#5224

Highly Compressed Tokenizer Can Generate Without Training

Lukas Lao Beyer, Tianhong Li, Xinlei Chen et al.

ICML 2025arXiv:2506.08257

citations

#5225

Multi-Granular Multimodal Clue Fusion for Meme Understanding

Li Zheng, Hao Fei, Ting Dai et al.

AAAI 2025paperarXiv:2503.12560

citations

#5226

Assessing the Creativity of LLMs in Proposing Novel Solutions to Mathematical Problems

Junyi Ye, Jingyi Gu, Xinyun Zhao et al.

AAAI 2025paperarXiv:2410.18336

citations

#5227

One Token Can Help! Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language Models

Yutao Zhu, Zhaoheng Huang, Zhicheng Dou et al.

AAAI 2025paperarXiv:2405.19670

citations

#5228

Intermediate Layer Classifiers for OOD generalization

Arnas Uselis, Seong Joon Oh

ICLR 2025arXiv:2504.05461

citations

#5229

LLM+AL: Bridging Large Language Models and Action Languages for Complex Reasoning About Actions

Adam Ishay, Joohyung Lee

AAAI 2025paperarXiv:2501.00830

citations

#5230

TabFlex: Scaling Tabular Learning to Millions with Linear Attention

Yuchen Zeng, Tuan Dinh, Wonjun Kang et al.

ICML 2025spotlightarXiv:2506.05584

citations

#5231

MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions

Yekun Chai, Haoran Sun, Huang Fang et al.

ICLR 2025oralarXiv:2410.02743

citations

#5232

Loss Functions and Operators Generated by f-Divergences

Vincent Roulet, Tianlin Liu, Nino Vieillard et al.

ICML 2025arXiv:2501.18537

citations

#5233

Extractive Structures Learned in Pretraining Enable Generalization on Finetuned Facts

Jiahai Feng, Stuart Russell, Jacob Steinhardt

ICML 2025arXiv:2412.04614

citations

#5234

GVMGen: A General Video-to-Music Generation Model with Hierarchical Attentions

Heda Zuo, Weitao You, Junxian Wu et al.

AAAI 2025paperarXiv:2501.09972

citations

#5235

MAGE: Model-Level Graph Neural Networks Explanations via Motif-based Graph Generation

Zhaoning Yu, Hongyang Gao

ICLR 2025arXiv:2405.12519

citations

#5236

Neural ODE Transformers: Analyzing Internal Dynamics and Adaptive Fine-tuning

Anh Tong, Thanh Nguyen-Tang, Dongeun Lee et al.

ICLR 2025arXiv:2503.01329

citations

#5237

Realistic Evaluation of Deep Partial-Label Learning Algorithms

Wei Wang, Dong-Dong Wu, Jindong Wang et al.

ICLR 2025arXiv:2502.10184

citations

#5238

$\gamma-$MoD: Exploring Mixture-of-Depth Adaptation for Multimodal Large Language Models

Yaxin Luo, Gen Luo, Jiayi Ji et al.

ICLR 2025

citations

#5239

Broken Tokens? Your Language Model can Secretly Handle Non-Canonical Tokenizations

Brian Zheng, Alisa Liu, Orevaoghene Ahia et al.

NEURIPS 2025spotlightarXiv:2506.19004

citations

#5240

ADIFF: Explaining audio difference using natural language

Soham Deshmukh, Shuo Han, Rita Singh et al.

ICLR 2025arXiv:2502.04476

citations

#5241

Grounding Language with Vision: A Conditional Mutual Information Calibrated Decoding Strategy for Reducing Hallucinations in LVLMs

Hao Fang, Changle Zhou, Jiawei Kong et al.

NEURIPS 2025arXiv:2505.19678

citations

#5242

QuaDiM: A Conditional Diffusion Model For Quantum State Property Estimation

Yehui Tang, Mabiao Long, Junchi Yan

ICLR 2025

citations

#5243

A Sharper Global Convergence Analysis for Average Reward Reinforcement Learning via an Actor-Critic Approach

Swetha Ganesh, Washim Mondal, Vaneet Aggarwal

ICML 2025arXiv:2407.18878

citations

#5244

A Generalist Intracortical Motor Decoder

Joel Ye, Fabio Rizzoglio, Xuan Ma et al.

NEURIPS 2025

citations

#5245

From Specificity to Generality: Revisiting Generalizable Artifacts in Detecting Face Deepfakes

Long Ma, Zhiyuan Yan, Jin Xu et al.

NEURIPS 2025arXiv:2504.04827

citations

#5246

Thinking Racial Bias in Fair Forgery Detection: Models, Datasets and Evaluations

Decheng Liu, Zongqi Wang, Chunlei Peng et al.

AAAI 2025paperarXiv:2407.14367

citations

#5247

MoRe: Class Patch Attention Needs Regularization for Weakly Supervised Semantic Segmentation

Zhiwei Yang, Yucong Meng, Kexue Fu et al.

AAAI 2025paperarXiv:2412.11076

citations

#5248

Stochastic Forward–Backward Deconvolution: Training Diffusion Models with Finite Noisy Datasets

Haoye Lu, Qifan Wu, Yaoliang Yu

ICML 2025arXiv:2502.05446

citations

#5249

Synthesizing Privacy-Preserving Text Data via Finetuning without Finetuning Billion-Scale LLMs

Bowen Tan, Zheng Xu, Eric Xing et al.

ICML 2025arXiv:2503.12347

citations

#5250

Knowing Your Target: Target-Aware Transformer Makes Better Spatio-Temporal Video Grounding

Xin Gu, Yaojie Shen, Chenxi Luo et al.

ICLR 2025oralarXiv:2502.11168

citations

#5251

Dialogue Without Limits: Constant-Sized KV Caches for Extended Response in LLMs

Ravi Ghadia, Avinash Kumar, Gaurav Jain et al.

ICML 2025arXiv:2503.00979

citations

#5252

Adversarial Generative Flow Network for Solving Vehicle Routing Problems

Ni Zhang, Jingfeng Yang, Zhiguang Cao et al.

ICLR 2025arXiv:2503.01931

citations

#5253

MSE-Adapter: A Lightweight Plugin Endowing LLMs with the Capability to Perform Multimodal Sentiment Analysis and Emotion Recognition

Yang Yang, Xunde Dong, Yupeng Qiang

AAAI 2025paperarXiv:2502.12478

citations

#5254

MUSE: Mamba Is Efficient Multi-scale Learner for Text-video Retrieval

Haoran Tang, Meng Cao, Jinfa Huang et al.

AAAI 2025paperarXiv:2408.10575

citations

#5255

Promptable Anomaly Segmentation with SAM Through Self-Perception Tuning

Hui-Yue Yang, Hui Chen, Ao Wang et al.

AAAI 2025paperarXiv:2411.17217

citations

#5256

DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving Scenes

Yiyuan Liang, Zhiying Yan, Liqun Chen et al.

AAAI 2025paperarXiv:2412.19458

citations

#5257

Everything Everywhere All at Once: LLMs can In-Context Learn Multiple Tasks in Superposition

Zheyang Xiong, Jack Cai, John Cooper et al.

ICML 2025spotlightarXiv:2410.05603

citations

#5258

Be More Diverse than the Most Diverse: Optimal Mixtures of Generative Models via Mixture-UCB Bandit Algorithms

Parham Rezaei, Farzan Farnia, Cheuk Ting Li

ICLR 2025arXiv:2412.17622

citations

#5259

Deep Linear Network Training Dynamics from Random Initialization: Data, Width, Depth, and Hyperparameter Transfer

Blake Bordelon, Cengiz Pehlevan

ICML 2025arXiv:2502.02531

citations

#5260

Can Textual Gradient Work in Federated Learning?

Minghui Chen, Ruinan Jin, Wenlong Deng et al.

ICLR 2025arXiv:2502.19980

citations

#5261

Value-Based Deep RL Scales Predictably

Oleh Rybkin, Michal Nauman, Preston Fu et al.

ICML 2025arXiv:2502.04327

citations

#5262

VLScene: Vision-Language Guidance Distillation for Camera-Based 3D Semantic Scene Completion

Meng Wang, Huilong Pi, Ruihui Li et al.

AAAI 2025paperarXiv:2503.06219

citations

#5263

A Training-Free Sub-quadratic Cost Transformer Model Serving Framework with Hierarchically Pruned Attention

Heejun Lee, Geon Park, Youngwan Lee et al.

ICLR 2025arXiv:2406.09827

citations

#5264

Feature Denoising Diffusion Model for Blind Image Quality Assessment

Xudong Li, Yan Zhang, Yunhang Shen et al.

AAAI 2025paperarXiv:2401.11949

citations

#5265

Chaos Meets Attention: Transformers for Large-Scale Dynamical Prediction

Yi He, Yiming Yang, Xiaoyuan Cheng et al.

ICML 2025arXiv:2504.20858

citations

#5266

ND-SDF: Learning Normal Deflection Fields for High-Fidelity Indoor Reconstruction

Ziyu Tang, Weicai Ye, Yifan Wang et al.

ICLR 2025arXiv:2408.12598

citations

#5267

LODGE: Level-of-Detail Large-Scale Gaussian Splatting with Efficient Rendering

Jonas Kulhanek, Marie-Julie Rakotosaona, Fabian Manhardt et al.

NEURIPS 2025spotlightarXiv:2505.23158

citations

#5268

Do Large Language Models Truly Understand Geometric Structures?

Xiaofeng Wang, Yiming Wang, Wenhong Zhu et al.

ICLR 2025arXiv:2501.13773

citations

#5269

Synthetic Tabular Data Generation for Imbalanced Classification: The Surprising Effectiveness of an Overlap Class

Annie D'souza, Swetha M, Sunita Sarawagi

AAAI 2025paperarXiv:2412.15657

citations

#5270

Lumina-T2X: Scalable Flow-based Large Diffusion Transformer for Flexible Resolution Generation

Gao Peng, Le Zhuo, Dongyang Liu et al.

ICLR 2025oral

citations

#5271

Gumbel Counterfactual Generation From Language Models

Shauli Ravfogel, Anej Svete, Vésteinn Snæbjarnarson et al.

ICLR 2025arXiv:2411.07180

citations

#5272

An Interpretable N-gram Perplexity Threat Model for Large Language Model Jailbreaks

Valentyn Boreiko, Alexander Panfilov, Václav Voráček et al.

ICML 2025arXiv:2410.16222

citations

#5273

ChangeDiff: A Multi-Temporal Change Detection Data Generator with Flexible Text Prompts via Diffusion Model

Qi Zang, Jiayi Yang, Shuang Wang et al.

AAAI 2025paperarXiv:2412.15541

citations

#5274

C2F-TP: A Coarse-to-Fine Denoising Framework for Uncertainty-Aware Trajectory Prediction

Zichen Wang, Hao Miao, Senzhang Wang et al.

AAAI 2025paperarXiv:2412.13231

citations

#5275

Multi-Modal and Multi-Attribute Generation of Single Cells with CFGen

Alessandro Palma, Till Richter, Hanyi Zhang et al.

ICLR 2025arXiv:2407.11734

citations

#5276

Fast Training of Sinusoidal Neural Fields via Scaling Initialization

Taesun Yeom, Sangyoon Lee, Jaeho Lee

ICLR 2025arXiv:2410.04779

citations

#5277

TEncDM: Understanding the Properties of the Diffusion Model in the Space of Language Model Encodings

Alexander Shabalin, Viacheslav Meshchaninov, Egor Chimbulatov et al.

AAAI 2025paperarXiv:2402.19097

citations

#5278

Sortformer: A Novel Approach for Permutation-Resolved Speaker Supervision in Speech-to-Text Systems

Taejin Park, Ivan Medennikov, Kunal Dhawan et al.

ICML 2025arXiv:2409.06656

citations

#5279

HieraFashDiff: Hierarchical Fashion Design with Multi-stage Diffusion Models

Zhifeng Xie, Hao Li, Huiming Ding et al.

AAAI 2025paperarXiv:2401.07450

citations

#5280

Leveraging Large Vision-Language Model as User Intent-Aware Encoder for Composed Image Retrieval

Zelong Sun, Dong Jing, Guoxing Yang et al.

AAAI 2025paperarXiv:2412.11087

citations

#5281

Everything is Editable: Extend Knowledge Editing to Unstructured Data in Large Language Models

Jingcheng Deng, Zihao Wei, Liang Pang et al.

ICLR 2025arXiv:2405.15349

citations

#5282

Does learning the right latent variables necessarily improve in-context learning?

Sarthak Mittal, Eric Elmoznino, Léo Gagnon et al.

ICML 2025arXiv:2405.19162

citations

#5283

Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents

Yaxin Luo, Zhaoyi Li, Jiacheng Liu et al.

NEURIPS 2025arXiv:2505.24878

citations

#5284

Temporal Difference Flows

Jesse Farebrother, Matteo Pirotta, Andrea Tirinzoni et al.

ICML 2025oralarXiv:2503.09817

citations

#5285

CAPrompt: Cyclic Prompt Aggregation for Pre-Trained Model Based Class Incremental Learning

Qiwei Li, Jiahuan Zhou

AAAI 2025paperarXiv:2412.08929

citations

#5286

STAR: Synthesis of Tailored Architectures

Armin Thomas, Rom Parnichkun, Alexander Amini et al.

ICLR 2025arXiv:2411.17800

citations

#5287

Accelerated Over-Relaxation Heavy-Ball Method: Achieving Global Accelerated Convergence with Broad Generalization

Jingrong Wei, Long Chen

ICLR 2025arXiv:2406.09772

citations

#5288

Incomplete Multi-view Deep Clustering with Data Imputation and Alignment

Jiyuan Liu, Xinwang Liu, Xinhang Wan et al.

NEURIPS 2025

citations

#5289

DenseGrounding: Improving Dense Language-Vision Semantics for Ego-centric 3D Visual Grounding

Henry Zheng, Hao Shi, Qihang Peng et al.

ICLR 2025arXiv:2505.04965

citations

#5290

Enhancing Large Language Model Performance with Gradient-Based Parameter Selection

Haoling Li, Xin Zhang, Xiao Liu et al.

AAAI 2025paperarXiv:2406.15330

citations

#5291

ProtComposer: Compositional Protein Structure Generation with 3D Ellipsoids

Hannes Stärk, Bowen Jing, Tomas Geffner et al.

ICLR 2025arXiv:2503.05025

citations

#5292

TIGeR: Unifying Text-to-Image Generation and Retrieval with Large Multimodal Models

Leigang Qu, Haochuan Li, Tan Wang et al.

ICLR 2025arXiv:2406.05814

citations

#5293

DataMan: Data Manager for Pre-training Large Language Models

Ru Peng, Kexin Yang, Yawen Zeng et al.

ICLR 2025arXiv:2502.19363

citations

#5294

A Reductions Approach to Risk-Sensitive Reinforcement Learning with Optimized Certainty Equivalents

Kaiwen Wang, Dawen Liang, Nathan Kallus et al.

ICML 2025arXiv:2403.06323

citations

#5295

Differentially Private Steering for Large Language Model Alignment

Anmol Goel, Yaxi Hu, Iryna Gurevych et al.

ICLR 2025arXiv:2501.18532

citations

#5296

Unlocking the Potential of Reverse Distillation for Anomaly Detection

Xinyue Liu, Jianyuan Wang, Biao Leng et al.

AAAI 2025paperarXiv:2412.07579

citations

#5297

Disentangling and Integrating Relational and Sensory Information in Transformer Architectures

Awni Altabaa, John Lafferty

ICML 2025arXiv:2405.16727

citations

#5298

Expensive Multi-Objective Bayesian Optimization Based on Diffusion Models

Bingdong Li, Zixiang Di, Yongfan Lu et al.

AAAI 2025paperarXiv:2405.08674

citations

#5299

Elucidating the Design Space of Multimodal Protein Language Models

Cheng-Yen Hsieh, Xinyou Wang, Daiheng Zhang et al.

ICML 2025spotlightarXiv:2504.11454

citations

#5300

AGAV-Rater: Adapting Large Multimodal Model for AI-Generated Audio-Visual Quality Assessment

Yuqin Cao, Xiongkuo Min, Yixuan Gao et al.

ICML 2025arXiv:2501.18314

citations

#5301

HC-LLM: Historical-Constrained Large Language Models for Radiology Report Generation

Tengfei Liu, Jiapu Wang, Yongli Hu et al.

AAAI 2025paperarXiv:2412.11070

citations

#5302

Enhancing Federated Domain Adaptation with Multi-Domain Prototype-Based Federated Fine-Tuning

Jingyuan Zhang, Yiyang Duan, Shuaicheng Niu et al.

ICLR 2025arXiv:2410.07738

citations

#5303

Multi-Marginal Stochastic Flow Matching for High-Dimensional Snapshot Data at Irregular Time Points

Justin Lee, Behnaz Moradi-Jamei, Heman Shakeri

ICML 2025arXiv:2508.04351

citations

#5304

Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers

Roman Abramov, Felix Steinbauer, Gjergji Kasneci

ICML 2025arXiv:2504.20752

citations

#5305

Bayesian Optimization of Antibodies Informed by a Generative Model of Evolving Sequences

Alan Amin, Nate Gruver, Yilun Kuang et al.

ICLR 2025arXiv:2412.07763

citations

#5306

Conformal Prediction Sets Can Cause Disparate Impact

Jesse Cresswell, Bhargava Kumar, Yi Sui et al.

ICLR 2025arXiv:2410.01888

citations

#5307

Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining

Jie Cheng, Ruixi Qiao, ma yingwei et al.

ICLR 2025oralarXiv:2410.00564

citations

#5308

Embedding Safety into RL: A New Take on Trust Region Methods

Nikola Milosevic, Johannes Müller, Nico Scherf

ICML 2025arXiv:2411.02957

citations

#5309

MDSGen: Fast and Efficient Masked Diffusion Temporal-Aware Transformers for Open-Domain Sound Generation

Trung X. Pham, Tri Ton, Chang Yoo

ICLR 2025oralarXiv:2410.02130

citations

#5310

Fast and Slow Streams for Online Time Series Forecasting Without Information Leakage

Ying-yee Ava Lau, Zhiwen Shao, Dit-Yan Yeung

ICLR 2025oral

citations

#5311

VQTalker: Towards Multilingual Talking Avatars Through Facial Motion Tokenization

Tao Liu, Ziyang Ma, Qi Chen et al.

AAAI 2025paperarXiv:2412.09892

citations

#5312

Proactive Privacy Amnesia for Large Language Models: Safeguarding PII with Negligible Impact on Model Utility

Martin Kuo, Jingyang Zhang, Jianyi Zhang et al.

ICLR 2025arXiv:2502.17591

citations

#5313

Data Taggants: Dataset Ownership Verification Via Harmless Targeted Data Poisoning

Wassim Bouaziz, Nicolas Usunier, El-Mahdi El-Mhamdi

ICLR 2025arXiv:2410.09101

citations

#5314

FaceMe: Robust Blind Face Restoration with Personal Identification

Siyu Liu, Zheng-Peng Duan, Jia OuYang et al.

AAAI 2025paperarXiv:2501.05177

citations

#5315

Does Training with Synthetic Data Truly Protect Privacy?

Yunpeng Zhao, Jie Zhang

ICLR 2025arXiv:2502.12976

citations

#5316

Modality-Specialized Synergizers for Interleaved Vision-Language Generalists

Zhiyang Xu, Minqian Liu, Ying Shen et al.

ICLR 2025arXiv:2407.03604

citations

#5317

Approaching Rate-Distortion Limits in Neural Compression with Lattice Transform Coding

Eric Lei, Hamed Hassani, Shirin Saeedi Bidokhti

ICLR 2025arXiv:2403.07320

citations

#5318

GravMAD: Grounded Spatial Value Maps Guided Action Diffusion for Generalized 3D Manipulation

Yangtao Chen, Zixuan Chen, Junhui Yin et al.

ICLR 2025arXiv:2409.20154

citations

#5319

Unlocking the Power of Function Vectors for Characterizing and Mitigating Catastrophic Forgetting in Continual Instruction Tuning

Gangwei Jiang, caigao jiang, Zhaoyi Li et al.

ICLR 2025arXiv:2502.11019

citations

#5320

Robustness of Quantum Algorithms for Nonconvex Optimization

Weiyuan Gong, Chenyi Zhang, Tongyang Li

ICLR 2025arXiv:2212.02548

citations

#5321

Accurate Link Prediction for Edge-Incomplete Graphs via PU Learning

Junghun Kim, Ka Hyun Park, Hoyoung Yoon et al.

AAAI 2025paperarXiv:2405.11911

citations

#5322

Efficiently Serving Large Multimodal Models Using EPD Disaggregation

Gursimran Singh, Xinglu Wang, Yifan Hu et al.

ICML 2025arXiv:2501.05460

citations

#5323

Efficient Reward Poisoning Attacks on Online Deep Reinforcement Learning

Yinglun Xu, Qi Zeng, Gagandeep Singh

ICLR 2025arXiv:2205.14842

citations

#5324

A Theoretical Perspective: How to Prevent Model Collapse in Self-consuming Training Loops

Shi Fu, Yingjie Wang, Yuzhu Chen et al.

ICLR 2025arXiv:2502.18865

citations

#5325

LIFe-GoM: Generalizable Human Rendering with Learned Iterative Feedback Over Multi-Resolution Gaussians-on-Mesh

Jing Wen, Alex Schwing, Shenlong Wang

ICLR 2025arXiv:2502.09617

citations

#5326

Visual Attention Never Fades: Selective Progressive Attention ReCalibration for Detailed Image Captioning in Multimodal Large Language Models

Mingi Jung, Saehyung Lee, Eunji Kim et al.

ICML 2025arXiv:2502.01419

citations

#5327

Anywhere: A Multi-Agent Framework for User-Guided, Reliable, and Diverse Foreground-Conditioned Image Generation

Xie Tianyidan, Rui Ma, Qian Wang et al.

AAAI 2025paperarXiv:2404.18598

citations

#5328

Free Hunch: Denoiser Covariance Estimation for Diffusion Models Without Extra Costs

Severi Rissanen, Markus Heinonen, Arno Solin

ICLR 2025arXiv:2410.11149

citations

#5329

Looking Backward: Retrospective Backward Synthesis for Goal-Conditioned GFlowNets

Haoran He, Can Chang, Huazhe Xu et al.

ICLR 2025arXiv:2406.01150

citations

#5330

Injecting Universal Jailbreak Backdoors into LLMs in Minutes

Zhuowei Chen, qiannan zhang, Shichao Pei

ICLR 2025arXiv:2502.10438

citations

#5331

Same Task, Different Circuits: Disentangling Modality-Specific Mechanisms in VLMs

Yaniv Nikankin, Dana Arad, Yossi Gandelsman et al.

NEURIPS 2025arXiv:2506.09047

citations

#5332

Growth Inhibitors for Suppressing Inappropriate Image Concepts in Diffusion Models

Die Chen, Zhiwen Li, Mingyuan Fan et al.

ICLR 2025arXiv:2408.01014

citations

#5333

From Kernels to Features: A Multi-Scale Adaptive Theory of Feature Learning

Noa Rubin, Kirsten Fischer, Javed Lindner et al.

ICML 2025arXiv:2502.03210

citations

#5334

Accessing Vision Foundation Models via ImageNet-1K

Yitian Zhang, Xu Ma, Yue Bai et al.

ICLR 2025arXiv:2407.10366

citations

#5335

The Canary’s Echo: Auditing Privacy Risks of LLM-Generated Synthetic Text

Matthieu Meeus, Lukas Wutschitz, Santiago Zanella-Beguelin et al.

ICML 2025arXiv:2502.14921

citations

#5336

Bayesian Optimization via Continual Variational Last Layer Training

Paul Brunzema, Mikkel Jordahn, John Willes et al.

ICLR 2025arXiv:2412.09477

citations

#5337

Episodic Novelty Through Temporal Distance

Yuhua Jiang, Qihan Liu, Yiqin Yang et al.

ICLR 2025oralarXiv:2501.15418

citations

#5338

A Two-Stage Learning-to-Defer Approach for Multi-Task Learning

Yannis Montreuil, Shu Heng Yeo, Axel Carlier et al.

ICML 2025arXiv:2410.15729

citations

#5339

Knowledge in Superposition: Unveiling the Failures of Lifelong Knowledge Editing for Large Language Models

Chenhui Hu, Pengfei Cao, Yubo Chen et al.

AAAI 2025paperarXiv:2408.07413

citations

#5340

Towards Bridging Generalization and Expressivity of Graph Neural Networks

Shouheng Li, Floris Geerts, Dongwoo Kim et al.

ICLR 2025arXiv:2410.10051

citations

#5341

CaPo: Cooperative Plan Optimization for Efficient Embodied Multi-Agent Cooperation

Jie Liu, Pan Zhou, Yingjun Du et al.

ICLR 2025arXiv:2411.04679

citations

#5342

Learning Evolving Tools for Large Language Models

Guoxin Chen, Zhong Zhang, Xin Cong et al.

ICLR 2025arXiv:2410.06617

citations

#5343

Fair Submodular Cover

Wenjing Chen, Shuo Xing, Samson Zhou et al.

ICLR 2025arXiv:2407.04804

citations

#5344

Preference Diffusion for Recommendation

Shuo Liu, An Zhang, Guoqing Hu et al.

ICLR 2025arXiv:2410.13117

citations

#5345

Incomplete Modality Disentangled Representation for Ophthalmic Disease Grading and Diagnosis

Chengzhi Liu, Zile Huang, Zhe Chen et al.

AAAI 2025paperarXiv:2502.11724

citations

#5346

Principled Algorithms for Optimizing Generalized Metrics in Binary Classification

Anqi Mao, Mehryar Mohri, Yutao Zhong

ICML 2025arXiv:2512.23133

citations

#5347

Enhancing Rating-Based Reinforcement Learning to Effectively Leverage Feedback from Large Vision-Language Models

Minh-Tung Luu, Younghwan Lee, Donghoon Lee et al.

ICML 2025arXiv:2506.12822

citations

#5348

RobuRCDet: Enhancing Robustness of Radar-Camera Fusion in Bird's Eye View for 3D Object Detection

Jingtong Yue, Zhiwei Lin, Xin Lin et al.

ICLR 2025arXiv:2502.13071

citations

#5349

Better than Your Teacher: LLM Agents that learn from Privileged AI Feedback

Sanjiban Choudhury, Paloma Sodhi

ICLR 2025arXiv:2410.05434

citations

#5350

Towards Unifying Evaluation of Counterfactual Explanations: Leveraging Large Language Models for Human-Centric Assessments

Marharyta Domnich, Julius Välja, Rasmus Moorits Veski et al.

AAAI 2025paperarXiv:2410.21131

citations

#5351

De-mark: Watermark Removal in Large Language Models

Ruibo Chen, Yihan Wu, Junfeng Guo et al.

ICML 2025arXiv:2410.13808

citations

#5352

Speculative Prefill: Turbocharging TTFT with Lightweight and Training-Free Token Importance Estimation

Jingyu Liu, Beidi Chen, Ce Zhang

ICML 2025arXiv:2502.02789

citations

#5353

Balancing the Scales: A Theoretical and Algorithmic Framework for Learning from Imbalanced Data

Corinna Cortes, Anqi Mao, Mehryar Mohri et al.

ICML 2025arXiv:2502.10381

citations

#5354

Micro-macro Wavelet-based Gaussian Splatting for 3D Reconstruction from Unconstrained Images

Yihui Li, Chengxin Lv, Hongyu Yang et al.

AAAI 2025paperarXiv:2501.14231

citations

#5355

EgoPrivacy: What Your First-Person Camera Says About You?

Yijiang Li, Genpei Zhang, Jiacheng Cheng et al.

ICML 2025arXiv:2506.12258

citations

#5356

Offline-to-Online Hyperparameter Transfer for Stochastic Bandits

Dravyansh Sharma, Arun Suggala

AAAI 2025paperarXiv:2501.02926

citations

#5357

Unsupervised Audio-Visual Segmentation with Modality Alignment

Swapnil Bhosale, Haosen Yang, Diptesh Kanojia et al.

AAAI 2025paperarXiv:2403.14203

citations

#5358

Towards Understanding Why Label Smoothing Degrades Selective Classification and How to Fix It

Guoxuan Xia, Olivier Laurent, Gianni Franchi et al.

ICLR 2025arXiv:2403.14715

citations

#5359

Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data

Sreyan Ghosh, Sonal Kumar, Zhifeng Kong et al.

ICLR 2025arXiv:2410.02056

citations

#5360

Meta-Black-Box-Optimization through Offline Q-function Learning

Zeyuan Ma, Zhiguang Cao, Zhou Jiang et al.

ICML 2025arXiv:2505.02010

citations

#5361

Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP

Yating Yu, Congqi Cao, Yueran Zhang et al.

AAAI 2025paperarXiv:2412.09895

citations

#5362

MergeNet: Knowledge Migration Across Heterogeneous Models, Tasks, and Modalities

Kunxi Li, Tianyu Zhan, Kairui Fu et al.

AAAI 2025paperarXiv:2404.13322

citations

#5363

OpenViewer: Openness-Aware Multi-View Learning

Shide Du, Zihan Fang, Yanchao Tan et al.

AAAI 2025paperarXiv:2412.12596

citations

#5364

World Knowledge-Enhanced Reasoning Using Instruction-Guided Interactor in Autonomous Driving

Mingliang Zhai, Cheng Li, Zengyuan Guo et al.

AAAI 2025paperarXiv:2412.06324

citations

#5365

Understanding High-Dimensional Bayesian Optimization

Leonard Papenmeier, Matthias Poloczek, Luigi Nardi

ICML 2025arXiv:2502.09198

citations

#5366

Decoding Game: On Minimax Optimality of Heuristic Text Generation Strategies

Sijin Chen, Omar Hagrass, Jason Klusowski

ICLR 2025arXiv:2410.03968

citations

#5367

Compositional simulation-based inference for time series

Manuel Gloeckler, Shoji Toyota, Kenji Fukumizu et al.

ICLR 2025arXiv:2411.02728

citations

#5368

Outsourced Diffusion Sampling: Efficient Posterior Inference in Latent Spaces of Generative Models

Siddarth Venkatraman, Mohsin Hasan, Minsu Kim et al.

ICML 2025arXiv:2502.06999

citations

#5369

Efficient stagewise pretraining via progressive subnetworks

Abhishek Panigrahi, Nikunj Saunshi, Kaifeng Lyu et al.

ICLR 2025arXiv:2402.05913

citations

#5370

GraphCL: Graph-based Clustering for Semi-Supervised Medical Image Segmentation

Mengzhu Wang, houcheng su, Jiao Li et al.

ICML 2025arXiv:2411.13147

citations

#5371

Stable Mean Teacher for Semi-supervised Video Action Detection

Akash Kumar, Sirshapan Mitra, Yogesh Singh Rawat

AAAI 2025paperarXiv:2412.07072

citations

#5372

The Belief State Transformer

Edward Hu, Kwangjun Ahn, Qinghua Liu et al.

ICLR 2025arXiv:2410.23506

citations

#5373

Do We Need to Verify Step by Step? Rethinking Process Supervision from a Theoretical Perspective

Zeyu Jia, Alexander Rakhlin, Tengyang Xie

ICML 2025arXiv:2502.10581

citations

#5374

Fine-Tuning Attention Modules Only: Enhancing Weight Disentanglement in Task Arithmetic

Ruochen Jin, Bojian Hou, Jiancong Xiao et al.

ICLR 2025arXiv:2407.07089

citations

#5375

Topograph: An Efficient Graph-Based Framework for Strictly Topology Preserving Image Segmentation

Laurin Lux, Alexander H Berger, Alexander Weers et al.

ICLR 2025arXiv:2411.03228

citations

#5376

ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL

Yang Qin, Chao Chen, Zhihang Fu et al.

ICLR 2025arXiv:2412.10138

citations

#5377

Hierarchical Alignment-enhanced Adaptive Grounding Network for Generalized Referring Expression Comprehension

Yaxian Wang, Henghui Ding, Shuting He et al.

AAAI 2025paperarXiv:2501.01416

citations

#5378

SynQ: Accurate Zero-shot Quantization by Synthesis-aware Fine-tuning

Minjun Kim, Jongjin Kim, U Kang

ICLR 2025

citations

#5379

Rethinking Pseudo-Label Guided Learning for Weakly Supervised Temporal Action Localization from the Perspective of Noise Correction

Quan Zhang, Yuxin Qi, Xi Tang et al.

AAAI 2025paperarXiv:2501.11124

citations

#5380

Video Action Differencing

James Burgess, Xiaohan Wang, Yuhui Zhang et al.

ICLR 2025arXiv:2503.07860

citations

#5381

CapeX: Category-Agnostic Pose Estimation from Textual Point Explanation

Matan Rusanovsky, Or Hirschorn, Shai Avidan

ICLR 2025arXiv:2406.00384

citations

#5382

SplatFormer: Point Transformer for Robust 3D Gaussian Splatting

Yutong Chen, Marko Mihajlovic, Xiyi Chen et al.

ICLR 2025arXiv:2411.06390

citations

#5383

DCBM: Data-Efficient Visual Concept Bottleneck Models

Katharina Prasse, Patrick Knab, Sascha Marton et al.

ICML 2025arXiv:2412.11576

citations

#5384

An All-Atom Generative Model for Designing Protein Complexes

Ruizhe Chen, Dongyu Xue, Xiangxin Zhou et al.

ICML 2025arXiv:2504.13075

citations

#5385

Manta: Enhancing Mamba for Few-Shot Action Recognition of Long Sub-Sequence

Wenbo Huang, Jinghui Zhang, Guang Li et al.

AAAI 2025paperarXiv:2412.07481

citations

#5386

Gradient descent with generalized Newton’s method

Zhiqi Bu, Shiyun Xu

ICLR 2025arXiv:2407.02772

citations

#5387

Glauber Generative Model: Discrete Diffusion Models via Binary Classification

Harshit Varma, Dheeraj Nagaraj, Karthikeyan Shanmugam

ICLR 2025arXiv:2405.17035

citations

#5388

Leveraging Consistent Spatio-Temporal Correspondence for Robust Visual Odometry

Zhaoxing Zhang, Junda Cheng, Gangwei Xu et al.

AAAI 2025paperarXiv:2412.16923

citations

#5389

Neural Context Flows for Meta-Learning of Dynamical Systems

Roussel Desmond Nzoyem, David Barton, Tom Deakin

ICLR 2025arXiv:2405.02154

citations

#5390

From Attention to Activation: Unraveling the Enigmas of Large Language Models

Prannay Kaul, Chengcheng Ma, Ismail Elezi et al.

ICLR 2025arXiv:2410.17174

citations

#5391

Evaluating LLM Reasoning in the Operations Research Domain with ORQA

Mahdi Mostajabdaveh, Timothy Tin Long Yu, Samarendra Chandan Bindu Dash et al.

AAAI 2025paperarXiv:2412.17874

citations

#5392

Boosting Fine-Grained Visual Anomaly Detection with Coarse-Knowledge-Aware Adversarial Learning

Qingqing Fang, Qinliang Su, Wenxi Lv et al.

AAAI 2025paperarXiv:2412.12850

citations

#5393

How many samples are needed to train a deep neural network?

Pegah Golestaneh, Mahsa Taheri, Johannes Lederer

ICLR 2025arXiv:2405.16696

citations

#5394

Position: The Future of Bayesian Prediction Is Prior-Fitted

Samuel Gabriel Müller, Arik Reuter, Noah Hollmann et al.

ICML 2025arXiv:2505.23947

citations

#5395

A transfer learning framework for weak to strong generalization

Seamus Somerstep, Felipe Maia Polo, Moulinath Banerjee et al.

ICLR 2025

citations

#5396

DuMo: Dual Encoder Modulation Network for Precise Concept Erasure

Feng Han, Kai Chen, Chao Gong et al.

AAAI 2025paperarXiv:2501.01125

citations

#5397

Learning Chaos In A Linear Way

Xiaoyuan Cheng, Yi He, Yiming Yang et al.

ICLR 2025arXiv:2503.14702

citations

#5398

Interactive Speculative Planning: Enhance Agent Efficiency through Co-design of System and User Interface

Wenyue Hua, Mengting Wan, JAGANNATH VADREVU et al.

ICLR 2025arXiv:2410.00079

citations

#5399

VIoTGPT: Learning to Schedule Vision Tools Towards Intelligent Video Internet of Things

Yaoyao Zhong, Mengshi Qi, Rui Wang et al.

AAAI 2025paper

citations

#5400

DISCO: learning to DISCover an evolution Operator for multi-physics-agnostic prediction

Rudy Morel, Jiequn Han, Edouard Oyallon

ICML 2025oralarXiv:2504.19496

citations

← Previous

1...25 26 27 28 29...112

Most Cited 2025 "semantic proximity" Papers

Conference

Paper Type

Diff3DS: Generating View-Consistent 3D Sketch via Differentiable Curve Rendering

REGENT: A Retrieval-Augmented Generalist Agent That Can Act In-Context in New Environments

GReaTer: Gradients Over Reasoning Makes Smaller Language Models Strong Prompt Optimizers

Simulation-Free Hierarchical Latent Policy Planning for Proactive Dialogues

The Elicitation Game: Evaluating Capability Elicitation Techniques

Fast Think-on-Graph: Wider, Deeper and Faster Reasoning of Large Language Model on Knowledge Graph

Near, far: Patch-ordering enhances vision foundation models' scene understanding

CP-Guard: Malicious Agent Detection and Defense in Collaborative Bird’s Eye View Perception

Beyond Sequence: Impact of Geometric Context for RNA Property Prediction

Understanding and Improving Length Generalization in Recurrent Models

(Almost Full) EFX for Three (and More) Types of Agents

Constrained Fair and Efficient Allocations

Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models

FloNa: Floor Plan Guided Embodied Visual Navigation

Counterfactual Generative Modeling with Variational Causal Inference

KPL: Training-Free Medical Knowledge Mining of Vision-Language Models

TimeCHEAT: A Channel Harmony Strategy for Irregularly Sampled Multivariate Time Series Analysis

Can a MISL Fly? Analysis and Ingredients for Mutual Information Skill Learning

Learning Generalizable Skills from Offline Multi-Task Data for Multi-Agent Cooperation

Explain Yourself, Briefly! Self-Explaining Neural Networks with Concise Sufficient Reasons

Semi-Supervised Multi-View Multi-Label Learning with View-Specific Transformer and Enhanced Pseudo-Label

Improved Finite-Particle Convergence Rates for Stein Variational Gradient Descent

HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation

Highly Compressed Tokenizer Can Generate Without Training

Multi-Granular Multimodal Clue Fusion for Meme Understanding

Assessing the Creativity of LLMs in Proposing Novel Solutions to Mathematical Problems

One Token Can Help! Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language Models

Intermediate Layer Classifiers for OOD generalization

LLM+AL: Bridging Large Language Models and Action Languages for Complex Reasoning About Actions

TabFlex: Scaling Tabular Learning to Millions with Linear Attention

MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions

Loss Functions and Operators Generated by f-Divergences

Extractive Structures Learned in Pretraining Enable Generalization on Finetuned Facts

GVMGen: A General Video-to-Music Generation Model with Hierarchical Attentions

MAGE: Model-Level Graph Neural Networks Explanations via Motif-based Graph Generation

Neural ODE Transformers: Analyzing Internal Dynamics and Adaptive Fine-tuning

Realistic Evaluation of Deep Partial-Label Learning Algorithms

$\gamma-$MoD: Exploring Mixture-of-Depth Adaptation for Multimodal Large Language Models

Broken Tokens? Your Language Model can Secretly Handle Non-Canonical Tokenizations

ADIFF: Explaining audio difference using natural language

Grounding Language with Vision: A Conditional Mutual Information Calibrated Decoding Strategy for Reducing Hallucinations in LVLMs

QuaDiM: A Conditional Diffusion Model For Quantum State Property Estimation

A Sharper Global Convergence Analysis for Average Reward Reinforcement Learning via an Actor-Critic Approach

A Generalist Intracortical Motor Decoder

From Specificity to Generality: Revisiting Generalizable Artifacts in Detecting Face Deepfakes

Thinking Racial Bias in Fair Forgery Detection: Models, Datasets and Evaluations

MoRe: Class Patch Attention Needs Regularization for Weakly Supervised Semantic Segmentation

Stochastic Forward–Backward Deconvolution: Training Diffusion Models with Finite Noisy Datasets

Synthesizing Privacy-Preserving Text Data via Finetuning *without* Finetuning Billion-Scale LLMs

Knowing Your Target: Target-Aware Transformer Makes Better Spatio-Temporal Video Grounding

Dialogue Without Limits: Constant-Sized KV Caches for Extended Response in LLMs

Adversarial Generative Flow Network for Solving Vehicle Routing Problems

MSE-Adapter: A Lightweight Plugin Endowing LLMs with the Capability to Perform Multimodal Sentiment Analysis and Emotion Recognition

MUSE: Mamba Is Efficient Multi-scale Learner for Text-video Retrieval

Promptable Anomaly Segmentation with SAM Through Self-Perception Tuning

DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving Scenes

Everything Everywhere All at Once: LLMs can In-Context Learn Multiple Tasks in Superposition

Be More Diverse than the Most Diverse: Optimal Mixtures of Generative Models via Mixture-UCB Bandit Algorithms

Deep Linear Network Training Dynamics from Random Initialization: Data, Width, Depth, and Hyperparameter Transfer

Can Textual Gradient Work in Federated Learning?

Value-Based Deep RL Scales Predictably

VLScene: Vision-Language Guidance Distillation for Camera-Based 3D Semantic Scene Completion

A Training-Free Sub-quadratic Cost Transformer Model Serving Framework with Hierarchically Pruned Attention

Feature Denoising Diffusion Model for Blind Image Quality Assessment

Chaos Meets Attention: Transformers for Large-Scale Dynamical Prediction

ND-SDF: Learning Normal Deflection Fields for High-Fidelity Indoor Reconstruction

LODGE: Level-of-Detail Large-Scale Gaussian Splatting with Efficient Rendering

Do Large Language Models Truly Understand Geometric Structures?

Synthetic Tabular Data Generation for Imbalanced Classification: The Surprising Effectiveness of an Overlap Class

Lumina-T2X: Scalable Flow-based Large Diffusion Transformer for Flexible Resolution Generation

Gumbel Counterfactual Generation From Language Models

An Interpretable N-gram Perplexity Threat Model for Large Language Model Jailbreaks

ChangeDiff: A Multi-Temporal Change Detection Data Generator with Flexible Text Prompts via Diffusion Model

C2F-TP: A Coarse-to-Fine Denoising Framework for Uncertainty-Aware Trajectory Prediction

Multi-Modal and Multi-Attribute Generation of Single Cells with CFGen

Fast Training of Sinusoidal Neural Fields via Scaling Initialization

TEncDM: Understanding the Properties of the Diffusion Model in the Space of Language Model Encodings

Synthesizing Privacy-Preserving Text Data via Finetuning without Finetuning Billion-Scale LLMs