Most Cited ICLR "cross-modality retrieval" Papers
6,124 papers found • Page 27 of 31
Conference
Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization
Weiyang Liu, Zeju Qiu, Yao Feng et al.
How I Warped Your Noise: a Temporally-Correlated Noise Prior for Diffusion Models
Pascal Chang, Jingwei Tang, Markus Gross et al.
Improving Long-Text Alignment for Text-to-Image Diffusion Models
Luping Liu, Chao Du, Tianyu Pang et al.
On the Computation of the Fisher Information in Continual Learning
Gido van de Ven
Efficient Planning with Latent Diffusion
Wenhao Li
Physics-Regulated Deep Reinforcement Learning: Invariant Embeddings
Hongpeng Cao, Yanbing Mao, Lui Sha et al.
3DMolFormer: A Dual-channel Framework for Structure-based Drug Discovery
Xiuyuan Hu, Guoqing Liu, Can Chen et al.
AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model
Zibin Dong, Yifu Yuan, Jianye HAO et al.
A Geometric Framework for Understanding Memorization in Generative Models
Brendan Ross, Hamidreza Kamkari, Tongzi Wu et al.
Glimpse: Enabling White-Box Methods to Use Proprietary Models for Zero-Shot LLM-Generated Text Detection
Guangsheng Bao, Yanbin Zhao, Juncai He et al.
Certified Adversarial Robustness for Rate Encoded Spiking Neural Networks
Bhaskar Mukhoty, Hilal AlQuabeh, Giulia De Masi et al.
Investigating Pattern Neurons in Urban Time Series Forecasting
Chengxin Wang, Yiran Zhao, shaofeng cai et al.
Rethinking Complex Queries on Knowledge Graphs with Neural Link Predictors
Hang Yin, Zihao Wang, Yangqiu Song
Reward Design for Justifiable Sequential Decision-Making
Aleksa Sukovic, Goran Radanovic
Can Watermarks be Used to Detect LLM IP Infringement For Free?
Zhengyue Zhao, Xiaogeng Liu, Somesh Jha et al.
Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game
Sam Toyer, Olivia Watkins, Ethan Mendes et al.
Neural Approximate Mirror Maps for Constrained Diffusion Models
Berthy Feng, Ricardo Baptista, Katherine Bouman
GANDALF: Generative AttentioN based Data Augmentation and predictive modeLing Framework for personalized cancer treatment
Aishwarya Jayagopal, Yanrong Zhang, Robert Walsh et al.
Graphical Multioutput Gaussian Process with Attention
Yijue Dai, Wenzhong Yan, Feng Yin
Dynamic Discounted Counterfactual Regret Minimization
Hang Xu, Kai Li, Haobo Fu et al.
Dichotomy of Early and Late Phase Implicit Biases Can Provably Induce Grokking
Kaifeng Lyu, Jikai Jin, Zhiyuan Li et al.
On the Fourier analysis in the SO(3) space : the EquiLoPO Network
Dmitrii Zhemchuzhnikov, Sergei Grudinin
Faster Approximation of Probabilistic and Distributional Values via Least Squares
Weida Li, Yaoliang Yu
SF(DA)$^2$: Source-free Domain Adaptation Through the Lens of Data Augmentation
Uiwon Hwang, Jonghyun Lee, Juhyeon Shin et al.
Grammar Reinforcement Learning: path and cycle counting in graphs with a Context-Free Grammar and Transformer approach
Jason Piquenot, Maxime Berar, Romain Raveaux et al.
Stable Anisotropic Regularization
William Rudman, Carsten Eickhoff
HyperFace: Generating Synthetic Face Recognition Datasets by Exploring Face Embedding Hypersphere
Hatef Otroshi Shahreza, Sébastien Marcel
Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM Guidance
Dongmin Park, Sebin Kim, Taehong Moon et al.
Learning with Language-Guided State Abstractions
Andi Peng, Ilia Sucholutsky, Belinda Li et al.
Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents
Boyu Gou, Demi Ruohan Wang, Boyuan Zheng et al.
Closing the Curious Case of Neural Text Degeneration
Matthew Finlayson, John Hewitt, Alexander Koller et al.
Decentralized Optimization with Coupled Constraints
Demyan Yarmoshik, Alexander Rogozin, Nikita Kiselev et al.
Towards Foundational Models for Molecular Learning on Large-Scale Multi-Task Datasets
Dominique Beaini, Shenyang(Andy) Huang, Joao Cunha et al.
A Visual Dive into Conditional Flow Matching
Anne Gagneux, Ségolène Martin, Rémi Emonet et al.
Proper Laplacian Representation Learning
Diego Gomez, Michael Bowling, Marlos C. Machado
A3D: Does Diffusion Dream about 3D Alignment?
Savva Ignatyev, Nina Konovalova, Daniil Selikhanovych et al.
Denoising Diffusion Step-aware Models
Shuai Yang, Yukang Chen, Luozhou WANG et al.
Long Context Compression with Activation Beacon
Peitian Zhang, Zheng Liu, Shitao Xiao et al.
K-HALU: Multiple Answer Korean Hallucination Benchmark for Large Language Models
Jaehyung Seo, Heuiseok Lim
DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer
Junyuan Hong, Jiachen (Tianhao) Wang, Chenhui Zhang et al.
CipherPrune: Efficient and Scalable Private Transformer Inference
Yancheng Zhang, Jiaqi Xue, Mengxin Zheng et al.
The Cost of Scaling Down Large Language Models: Reducing Model Size Affects Memory before In-context Learning
Tian Jin, Nolan Clement, Xin Dong et al.
Learning Flexible Body Collision Dynamics with Hierarchical Contact Mesh Transformer
Youn-Yeol Yu, Jeongwhan Choi, Woojin Cho et al.
Data Selection via Optimal Control for Language Models
Yuxian Gu, Li Dong, Hongning Wang et al.
VVC-Gym: A Fixed-Wing UAV Reinforcement Learning Environment for Multi-Goal Long-Horizon Problems
Xudong Gong, Feng Dawei, Kele Xu et al.
Towards Optimal Feature-Shaping Methods for Out-of-Distribution Detection
Qinyu Zhao, Ming Xu, Kartik Gupta et al.
Scaling Laws for Downstream Task Performance in Machine Translation
Berivan Isik, NATALIA PONOMAREVA, Hussein Hazimeh et al.
Domain-Agnostic Molecular Generation with Chemical Feedback
Yin Fang, Ningyu Zhang, Zhuo Chen et al.
Provably Efficient Iterated CVaR Reinforcement Learning with Function Approximation and Human Feedback
Yu Chen, Yihan Du, Pihe Hu et al.
CURIE: Evaluating LLMs on Multitask Scientific Long-Context Understanding and Reasoning
Hao Cui, Zahra Shamsi, Gowoon Cheon et al.
SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression
Xin Wang, Yu Zheng, Zhongwei Wan et al.
Coeditor: Leveraging Repo-level Diffs for Code Auto-editing
Jiayi Wei, Greg Durrett, Isil Dillig
Consistent Multi-Class Classification from Multiple Unlabeled Datasets
Zixi Wei, Senlin Shu, Yuzhou Cao et al.
Federated Continual Learning Goes Online: Uncertainty-Aware Memory Management for Vision Tasks and Beyond
Giuseppe Serra, Florian Buettner
Diversity-Rewarded CFG Distillation
Geoffrey Cideron, Andrea Agostinelli, Johan Ferret et al.
Task Adaptation from Skills: Information Geometry, Disentanglement, and New Objectives for Unsupervised Reinforcement Learning
Yucheng Yang, Tianyi Zhou, Qiang HE et al.
Risk Bounds of Accelerated SGD for Overparameterized Linear Regression
Xuheng Li, Yihe Deng, Jingfeng Wu et al.
In defense of parameter sharing for model-compression
Aditya Desai, Anshumali Shrivastava
NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
Chankyu Lee, Rajarshi Roy, Mengyao Xu et al.
GenXD: Generating Any 3D and 4D Scenes
Yuyang Zhao, Chung-Ching Lin, Kevin Lin et al.
OmniControl: Control Any Joint at Any Time for Human Motion Generation
Yiming Xie, Varun Jampani, Lei Zhong et al.
Adaptive Federated Learning with Auto-Tuned Clients
Junhyung Lyle Kim, Mohammad Taha Toghani, Cesar Uribe et al.
Meta-Continual Learning of Neural Fields
Seungyoon Woo, Junhyeog Yun, Gunhee Kim
Accelerating Distributed Stochastic Optimization via Self-Repellent Random Walks
Jie Hu, Vishwaraj Doshi, Do Young Eun
Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving
Xiang Li, Pengfei Li, Yupeng Zheng et al.
DPLM-2: A Multimodal Diffusion Protein Language Model
Xinyou Wang, Zaixiang Zheng, Fei YE et al.
Provably Robust Conformal Prediction with Improved Efficiency
Ge Yan, Yaniv Romano, Tsui-Wei Weng
Large-scale Training of Foundation Models for Wearable Biosignals
Salar Abbaspourazad, Oussama Elachqar, Andrew Miller et al.
LRM: Large Reconstruction Model for Single Image to 3D
Yicong Hong, Kai Zhang, Jiuxiang Gu et al.
Stochastic Modified Equations and Dynamics of Dropout Algorithm
Zhongwang Zhang, Yuqing Li, Tao Luo et al.
Continuous Autoregressive Modeling with Stochastic Monotonic Alignment for Speech Synthesis
Weiwei Lin, Chenhang HE
Trained Transformer Classifiers Generalize and Exhibit Benign Overfitting In-Context
Spencer Frei, Gal Vardi
Elastic Feature Consolidation For Cold Start Exemplar-Free Incremental Learning
Simone Magistri, Tomaso Trinci, Albin Soutif--Cormerais et al.
InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning
Ziheng Qin, Kai Wang, Zangwei Zheng et al.
MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning
Haotian Zhang, Mingfei Gao, Zhe Gan et al.
FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets
Seonghyeon Ye, Doyoung Kim, Sungdong Kim et al.
Transferring Labels to Solve Annotation Mismatches Across Object Detection Datasets
Yuan-Hong Liao, David Acuna, Rafid Mahmood et al.
NarrativeBridge: Enhancing Video Captioning with Causal-Temporal Narrative
Asmar Nadeem, Faegheh Sardari, Robert Dawes et al.
Idempotence and Perceptual Image Compression
Tongda Xu, Ziran Zhu, Dailan He et al.
Multi-Resolution Diffusion Models for Time Series Forecasting
Lifeng Shen, Weiyu Chen, James Kwok
Exploring Local Memorization in Diffusion Models via Bright Ending Attention
Chen Chen, Daochang Liu, Mubarak Shah et al.
Prompt Gradient Projection for Continual Learning
Jingyang Qiao, Zhizhong Zhang, Xin Tan et al.
Exploiting Structure in Offline Multi-Agent RL: The Benefits of Low Interaction Rank
Wenhao Zhan, Scott Fujimoto, Zheqing Zhu et al.
Towards Generalization Bounds of GCNs for Adversarially Robust Node Classification
Wen Wen, Han Li, Tieliang Gong et al.
InfoCon: Concept Discovery with Generative and Discriminative Informativeness
Ruizhe Liu, Qian Luo, Yanchao Yang
TGB-Seq Benchmark: Challenging Temporal GNNs with Complex Sequential Dynamics
Lu Yi, Jie Peng, Yanping Zheng et al.
Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo
Haque Ishfaq, Qingfeng Lan, Pan Xu et al.
You Only Query Once: An Efficient Label-Only Membership Inference Attack
Yutong Wu, Han Qiu, Shangwei Guo et al.
Process Reward Model with Q-value Rankings
Wendi Li, Yixuan Li
Symmetric Basis Convolutions for Learning Lagrangian Fluid Mechanics
Rene Winchenbach, Nils Thuerey
UniCon: Unidirectional Information Flow for Effective Control of Large-Scale Diffusion Models
Fanghua Yu, Jinjin Gu, Jinfan Hu et al.
Efficient Cross-Episode Meta-RL
Gresa Shala, André Biedenkapp, Pierre Krack et al.
Teach LLMs to Phish: Stealing Private Information from Language Models
Ashwinee Panda, Christopher Choquette-Choo, Zhengming Zhang et al.
On Bias-Variance Alignment in Deep Models
Lin Chen, Michal Lukasik, Wittawat Jitkrittum et al.
The Reversal Curse: LLMs trained on “A is B” fail to learn “B is A”
Lukas Berglund, Meg Tong, Maximilian Kaufmann et al.
Successor Heads: Recurring, Interpretable Attention Heads In The Wild
Rhys Gould, Euan Ong, George Ogden et al.
Consistent4D: Consistent 360° Dynamic Object Generation from Monocular Video
Yanqin Jiang, Li Zhang, Jin Gao et al.
On the Expressivity of Objective-Specification Formalisms in Reinforcement Learning
Rohan Subramani, Marcus Williams, Max Heitmann et al.
Weaker MVI Condition: Extragradient Methods with Multi-Step Exploration
Yifeng Fan, Yongqiang Li, Bo Chen
Rethinking Neural Multi-Objective Combinatorial Optimization via Neat Weight Embedding
Jinbiao Chen, Zhiguang Cao, Jiahai Wang et al.
On the Hardness of Online Nonconvex Optimization with Single Oracle Feedback
Ziwei Guan, Yi Zhou, Yingbin Liang
MMD Graph Kernel: Effective Metric Learning for Graphs via Maximum Mean Discrepancy
Yan Sun, Jicong Fan
Denoising Diffusion via Image-Based Rendering
Titas Anciukevičius, Fabian Manhardt, Federico Tombari et al.
PARL: A Unified Framework for Policy Alignment in Reinforcement Learning from Human Feedback
Souradip Chakraborty, Amrit Bedi, Alec Koppel et al.
Differentially Private SGD Without Clipping Bias: An Error-Feedback Approach
Xinwei Zhang, Zhiqi Bu, Steven Wu et al.
RB-Modulation: Training-Free Stylization using Reference-Based Modulation
Litu Rout, Yujia Chen, Nataniel Ruiz et al.
Training Graph Transformers via Curriculum-Enhanced Attention Distillation
Yisong Huang, Jin Li, Xinlong Chen et al.
Incremental Randomized Smoothing Certification
Shubham Dipak Ugare, Tarun Suresh, Debangshu Banerjee et al.
Single Teacher, Multiple Perspectives: Teacher Knowledge Augmentation for Enhanced Knowledge Distillation
Md Imtiaz Hossain, Sharmen Akhter, Choong Seon Hong et al.
Effective Data Augmentation With Diffusion Models
Brandon Trabucco, Kyle Doherty, Max Gurinas et al.
Forward $\chi^2$ Divergence Based Variational Importance Sampling
Chengrui Li, Yule Wang, Weihan Li et al.
Your Weak LLM is Secretly a Strong Teacher for Alignment
Leitian Tao, Yixuan Li
CONDA: Adaptive Concept Bottleneck for Foundation Models Under Distribution Shifts
Jihye Choi, Jayaram Raghuram, Yixuan Li et al.
Rethinking the Benefits of Steerable Features in 3D Equivariant Graph Neural Networks
Shih-Hsin Wang, Yung-Chang Hsu, Justin Baker et al.
Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory Matching
Ziyao Guo, Kai Wang, George Cazenavette et al.
Boosting Graph Anomaly Detection with Adaptive Message Passing
Jingyan Chen, Guanghui Zhu, Chunfeng Yuan et al.
Looped Transformers are Better at Learning Learning Algorithms
Liu Yang, Kangwook Lee, Robert Nowak et al.
Lean-STaR: Learning to Interleave Thinking and Proving
Haohan Lin, Zhiqing Sun, Sean Welleck et al.
Sign2GPT: Leveraging Large Language Models for Gloss-Free Sign Language Translation
Ryan Wong, Necati Cihan Camgoz, Richard Bowden
Enhancing Zeroth-order Fine-tuning for Language Models with Low-rank Structures
Yiming Chen, Yuan Zhang, Liyuan Cao et al.
Exploring the cloud of feature interaction scores in a Rashomon set
Sichao Li, Rong Wang, Quanling Deng et al.
4K4DGen: Panoramic 4D Generation at 4K Resolution
Renjie Li, Panwang Pan, Bangbang Yang et al.
Latent Trajectory Learning for Limited Timestamps under Distribution Shift over Time
Qiuhao Zeng, Changjian Shui, Long-Kai Huang et al.
Online Stabilization of Spiking Neural Networks
Yaoyu Zhu, Jianhao Ding, Tiejun Huang et al.
Set Learning for Accurate and Calibrated Models
Lukas Muttenthaler, Robert A Vandermeulen, Qiuyi (Richard) Zhang et al.
Unleashing the Power of Task-Specific Directions in Parameter Efficient Fine-tuning
Chongjie Si, Zhiyi Shi, Shifan Zhang et al.
Diffusing to the Top: Boost Graph Neural Networks with Minimal Hyperparameter Tuning
Lequan Lin, Dai Shi, Andi Han et al.
When Graph Neural Networks Meet Dynamic Mode Decomposition
Dai Shi, Lequan Lin, Andi Han et al.
DINOv2: Learning Robust Visual Features without Supervision
Pierre Fernandez, Piotr Bojanowski, Gabriel Synnaeve et al.
Uncertainty Herding: One Active Learning Method for All Label Budgets
Wonho Bae, Danica Sutherland, Gabriel Oliveira
Self-Guided Masked Autoencoders for Domain-Agnostic Self-Supervised Learning
Johnathan Xie, Yoonho Lee, Annie Chen et al.
$q$-exponential family for policy optimization
Lingwei Zhu, Haseeb Shah, Han Wang et al.
SEAL: Safety-enhanced Aligned LLM Fine-tuning via Bilevel Data Selection
Han Shen, Pin-Yu Chen, Payel Das et al.
Addressing Label Shift in Distributed Learning via Entropy Regularization
Zhiyuan Wu, Changkyu Choi, Xiangcheng Cao et al.
CollabEdit: Towards Non-destructive Collaborative Knowledge Editing
Jiamu Zheng, Jinghuai Zhang, Tianyu Du et al.
Generalizability of Neural Networks Minimizing Empirical Risk Based on Expressive Power
Lijia Yu, Yibo Miao, Yifan Zhu et al.
An Empirical Analysis of Uncertainty in Large Language Model Evaluations
Qiujie Xie, Qingqiu Li, Zhuohao Yu et al.
Long-horizon Visual Instruction Generation with Logic and Attribute Self-reflection
Yucheng Suo, Fan Ma, Kaixin Shen et al.
TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights
Aiwei Liu, Haoping Bai, Zhiyun Lu et al.
Routing Experts: Learning to Route Dynamic Experts in Existing Multi-modal Large Language Models
Qiong Wu, Zhaoxi Ke, Yiyi Zhou et al.
Cross-Entropy Is All You Need To Invert the Data Generating Process
Patrik Reizinger, Alice Bizeul, Attila Juhos et al.
Point Cluster: A Compact Message Unit for Communication-Efficient Collaborative Perception
Zihan Ding, Jiahui Fu, Si Liu et al.
In Search of Forgotten Domain Generalization
Prasanna Mayilvahanan, Roland Zimmermann, Thaddäus Wiedemer et al.
RazorAttention: Efficient KV Cache Compression Through Retrieval Heads
Hanlin Tang, Yang Lin, Jing Lin et al.
Towards Hierarchical Rectified Flow
Yichi Zhang, Yici Yan, Alex Schwing et al.
Language Models Need Inductive Biases to Count Inductively
Yingshan Chang, Yonatan Bisk
Latent Safety-Constrained Policy Approach for Safe Offline Reinforcement Learning
Prajwal Koirala, Zhanhong Jiang, Soumik Sarkar et al.
Let Your Features Tell The Differences: Understanding Graph Convolution By Feature Splitting
Yilun Zheng, Xiang Li, Sitao Luan et al.
Sample then Identify: A General Framework for Risk Control and Assessment in Multimodal Large Language Models
Qingni Wang, Tiantian Geng, Zhiyuan Wang et al.
Not All Prompts Are Made Equal: Prompt-based Pruning of Text-to-Image Diffusion Models
Alireza Ganjdanesh, Reza Shirkavand, Shangqian Gao et al.
HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models
Hayk Manukyan, Andranik Sargsyan, Barsegh Atanyan et al.
Relax and Merge: A Simple Yet Effective Framework for Solving Fair $k$-Means and $k$-sparse Wasserstein Barycenter Problems
Shihong Song, Guanlin Mo, Hu Ding
A Stochastic Approach to the Subset Selection Problem via Mirror Descent
Dan Greenstein, Elazar Gershuni, Ilan Ben-Bassat et al.
To Tackle Adversarial Transferability: A Novel Ensemble Training Method with Fourier Transformation
Wanlin Zhang, Weichen Lin, Ruomin Huang et al.
Latent Representation and Simulation of Markov Processes via Time-Lagged Information Bottleneck
Marco Federici, Patrick Forré, Ryota Tomioka et al.
Iterative Substructure Extraction for Molecular Relational Learning with Interactive Graph Information Bottleneck
Shuai Zhang, Junfeng Fang, Xuqiang Li et al.
Enhancing Neural Subset Selection: Integrating Background Information into Set Representations
Binghui Xie, Yatao Bian, Kaiwen Zhou et al.
Medium-Difficulty Samples Constitute Smoothed Decision Boundary for Knowledge Distillation on Pruned Datasets
Yudong Chen, Xuwei Xu, Frank de Hoog et al.
Correlated Proxies: A New Definition and Improved Mitigation for Reward Hacking
Cassidy Laidlaw, Shivam Singhal, Anca Dragan
AgentStudio: A Toolkit for Building General Virtual Agents
Longtao Zheng, Zhiyuan Huang, Zhenghai Xue et al.
Fewer May Be Better: Enhancing Offline Reinforcement Learning with Reduced Dataset
Yiqin Yang, Quanwei Wang, Chenghao Li et al.
Modality-Specialized Synergizers for Interleaved Vision-Language Generalists
Zhiyang Xu, Minqian Liu, Ying Shen et al.
VisualPredicator: Learning Abstract World Models with Neuro-Symbolic Predicates for Robot Planning
Yichao Liang, Nishanth Kumar, Hao Tang et al.
Reward Dimension Reduction for Scalable Multi-Objective Reinforcement Learning
Giseung Park, Youngchul Sung
MP-Mat: A 3D-and-Instance-Aware Human Matting and Editing Framework with Multiplane Representation
Siyi Jiao, Wenzheng Zeng, Yerong Li et al.
SPD Attack - Prevention of AI Powered Image Editing by Image Immunization
Parth Badgujar, Shorya Singhal, Devansh Bhardwaj
Partitioning Message Passing for Graph Fraud Detection
Wei Zhuo, Zemin Liu, Bryan Hooi et al.
Value-aligned Behavior Cloning for Offline Reinforcement Learning via Bi-level Optimization
Xingyu Jiang, Ning Gao, Xiuhui Zhang et al.
Wasserstein Distances, Neuronal Entanglement, and Sparsity
Shashata Sawmya, Linghao Kong, Ilia Markov et al.
Hybrid Regularization Improves Diffusion-based Inverse Problem Solving
Hongkun Dou, Zeyu Li, Jinyang Du et al.
Effective and Efficient Time-Varying Counterfactual Prediction with State-Space Models
Haotian Wang, Haoxuan Li, Hao Zou et al.
OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data
Shubham Toshniwal, Wei Du, Ivan Moshkov et al.
HGM³: Hierarchical Generative Masked Motion Modeling with Hard Token Mining
Minjae Jeong, Yechan Hwang, Jaejin Lee et al.
Logic-Logit: A Logic-Based Approach to Choice Modeling
Shuhan Zhang, Wendi Ren, Shuang Li
Learning Evolving Tools for Large Language Models
Guoxin Chen, Zhong Zhang, Xin Cong et al.
Enhancing Compositional Text-to-Image Generation with Reliable Random Seeds
Shuangqi Li, Hieu Le, Jingyi Xu et al.
Robust Representation Consistency Model via Contrastive Denoising
jiachen lei, Julius Berner, Jiongxiao Wang et al.
Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning
Amin Karimi Monsefi, Mengxi Zhou, Nastaran Monsefi et al.
Learning View-invariant World Models for Visual Robotic Manipulation
Jing-Cheng Pang, Nan Tang, Kaiyuan Li et al.
Meta-Evolve: Continuous Robot Evolution for One-to-many Policy Transfer
Xingyu Liu, Deepak Pathak, DING ZHAO
Zero-shot Imputation with Foundation Inference Models for Dynamical Systems
Patrick Seifner, Kostadin Cvejoski, Antonia Körner et al.
OpenPRM: Building Open-domain Process-based Reward Models with Preference Trees
Kaiyan Zhang, Jiayuan Zhang, Haoxin Li et al.
Provably Safeguarding a Classifier from OOD and Adversarial Samples
Nicolas Atienza, Johanne Cohen, Christophe Labreuche et al.
Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models
Guanting Dong, Keming Lu, Chengpeng Li et al.
CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models
Zheng Chong, Xiao Dong, Haoxiang Li et al.
DriveTransformer: Unified Transformer for Scalable End-to-End Autonomous Driving
Xiaosong Jia, Junqi You, Zhiyuan Zhang et al.
Reframing Structure-Based Drug Design Model Evaluation via Metrics Correlated to Practical Needs
Bowen Gao, Haichuan Tan, Yanwen Huang et al.
CertainlyUncertain: A Benchmark and Metric for Multimodal Epistemic and Aleatoric Awareness
Khyathi Chandu, Linjie Li, Anas Awadalla et al.
Unsupervised Zero-Shot Reinforcement Learning via Dual-Value Forward-Backward Representation
Jingbo Sun, Songjun Tu, Qichao Zhang et al.
Boltzmann Semantic Score: A Semantic Metric for Evaluating Large Vision Models Using Large Language Models
Ali Khajegili Mirabadi, Katherine Rich, Hossein Farahani et al.
Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image Animation
Jiahao Cui, Hui Li, Yao Yao et al.
GaussianBlock: Building Part-Aware Compositional and Editable 3D Scene by Primitives and Gaussians
Shuyi Jiang, Qihao Zhao, Hossein Rahmani et al.
Models trained with unnormalized density functions: A need for a course correction
Rishal Aggarwal, Daniel Penaherrera, Justin Shao et al.
Pedestrian Motion Reconstruction: A Large-scale Benchmark via Mixed Reality Rendering with Multiple Perspectives and Modalities
Yichen Wang, Yiyi Zhang, Xinhao Hu et al.
Flat Reward in Policy Parameter Space Implies Robust Reinforcement Learning
HyunKyu Lee, Sung Whan Yoon
Integral Performance Approximation for Continuous-Time Reinforcement Learning Control
Brent Wallace, Jennie Si
Collaborative Discrete-Continuous Black-Box Prompt Learning for Language Models
Hualin Zhang, Haozhen Zhang, Zhekai Liu et al.
BTBS-LNS: Binarized-Tightening, Branch and Search on Learning LNS Policies for MIP
Hao Yuan, wenli ouyang, Changwen Zhang et al.
A Theoretically-Principled Sparse, Connected, and Rigid Graph Representation of Molecules
Shih-Hsin Wang, Yuhao Huang, Justin Baker et al.
Efficient Perplexity Bound and Ratio Matching in Discrete Diffusion Language Models
Etrit Haxholli, Yeti Z. Gurbuz, Oğul Can et al.