Most Cited 2025 "reduction step elimination" Papers
22,274 papers found • Page 75 of 112
Conference
Language Decoupling with Fine-grained Knowledge Guidance for Referring Multi-object Tracking
guangyao Li, Siping Zhuang, Yajun Jian et al.
Medusa: A Multi-Scale High-order Contrastive Dual-Diffusion Approach for Multi-View Clustering
Liang Chen, Zhe Xue, Yawen Li et al.
RDD: Retrieval-Based Demonstration Decomposer for Planner Alignment in Long-Horizon Tasks
Mingxuan Yan, Yuping Wang, Zechun Liu et al.
VIBE: Annotation-Free Video-to-Text Information Bottleneck Evaluation for TL;DR
Shenghui Chen, Po-han Li, Sandeep Chinchali et al.
Towards Fine-grained Interactive Segmentation in Images and Videos
Yuan Yao, Qiushi Yang, Miaomiao Cui et al.
Reminiscence Attack on Residuals: Exploiting Approximate Machine Unlearning for Privacy
Yaxin Xiao, Qingqing Ye, Li Hu et al.
Devil is in the Uniformity: Exploring Diverse Learners within Transformer for Image Restoration
Shihao Zhou, Dayu Li, Jinshan Pan et al.
On the Entropy Calibration of Language Models
Steven Cao, Gregory Valiant, Percy Liang
From Synapses to Dynamics: Obtaining Function from Structure in a Connectome Constrained Model of the Head Direction Circuit
Sunny Duan, Ling L. Dong, Ila Fiete
Learning on the Go: A Meta-learning Object Navigation Model
Xiaorong Qin, Xinhang Song, Sixian Zhang et al.
De^2Gaze: Deformable and Decoupled Representation Learning for 3D Gaze Estimation
Yunfeng Xiao, Xiaowei Bai, Baojun Chen et al.
IM360: Large-scale Indoor Mapping with 360 Cameras
Dongki Jung, Jaehoon Choi, Yonghan Lee et al.
EMatch: A Unified Framework for Event-based Optical Flow and Stereo Matching
Pengjie Zhang, Lin Zhu, Xiao Wang et al.
Mamba Goes HoME: Hierarchical Soft Mixture-of-Experts for 3D Medical Image Segmentation
Szymon Płotka, Gizem Mert, Maciej Chrabaszcz et al.
Customized Condition Controllable Generation for Video Soundtrack
Fan Qi, KunSheng Ma, Changsheng Xu
Granular Concept Circuits: Toward a Fine-Grained Circuit Discovery for Concept Representations
Dahee Kwon, Sehyun Lee, Jaesik Choi
CanFields: Consolidating Diffeomorphic Flows for Non-Rigid 4D Interpolation from Arbitrary-Length Sequences
Miaowei Wang, Changjian Li, Amir Vaxman
GeneFlow: Translation of Single-cell Gene Expression to Histopathological Images via Rectified Flow
Mengbo Wang, Shourya Verma, Aditya Malusare et al.
QR-LoRA: Efficient and Disentangled Fine-tuning via QR Decomposition for Customized Generation
Jiahui Yang, Yongjia Ma, Donglin Di et al.
RGE-GS: Reward-Guided Expansive Driving Scene Reconstruction via Diffusion Priors
Sicong Du, Jiarun Liu, Qifeng Chen et al.
Scene Coordinate Reconstruction Priors
Wenjing Bian, Axel Barroso-Laguna, Tommaso Cavallari et al.
Spk2SRImgNet: Super-Resolve Dynamic Scene from Spike Stream via Motion Aligned Collaborative Filtering
Yuanlin Wang, Yiyang Zhang, Ruiqin Xiong et al.
Multi-Agent Learning under Uncertainty: Recurrence vs. Concentration
Kyriakos Lotidis, Panayotis Mertikopoulos, Nicholas Bambos et al.
AMBER: Adaptive Mesh Generation by Iterative Mesh Resolution Prediction
Niklas Freymuth, Tobias Würth, Nicolas Schreiber et al.
Self-Calibrating BCIs: Ranking and Recovery of Mental Targets Without Labels
Jonathan Grizou, Carlos De la Torre-Ortiz, Tuukka Ruotsalo
Boosting Point-Supervised Temporal Action Localization through Integrating Query Reformation and Optimal Transport
Mengnan Liu, Le Wang, Sanping Zhou et al.
Multidimensional Byte Pair Encoding: Shortened Sequences for Improved Visual Data Generation
Tim Elsner, Paula Usinger, Julius Nehring-Wirxel et al.
PixPerfect: Seamless Latent Diffusion Local Editing with Discriminative Pixel-Space Refinement
Haitian Zheng, Yuan Yao, yongsheng yu et al.
Exploiting LLMs for Automatic Hypothesis Assessment via a Logit-Based Calibrated Prior
Yue Gong, Raul Fernandez
Any-stepsize Gradient Descent for Separable Data under Fenchel–Young Losses
Han Bao, Shinsaku Sakaue, Yuki Takezawa
Efficient Data Driven Mixture-of-Expert Extraction from Trained Networks
Uranik Berisha, Jens Mehnert, Alexandru Paul Condurache
Generalizable Reasoning through Compositional Energy Minimization
Alexandru Oarga, Yilun Du
PoseAnchor: Robust Root Position Estimation for 3D Human Pose Estimation
Jun-Hee Kim, Jumin Han, Seong-Whan Lee
DeepASA: An Object-Oriented Multi-Purpose Network for Auditory Scene Analysis
Dongheon Lee, Younghoo Kwon, Jung-Woo Choi
A Controllable Examination for Long-Context Language Models
Yijun Yang, Zeyu Huang, Wenhao Zhu et al.
CryptoMoE: Privacy-Preserving and Scalable Mixture of Experts Inference via Balanced Expert Routing
Yifan Zhou, Tianshi Xu, Jue Hong et al.
Uncertainty-Instructed Structure Injection for Generalizable HD Map Construction
Xiaolu Liu, Ruizi Yang, Song Wang et al.
PRO-VPT: Distribution-Adaptive Visual Prompt Tuning via Prompt Relocation
Chikai Shang, Mengke Li, Yiqun Zhang et al.
SAO-Instruct: Free-form Audio Editing using Natural Language Instructions
Michael Ungersböck, Florian Grötschla, Luca Lanzendörfer et al.
Structured Temporal Causality for Interpretable Multivariate Time Series Anomaly Detection
Dongchan Cho, Jiho Han, Keumyeong Kang et al.
Time-Embedded Algorithm Unrolling for Computational MRI
Junno Yun, Yasar Utku Alcalar, Mehmet Akcakaya
Transductive Conformal Inference for Full Ranking
Jean-Baptiste Fermanian, Pierre Humbert, Gilles Blanchard
SPACE: SPike-Aware Consistency Enhancement for Test-Time Adaptation in Spiking Neural Networks
Xinyu Luo, Kecheng Chen, Pao-Sheng Sun et al.
VimoRAG: Video-based Retrieval-augmented 3D Motion Generation for Motion Language Models
Haidong Xu, Guangwei Xu, Zhedong Zheng et al.
Self-Supervised Sparse Sensor Fusion for Long Range Perception
Edoardo Palladin, Samuel Brucker, Filippo Ghilotti et al.
Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions
Simon Matrenok, Skander Moalla, Caglar Gulcehre
PHGC: Procedural Heterogeneous Graph Completion for Natural Language Task Verification in Egocentric Videos
Xun Jiang, Zhiyi Huang, Xing Xu et al.
Bidirectional Motion Transformer for Safety-Critical Traffic Scenario Generation
Yuxin Liu, Zhenghao (Mark) Peng, Xuanhao Cui et al.
Machine Unlearning under Overparameterization
Jacob Block, Aryan Mokhtari, Sanjay Shakkottai
TAB: Transformer Attention Bottlenecks enable User Intervention and Debugging in Vision-Language Models
Pooyan Rahmanzadehgervi, Hung Nguyen, Rosanne Liu et al.
Probabilistic Prompt Distribution Learning for Animal Pose Estimation
Jiyong Rao, Brian Nlong Zhao, Yu Wang
MEET: Towards Memory-Efficient Temporal Sparse Deep Neural Networks
Zeqi Zhu, Ibrahim Batuhan Akkaya, Luc Waeijen et al.
Explicitly Modeling Subcortical Vision with a Neuro-Inspired Front-End Improves CNN Robustness
Lucas Piper, Arlindo L Oliveira, Tiago Marques
Context Guided Transformer Entropy Modeling for Video Compression
Junlong Tong, Wei Zhang, Yaohui Jin et al.
TeethGenerator: A two-stage framework for paired pre- and post-orthodontic 3D dental data generation
Changsong Lei, Yaqian Liang, Shaofeng Wang et al.
UNEM: UNrolled Generalized EM for Transductive Few-Shot Learning
Long Zhou, Fereshteh Shakeri, Aymen Sadraoui et al.
The Graphon Limit Hypothesis: Understanding Neural Network Pruning via Infinite Width Analysis
Hoang Pham, The Anh Ta, Tom Jacobs et al.
Leveraging Depth and Language for Open-Vocabulary Domain-Generalized Semantic Segmentation
Siyu Chen, Ting Han, Chengzheng Fu et al.
Saliuitl: Ensemble Salience Guided Recovery of Adversarial Patches against CNNs
Mauricio Byrd Victorica, György Dán, Henrik Sandberg
WISNet: Pseudo Label Generation on Unbalanced and Patch Annotated Waste Images
Shifan Zhang, Hongzi Zhu, Yinan He et al.
Towards Interpretability Without Sacrifice: Faithful Dense Layer Decomposition with Mixture of Decoders
James Oldfield, Shawn Im, Sharon Li et al.
Implicit Counterfactual Learning for Audio-Visual Segmentation
Mingfeng Zha, Tianyu Li, Guoqing Wang et al.
Memory-Efficient Generative Models via Product Quantization
Jie Shao, Hanxiao Zhang, Hao Yu et al.
Libra-Merging: Importance-redundancy and Pruning-merging Trade-off for Acceleration Plug-in in Large Vision-Language Model
Longrong Yang, Dong Shen, Chaoxiang Cai et al.
ReCon: Region-Controllable Data Augmentation with Rectification and Alignment for Object Detection
Haowei Zhu, Tianxiang Pan, Rui Qin et al.
SegMASt3R: Geometry Grounded Segment Matching
Rohit Jayanti, Swayam Agrawal, Vansh Garg et al.
Continuity and Isolation Lead to Doubts or Dilemmas in Large Language Models
Hector Pasten, Felipe Urrutia, Hector Orellana et al.
Characterization and Learning of Causal Graphs from Hard Interventions
Zihan Zhou, Muhammad Qasim Elahi, Murat Kocaoglu
Dual-Flow: Transferable Multi-Target, Instance-Agnostic Attacks via $\textit{In-the-wild}$ Cascading Flow Optimization
Yixiao Chen, Shikun Sun, Jianshu Li et al.
NaViL: Rethinking Scaling Properties of Native Multimodal Large Language Models under Data Constraints
Changyao Tian, Hao Li, Gen Luo et al.
Cooperative Retrieval-Augmented Generation for Question Answering: Mutual Information Exchange and Ranking by Contrasting Layers
Youmin Ko, Sungjong Seo, Hyunjoon Kim
LLM Interpretability with Identifiable Temporal-Instantaneous Representation
Xiangchen Song, Jiaqi Sun, Zijian Li et al.
TreeSynth: Synthesizing Diverse Data from Scratch via Tree-Guided Subspace Partitioning
Sheng Wang, Pengan CHEN, Jingqi Zhou et al.
PathVQ: Reforming Computational Pathology Foundation Model for Whole Slide Image Analysis via Vector Quantization
Honglin Li, Zhongyi Shui, Yunlong Zhang et al.
Blind Bitstream-corrupted Video Recovery via Metadata-guided Diffusion Model
Shuyun Wang, Hu Zhang, Xin Shen et al.
Towards Accurate and Efficient 3D Object Detection for Autonomous Driving: A Mixture of Experts Computing System on Edge
Linshen Liu, Boyan Su, Junyue Jiang et al.
DreamLight: Towards Harmonious and Consistent Image Relighting
Yong Liu, Wenpeng Xiao, Qianqian Wang et al.
Explaining and Mitigating Crosslingual Tokenizer Inequities
Catherine Arnett, Tyler Chang, Stella Biderman et al.
Competitive Distillation: A Simple Learning Strategy for Improving Visual Classification
Daqian Shi, Xiaolei Diao, Xu Chen et al.
AIComposer: Any Style and Content Image Composition via Feature Integration
Haowen Li, Zhenfeng Fan, Zhang Wen et al.
3D Prior Is All You Need: Cross-Task Few-shot 2D Gaze Estimation
Yihua Cheng, Hengfei Wang, Zhongqun Zhang et al.
Reliably detecting model failures in deployment without labels
Viet Nguyen, Changjian Shui, Vijay Giri et al.
StreamBP: Memory-Efficient Exact Backpropagation for Long Sequence Training of LLMs
Qijun Luo, Mengqi Li, Lei Zhao et al.
Rethink Sparse Signals for Pose-guided Text-to-image Generation
Wenjie Xuan, Jing Zhang, Juhua Liu et al.
LTD-Bench: Evaluating Large Language Models by Letting Them Draw
Liuhao Lin, Ke Li, Zihan Xu et al.
IRGPT: Understanding Real-world Infrared Image with Bi-cross-modal Curriculum on Large-scale Benchmark
Zhe Cao, Jin Zhang, Ruiheng Zhang
RDB2G-Bench: A Comprehensive Benchmark for Automatic Graph Modeling of Relational Databases
Dongwon Choi, Sunwoo Kim, Juyeon Kim et al.
Conditional Representation Learning for Customized Tasks
Honglin Liu, Chao Sun, Peng Hu et al.
Conformal Online Learning of Deep Koopman Linear Embeddings
Ben Gao, Jordan Patracone, Stephane Chretien et al.
Embodied Image Captioning: Self-supervised Learning Agents for Spatially Coherent Image Descriptions
Tommaso Galliena, Tommaso Apicella, Stefano Rosa et al.
Attention on the Sphere
Boris Bonev, Max Rietmann, Andrea Paris et al.
Neural Tangent Knowledge Distillation for Optical Convolutional Networks
Jinlin Xiang, Minho Choi, Yubo Zhang et al.
Composing Parts for Expressive Object Generation
Harsh Rangwani, Aishwarya Agarwal, Kuldeep Kulkarni et al.
Grids Often Outperform Implicit Neural Representation at Compressing Dense Signals
Namhoon Kim, Sara Fridovich-Keil
Test-time Augmentation Improves Efficiency in Conformal Prediction
Divya M Shanmugam, Helen Lu, Swami Sankaranarayanan et al.
Anchor-Aware Similarity Cohesion in Target Frames Enables Predicting Temporal Moment Boundaries in 2D
Jiawei Tan, Hongxing Wang, Junwu Weng et al.
Beyond the Average: Distributional Causal Inference under Imperfect Compliance
Undral Byambadalai, Tomu Hirata, Tatsushi Oka et al.
Feature-Based Instance Neighbor Discovery: Advanced Stable Test-Time Adaptation in Dynamic World
Qinting Jiang, Chuyang Ye, Dongyan Wei et al.
Adaptive Algorithms with Sharp Convergence Rates for Stochastic Hierarchical Optimization
Xiaochuan Gong, Jie Hao, Mingrui Liu
DrivAerStar: An Industrial-Grade CFD Dataset for Vehicle Aerodynamic Optimization
Jiyan Qiu, Lyulin Kuang, Guan Wang et al.
Accelerating data-driven algorithm selection for combinatorial partitioning problems
Vaggos Chatziafratis, Ishani Karmarkar, Yingxi Li et al.
Single-Scanline Relative Pose Estimation for Rolling Shutter Cameras
Petr Hruby, Marc Pollefeys
Transformers are almost optimal metalearners for linear classification
Roey Magen, Gal Vardi
Hybrid-TTA: Continual Test-time Adaptation via Dynamic Domain Shift Detection
Hyewon Park, Hyejin Park, Jueun Ko et al.
Linearly Constrained Diffusion Implicit Models
Vivek Jayaram, Ira Kemelmacher-Shlizerman, Steve Seitz et al.
OURO: A Self-Bootstrapped Framework for Enhancing Multimodal Scene Understanding
Tianrun Xu, Guanyu Chen, Ye Li et al.
Activation Subspaces for Out-of-Distribution Detection
Barış Zöngür, Robin Hesse, Stefan Roth
Valid Inference with Imperfect Synthetic Data
Yewon Byun, Shantanu Gupta, Zachary Lipton et al.
Rep-MTL: Unleashing the Power of Representation-level Task Saliency for Multi-Task Learning
Zedong Wang, Siyuan Li, Dan Xu
Augmenting Moment Retrieval: Zero-Dependency Two-Stage Learning
Zhengxuan Wei, Jiajin Tang, Sibei Yang
Re-ttention: Ultra Sparse Visual Generation via Attention Statistical Reshape
Ruichen Chen, Keith Mills, Liyao Jiang et al.
$O(\sqrt{T})$ Static Regret and Instance Dependent Constraint Violation for Constrained Online Convex Optimization
Rahul Vaze, Abhishek Sinha
Synthesizing Performance Constraints for Evaluating and Improving Code Efficiency
Jun Yang, Cheng-Chi Wang, Bogdan Stoica et al.
ReID5o: Achieving Omni Multi-modal Person Re-identification in a Single Model
Jialong Zuo, Yongtai Deng, Mengdan Tan et al.
ClaraVid: A Holistic Scene Reconstruction Benchmark From Aerial Perspective With Delentropy-Based Complexity Profiling
Radu Beche, Sergiu Nedevschi
A learnability analysis on neuro-symbolic learning
Hao-Yuan He, Ming LI
Exponential Dynamic Energy Network for High Capacity Sequence Memory
Arjun Karuvally, Pichsinee Lertsaroj, Terrence Sejnowski et al.
Hierarchical Visual Prompt Learning for Continual Video Instance Segmentation
Jiahua Dong, Hui Yin, Wenqi Liang et al.
Discontinuity-aware Normal Integration for Generic Central Camera Models
Francesco Milano, Manuel Lopez-Antequera, Naina Dhingra et al.
Advancing Manga Analysis: Comprehensive Segmentation Annotations for the Manga109 Dataset
Minshan Xie, Jian Lin, Hanyuan Liu et al.
Boost the Inference with Co-training: A Depth-guided Mutual Learning Framework for Semi-supervised Medical Polyp Segmentation
Yuxin Li, Zihao Zhu, Yuxiang Zhang et al.
Time-Masked Transformers with Lightweight Test-Time Adaptation for Neural Speech Decoding
Ebrahim Feghhi, Shreyas Kaasyap, Nima Hadidi et al.
RAGD: Regional-Aware Diffusion Model for Text-to-Image Generation
Chen Zhennan, Yajie Li, Haofan Wang et al.
SL2A-INR: Single-Layer Learnable Activation for Implicit Neural Representation
Reza Rezaeian, Moein Heidari, Reza Azad et al.
Nonlinearly Preconditioned Gradient Methods: Momentum and Stochastic Analysis
Konstantinos Oikonomidis, Jan Quan, Panagiotis Patrinos
Energy-based generator matching: A neural sampler for general state space
Dongyeop Woo, Minsu Kim, Minkyu Kim et al.
WildAvatar: Learning In-the-wild 3D Avatars from the Web
Zihao Huang, Shoukang Hu, Guangcong Wang et al.
Seemingly Redundant Modules Enhance Robust Odor Learning in Fruit Flies
HaiYang Li, Liao Yu, Qiang Yu et al.
Domain Generalizable Portrait Style Transfer
Xinbo Wang, Wenju Xu, Qing Zhang et al.
SCoT: Unifying Consistency Models and Rectified Flows via Straight-Consistent Trajectories
zhangkai wu, Xuhui Fan, Hongyu Wu et al.
Token Bottleneck: One Token to Remember Dynamics
Taekyung Kim, Dongyoon Han, Byeongho Heo et al.
High-quality Point Cloud Oriented Normal Estimation via Hybrid Angular and Euclidean Distance Encoding
Yuanqi Li, Jingcheng Huang, Hongshen Wang et al.
DoppDrive: Doppler-Driven Temporal Aggregation for Improved Radar Object Detection
Yuval Haitman, Oded Bialer
LC-Opt: Benchmarking Reinforcement Learning and Agentic AI for End-to-End Liquid Cooling Optimization in Data Centers
Avisek Naug, Antonio Guillen-Perez, Vineet Kumar et al.
Look-Ahead Reasoning on Learning Platforms
Haiqing Zhu, Tijana Zrnic, Celestine Mendler-Dünner
Comprehensive Assessment and Analysis for NSFW Content Erasure in Text-to-Image Diffusion models
Die Chen, Zhiwen Li, Cen Chen et al.
MaskHand: Generative Masked Modeling for Robust Hand Mesh Reconstruction in the Wild
Muhammad Usama Saleem, Ekkasit Pinyoanuntapong, Mayur Patel et al.
ResidualViT for Efficient Temporally Dense Video Encoding
Mattia Soldan, Fabian Caba Heilbron, Bernard Ghanem et al.
Stochastically Dominant Peer Prediction
Yichi Zhang, Shengwei Xu, Grant Schoenebeck et al.
GVDepth: Zero-Shot Monocular Depth Estimation for Ground Vehicles based on Probabilistic Cue Fusion
Karlo Koledic, Luka Petrovic, Ivan Marković et al.
Regularized least squares learning with heavy-tailed noise is minimax optimal
Mattes Mollenhauer, Nicole Muecke, Dimitri Meunier et al.
Aligning Effective Tokens with Video Anomaly in Large Language Models
YINGXIAN Chen, Jiahui Liu, Ruidi Fan et al.
VQ-SGen: A Vector Quantized Stroke Representation for Creative Sketch Generation
Jiawei Wang, Zhiming Cui, Changjian Li
Self-Supervised Learning of Graph Representations for Network Intrusion Detection
Lorenzo Guerra, Thomas Chapuis, Guillaume Duc et al.
Adaptive Data Analysis for Growing Data
Neil Marchant, Benjamin Rubinstein
FailureSensorIQ: A Multi-Choice QA Dataset for Understanding Sensor Relationships and Failure Modes
Christodoulos Constantinides, Dhaval Patel, Shuxin Lin et al.
Beyond Low-Rank Tuning: Model Prior-Guided Rank Allocation for Effective Transfer in Low-Data and Large-Gap Regimes.
Chuyan Zhang, Kefan Wang, Yun Gu
Meta-Learning Hyperparameters for Parameter Efficient Fine-Tuning
Zichen Tian, Yaoyao Liu, Qianru Sun
Geometry of Decision Making in Language Models
Abhinav Joshi, Divyanshu Bhatt, Ashutosh Modi
MFogHub: Bridging Multi-Regional and Multi-Satellite Data for Global Marine Fog Detection and Forecasting
Mengqiu XU, Kaixin Chen, Heng Guo et al.
Hybrid-Balance GFlowNet for Solving Vehicle Routing Problems
Ni Zhang, Zhiguang Cao
No More Sibling Rivalry: Debiasing Human-Object Interaction Detection
Bin Yang, Yulin Zhang, Hong-Yu Zhou et al.
Kernel Learning with Adversarial Features: Numerical Efficiency and Adaptive Regularization
Antonio Ribeiro, David Vävinggren, Dave Zachariah et al.
Borrowing Eyes for the Blind Spot: Overcoming Data Scarcity in Malicious Video Detection via Cross-Domain Retrieval Augmentation
Rongpei Hong, Jian Lang, Ting Zhong et al.
MUG: Pseudo Labeling Augmented Audio-Visual Mamba Network for Audio-Visual Video Parsing
Langyu Wang, Langyu Wang, Yingying Chen et al.
CARE Transformer: Mobile-Friendly Linear Visual Transformer via Decoupled Dual Interaction
Yuan Zhou, Qingshan Xu, Jiequan Cui et al.
Align-A-Video: Deterministic Reward Tuning of Image Diffusion Models for Consistent Video Editing
Shengzhi Wang, Yingkang Zhong, Jiangchuan Mu et al.
LLM Query Scheduling with Prefix Reuse and Latency Constraints
Gregory Dexter, Shao Tang, Ata Fatahi et al.
Touch2Shape: Touch-Conditioned 3D Diffusion for Shape Exploration and Reconstruction
Yuanbo Wang, Zhaoxuan Zhang, Jiajin Qiu et al.
Implicit Correspondence Learning for Image-to-Point Cloud Registration
Xinjun Li, Wenfei Yang, Jiacheng Deng et al.
Instant4D: 4D Gaussian Splatting in Minutes
Zhanpeng Luo, Haoxi Ran, Li Lu
G2PDiffusion: Cross-species Genotype-to-Phenotype Prediction via Evolutionary Diffusion
Mengdi Liu, Zhangyang Gao, Hong Chang et al.
Risk Management for Mitigating Benchmark Failure Modes: BenchRisk
Sean McGregor, Vassil Tashev, Armstrong Foundjem et al.
Individual Regret in Cooperative Stochastic Multi-Armed Bandits
Idan Barnea, Tal Lancewicki, Yishay Mansour
AlphaDecay: Module-wise Weight Decay for Heavy-Tailed Balancing in LLMs
Di He, Songjun Tu, Ajay Jaiswal et al.
What Really is a Member? Discrediting Membership Inference via Poisoning
Neal Mangaokar, Ashish Hooda, Zhuohang Li et al.
From Condensation to Rank Collapse: A Two-Stage Analysis of Transformer Training Dynamics
Zheng-An Chen, Tao Luo
ConViS-Bench: Estimating Video Similarity Through Semantic Concepts
Benedetta Liberatori, Alessandro Conti, Lorenzo Vaquero et al.
How many measurements are enough? Bayesian recovery in inverse problems with general distributions
Ben Adcock, Zi Yuan (Nick) Huang
Progressive Homeostatic and Plastic Prompt Tuning for Audio-Visual Multi-Task Incremental Learning
Jiong Yin, Liang Li, Jiehua Zhang et al.
COS3D: Collaborative Open-Vocabulary 3D Segmentation
Runsong Zhu, Ka-Hei Hui, Zhengzhe Liu et al.
Take the Bull by the Horns: Learning to Segment Hard Samples
Yuan Guo, Jingyu Kong, Yu Wang et al.
FairDD: Fair Dataset Distillation
Qihang Zhou, ShenHao Fang, Shibo He et al.
UniGS: Modeling Unitary 3D Gaussians for Novel View Synthesis from Sparse-view Images
Jiamin WU, Kenkun Liu, Xiaoke Jiang et al.
Task-Specific Zero-shot Quantization-Aware Training for Object Detection
Changhao Li, Xinrui Chen, Ji Wang et al.
Tree-Guided Diffusion Planner
Hyeonseong Jeon, Cheolhong Min, Jaesik Park
PRESTO: Preimage-Informed Instruction Optimization for Prompting Black-Box LLMs
Jaewon Chu, Seunghun Lee, Hyunwoo J. Kim
Differentially Private Bilevel Optimization: Efficient Algorithms with Near-Optimal Rates
Andrew Lowy, Daogao Liu
Image Super-Resolution with Guarantees via Conformalized Generative Models
Eduardo Adame, Daniel Csillag, Guilherme Tegoni Goedert
SharpZO: Hybrid Sharpness-Aware Vision Language Model Prompt Tuning via Forward-Only Passes
Yifan Yang, Zhen Zhang, Rupak Vignesh Swaminathan et al.
Bridging Human and LLM Judgments: Understanding and Narrowing the Gap
Felipe Maia Polo, Xinhe Wang, Mikhail Yurochkin et al.
AutoPrompt: Automated Red-Teaming of Text-to-Image Models via LLM-Driven Adversarial Prompts
Yufan Liu, Wanqian Zhang, Huashan Chen et al.
Minimizing False-Positive Attributions in Explanations of Non-Linear Models
Anders Gjølbye, Stefan Haufe, Lars Kai Hansen
FreeDance: Towards Harmonic Free-Number Group Dance Generation via a Unified Framework
Yiwen Zhao, Yang Wang, Liting Wen et al.
Semantic Surgery: Zero-Shot Concept Erasure in Diffusion Models
Lexiang Xiong, Liu Chengyu, Jingwen Ye et al.
Towards A Translative Model of Sperm Whale Vocalization
Orr Paradise, Liangyuan Chen, Pranav Muralikrishnan et al.
Counterfactual Identifiability via Dynamic Optimal Transport
Fabio De Sousa Ribeiro, Ainkaran Santhirasekaram, Ben Glocker
KnowMol: Advancing Molecular Large Language Models with Multi-Level Chemical Knowledge
Zaifei Yang, Hong Chang, RuiBing Hou et al.
REGen: Multimodal Retrieval-Embedded Generation for Long-to-Short Video Editing
Weihan Xu, Yimeng Ma, Jingyue Huang et al.
Classic Video Denoising in a Machine Learning World: Robust, Fast, and Controllable
Xin Jin, Simon Niklaus, Zhoutong Zhang et al.
Convergence Theorems for Entropy-Regularized and Distributional Reinforcement Learning
Yash Jhaveri, Harley Wiltzer, Patrick Shafto et al.
Dual Semantic Guidance for Open Vocabulary Semantic Segmentation
ZhengYang Wang, Tingliang Feng, Fan Lyu et al.
Towards Explicit Geometry-Reflectance Collaboration for Generalized LiDAR Segmentation in Adverse Weather
Longyu Yang, Ping Hu, Shangbo Yuan et al.
From Counterfactuals to Trees: Competitive Analysis of Model Extraction Attacks
Awa Khouna, Julien Ferry, Thibaut Vidal
Error Broadcast and Decorrelation as a Potential Artificial and Natural Learning Mechanism
Mete Erdogan, Cengiz Pehlevan, Alper Erdogan
Architectural and Inferential Inductive Biases for Exchangeable Sequence Modeling
Daksh Mittal, Leon Li, Thomson Yen et al.
Empowering Vector Graphics with Consistently Arbitrary Viewing and View-dependent Visibility
Yidi Li, Jun Xiao, Zhengda Lu et al.
BitMark: Watermarking Bitwise Autoregressive Image Generative Models
Louis Kerner, Michel Meintz, Bihe Zhao et al.
Actial: Activate Spatial Reasoning Ability of Multimodal Large Language Models
Xiaoyu Zhan, Wenxuan Huang, Hao Sun et al.
UnMix-NeRF: Spectral Unmixing Meets Neural Radiance Fields
Fabian Perez, Sara Rojas Martinez, Carlos Hinojosa et al.