Most Cited 2025 "microtransactions" Papers
22,274 papers found • Page 60 of 112
Conference
Improving Sound Source Localization with Joint Slot Attention on Image and Audio
Inho Kim, YOUNGKIL SONG, Jicheol Park et al.
UniFoil: A Universal Dataset of Airfoils in Transitional and Turbulent Regimes for Subsonic and Transonic Flows
Rohit Kanchi, Benjamin Melanson, Nithin Somasekharan et al.
A Provable Approach for End-to-End Safe Reinforcement Learning
Akifumi Wachi, Kohei Miyaguchi, Takumi Tanabe et al.
metaTextGrad: Automatically optimizing language model optimizers
Guowei Xu, Mert Yuksekgonul, Carlos Guestrin et al.
EntityErasure: Erasing Entity Cleanly via Amodal Entity Segmentation and Completion
Yixing Zhu, Qing Zhang, Yitong Wang et al.
REASONING COMPILER: LLM-Guided Optimizations for Efficient Model Serving
Annabelle Sujun Tang, Christopher Priebe, Rohan Mahapatra et al.
SING: SDE Inference via Natural Gradients
Amber Hu, Henry Smith, Scott Linderman
Bigram Subnetworks: Mapping to Next Tokens in Transformer Language Models
Tyler Chang, Benjamin Bergen
MUSTAFAR: Promoting Unstructured Sparsity for KV Cache Pruning in LLM Inference
Donghyeon Joo, Helya Hosseini, Ramyad Hadidi et al.
Where the Devil Hides: Deepfake Detectors Can No Longer Be Trusted
Shuaiwei Yuan, Junyu Dong, Yuezun Li
MoE-Gyro: Self-Supervised Over-Range Reconstruction and Denoising for MEMS Gyroscopes
Feiyang Pan, Shenghe Zheng, Chunyan Yin et al.
Detoxifying Large Language Models via Autoregressive Reward Guided Representation Editing
Yisong Xiao, Aishan Liu, Siyuan Liang et al.
CALM-PDE: Continuous and Adaptive Convolutions for Latent Space Modeling of Time-dependent PDEs
Jan Hagnberger, Daniel Musekamp, Mathias Niepert
Unsupervised Learning for Optimal Transport plan prediction between unbalanced graphs
Sonia Mazelet, Rémi Flamary, Bertrand Thirion
Forensic Self-Descriptions Are All You Need for Zero-Shot Detection, Open-Set Source Attribution, and Clustering of AI-generated Images
Tai Nguyen, Aref Azizpour, Matthew Stamm
Stable Matching with Ties: Approximation Ratios and Learning
Shiyun Lin, Simon Mauras, Nadav Merlis et al.
Incomplete Multi-view Clustering via Hierarchical Semantic Alignment and Cooperative Completion
Xiaojian Ding, Lin Zhao, Xian Li et al.
Efficient Multimodal Dataset Distillation via Generative Models
Zhenghao Zhao, Haoxuan Wang, Junyi Wu et al.
Just Dance with pi! A Poly-modal Inductor for Weakly-supervised Video Anomaly Detection
Snehashis Majhi, Giacomo D'Amicantonio, Antitza Dantcheva et al.
Dyn-O: Building Structured World Models with Object-Centric Representations
Zizhao Wang, Kaixin Wang, Li Zhao et al.
Tackling Feature-Classifier Mismatch in Federated Learning via Prompt-Driven Feature Transformation
Xinghao Wu, Xuefeng Liu, Jianwei Niu et al.
MultiHuman-Testbench: Benchmarking Image Generation for Multiple Humans
Shubhankar Borse, Seokeon Choi, Sunghyun Park et al.
Improving Diffusion-based Inverse Algorithms under Few-Step Constraint via Linear Extrapolation
Jiawei Zhang, Ziyuan Liu, Leon Yan et al.
ADPretrain: Advancing Industrial Anomaly Detection via Anomaly Representation Pretraining
Xincheng Yao, Yan Luo, Zefeng Qian et al.
HAODiff: Human-Aware One-Step Diffusion via Dual-Prompt Guidance
JUE GONG, Tingyu Yang, Jingkai Wang et al.
On the Optimal Construction of Unbiased Gradient Estimators for Zeroth-Order Optimization
Shaocong Ma, Heng Huang
Inv-Entropy: A Fully Probabilistic Framework for Uncertainty Quantification in Language Models
Haoyi Song, Ruihan Ji, Naichen Shi et al.
Test-Time Domain Generalization via Universe Learning: A Multi-Graph Matching Approach for Medical Image Segmentation
Xingguo Lv, Xingbo Dong, Liwen Wang et al.
PBR-SR: Mesh PBR Texture Super Resolution from 2D Image Priors
Yujin Chen, Yinyu Nie, Benjamin Ummenhofer et al.
DTOS: Dynamic Time Object Sensing with Large Multimodal Model
Jirui Tian, Jinrong Zhang, Shenglan Liu et al.
Enhancing Dance-to-Music Generation via Negative Conditioning Latent Diffusion Model
Changchang Sun, Gaowen Liu, Charles Fleming et al.
SimSort: A Data-Driven Framework for Spike Sorting by Large-Scale Electrophysiology Simulation
Yimu Zhang, Dongqi Han, Yansen Wang et al.
Auto-Encoded Supervision for Perceptual Image Super-Resolution
MinKyu Lee, Sangeek Hyun, Woojin Jun et al.
TSP-Mamba: The Travelling Salesman Problem Meets Mamba for Image Super-resolution and Beyond
Kun Zhou, Xinyu Lin, Jiangbo Lu
Early-Bird Diffusion: Investigating and Leveraging Timestep-Aware Early-Bird Tickets in Diffusion Models for Efficient Training
Lexington Whalen, Zhenbang Du, Haoran You et al.
Flexible Group Count Enables Hassle-Free Structured Pruning
Jiamu Zhang, Shaochen Zhong, Andrew Ye et al.
ROGR: Relightable 3D Objects using Generative Relighting
Jiapeng Tang, Matthew Levine, Dor Verbin et al.
Scaling Image Geo-Localization to Continent Level
Philipp Lindenberger, Paul-Edouard Sarlin, Jan Hosang et al.
With Limited Data for Multimodal Alignment, Let the STRUCTURE Guide You
Fabian Gröger, Shuo Wen, Huyen Le et al.
RADAR: Benchmarking Language Models on Imperfect Tabular Data
Ken Gu, Zhihan Zhang, Kate Lin et al.
Auto-Compressing Networks
Evangelos Dorovatas, Georgios Paraskevopoulos, Alexandros Potamianos
Multi-Modal Aerial-Ground Cross-View Place Recognition with Neural ODEs
Sijie Wang, Rui She, Qiyu Kang et al.
HeroFilter: Adaptive Spectral Graph Filter for Varying Heterophilic Relations
Shuaicheng Zhang, Haohui Wang, Junhong Lin et al.
Bridging Theory and Practice in Link Representation with Graph Neural Networks
Veronica Lachi, Francesco Ferrini, Antonio Longa et al.
MotionRAG: Motion Retrieval-Augmented Image-to-Video Generation
Chenhui Zhu, Yilu Wu, Shuai Wang et al.
Tail-Optimized Caching for LLM Inference
Wenxin Zhang, Yueying Li, Ciamac C Moallemi et al.
JiSAM: Alleviate Labeling Burden and Corner Case Problems in Autonomous Driving via Minimal Real-World Data
Runjian Chen, Wenqi Shao, Bo Zhang et al.
NOBLE - Neural Operator with Biologically-informed Latent Embeddings to Capture Experimental Variability in Biological Neuron Models
Luca Ghafourpour, Valentin Duruisseaux, Bahareh Tolooshams et al.
AdaSTaR: Adaptive Data Sampling for Training Self-Taught Reasoners
Reiss Koh, Wonbeen Oh, Jaein Jang et al.
Synthetic Visual Genome
Jae Sung Park, Zixian Ma, Linjie Li et al.
From Prototypes to General Distributions: An Efficient Curriculum for Masked Image Modeling
Jinhong Lin, Cheng-En Wu, Huanran Li et al.
Improved Bounds for Swap Multicalibration and Swap Omniprediction
Haipeng Luo, Spandan Senapati, Vatsal Sharan
Sequential Monte Carlo for Policy Optimization in Continuous POMDPs
Hany Abdulsamad, Sahel Mohammad Iqbal, Simo Sarkka
Seg2Any: Open-set Segmentation-Mask-to-Image Generation with Precise Shape and Semantic Control
Danfeng Li, Hui Zhang, Sheng Wang et al.
Temporal Logic-Based Multi-Vehicle Backdoor Attacks against Offline RL Agents in End-to-end Autonomous Driving
Xuan Chen, Shiwei Feng, Zikang Xiong et al.
MotionMap: Representing Multimodality in Human Pose Forecasting
Reyhaneh Hosseininejad, Megh Shukla, Saeed Saadatnejad et al.
T-CIL: Temperature Scaling using Adversarial Perturbation for Calibration in Class-Incremental Learning
Seong-Hyeon Hwang, Minsu Kim, Steven Euijong Whang
CoE: Chain-of-Explanation via Automatic Visual Concept Circuit Description and Polysemanticity Quantification
wenlong yu, Qilong Wang, Chuang Liu et al.
LabUtopia: High-Fidelity Simulation and Hierarchical Benchmark for Scientific Embodied Agents
Rui Li, Zixuan Hu, Wenxi Qu et al.
Guard Me If You Know Me: Protecting Specific Face-Identity from Deepfakes
Kaiqing Lin, Zhiyuan Yan, Ke-Yue Zhang et al.
Robust Transfer Learning with Unreliable Source Data
Jianqing Fan, Cheng Gao, Jason Klusowski
See through the Dark: Learning Illumination-affined Representations for Nighttime Occupancy Prediction
Yuan Wu, Zhiqiang Yan, Yigong Zhang et al.
Certifying Stability of Reinforcement Learning Policies using Generalized Lyapunov Functions
Kehan Long, Jorge Cortes, Nikolay Atanasov
Sampling 3D Molecular Conformers with Diffusion Transformers
J. Thorben Frank, Winfried Ripken, Gregor Lied et al.
Unifying Attention Heads and Task Vectors via Hidden State Geometry in In-Context Learning
Haolin Yang, Hakaze Cho, Yiqiao Zhong et al.
OmniTry: Virtual Try-On Anything without Masks
Yutong Feng, Linlin Zhang, Hengyuan Cao et al.
Block Coordinate Descent for Neural Networks Provably Finds Global Minima
Shunta Akiyama
STDD: Spatio-Temporal Dual Diffusion for Video Generation
Shuaizhen Yao, Xiaoya Zhang, Xin Liu et al.
Final-Model-Only Data Attribution with a Unifying View of Gradient-Based Methods
Dennis Wei, Inkit Padhi, Soumya Ghosh et al.
Exploring the Noise Robustness of Online Conformal Prediction
HuaJun Xi, Kangdao Liu, Hao Zeng et al.
Self-Verifying Reflection Helps Transformers with CoT Reasoning
Zhongwei Yu, Wannian Xia, Xue Yan et al.
Constrained Discrete Diffusion
Michael Cardei, Jacob K Christopher, Bhavya Kailkhura et al.
BraVE: Offline Reinforcement Learning for Discrete Combinatorial Action Spaces
Matthew Landers, Taylor W. Killian, Hugo Barnes et al.
Localizing Knowledge in Diffusion Transformers
Arman Zarei, Samyadeep Basu, Keivan Rezaei et al.
DenoiseRotator: Enhance Pruning Robustness for LLMs via Importance Concentration
Tianteng Gu, Bei Liu, Bo Xiao et al.
Forging Time Series with Language: A Large Language Model Approach to Synthetic Data Generation
Cécile Rousseau, Tobia Boschi, Giandomenico Cornacchia et al.
3DOT: Texture Transfer for 3DGS Objects from a Single Reference Image
Xiao Cao, Beibei Lin, Bo Wang et al.
A2Seek: Towards Reasoning-Centric Benchmark for Aerial Anomaly Understanding
Mengjingcheng Mo, Xinyang Tong, Mingpi Tan et al.
EZSR: Event-based Zero-Shot Recognition
Yan Yang, Liyuan Pan, Dongxu Li et al.
FedSVD: Adaptive Orthogonalization for Private Federated Learning with LoRA
Seanie Lee, Sangwoo Park, Dong Bok Lee et al.
Neural-Driven Image Editing
Pengfei Zhou, Jie Xia, Xiaopeng Peng et al.
Maintaining Consistent Inter-Class Topology in Continual Test-Time Adaptation
Chenggong Ni, Fan Lyu, Jiayao Tan et al.
Visual Consensus Prompting for Co-Salient Object Detection
Jie Wang, Nana Yu, Zihao Zhang et al.
AlignedGen: Aligning Style Across Generated Images
Jiexuan Zhang, Yiheng Du, Qian Wang et al.
Towards Unsupervised Domain Bridging via Image Degradation in Semantic Segmentation
Wangkai Li, Rui Sun, Huayu Mai et al.
Fancy123: One Image to High-Quality 3D Mesh Generation via Plug-and-Play Deformation
Qiao Yu, Xianzhi Li, Yuan Tang et al.
BundleFlow: Deep Menus for Combinatorial Auctions by Diffusion-Based Optimization
Tonghan Wang, Yanchen Jiang, David Parkes
Filter Like You Test: Data-Driven Data Filtering for CLIP Pretraining
Mikey Shechter, Yair Carmon
Learning non-equilibrium diffusions with Schrödinger bridges: from exactly solvable to simulation-free
Stephen Zhang, Michael Stumpf
Hierarchical Flow Diffusion for Efficient Frame Interpolation
Yang Hai, Guo Wang, Tan Su et al.
Reconciling Stochastic and Deterministic Strategies for Zero-shot Image Restoration using Diffusion Model in Dual
Chong Wang, Lanqing Guo, Zixuan Fu et al.
PatchGuard: Adversarially Robust Anomaly Detection and Localization through Vision Transformers and Pseudo Anomalies
Mojtaba Nafez, Amirhossein Koochakian, Arad Maleki et al.
Sound Bridge: Associating Egocentric and Exocentric Videos via Audio Cues
Sihong Huang, Jiaxin Wu, Xiaoyong Wei et al.
Multi-Modal Synergistic Implicit Image Enhancement for Efficient Optical Flow Estimation
Weichen Dai, wu hexing, xiaoyang weng et al.
Towards Million-Scale Adversarial Robustness Evaluation With Stronger Individual Attacks
Yong Xie, Weijie Zheng, Hanxun Huang et al.
Volumetric Surfaces: Representing Fuzzy Geometries with Layered Meshes
Stefano Esposito, Anpei Chen, Christian Reiser et al.
Any Large Language Model Can Be a Reliable Judge: Debiasing with a Reasoning-based Bias Detector
Haoyan Yang, Runxue Bao, Cao (Danica) Xiao et al.
Revisiting Follow-the-Perturbed-Leader with Unbounded Perturbations in Bandit Problems
Jongyeong Lee, Junya Honda, Shinji Ito et al.
Practical Bayes-Optimal Membership Inference Attacks
Marcus Lassila, Johan Oestman, Khac-Hoang Ngo et al.
Adapting Dense Matching for Homography Estimation with Grid-based Acceleration
Kaining Zhang, Yuxin Deng, Jiayi Ma et al.
Multi-Group Proportional Representations for Text-to-Image Models
Sangwon Jung, Alex Oesterling, Claudio Mayrink Verdun et al.
Encapsulated Composition of Text-to-Image and Text-to-Video Models for High-Quality Video Synthesis
Tongtong Su, Chengyu Wang, Bingyan Liu et al.
SHAP Meets Tensor Networks: Provably Tractable Explanations with Parallelism
Reda Marzouk, Shahaf Bassan, Guy Katz
Sample and Map from a Single Convex Potential: Generation using Conjugate Moment Measures
Nina Vesseron, Louis Bethune, Marco Cuturi
Benchmarking Spatiotemporal Reasoning in LLMs and Reasoning Models: Capabilities and Challenges
Pengrui Quan, Brian Wang, Kang Yang et al.
The Computational Complexity of Counting Linear Regions in ReLU Neural Networks
Moritz Stargalla, Christoph Hertrich, Daniel Reichman
LLM Meeting Decision Trees on Tabular Data
Hangting Ye, Jinmeng Li, He Zhao et al.
Stochastic Regret Guarantees for Online Zeroth- and First-Order Bilevel Optimization
Parvin Nazari, Bojian Hou, Davoud Ataee Tarzanagh et al.
BLINK-Twice: You see, but do you observe? A Reasoning Benchmark on Visual Perception
junyan ye, Dongzhi JIANG, Jun He et al.
Beyond Higher Rank: Token-wise Input-Output Projections for Efficient Low-Rank Adaptation
Shiwei Li, Xiandi Luo, Haozhao Wang et al.
A High-Dimensional Statistical Method for Optimizing Transfer Quantities in Multi-Source Transfer Learning
Qingyue Zhang, Haohao Fu, Guanbo Huang et al.
RSCC: A Large-Scale Remote Sensing Change Caption Dataset for Disaster Events
Zhenyuan Chen, Chenxi Wang, Ningyu Zhang et al.
GnnXemplar: Exemplars to Explanations - Natural Language Rules for Global GNN Interpretability
Burouj Armgaan, Eshan Jain, Harsh Pandey et al.
Adapting to Stochastic and Adversarial Losses in Episodic MDPs with Aggregate Bandit Feedback
Shinji Ito, Kevin Jamieson, Haipeng Luo et al.
ViSpec: Accelerating Vision-Language Models with Vision-Aware Speculative Decoding
Jialiang Kang, Han Shu, Wenshuo Li et al.
Taming Hyperparameter Sensitivity in Data Attribution: Practical Selection Without Costly Retraining
Weiyi Wang, Junwei Deng, Yuzheng Hu et al.
EvoBrain: Dynamic Multi-Channel EEG Graph Modeling for Time-Evolving Brain Networks
Rikuto Kotoge, Zheng Chen, Tasuku Kimura et al.
Improved Approximation Algorithms for Chromatic and Pseudometric-Weighted Correlation Clustering
Chenglin Fan, Dahoon Lee, Euiwoong Lee
Follow the Energy, Find the Path: Riemannian Metrics from Energy-Based Models
Louis Bethune, David Vigouroux, Yilun Du et al.
DesignX: Human-Competitive Algorithm Designer for Black-Box Optimization
Hongshu Guo, Zeyuan Ma, Yining Ma et al.
Certified Human Trajectory Prediction
Mohammadhossein Bahari, Saeed Saadatnejad, Amirhossein Askari Farsangi et al.
Conformal Arbitrage: Risk-Controlled Balancing of Competing Objectives in Language Models
William Overman, Mohsen Bayati
Efficient Adaptive Federated Optimization
Su Hyeong Lee, Sidharth Sharma, Manzil Zaheer et al.
Automatic Synthetic Data and Fine-grained Adaptive Feature Alignment for Composed Person Retrieval
Delong Liu, Haiwen Li, Zhaohui Hou et al.
Disentangling Latent Shifts of In-Context Learning with Weak Supervision
Josip Jukić, Jan Šnajder
ArchCAD-400K: A Large-Scale CAD drawings Dataset and New Baseline for Panoptic Symbol Spotting
Ruifeng Luo, Zhengjie Liu, Tianxiao Cheng et al.
ScaleLSD: Scalable Deep Line Segment Detection Streamlined
Zeran Ke, Bin Tan, Xianwei Zheng et al.
AI2TALE: An Innovative Information Theory-based Approach for Learning to Localize Phishing Attacks
Van Nguyen, Tingmin Wu, Xingliang YUAN et al.
Neural Collapse in Cumulative Link Models for Ordinal Regression: An Analysis with Unconstrained Feature Model
Chuang Ma, Tomoyuki Obuchi, Toshiyuki Tanaka
On-Device Self-Supervised Learning of Low-Latency Monocular Depth from Only Events
Jesse Hagenaars, Yilun Wu, Federico Paredes Valles et al.
Multivariate Latent Recalibration for Conditional Normalizing Flows
Victor Dheur, Souhaib Ben Taieb
Track Any Anomalous Object:A Granular Video Anomaly Detection Pipeline
Yuzhi Huang, Chenxin Li, Haitao Zhang et al.
Teaching Language Models to Reason with Tools
Chengpeng Li, Zhengyang Tang, Ziniu Li et al.
Edit Less, Achieve More: Dynamic Sparse Neuron Masking for Lifelong Knowledge Editing in LLMs
Jinzhe Liu, Junshu Sun, Shufan Shen et al.
HMARL-CBF – Hierarchical Multi-Agent Reinforcement Learning with Control Barrier Functions for Safety-Critical Autonomous Systems
H M Sabbir Ahmad, Ehsan Sabouni, Alexander Wasilkoff et al.
MoST: Efficient Monarch Sparse Tuning for 3D Representation Learning
Xu Han, Yuan Tang, Jinfeng Xu et al.
A Theoretical Framework for Grokking: Interpolation followed by Riemannian Norm Minimisation
Etienne Boursier, Scott Pesme, Radu-Alexandru Dragomir
Layer as Puzzle Pieces: Compressing Large Language Models through Layer Concatenation
Fei Wang, Li Shen, Liang Ding et al.
Chirality in Action: Time-Aware Video Representation Learning by Latent Straightening
Piyush Nitin Bagad, Andrew Zisserman
Learning Visual Composition through Improved Semantic Guidance
Austin Stone, Hagen Soltau, Robert Geirhos et al.
Adaptive Kernel Design for Bayesian Optimization Is a Piece of CAKE with LLMs
Richard Suwandi, Feng Yin, Juntao Wang et al.
Towards Improved Text-Aligned Codebook Learning: Multi-Hierarchical Codebook-Text Alignment with Long Text
Guotao liang, Baoquan Zhang, Zhiyuan Wen et al.
VA-GS: Enhancing the Geometric Representation of Gaussian Splatting via View Alignment
Qing Li, Huifang Feng, Xun Gong et al.
System-Embedded Diffusion Bridge Models
Bartlomiej Sobieski, Matthew Tivnan, Yuang Wang et al.
Statistical Inference under Performativity
Xiang Li, Yunai Li, Huiying Zhong et al.
LotusFilter: Fast Diverse Nearest Neighbor Search via a Learned Cutoff Table
Yusuke Matsui
PhysioWave: A Multi-Scale Wavelet-Transformer for Physiological Signal Representation
Yanlong Chen, Mattia Orlandi, Pierangelo Rapa et al.
Template-Guided 3D Molecular Pose Generation via Flow Matching and Differentiable Optimization
Noémie Bergues, Arthur Carré, Paul Join-Lambert et al.
Homogeneous Algorithms Can Reduce Competition in Personalized Pricing
Nathanael Jo, Ashia Wilson, Kathleen Creel et al.
Towards Large-Scale In-Context Reinforcement Learning by Meta-Training in Randomized Worlds
Fan Wang, Pengtao Shao, Yiming Zhang et al.
Reconstruct, Inpaint, Test-Time Finetune: Dynamic Novel-view Synthesis from Monocular Videos
Kaihua Chen, Tarasha Khurana, Deva Ramanan
Zero-Shot Head Swapping in Real-World Scenarios
Sohyun Jeong, Taewoong Kang, Hyojin Jang et al.
STAR: A Benchmark for Astronomical Star Fields Super-Resolution
WU KUO-CHENG, Guohang Zhuang, Jinyang Huang et al.
Harnessing the Computation Redundancy in ViTs to Boost Adversarial Transferability
Jiani Liu, Zhiyuan Wang, Zeliang Zhang et al.
CLIMB: Class-imbalanced Learning Benchmark on Tabular Data
Zhining Liu, Zihao Li, Ze Yang et al.
Panoptic Captioning: An Equivalence Bridge for Image and Text
Kun-Yu Lin, Hongjun Wang, Weining Ren et al.
Autoregressive Distillation of Diffusion Transformers
Yeongmin Kim, Sotiris Anagnostidis, Yuming Du et al.
Do ImageNet-trained Models Learn Shortcuts? The Impact of Frequency Shortcuts on Generalization
Shunxin Wang, Raymond Veldhuis, Nicola Strisciuglio
Semi-off-Policy Reinforcement Learning for Vision-Language Slow-Thinking Reasoning
Junhao Shen, Haiteng Zhao, Yuzhe Gu et al.
Datasets, Documents, and Repetitions: The Practicalities of Unequal Data Quality
Alex Fang, Hadi Pouransari, Matt Jordan et al.
Path Gradients after Flow Matching
Lorenz Vaitl, Leon Klein
Tripartite Weight-Space Ensemble for Few-Shot Class-Incremental Learning
Juntae Lee, Munawar Hayat, Sungrack Yun
Rethinking Nighttime Image Deraining via Learnable Color Space Transformation
Qiyuan Guan, Xiang Chen, Guiyue Jin et al.
Bias for Action: Video Implicit Neural Representations with Bias Modulation
Alper Kayabasi, Anil Kumar Vadathya, Guha Balakrishnan et al.
msf-CNN: Patch-based Multi-Stage Fusion with Convolutional Neural Networks for TinyML
Zhaolan Huang, Emmanuel Baccelli
Bayes optimal learning of attention-indexed models
Fabrizio Boncoraglio, Emanuele Troiani, Vittorio Erba et al.
Breaking the Performance Ceiling in Reinforcement Learning requires Inference Strategies
Felix Chalumeau, Daniel Rajaonarivonivelomanantsoa, Ruan John de Kock et al.
StarTrail: Concentric Ring Sequence Parallelism for Efficient Near-Infinite-Context Transformer Model Training
Ziming Liu, Shaoyu Wang, Shenggan Cheng et al.
HORP: Human-Object Relation Priors Guided HOI Detection
Pei Geng, Jian Yang, Shanshan Zhang
Robustness in Both Domains: CLIP Needs a Robust Text Encoder
Elias Abad Rocamora, Christian Schlarmann, Naman Deep Singh et al.
The Gaussian Mixing Mechanism: Renyi Differential Privacy via Gaussian Sketches
Omri Lev, Vishwak Srinivasan, Moshe Shenfeld et al.
Generative Modeling of Full-Atom Protein Conformations using Latent Diffusion on Graph Embeddings
Aditya Sengar, Ali Hariri, Daniel Probst et al.
FlareX: A Physics-Informed Dataset for Lens Flare Removal via 2D Synthesis and 3D Rendering
Lishen Qu, Zhihao Liu, Jinshan Pan et al.
Style Quantization for Data-Efficient GAN Training
Jian Wang, Xin Lan, Ji-Zhe Zhou et al.
Point Cloud Upsampling Using Conditional Diffusion Module with Adaptive Noise Suppression
Boqian Zhang, shen yang, Hao Chen et al.
Devil is in the Detail: Towards Injecting Fine Details of Image Prompt in Image Generation via Conflict-free Guidance and Stratified Attention
Kyungmin Jo, Jooyeol Yun, Jaegul Choo
Can Large Language Models Help Multimodal Language Analysis? MMLA: A Comprehensive Benchmark
Hanlei Zhang, zhuohang li, Hua Xu et al.
Neural Mutual Information Estimation with Vector Copulas
Yanzhi Chen, Zijing Ou, Adrian Weller et al.
Through the River: Understanding the Benefit of Schedule-Free Methods for Language Model Training
Minhak Song, Beomhan Baek, Kwangjun Ahn et al.
Visual Anagrams Reveal Hidden Differences in Holistic Shape Processing Across Vision Models
Fenil Doshi, Thomas Fel, Talia Konkle et al.
LoTA-QAF: Lossless Ternary Adaptation for Quantization-Aware Fine-Tuning
Junyu Chen, Junzhuo Li, Zhen Peng et al.
Consistency-aware Self-Training for Iterative-based Stereo Matching
Jingyi Zhou, Peng Ye, Haoyu Zhang et al.
Non-Stationary Dueling Bandits Under a Weighted Borda Criterion
Joe Suk, Arpit Agarwal
Diffusion-based Event Generation for High-Quality Image Deblurring
Xinan Xie, Qing Zhang, Wei-Shi Zheng
The Implicit Bias of Structured State Space Models Can Be Poisoned With Clean Labels
Yonatan Slutzky, Yotam Alexander, Noam Razin et al.
Unified Scaling Laws for Compressed Representations
Andrei Panferov, Alexandra Volkova, Ionut-Vlad Modoranu et al.
RHYTHM: Reasoning with Hierarchical Temporal Tokenization for Human Mobility
Haoyu He, Haozheng Luo, Yan Chen et al.
An Efficient Local Search Approach for Polarized Community Discovery in Signed Networks
Linus Aronsson, Morteza Haghir Chehreghani
Generative diffusion for perceptron problems: statistical physics analysis and efficient algorithms
Davide Straziota, Elizaveta Demyanenko, Carlo Baldassi et al.
EDELINE: Enhancing Memory in Diffusion-based World Models via Linear-Time Sequence Modeling
Jia-Hua Lee, Bor-Jiun Lin, Wei-Fang Sun et al.
WeatherPrompt: Multi-modality Representation Learning for All-Weather Drone Visual Geo-Localization
Jiahao Wen, Hang Yu, Zhedong Zheng
What Data Enables Optimal Decisions? An Exact Characterization for Linear Optimization
Omar Bennouna, Amine Bennouna, Saurabh Amin et al.
Red-Teaming Text-to-Image Systems by Rule-based Preference Modeling
Yichuan Cao, Yibo Miao, Xiao-Shan Gao et al.
TexGarment: Consistent Garment UV Texture Generation via Efficient 3D Structure-Guided Diffusion Transformer
Jialun Liu, Jinbo Wu, Xiaobo Gao et al.
Insightful Instance Features for 3D Instance Segmentation
Wonseok Roh, Hwanhee Jung, Giljoo Nam et al.
Embracing Collaboration Over Competition: Condensing Multiple Prompts for Visual In-Context Learning
Jinpeng Wang, Tianci Luo, Yaohua Zha et al.
Uncertainty Quantification for Physics-Informed Neural Networks with Extended Fiducial Inference
Frank Shih, Zhenghao Jiang, Faming Liang
Escaping Plato's Cave: Towards the Alignment of 3D and Text Latent Spaces
Souhail Hadgi, Luca Moschella, Andrea Santilli et al.
Aligning Compound AI Systems via System-level DPO
Xiangwen Wang, Yibo Jacky Zhang, Zhoujie Ding et al.
Analyzing the Synthetic-to-Real Domain Gap in 3D Hand Pose Estimation
Zhuoran ZHAO, Linlin Yang, Pengzhan Sun et al.