Most Cited 2025 "hardware robotic control" Papers
22,274 papers found • Page 92 of 112
Conference
Text Embedding is Not All You Need: Attention Control for Text-to-Image Semantic Alignment with Text Self-Attention Maps
Jeeyung Kim, Erfan Esmaeili Fakhabi, Qiang Qiu
SimLTD: Simple Supervised and Semi-Supervised Long-Tailed Object Detection
Phi Vu Tran
The Lighthouse of Language: Enhancing LLM Agents via Critique-Guided Improvement
Ruihan Yang, Fanghua Ye, Jian Li et al.
Spatiotemporal Consensus with Scene Prior for Unsupervised Domain Adaptive Person Search
Yimin Jiang, Huibing Wang, Jinjia peng
COSTARR: Consolidated Open Set Technique with Attenuation for Robust Recognition
Ryan Rabinowitz, Steve Cruz, Walter Scheirer et al.
MVSMamba: Multi-View Stereo with State Space Model
Jianfei Jiang, Qiankun Liu, Hongyuan Liu et al.
Understanding and Enhancing Message Passing on Heterophilic Graphs via Compatibility Matrix
Zhuonan Zheng, Yuanchen Bei, Zhiyao Zhou et al.
Consistency Posterior Sampling for Diverse Image Synthesis
Vishal Purohit, Matthew Repasky, Jianfeng Lu et al.
Radiant Foam: Real-Time Differentiable Ray Tracing
Shrisudhan Govindarajan, Daniel Rebain, Kwang Moo Yi et al.
Balancing Positive and Negative Classification Error Rates in Positive-Unlabeled Learning
Ximing Li, Yuanchao Dai, Bing Wang et al.
Robust LLM Alignment via Distributionally Robust Direct Preference Optimization
Zaiyan Xu, Sushil Vemuri, Kishan Panaganti et al.
StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant
Haibo Wang, Bo Feng, Zhengfeng Lai et al.
FreqExit: Enabling Early-Exit Inference for Visual Autoregressive Models via Frequency-Aware Guidance
Ying Li, Chengfei Lyu, Huan Wang
Assignments for Congestion-Averse Agents: Seeking Competitive and Envy-Free Solutions
Jiehua Chen, Jiong Guo, Yinghui Wen
MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
Ho Kei Cheng, Masato Ishii, Akio Hayakawa et al.
Transductive Conformal Inference for Full Ranking
Jean-Baptiste Fermanian, Pierre Humbert, Gilles Blanchard
Improved Approximation Algorithms for Chromatic and Pseudometric-Weighted Correlation Clustering
Chenglin Fan, Dahoon Lee, Euiwoong Lee
Prototype-based Contrastive Learning with Stage-wise Progressive Augmentation for Self-Supervised Fine-Grained Learning
BaoFeng Tan, Xiu-Shen Wei, Lin Zhao
Guiding LLM Decision-Making with Fairness Reward Models
Zara Hall, Melanie Subbiah, Thomas Zollo et al.
Convergence Rates of Constrained Expected Improvement
Haowei Wang, Jingyi Wang, Zhongxiang Dai et al.
Gaussian Regression-Driven Tensorized Incomplete Multi-View Clustering with Dual Manifold Regularization
Zhenhao Zhong, Zhibin Gu, Pengpeng Yang et al.
Noise-Modeled Diffusion Models for Low-Light Spike Image Restoration
Ruonan Liu, Lin Zhu, Xijie Xiang et al.
Machine Unlearning under Overparameterization
Jacob Block, Aryan Mokhtari, Sanjay Shakkottai
Personalized Federated Learning under Local Supervision
Qiqi Liu, Jiaqiang Li, Yuchen Liu et al.
Frequency-Aware Token Reduction for Efficient Vision Transformer
DongJae Lee, Jiwan Hur, Jaehyun Choi et al.
When Models Don’t Collapse: On the Consistency of Iterative MLE
Daniel Barzilai, Ohad Shamir
Uncalibrated Structure from Motion on a Sphere
Jonathan Ventura, Viktor Larsson, Fredrik Kahl
To Label or Not to Label: PALM – A Predictive Model for Evaluating Sample Efficiency in Active Learning Models
Julia Machnio, Mads Nielsen, Mostafa Mehdipour Ghazi
Find a Scapegoat: Poisoning Membership Inference Attack and Defense to Federated Learning
Wenjin Mo, Zhiyuan Li, Minghong Fang et al.
OCN: Effectively Utilizing Higher-Order Common Neighbors for Better Link Prediction
Juntong Wang, Xiyuan Wang, Muhan Zhang
ZIUM: Zero-Shot Intent-Aware Adversarial Attack on Unlearned Models
Hyun Jun Yook, Ga San Jhun, Cho Hyun et al.
FedMGP: Personalized Federated Learning with Multi-Group Text-Visual Prompts
Weihao Bo, Yanpeng Sun, Yu Wang et al.
Efficient Data Selection at Scale via Influence Distillation
Mahdi Nikdan, Vincent Cohen-Addad, Dan Alistarh et al.
LLM Meets Diffusion: A Hybrid Framework for Crystal Material Generation
Subhojyoti Khastagir, KISHALAY DAS, Pawan Goyal et al.
The Graphon Limit Hypothesis: Understanding Neural Network Pruning via Infinite Width Analysis
Hoang Pham, The Anh Ta, Tom Jacobs et al.
An Efficient Post-hoc Framework for Reducing Task Discrepancy of Text Encoders for Composed Image Retrieval
Jaeseok Byun, Seokhyeon Jeong, Wonjae Kim et al.
ViewPoint: Panoramic Video Generation with Pretrained Diffusion Models
Zixun Fang, Kai Zhu, Zhiheng Liu et al.
Nearly Dimension-Independent Convergence of Mean-Field Black-Box Variational Inference
Kyurae Kim, Yian Ma, Trevor Campbell et al.
Mind the Gap: Preserving and Compensating for the Modality Gap in CLIP-Based Continual Learning
Linlan Huang, Xusheng Cao, Haori Lu et al.
Geometric Algorithms for Neural Combinatorial Optimization with Constraints
Nikolaos Karalias, Akbar Rafiey, Yifei Xu et al.
Controlling Thinking Speed in Reasoning Models
Zhengkai Lin, Zhihang Fu, Ze Chen et al.
MiCADangelo: Fine-Grained Reconstruction of Constrained CAD Models from 3D Scans
Ahmet Karadeniz, Dimitrios Mallis, Danila Rukhovich et al.
Understanding the Generalization of Stochastic Gradient Adam in Learning Neural Networks
Xuan Tang, Han Zhang, Yuan Cao et al.
Non-Singularity of the Gradient Descent Map for Neural Networks with Piecewise Analytic Activations
Alexandru Crăciun, Debarghya Ghoshdastidar
Complexity Scaling Laws for Neural Models using Combinatorial Optimization
Lowell Weissman, Michael Krumdick, A. Abbott
VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforcement Learning
Qiuchen Wang, Ruixue Ding, Yu Zeng et al.
VLsI: Verbalized Layers-to-Interactions from Large to Small Vision Language Models
Byung-Kwan Lee, Ryo Hachiuma, Yu-Chiang Frank Wang et al.
Registration beyond Points: General Affine Subspace Alignment via Geodesic Distance on Grassmann Manifold
Jaeho Shin, Hyeonjae Gil, Junwoo Jang et al.
How to Scale Second-Order Optimization
Charlie Chen, Shikai Qiu, Hoang Phan et al.
Robust Satisficing Gaussian Process Bandits Under Adversarial Attacks
Artun Saday, Yaşar Cahit Yıldırım, Cem Tekin
Category-Specific Selective Feature Enhancement for Long-Tailed Multi-Label Image Classification
Ruiqi Du, Xu Tang, Xiangrong Zhang et al.
PathVQ: Reforming Computational Pathology Foundation Model for Whole Slide Image Analysis via Vector Quantization
Honglin Li, Zhongyi Shui, Yunlong Zhang et al.
Optimal Mistake Bounds for Transductive Online Learning
Zachary Chase, Steve Hanneke, Shay Moran et al.
A Private Approximation of the 2nd-Moment Matrix of Any Subsamplable Input
Bar Mahpud, Or Sheffet
Out-of-Distribution Generalized Graph Anomaly Detection with Homophily-aware Environment Mixup
Sibo Tian, Xin Wang, Zeyang Zhang et al.
Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding
Wenxuan Guo, Xiuwei Xu, Ziwei Wang et al.
Dual Alignment Framework for Few-shot Learning with Inter-Set and Intra-Set Shifts
Siyang Jiang, Rui Fang, Hsi-Wen Chen et al.
FedDifRC: Unlocking the Potential of Text-to-Image Diffusion Models in Heterogeneous Federated Learning
Huan Wang, Haoran Li, Huaming Chen et al.
p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay
Jun Zhang, Desen Meng, Zhengming Zhang et al.
What Expressivity Theory Misses: Message Passing Complexity for GNNs
Niklas Kemper, Tom Wollschläger, Stephan Günnemann
Continuous Soft Actor-Critic: An Off-Policy Learning Method Robust to Time Discretization
Huimin Han, Shaolin Ji
Large Multi-modal Models Can Interpret Features in Large Multi-modal Models
Kaichen Zhang, Yifei Shen, Bo Li et al.
Statistical Guarantees for High-Dimensional Stochastic Gradient Descent
Jiaqi Li, Zhipeng Lou, Johannes Schmidt-Hieber et al.
Lark: Low-Rank Updates After Knowledge Localization for Few-shot Class-Incremental Learning
Jinxin Shi, Jiabao Zhao, Yifan Yang et al.
Conditional Representation Learning for Customized Tasks
Honglin Liu, Chao Sun, Peng Hu et al.
Non-Stationary Structural Causal Bandits
Yeahoon Kwon, Yesong Choe, Soungmin Park et al.
Hierarchical Divide-and-Conquer Grouping for Classification Adaptation of Pre-Trained Models
Ziqian Lu, Yunlong Yu, Qinyue Tong et al.
Learning to Inference Adaptively for Multimodal Large Language Models
Zhuoyan Xu, Khoi Nguyen, Preeti Mukherjee et al.
On the Complexity-Faithfulness Trade-off of Gradient-Based Explanations
Amir Mehrpanah, Matteo Gamba, Kevin Smith et al.
MoESD: Unveil Speculative Decoding's Potential for Accelerating Sparse MoE
Zongle Huang, Lei Zhu, ZongYuan Zhan et al.
LA-MOTR: End-to-End Multi-Object Tracking by Learnable Association
Peng Wang, Yongcai Wang, Hualong Cao et al.
Learning from Delayed Feedback in Games via Extra Prediction
Yuma Fujimoto, Kenshi Abe, Kaito Ariu
TransiT: Transient Transformer for Non-line-of-sight Videography
Ruiqian Li, Siyuan Shen, Suan Xia et al.
SpectraLDS: Provable Distillation for Linear Dynamical Systems
Devan Shah, Shlomo Fortgang, Sofiia Druchyna et al.
LLM Safety Alignment is Divergence Estimation in Disguise
Rajdeep Haldar, Ziyi Wang, Guang Lin et al.
scPilot: Large Language Model Reasoning Toward Automated Single-Cell Analysis and Discovery
Yiming Gao, Zhen Wang, Jefferson Chen et al.
Edit Flows: Variable Length Discrete Flow Matching with Sequence-Level Edit Operations
Marton Havasi, Brian Karrer, Itai Gat et al.
Spectral Sensitivity Estimation with an Uncalibrated Diffraction Grating
Lilika Makabe, Hiroaki Santo, Fumio Okura et al.
Integrating Biological Knowledge for Robust Microscopy Image Profiling on De Novo Cell Lines
Jiayuan Chen, Thai-Hoang Pham, Yuanlong Wang et al.
SAFEPATH: Preventing Harmful Reasoning in Chain-of-Thought via Early Alignment
Wonje Jeung, Yoon Sangyeon, Minsuk Kahng et al.
A-Mem: Agentic Memory for LLM Agents
Wujiang Xu, Zujie Liang, Kai Mei et al.
Investigating and Mitigating Catastrophic Forgetting in Medical Knowledge Injection through Internal Knowledge Augmentation Learning
Yuxuan Zhou, Xien Liu, Xiao Zhang et al.
Learning Parameterized Skills from Demonstrations
Vedant Gupta, Haotian Fu, Calvin Luo et al.
NeuroH-TGL: Neuro-Heterogeneity Guided Temporal Graph Learning Strategy for Brain Disease Diagnosis
Shengrong Li, Qi Zhu, Chunwei Tian et al.
Keep Your Friends Close, and Your Enemies Farther: Distance-aware Voxel-wise Contrastive Learning for Semi-supervised Multi-organ Segmentation
Haochen Zhao, Jianwei Niu, Xuefeng Liu et al.
Constructing Ophthalmic MLLM for Positioning-diagnosis Collaboration Through Clinical Cognitive Chain Reasoning
Xinyao Liu, Diping Song
A Gradient Guided Diffusion Framework for Chance Constrained Programming
Boyang Zhang, Zhiguo Wang, Ya-Feng Liu
FACT: Mitigating Inconsistent Hallucinations in LLMs via Fact-Driven Alternating Code-Text Training
Xinxin You, Qixin Sun, Chenwei Yan et al.
A Counterfactual Semantics for Hybrid Dynamical Systems
Andy Zane, Dmitry Batenkov, Rafal Urbaniak et al.
Adaptive Learning of High-Value Regions for Semi-Supervised Medical Image Segmentation
Tao Lei, Ziyao Yang, Xingwu wang et al.
Fully Spiking Neural Networks for Unified Frame-Event Object Tracking
Jingjun Yang, Liangwei Fan, Jinpu Zhang et al.
MA-CIR: A Multimodal Arithmetic Benchmark for Composed Image Retrieval
Jaeseok Byun, Young Kyun Jang, Seokhyeon Jeong et al.
Training-Free Class Purification for Open-Vocabulary Semantic Segmentation
Qi Chen, Lingxiao Yang, Yun Chen et al.
TabSTAR: A Tabular Foundation Model for Tabular Data with Text Fields
Alan Arazi, Eilam Shapira, Roi Reichart
DiffDoctor: Diagnosing Image Diffusion Models Before Treating
Yiyang Wang, Xi Chen, Xiaogang Xu et al.
CacheQuant: Comprehensively Accelerated Diffusion Models
Xuewen Liu, Zhikai Li, Qingyi Gu
OASIS: One-Shot Federated Graph Learning via Wasserstein Assisted Knowledge Integration
Guancheng Wan, Jiaru Qian, Wenke Huang et al.
Guiding Diffusion Models with Adaptive Negative Sampling Without External Resources
Alakh Desai, Nuno Vasconcelos
CCL: Causal-aware In-context Learning for Out-of-Distribution Generalization
Hoyoon Byun, Gyeongdeok Seo, Joonseong Kang et al.
SemTalk: Holistic Co-speech Motion Generation with Frame-level Semantic Emphasis
Xiangyue Zhang, Jianfang Li, Jiaxu Zhang et al.
PersonaCraft: Personalized and Controllable Full-Body Multi-Human Scene Generation Using Occlusion-Aware 3D-Conditioned Diffusion
Gwanghyun Kim, Suh Jeon Jeon, Seunggyu Lee et al.
UEPI: Universal Energy-Behavior-Preserving Integrators for Energy Conservative/Dissipative Differential Equations
Elena Celledoni, Brynjulf Owren, Chong Shen et al.
NPN: Non-Linear Projections of the Null-Space for Imaging Inverse Problems
Roman Jacome, Romario Gualdrón-Hurtado, León Suárez-Rodríguez et al.
When Lighting Deceives: Exposing Vision-Language Models' Illumination Vulnerability Through Illumination Transformation Attack
Hanqing Liu, Shouwei Ruan, Yao Huang et al.
A Closer Look at NTK Alignment: Linking Phase Transitions in Deep Image Regression
Giuseppe Castiglione, Christopher L Buckley, Ivor Simpson
Harnessing Input-Adaptive Inference for Efficient VLN
Dongwoo Kang, Akhil Perincherry, Zachary Coalson et al.
seq-JEPA: Autoregressive Predictive Learning of Invariant-Equivariant World Models
Hafez Ghaemi, Eilif B. Muller, Shahab Bakhtiari
Rethinking the Upsampling Process in Light Field Super-Resolution with Spatial-Epipolar Implicit Image Function
Ruixuan Cong, Yu Wang, Mingyuan Zhao et al.
ElliCE: Efficient and Provably Robust Algorithmic Recourse via the Rashomon Sets
Bohdan Turbal, Iryna Voitsitska, Lesia Semenova
AtlasGS: Atlanta-world Guided Surface Reconstruction with Implicit Structured Gaussians
Xiyu Zhang, Chong Bao, YiPeng Chen et al.
Fourier Token Merging: Understanding and Capitalizing Frequency Domain for Efficient Image Generation
Jiesong Liu, Xipeng Shen
FoGE: Fock Space inspired encoding for graph prompting
Takis Chytas, Rudrasis Chakraborty, Vikas Singh
What One Cannot, Two Can: Two-Layer Transformers Provably Represent Induction Heads on Any-Order Markov Chains
Chanakya Ekbote, Ashok Vardhan Makkuva, Marco Bondaschi et al.
Low Precision Streaming PCA
Sanjoy Dasgupta, Syamantak Kumar, Shourya Pandey et al.
Obliviator Reveals the Cost of Nonlinear Guardedness in Concept Erasure
Ramin Akbari, Milad Afshari, Vishnu Boddeti
Improving Generative Behavior Cloning via Self-Guidance and Adaptive Chunking
Junhyuk So, Chiwoong Lee, Shinyoung Lee et al.
3D Human Pose Estimation with Muscles
Kevin Zhu, AliAsghar MohammadiNasrabadi, Alexander Wong et al.
EventUPS: Uncalibrated Photometric Stereo Using an Event Camera
Jinxiu Liang, Bohan Yu, Siqi Yang et al.
Non-exchangeable Conformal Prediction with Optimal Transport: Tackling Distribution Shift with Unlabeled Data
Alvaro Correia, Christos Louizos
Optimal Online Change Detection via Random Fourier Features
Florian Kalinke, Shakeel Gavioli-Akilagun
SORTeD Rashomon Sets of Sparse Decision Trees: Anytime Enumeration
Elif Arslan, Jacobus van der Linden, Serge Hoogendoorn et al.
Towards 3D Objectness Learning in an Open World
Taichi Liu, Zhenyu Wang, Ruofeng Liu et al.
Stitch and Tell: A Structured Data Augmentation Method for Spatial Understanding
Yin Hang, Xiaomin He, Peiwen Yuan et al.
IM360: Large-scale Indoor Mapping with 360 Cameras
Dongki Jung, Jaehoon Choi, Yonghan Lee et al.
Less is More: Empowering GUI Agent with Context-Aware Simplification
Gongwei Chen, Xurui Zhou, Rui Shao et al.
Generalized Deep Multi-view Clustering via Causal Learning with Partially Aligned Cross-view Correspondence
Xihong Yang, Siwei Wang, Jiaqi Jin et al.
Adaptive Data Analysis for Growing Data
Neil Marchant, Benjamin Rubinstein
Knowledge Distillation of Uncertainty using Deep Latent Factor Model
Sehyun Park, Jongjin Lee, Yunseop Shin et al.
Personalized Safety in LLMs: A Benchmark and A Planning-Based Agent Approach
Yuchen Wu, Edward Sun, Kaijie Zhu et al.
VQ-Seg: Vector-Quantized Token Perturbation for Semi-Supervised Medical Image Segmentation
Sicheng Yang, Zhaohu Xing, Lei Zhu
Language-Driven Multi-Label Zero-Shot Learning with Semantic Granularity
Shouwen Wang, Qian Wan, Junbin Gao et al.
PRO-VPT: Distribution-Adaptive Visual Prompt Tuning via Prompt Relocation
Chikai Shang, Mengke Li, Yiqun Zhang et al.
HMVLM:Human Motion-Vision-Language Model via MoE LoRA
Lei Hu, Yongjing Ye, Shihong Xia
Instruction-Grounded Visual Projectors for Continual Learning of Generative Vision-Language Models
Hyundong Jin, Hyung Jin Chang, Eunwoo Kim
From Faces to Voices: Learning Hierarchical Representations for High-quality Video-to-Speech
Jihoon Kim, Jeongsoo Choi, Jaehun Kim et al.
OmniGen-AR: AutoRegressive Any-to-Image Generation
Junke Wang, Xun Wang, Qiushan Guo et al.
CRRL: Learning Channel-invariant Neural Representations for High-performance Cross-day Decoding
Xianhan Tan, Binli Luo, Yu Qi et al.
Unextractable Protocol Models: Collaborative Training and Inference without Weight Materialization
Alexander Long, Chamin Hewa Koneputugodage, Thalaiyasingam Ajanthan et al.
Token Cropr: Faster ViTs for Quite a Few Tasks
Benjamin Bergner, Christoph Lippert, Aravindh Mahendran
PRVQL: Progressive Knowledge-guided Refinement for Robust Egocentric Visual Query Localization
Bing Fan, Yunhe Feng, Yapeng Tian et al.
Online Prediction with Limited Selectivity
Licheng Liu, Mingda Qiao
Simultaneous Modeling of Protein Conformation and Dynamics via Autoregression
Yuning Shen, Lihao Wang, Huizhuo Yuan et al.
Learning Separable Fine-Grained Representation via Dendrogram Construction from Coarse Labels for Fine-grained Visual Recognition
Guanghui Shi, Xuefeng liang, Wenjie Li et al.
Probabilistic Reasoning with LLMs for Privacy Risk Estimation
Jonathan Zheng, Alan Ritter, Sauvik Das et al.
Hyper-Depth: Hypergraph-based Multi-Scale Representation Fusion for Monocular Depth Estimation
Lin Bie, Siqi Li, Yifan Feng et al.
4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time
Ziqiao Ma, Xuweiyi Chen, Shoubin Yu et al.
Instant4D: 4D Gaussian Splatting in Minutes
Zhanpeng Luo, Haoxi Ran, Li Lu
Reducing Unimodal Bias in Multi-Modal Semantic Segmentation with Multi-Scale Functional Entropy Regularization
Xu Zheng, Yuanhuiyi Lyu, Lutao Jiang et al.
A Unified Stability Analysis of SAM vs SGD: Role of Data Coherence and Emergence of Simplicity Bias
WEI-KAI CHANG, Rajiv Khanna
Language Model Behavioral Phases are Consistent Across Architecture, Training Data, and Scale
James Michaelov, Roger Levy, Benjamin Bergen
The Temporal Graph of Bitcoin Transactions
Vahid Jalili
Conformal Prediction for Time-series Forecasting with Change Points
Sophia Sun, Rose Yu
Amortized Active Generation of Pareto Sets
Daniel Steinberg, Asiri Wijesinghe, Rafael Oliveira et al.
Hybrid Latent Representations for PDE Emulation
Ali Can Bekar, Siddhant Agarwal, Christian Hüttig et al.
Fractional Diffusion Bridge Models
Gabriel Nobis, Maximilian Springenberg, Arina Belova et al.
SPMDM: Enhancing Masked Diffusion Models through Simplifing Sampling Path
Yichen Zhu, Weiyu Chen, James Kwok et al.
Unknown Text Learning for CLIP-based Few-Shot Open-set Recognition
Rui Ma, Qilong Wang, Bing Cao et al.
Adaptable Safe Policy Learning from Multi-task Data with Constraint Prioritized Decision Transformer
Ruiqi Xue, Ziqian Zhang, Lihe Li et al.
One Object, Multiple Lies: A Benchmark for Cross-task Adversarial Attack on Unified Vision-Language Models
Jiale Zhao, XINYANG JIANG, Junyao Gao et al.
Confidence-Aware With Prototype Alignment for Partial Multi-label Learning
Weijun Lv, Yu Chen, Xiaozhao Fang et al.
A Unified Latent Schrödinger Bridge Diffusion Model for Unsupervised Anomaly Detection and Localization
Shilhora Akshay, Niveditha Lakshmi Narasimhan, Jacob George et al.
Mitigating Intra- and Inter-modal Forgetting in Continual Learning of Unified Multimodal Models
Xiwen Wei, Mustafa Munir, Radu Marculescu
Fostering the Ecosystem of AI for Social Impact Requires Expanding and Strengthening Evaluation Standards
Bryan Wilder, Angela Zhou
Temporal In‑Context Fine‑Tuning for Versatile Control of Video Diffusion Models
Kinam Kim, Junha Hyung, Jaegul Choo
FairDD: Fair Dataset Distillation
Qihang Zhou, ShenHao Fang, Shibo He et al.
Distributionally Robust Feature Selection
Maitreyi Swaroop, Tamar Krishnamurti, Bryan Wilder
Toward Artificial Palpation: Representation Learning of Touch on Soft Bodies
Zohar Rimon, Elisei Shafer, Tal Tepper et al.
Volume Transmission Implements Context Factorization to Target Online Credit Assignment and Enable Compositional Generalization
Matthew Bull, Po-Chen Kuo, Andrew Smith et al.
Your Pre-trained LLM is Secretly an Unsupervised Confidence Calibrator
Beier Luo, Shuoyuan Wang, Sharon Li et al.
Continuous Locomotive Crowd Behavior Generation
Inhwan Bae, Junoh Lee, Hae-Gon Jeon
SymRTLO: Enhancing RTL Code Optimization with LLMs and Neuron-Inspired Symbolic Reasoning
Yiting Wang, Wanghao Ye, Ping Guo et al.
Predicting the Performance of Black-box Language Models with Follow-up Queries
Dylan Sam, Marc Finzi, Zico Kolter
Bridging the gap to real-world language-grounded visual concept learning
whie jung, Semin Kim, Junee Kim et al.
MoRIC: A Modular Region-based Implicit Codec for Image Compression
Gen Li, Haotian Wu, Deniz Gunduz
IRGPT: Understanding Real-world Infrared Image with Bi-cross-modal Curriculum on Large-scale Benchmark
Zhe Cao, Jin Zhang, Ruiheng Zhang
Differentially Private Fine-Tuning of Diffusion Models
Yu-Lin Tsai, Yizhe Li, Zekai Chen et al.
Private Continual Counting of Unbounded Streams
Ben Jacobsen, Kassem Fawaz
4KAgent: Agentic Any Image to 4K Super-Resolution
Yushen Zuo, Qi Zheng, Mingyang Wu et al.
Reasoning Planning for Language Models
Ngoc Bao Nguyen, Trung Hieu Nguyen, Ruifeng She et al.
A Single-Loop First-Order Algorithm for Linearly Constrained Bilevel Optimization
Wei Shen, Jiawei Zhang, Minhui Huang et al.
Continual Adaptation: Environment-Conditional Parameter Generation for Object Detection in Dynamic Scenarios
Deng Li, Aming WU, Yang Li et al.
DP²O-SR: Direct Perceptual Preference Optimization for Real-World Image Super-Resolution
Rongyuan Wu, Lingchen Sun, Zhengqiang ZHANG et al.
PAN-Crafter: Learning Modality-Consistent Alignment for PAN-Sharpening
Jeonghyeok Do, Sungpyo Kim, Geunhyuk Youk et al.
Fully Dynamic Algorithms for Chamfer Distance
Gramoz Goranci, Shaofeng Jiang, Peter Kiss et al.
When One Moment Isn't Enough: Multi-Moment Retrieval with Cross-Moment Interactions
Zhuo Cao, Heming Du, Bingqing Zhang et al.
Augmenting Moment Retrieval: Zero-Dependency Two-Stage Learning
Zhengxuan Wei, Jiajin Tang, Sibei Yang
From Sharp to Blur: Unsupervised Domain Adaptation for 2D Human Pose Estimation Under Extreme Motion Blur Using Event Cameras
Youngho Kim, Hoonhee Cho, Kuk-Jin Yoon
Error Broadcast and Decorrelation as a Potential Artificial and Natural Learning Mechanism
Mete Erdogan, Cengiz Pehlevan, Alper Erdogan
Estimation of Treatment Effects in Extreme and Unobserved Data
Jiyuan Tan, Vasilis Syrgkanis, Jose Blanchet
MemDistill: Distilling LiDAR Knowledge into Memory for Camera-Only 3D Object Detection
Donghyeon Kwon, Youngseok Yoon, Hyeongseok Son et al.
BioOSS: A Bio-Inspired Oscillatory State System with Spatio-Temporal Dynamics
Zhongju Yuan, Geraint Wiggins, Dick Botteldooren
BevSplat: Resolving Height Ambiguity via Feature-Based Gaussian Primitives for Weakly-Supervised Cross-View Localization
Qiwei Wang, Shaoxun Wu, Yujiao Shi
Cooperative Pseudo Labeling for Unsupervised Federated Classification
Kuangpu Guo, Lijun Sheng, Yongcan Yu et al.
External Knowledge Injection for CLIP-Based Class-Incremental Learning
Da-Wei Zhou, Kai-Wen Li, Jingyi Ning et al.
No Pose at All: Self-Supervised Pose-Free 3D Gaussian Splatting from Sparse Views
Ranran Huang, Krystian Mikolajczyk
CATransformers: Carbon Aware Transformers Through Joint Model-Hardware Optimization
Irene Wang, Mostafa Elhoushi, H Ekin Sumbul et al.
Quantum speedup of non-linear Monte Carlo problems
Jose Blanchet, Yassine Hamoudi, Mario Szegedy et al.
Period-LLM: Extending the Periodic Capability of Multimodal Large Language Model
Yuting Zhang, Hao Lu, Qingyong Hu et al.
SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training
Jierun Chen, Dongting Hu, Xijie Huang et al.
Iris: Breaking GUI Complexity with Adaptive Focus and Self-Refining
Zhiqi Ge, Juncheng Li, Xinglei Pang et al.