Most Cited 2025 "hardware robotic control" Papers
22,274 papers found • Page 74 of 112
Conference
3D Gaussian Map with Open-Set Semantic Grouping for Vision-Language Navigation
Jianzhe Gao, Rui Liu, Wenguan Wang
Text-IRSTD: Leveraging Semantic Text to Promote Infrared Small Target Detection in Complex Scenes
Feng Huang, Shuyuan Zheng, Zhaobing Qiu et al.
Balancing Conservatism and Aggressiveness: Prototype-Affinity Hybrid Network for Few-Shot Segmentation
Tianyu Zou, Shengwu Xiong, Ruilin Yao et al.
ShadowHack: Hacking Shadows via Luminance-Color Divide and Conquer
Jin Hu, Mingjia Li, Xiaojie Guo
The Complexity of Correlated Equilibria in Generalized Games
Martino Bernasconi, Matteo Castiglioni, Andrea Celli et al.
Uncovering the Spectral Bias in Diagonal State Space Models
Ruben Solozabal, Velibor Bojkovic, Hilal AlQuabeh et al.
MindForge: Empowering Embodied Agents with Theory of Mind for Lifelong Cultural Learning
Mircea Lică, Ojas Shirekar, Baptiste Colle et al.
Accelerating Visual-Policy Learning through Parallel Differentiable Simulation
Haoxiang You, Yilang Liu, Ian Abraham
EvRT-DETR: Latent Space Adaptation of Image Detectors for Event-based Vision
Dmitrii Torbunov, Yihui Ren, Animesh Ghose et al.
Temporal Smoothness-Aware Rate-Distortion Optimized 4D Gaussian Splatting
Hyeongmin Lee, Kyungjune Baek
NeuroGenPoisoning: Neuron-Guided Attacks on Retrieval-Augmented Generation of LLM via Genetic Optimization of External Knowledge
Hanyu Zhu, Lance Fiondella, Jiawei Yuan et al.
Achilles' Heel of Mamba: Essential difficulties of the Mamba architecture demonstrated by synthetic data
Tianyi Chen, Pengxiao Lin, Zhiwei Wang et al.
Instruction-Oriented Preference Alignment for Enhancing Multi-Modal Comprehension Capability of MLLMs
Zitian Wang, Yue Liao, RONG KANG et al.
MCOP: Multi-UAV Collaborative Occupancy Prediction
Zefu Lin, Wenbo Chen, Xiaojuan Jin et al.
ViT-EnsembleAttack: Augmenting Ensemble Models for Stronger Adversarial Transferability in Vision Transformers
Hanwen Cao, Haobo Lu, Xiaosen Wang et al.
MetaFind: Scene-Aware 3D Asset Retrieval for Coherent Metaverse Scene Generation
Zhenyu Pan, Yucheng Lu, Han Liu
Sampling by averaging: A multiscale approach to score estimation
Paula Cordero-Encinar, Andrew Duncan, Sebastian Reich et al.
Towards Graph Foundation Models: Training on Knowledge Graphs Enables Transferability to General Graphs
Kai Wang, Siqiang Luo, Caihua Shan et al.
How to Auto-optimize Prompts for Domain Tasks? Adaptive Prompting and Reasoning through Evolutionary Domain Knowledge Adaptation
Yang Zhao, Pu Wang, Hao Frank Yang
Benchmarking Egocentric Visual-Inertial SLAM at City Scale
Anusha Krishnan, Shaohui Liu, Paul-Edouard Sarlin et al.
SPARC: Score Prompting and Adaptive Fusion for Zero-Shot Multi-Label Recognition in Vision-Language Models
Kevin Miller, Aditya Gangrade, Samarth Mishra et al.
From Cradle to Cane: A Two-Pass Framework for High-Fidelity Lifespan Face Aging
Tao Liu, Dafeng Zhang, Gengchen Li et al.
Wavelet Canonical Coherence for Nonstationary Signals
Haibo Wu, Marina Knight, Keiland Cooper et al.
Two‑Stage Learning of Stabilizing Neural Controllers via Zubov Sampling and Iterative Domain Expansion
Haoyu Li, Xiangru Zhong, Bin Hu et al.
Minimal Interaction Seperated Tuning: A New Paradigm for Visual Adaptation
Ningyuan Tang, Minghao Fu, Jianxin Wu
Serialization based Point Cloud Oversegmentation
chenghui Lu, Dilong Li, Jianlong Kwan et al.
Difference Inversion: Interpolate and Isolate the Difference with Token Consistency for Image Analogy Generation
Hyunsoo Kim, Donghyun Kim, Suhyun Kim
Revisiting Fairness in Multitask Learning: A Performance-Driven Approach for Variance Reduction
Xiaohan Qin, Xiaoxing Wang, Junchi Yan
Less Is More, but Where? Dynamic Token Compression via LLM-Guided Keyframe Prior
Yulin Li, Haokun GUI, Ziyang Fan et al.
Jamais Vu: Exposing the Generalization Gap in Supervised Semantic Correspondence
Octave Mariotti, Zhipeng Du, Yash Bhalgat et al.
FineRS: Fine-grained Reasoning and Segmentation of Small Objects with Reinforcement Learning
Lu Zhang, Jiazuo Yu, Haomiao Xiong et al.
InstructRestore: Region-Customized Image Restoration with Human Instructions
Shuaizheng Liu, Jianqi Ma, Lingchen Sun et al.
Self-diffusion for Solving Inverse Problems
Guanxiong Luo, Shoujin Huang
Progressive Artwork Outpainting via Latent Diffusion Models
Dae-Young Song, Jung-Jae Yu, Donghyeon Cho
Reinforcement Learning-Guided Data Selection via Redundancy Assessment
Suorong Yang, Peijia Li, Furao Shen et al.
After the Party: Navigating the Mapping From Color to Ambient Lighting
Florin-Alexandru Vasluianu, Tim Seizinger, Zongwei Wu et al.
Distance-informed Neural Processes
Aishwarya Venkataramanan, Joachim Denzler
Recognizing Actions from Robotic View for Natural Human-Robot Interaction
Ziyi Wang, Peiming Li, Hong Liu et al.
Flex-Judge: Text-Only Reasoning Unleashes Zero-Shot Multimodal Evaluators
Jongwoo Ko, Sungnyun Kim, Sungwoo Cho et al.
Bilevel ZOFO: Efficient LLM Fine-Tuning and Meta-Training
Reza Shirkavand, Peiran Yu, Qi He et al.
A Real-world Display Inverse Rendering Dataset
Seokjun Choi, Hoon-Gyu Chung, Yujin Jeon et al.
DDB: Diffusion Driven Balancing to Address Spurious Correlations
Aryan Yazdan Parast, Basim Azam, Naveed Akhtar
Efficiently Verifiable Proofs of Data Attribution
Ari Karchmer, Seth Neel, Martin Pawelczyk
ItDPDM: Information-Theoretic Discrete Poisson Diffusion Model
Sagnik Bhattacharya, Abhiram Gorle, Ahsan Bilal et al.
TurboVSR: Fantastic Video Upscalers and Where to Find Them
Zhongdao Wang, Guodongfang Zhao, Jingjing Ren et al.
TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixels
Jiahao Lu, Weitao Xiong, Jiacheng Deng et al.
PS3: A Multimodal Transformer Integrating Pathology Reports with Histology Images and Biological Pathways for Cancer Survival Prediction
Manahil Raza, Ayesha Azam, Talha Qaiser et al.
Handling Label Noise via Instance-Level Difficulty Modeling and Dynamic Optimization
Kuan Zhang, Chengliang Chai, Jingzhe Xu et al.
FedFACT: A Provable Framework for Controllable Group-Fairness Calibration in Federated Learning
Li Zhang, Zhongxuan Han, XiaoHua Feng et al.
FRET: Feature Redundancy Elimination for Test Time Adaptation
Linjing You, Jiabao Lu, Xiayuan Huang et al.
Tune-Your-Style: Intensity-tunable 3D Style Transfer with Gaussian Splatting
Yian Zhao, rushi ye, Ruochong Zheng et al.
Revisiting Point Cloud Completion: Are We Ready For The Real-World?
Stuti Pathak, Prashant Kumar, Dheeraj Baiju et al.
Interpretable Next-token Prediction via the Generalized Induction Head
Eunji Kim, Sriya Mantena, Weiwei Yang et al.
Generalization Bound of Gradient Flow through Training Trajectory and Data-dependent Kernel
Yilan Chen, Zhichao Wang, Wei Huang et al.
A Geometric Analysis of PCA
Ayoub El Hanchi, Murat Erdogdu, Chris Maddison
PIDLoc: Cross-View Pose Optimization Network Inspired by PID Controllers
Wooju Lee, Juhye Park, Dasol Hong et al.
Causal Spatio-Temporal Prediction: An Effective and Efficient Multi-Modal Approach
Yuting Huang, Ziquan Fang, Zhihao Zeng et al.
An Image-like Diffusion Method for Human-Object Interaction Detection
Xiaofei Hui, Haoxuan Qu, Hossein Rahmani et al.
SPA: Efficient User-Preference Alignment against Uncertainty in Medical Image Segmentation
Jiayuan Zhu, Junde Wu, Cheng Ouyang et al.
Automatic Spectral Calibration of Hyperspectral Images: Method, Dataset and Benchmark
Zhuoran Du, Shaodi You, Cheng Cheng et al.
Towards Robust Zero-Shot Reinforcement Learning
Kexin ZHENG, Lauriane Teyssier, Yinan Zheng et al.
Unified Reconstruction of Static and Dynamic Scenes from Events
Qiyao Gao, Peiqi Duan, Hanyue Lou et al.
Training-free Detection of AI-generated images via Cropping Robustness
Sungik Choi, Hankook Lee, Moontae Lee
Bandit and Delayed Feedback in Online Structured Prediction
Yuki Shibukawa, Taira Tsuchiya, Shinsaku Sakaue et al.
Zero-shot Denoising via Neural Compression: Theoretical and algorithmic framework
Ali Zafari, Xi Chen, Shirin Jalali
EditInfinity: Image Editing with Binary-Quantized Generative Models
Jiahuan Wang, Yuxin Chen, Jun Yu et al.
Creating Your Editable 3D Photorealistic Avatar with Tetrahedron-constrained Gaussian Splatting
Hanxi Liu, Yifang Men, Zhouhui Lian
DETree: DEtecting Human-AI Collaborative Texts via Tree-Structured Hierarchical Representation Learning
Yongxin He, Shan Zhang, Yixuan Cao et al.
A Generalized Binary Tree Mechanism for Private Approximation of All-Pair Shortest Distances
Zongrui Zou, Chenglin Fan, Michael Dinitz et al.
Balancing Two Classifiers via A Simplex ETF Structure for Model Calibration
Jiani Ni, He Zhao, Jintong Gao et al.
Partner Modelling Emerges in Recurrent Agents (But Only When It Matters)
Ruaridh Mon-Williams, Max Taylor-Davies, Elizabeth Mieczkowski et al.
Statistical Inference for Gradient Boosting Regression
Haimo Fang, Kevin Tan, Giles Hooker
The Rise of Parameter Specialization for Knowledge Storage in Large Language Models
Yihuai Hong, Yiran Zhao, Wei Tang et al.
Controllable and Expressive One-Shot Video Head Swapping
Chaonan Ji, Jinwei Qi, Peng Zhang et al.
Dynamic Group Normalization: Spatio-Temporal Adaptation to Evolving Data Statistics
Yair Smadar, Assaf Hoogi
EA3D: Online Open-World 3D Object Extraction from Streaming Videos
Xiaoyu Zhou, Jingqi Wang, Yuang Jia et al.
It’s Hard to Be Normal: The Impact of Noise on Structure-agnostic Estimation
Jikai Jin, Lester Mackey, Vasilis Syrgkanis
STRIDER: Navigation via Instruction-Aligned Structural Decision Space Optimization
Diqi He, Xuehao Gao, Hao Li et al.
Learning Dense Feature Matching via Lifting Single 2D Image to 3D Space
Yingping Liang, Yutao Hu, Wenqi Shao et al.
DialNav: Multi-turn Dialog Navigation with a Remote Guide
Leekyeung Han, Hyunji Min, Gyeom Hwangbo et al.
Learning with Restricted Boltzmann Machines: Asymptotics of AMP and GD in High Dimensions
Yizhou Xu, Florent Krzakala, Lenka Zdeborová
Seeing 3D Through 2D Lenses: 3D Few-Shot Class-Incremental Learning via Cross-Modal Geometric Rectification
Tuo Xiang, Xuemiao Xu, Bangzhen Liu et al.
OpenBox: Annotate Any Bounding Boxes in 3D
In-Jae Lee, Mungyeom Kim, Kwonyoung Ryu et al.
Easy3D: A Simple Yet Effective Method for 3D Interactive Segmentation
Andrea Simonelli, Norman Müller, Peter Kontschieder
RayGaussX: Accelerating Gaussian-Based Ray Marching for Real-Time and High-Quality Novel View Synthesis
Hugo Blanc, Jean-Emmanuel Deschaud, Alexis Paljic
CLIPTTA: Robust Contrastive Vision-Language Test-Time Adaptation
Marc Lafon, Gustavo Vargas Hakim, Clément Rambour et al.
Tight Generalization Bounds for Large-Margin Halfspaces
Kasper Green Larsen, Natascha Schalburg
VIRES: Video Instance Repainting via Sketch and Text Guided Generation
Shuchen Weng, Haojie Zheng, Peixuan Zhang et al.
Degrees of Freedom for Linear Attention: Distilling Softmax Attention with Optimal Feature Efficiency
Naoki Nishikawa, Rei Higuchi, Taiji Suzuki
UnCLe: Towards Scalable Dynamic Causal Discovery in Non-linear Temporal Systems
Tingzhu Bi, Yicheng Pan, Xinrui Jiang et al.
Graph Your Own Prompt
Xi Ding, Lei Wang, Piotr Koniusz et al.
Video Language Model Pretraining with Spatio-temporal Masking
Yue Wu, Zhaobo Qi, Junshu Sun et al.
FastDINOv2: Frequency Based Curriculum Learning Improves Robustness and Training Speed
Jiaqi Zhang, Juntuo Wang, Zhixin Sun et al.
Boosting Domain Generalized and Adaptive Detection with Diffusion Models: Fitness, Generalization, and Transferability
Boyong He, Yuxiang Ji, Zhuoyue Tan et al.
Controllable-LPMoE: Adapting to Challenging Object Segmentation via Dynamic Local Priors from Mixture-of-Experts
Yanguang Sun, Jiawei Lian, jian Yang et al.
ProSpero: Active Learning for Robust Protein Design Beyond Wild-Type Neighborhoods
Michal Kmicikiewicz, Vincent Fortuin, Ewa Szczurek
Towards Privacy-preserved Pre-training of Remote Sensing Foundation Models with Federated Mutual-guidance Learning
Jieyi Tan, Chengwei Zhang, Bo Dang et al.
Pinpointing Attention-Causal Communication in Language Models
Gabriel Franco, Mark Crovella
Towards Cost-Effective Learning: A Synergy of Semi-Supervised and Active Learning
Tianxiang Yin, Ningzhong Liu, Han Sun
The Photographer's Eye: Teaching Multimodal Large Language Models to See, and Critique Like Photographers
Daiqing Qi, Handong Zhao, Jing Shi et al.
Aggregation Hides Out-of-Distribution Generalization Failures from Spurious Correlations
Olawale Salaudeen, Haoran Zhang, Kumail Alhamoud et al.
Learning Pixel-adaptive Multi-layer Perceptrons for Real-time Image Enhancement
Junyu Lou, Xiaorui Zhao, Kexuan Shi et al.
Visual Diversity and Region-aware Prompt Learning for Zero-shot HOI Detection
Chanhyeong Yang, Taehoon song, Jihwan Park et al.
Masked Diffusion Models as Energy Minimization
Sitong Chen, Shen Nie, Jiacheng Sun et al.
Equipping Vision Foundation Model with Mixture of Experts for Out-of-Distribution Detection
Shizhen Zhao, Jiahui Liu, Xin Wen et al.
CULTURE3D: A Large-Scale and Diverse Dataset of Cultural Landmarks and Terrains for Gaussian-Based Scene Rendering
xinyi zheng, Steve Zhang, Weizhe Lin et al.
Continual Optimization with Symmetry Teleportation for Multi-Task Learning
Zhipeng Zhou, Ziqiao Meng, Pengcheng Wu et al.
Revisiting Efficient Semantic Segmentation: Learning Offsets for Better Spatial and Class Feature Alignment
Shi-Chen Zhang, Yunheng Li, Yu-Huan Wu et al.
Visual and Semantic Prompt Collaboration for Generalized Zero-Shot Learning
Huajie Jiang, Zhengxian Li, Xiaohan Yu et al.
Cue3D: Quantifying the Role of Image Cues in Single-Image 3D Generation
Xiang Li, Zirui Wang, Zixuan Huang et al.
VaMP: Variational Multi-Modal Prompt Learning for Vision-Language Models
Silin Cheng, Kai Han
Pseudo-SD: Pseudo Controlled Stable Diffusion for Semi-Supervised and Cross-Domain Semantic Segmentation
Dong Zhao, Qi Zang, Shuang Wang et al.
A New Statistical Model of Star Speckles for Learning to Detect and Characterize Exoplanets in Direct Imaging Observations
Theo Bodrito, Olivier Flasseur, Julien Mairal et al.
Attention (as Discrete-Time Markov) Chains
Yotam Erel, Olaf Dünkel, Rishabh Dabral et al.
HoliTracer: Holistic Vectorization of Geographic Objects from Large-Size Remote Sensing Imagery
Yu Wang, Bo Dang, Wanchun Li et al.
MODEM: A Morton-Order Degradation Estimation Mechanism for Adverse Weather Image Recovery
Hainuo Wang, Qiming Hu, Xiaojie Guo
RNNs perform task computations by dynamically warping neural representations
Arthur Pellegrino, Angus Chadwick
Linear Differential Vision Transformer: Learning Visual Contrasts via Pairwise Differentials
Yifan Pu, Jixuan Ying, Qixiu Li et al.
No-Regret Online Autobidding Algorithms in First-price Auctions
Yilin LI, Yuan Deng, Wei Tang et al.
SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting
Dongliang Luo, Hanshen Zhu, Ziyang Zhang et al.
GeoProg3D: Compositional Visual Reasoning for City-Scale 3D Language Fields
Shunsuke Yasuki, Taiki Miyanishi, Nakamasa Inoue et al.
Verbalized Representation Learning for Interpretable Few-Shot Generalization
Cheng-Fu Yang, Da Yin, Wenbo Hu et al.
Revisiting Bi-Linear State Transitions in Recurrent Neural Networks
Reza Ebrahimi, Roland Memisevic
Asymmetric Duos: Sidekicks Improve Uncertainty
Tim G. Zhou, Evan Shelhamer, Geoff Pleiss
Information-Bottleneck Driven Binary Neural Network for Change Detection
Kaijie Yin, Zhiyuan Zhang, Shu Kong et al.
A Differential and Pointwise Control Approach to Reinforcement Learning
Minh Nguyen, Chandrajit Bajaj
VisHall3D: Monocular Semantic Scene Completion from Reconstructing the Visible Regions to Hallucinating the Invisible Regions
Haoang Lu, Yuanqi Su, Xiaoning Zhang et al.
Equi-mRNA: Protein Translation Equivariant Encoding for mRNA Language Models
Mehdi Yazdani-Jahromi, Ali Khodabandeh Yalabadi, Ozlem Garibay
Training-Free Class Purification for Open-Vocabulary Semantic Segmentation
Qi Chen, Lingxiao Yang, Yun Chen et al.
Practical Solutions to the Relative Pose of Three Calibrated Cameras
Charalambos Tzamos, Viktor Kocur, Yaqing Ding et al.
SemiDAViL: Semi-supervised Domain Adaptation with Vision-Language Guidance for Semantic Segmentation
Hritam Basak, Zhaozheng Yin
RipVIS: Rip Currents Video Instance Segmentation Benchmark for Beach Monitoring and Safety
Andrei Dumitriu, Florin Tatui, Florin Miron et al.
Towards Pre-trained Graph Condensation via Optimal Transport
Yeyu Yan, Shuai Zheng, Wenjun Hui et al.
HELVIPAD: A Real-World Dataset for Omnidirectional Stereo Depth Estimation
Mehdi Zayene, Albias Havolli, Jannik Endres et al.
SAGE-Eval: Evaluating LLMs for Systematic Generalizations of Safety Facts
Yueh-Han Chen, Guy Davidson, Brenden Lake
An Adaptive Algorithm for Bilevel Optimization on Riemannian Manifolds
Xu Shi, Rufeng Xiao, Rujun Jiang
Evidential Knowledge Distillation
Liangyu Xiang, Junyu Gao, Changsheng Xu
Pruning All-Rounder: Rethinking and Improving Inference Efficiency for Large Vision Language Models
Wei Suo, Ji Ma, Mengyang Sun et al.
Improving Large Vision and Language Models by Learning from a Panel of Peers
Jefferson Hernandez, Jing Shi, Simon Jenni et al.
Precise Asymptotics and Refined Regret of Variance-Aware UCB
Yingying Fan, Yuxuan Han, Jinchi Lv et al.
Enhancing Infrared Vision: Progressive Prompt Fusion Network and Benchmark
Jinyuan Liu, Zihang Chen, Zhu Liu et al.
SceneDesigner: Controllable Multi-Object Image Generation with 9-DoF Pose Manipulation
Zhenyuan Qin, Xincheng Shuai, Henghui Ding
Probabilistic Prototype Calibration of Vision-language Models for Generalized Few-shot Semantic Segmentation
Jie Liu, Jiayi Shen, Pan Zhou et al.
Thumb on the Scale: Optimal Loss Weighting in Last Layer Retraining
Nathan Stromberg, Christos Thrampoulidis, Lalitha Sankar
Enhancing Virtual Try-On with Synthetic Pairs and Error-Aware Noise Scheduling
Nannan Li, Kevin Shih, Bryan A. Plummer
MAPLE: Multi-scale Attribute-enhanced Prompt Learning for Few-shot Whole Slide Image Classification
Junjie Zhou, WEI SHAO, Yagao Yue et al.
Prompt Tuning Decision Transformers with Structured and Scalable Bandits
Finn Rietz, Oleg Smirnov, Sara Karimi et al.
Mint: A Simple Test-Time Adaptation of Vision-Language Models against Common Corruptions
Wenxuan Bao, Ruxi Deng, Jingrui He
Beyond $\tilde{O}(\sqrt{T})$ Constraint Violation for Online Convex Optimization with Adversarial Constraints
Abhishek Sinha, Rahul Vaze
Deeper with Riemannian Geometry: Overcoming Oversmoothing and Oversquashing for Graph Foundation Models
Li Sun, Zhenhao Huang, Ming Zhang et al.
Competitive Advantage Attacks to Decentralized Federated Learning
Yuqi Jia, Minghong Fang, Neil Gong
PDPO: Parametric Density Path Optimization
Sebastian Gutierrez Hernandez, Peng Chen, Hao-Min Zhou
Depth AnyEvent: A Cross-Modal Distillation Paradigm for Event-Based Monocular Depth Estimation
Luca Bartolomei, Enrico Mannocci, Fabio Tosi et al.
Fish2Mesh Transformer: 3D Human Mesh Recovery from Egocentric Vision
Tianma Shen, Aditya Shrish Puranik, James Vong et al.
Formal Models of Active Learning from Contrastive Examples
Farnam Mansouri, Hans Simon, Adish Singla et al.
Holistic Large-Scale Scene Reconstruction via Mixed Gaussian Splatting
Chuandong Liu, Huijiao Wang, Lei YU et al.
TRiCo: Triadic Game-Theoretic Co-Training for Robust Semi-Supervised Learning
Hongyang He, Xinyuan Song, Yangfan He et al.
Optimizing Retrieval for RAG via Reinforced Contrastive Learning
Jiawei Zhou, Lei Chen
Sketched Gaussian Mechanism for Private Federated Learning
Qiaobo Li, Zhijie Chen, Arindam Banerjee
WIPES: Wavelet-based Visual Primitives
Wenhao Zhang, Hao Zhu, Delong Wu et al.
Learning in Compact Spaces with Approximately Normalized Transformer
Jörg Franke, Urs Spiegelhalter, Marianna Nezhurina et al.
PhysVLM-AVR: Active Visual Reasoning for Multimodal Large Language Models in Physical Environments
Weijie Zhou, Xuantang Xiong, Yi Peng et al.
DLFR-Gen: Diffusion-based Video Generation with Dynamic Latent Frame Rate
Zhihang Yuan, Rui Xie, Yuzhang Shang et al.
Acc3D: Accelerating Single Image to 3D Diffusion Models via Edge Consistency Guided Score Distillation
Kendong Liu, Zhiyu Zhu, Hui LIU et al.
High-order Interactions Modeling for Interpretable Multi-Agent Q-Learning
Qinyu Xu, Yuanyang Zhu, Xuefei Wu et al.
Neuro-Spectral Architectures for Causal Physics-Informed Networks
Arthur Bizzi, Leonardo Moreira, Márcio Marques et al.
BabyVLM: Data-Efficient Pretraining of VLMs Inspired by Infant Learning
Shengao Wang, Arjun Chandra, Aoming Liu et al.
Revisiting Orbital Minimization Method for Neural Operator Decomposition
Jongha (Jon) Ryu, Samuel Zhou, Gregory Wornell
SyncHuman: Synchronizing 2D and 3D Generative Models for Single-view Human Reconstruction
Wenyue Chen, Peng Li, Wangguandong Zheng et al.
Graph–Smoothed Bayesian Black-Box Shift Estimator and Its Information Geometry
Masanari Kimura
A Minimalist Example of Edge-of-Stability and Progressive Sharpening
Liming Liu, Zixuan Zhang, Simon Du et al.
Revisiting Logit Distributions for Reliable Out-of-Distribution Detection
Jiachen Liang, RuiBing Hou, Minyang Hu et al.
Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis
Dong Yang, YIYI CAI, Yuki Saito et al.
DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling
Kairun Wen, Yuzhihuang, Runyu Chen et al.
In Silico Mapping of Visual Categorical Selectivity Across the Whole Brain
Ethan Hwang, Hossein Adeli, Wenxuan Guo et al.
Variational Task Vector Composition
Boyuan Zhang, Yingjun Du, Xiantong Zhen et al.
AI Debate Aids Assessment of Controversial Claims
Salman Rahman, Sheriff Issaka, Ashima Suvarna et al.
DanceEditor: Towards Iterative Editable Music-driven Dance Generation with Open-Vocabulary Descriptions
Hengyuan Zhang, Zhe Li, Xingqun Qi et al.
TAG-WM: Tamper-Aware Generative Image Watermarking via Diffusion Inversion Sensitivity
Yuzhuo Chen, Zehua Ma, Han Fang et al.
Personalized Image Editing in Text-to-Image Diffusion Models via Collaborative Direct Preference Optimization
Connor Dunlop, Matthew Zheng, Kavana Venkatesh et al.
GeRaF: Neural Geometry Reconstruction from Radio Frequency Signals
Jiachen Lu, Hailan Shanbhag, Haitham Al Hassanieh
LIM: Large Interpolator Model for Dynamic Reconstruction
Remy Sabathier, Niloy J. Mitra, David Novotny
OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps
Bingnan Li, Chen-Yu Wang, Haiyang Xu et al.
EnzyControl: Adding Functional and Substrate-Specific Control for Enzyme Backbone Generation
Chao Song, ZHIYUAN LIU, Han Huang et al.
Curriculum Design for Trajectory-Constrained Agent: Compressing Chain-of-Thought Tokens in LLMs
Georgios Tzannetos, Parameswaran Kamalaruban, Adish Singla
Fuse Before Transfer: Knowledge Fusion for Heterogeneous Distillation
Guopeng Li, Qiang Wang, Ke Yan et al.
EnliveningGS: Active Locomotion of 3DGS
Siyuan Shen, Tianjia Shao, Kun Zhou et al.
Free-MoRef: Instantly Multiplexing Context Perception Capabilities of Video-MLLMs within Single Inference
KUO WANG, Quanlong Zheng, Junlin Xie et al.
CamPoint: Boosting Point Cloud Segmentation with Virtual Camera
Jianhui Zhang, Luo Yizhi, Zicheng Zhang et al.
AutoScape: Geometry-Consistent Long-Horizon Scene Generation
Jiacheng Chen, Ziyu Jiang, Mingfu Liang et al.
Scaling can lead to compositional generalization
Florian Redhardt, Yassir Akram, Simon Schug
A Dataset for Distilling Knowledge Priors from Literature for Therapeutic Design
Haydn Jones, Natalie Maus, Josh magnus Ludan et al.
Bootstrap Off-policy with World Model
Guojian Zhan, Likun Wang, Xiangteng Zhang et al.
Diffusion-based 3D Hand Motion Recovery with Intuitive Physics
Yufei Zhang, Zijun Cui, Jeffrey Kephart et al.
Conditional Distribution Compression via the Kernel Conditional Mean Embedding
Dominic Broadbent, Nick Whiteley, Robert Allison et al.
Adversarial Diffusion for Robust Reinforcement Learning
Daniele Foffano, Alessio Russo, Alexandre Proutiere
Dense Dispersed Structured Light for Hyperspectral 3D Imaging of Dynamic Scenes
Suhyun Shin, Seungwoo Yoon, Ryota Maeda et al.
Adjusted Count Quantification Learning on Graphs
Clemens Damke, Eyke Hüllermeier
OpenHype: Hyperbolic Embeddings for Hierarchical Open-Vocabulary Radiance Fields
Lisa Weijler, Sebastian Koch, Fabio Poiesi et al.
Clip-and-Verify: Linear Constraint-Driven Domain Clipping for Accelerating Neural Network Verification
Duo Zhou, Jorge Chavez, Hesun Chen et al.