Most Cited 2025 "knowledge base uncertainty" Papers
22,274 papers found • Page 98 of 112
Conference
Inference-Time Personalized Alignment with a Few User Preference Queries
Victor-Alexandru Pădurean, Parameswaran Kamalaruban, Nachiket Kotalwar et al.
Seeing is Not Believing: Adversarial Natural Object Optimization for Hard-Label 3D Scene Attacks
Daizong Liu, Wei Hu
GauSAM: Contour‑Guided 2D Gaussian Fields for Multi‑Scale Medical Image Segmentation with Segment Anything
Jinxuan Wu, Jiange Wang, Dongdong Zhang
Auto-Search and Refinement: An Automated Framework for Gender Bias Mitigation in Large Language Models
Yue Xu, Chengyan Fu, Li Xiong et al.
Embodied Cognition Augmented End2End Autonomous Driving
Ling Niu, Xiaoji Zheng, han wang et al.
HomoGen: Enhanced Video Inpainting via Homography Propagation and Diffusion
Ding Ding, Yueming Pan, Ruoyu Feng et al.
Exploiting Dynamic Sparsity in Einsum
Christoph Staudt, Mark Blacher, Tim Hoffmann et al.
When Do Transformers Outperform Feedforward and Recurrent Networks? A Statistical Perspective
Alireza Mousavi-Hosseini, Clayton Sanford, Denny Wu et al.
VLM in a flash: I/O-Efficient Sparsification of Vision-Language Model via Neuron Chunking
Kichang Yang, Seonjun Kim, Minjae Kim et al.
Blindfolded Experts Generalize Better: Insights from Robotic Manipulation and Videogames
Ev Zisselman, Mirco Mutti, Shelly Francis-Meretzki et al.
Discovering Symbolic Partial Differential Equation by Abductive Learning
En-Hao Gao, Cunjing Ge, Yuan Jiang et al.
MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling
Yifang Men, Yuan Yao, Miaomiao Cui et al.
Why Popular MOEAs are Popular: Proven Advantages in Approximating the Pareto Front
Mingfeng Li, Qiang Zhang, Weijie Zheng et al.
Enhancing Vision-Language Compositional Understanding with Multimodal Synthetic Data
Haoxin Li, Boyang Li
Contextual Online Pricing with (Biased) Offline Data
Yixuan Zhang, Ruihao Zhu, Qiaomin Xie
Efficient Depth Estimation for Unstable Stereo Camera Systems on AR Glasses
Yongfan Liu, Hyoukjun Kwon
Less but More: Linear Adaptive Graph Learning Empowering Spatiotemporal Forecasting
Jiaming Ma, Binwu Wang, Guanjun Wang et al.
CADMorph: Geometry‑Driven Parametric CAD Editing via a Plan–Generate–Verify Loop
Weijian Ma, Shizhao Sun, Ruiyu Wang et al.
CHiQPM: Calibrated Hierarchical Interpretable Image Classification
Thomas Norrenbrock, Timo Kaiser, Sovan Biswas et al.
Scalable, Explainable and Provably Robust Anomaly Detection with One-Step Flow Matching
Zhong Li, Qi Huang, Yuxuan Zhu et al.
Optimizing Anytime Reasoning via Budget Relative Policy Optimization
Penghui Qi, Zichen Liu, Tianyu Pang et al.
MuRating: A High Quality Data Selecting Approach to Multilingual Large Language Model Pretraining
Zhixun Chen, Ping Guo, Wenhan Han et al.
Efficient Spectral Control of Partially Observed Linear Dynamical Systems
Anand Brahmbhatt, Gon Buzaglo, Sofiia Druchyna et al.
Robotic Visual Instruction
Yanbang Li, ZiYang Gong, Haoyang Li et al.
FedGPS: Statistical Rectification Against Data Heterogeneity in Federated Learning
Zhiqin Yang, Yonggang Zhang, Chenxin Li et al.
Robust Neural Rendering in the Wild with Asymmetric Dual 3D Gaussian Splatting
CHENGQI LI, Zhihao Shi, Yangdi Lu et al.
Learned Binocular-Encoding Optics for RGBD Imaging Using Joint Stereo and Focus Cues
Yuhui Liu, Liangxun Ou, Qiang Fu et al.
Greedy Sampling Is Provably Efficient For RLHF
Di Wu, Chengshuai Shi, Jing Yang et al.
SAGE: A Unified Framework for Generalizable Object State Recognition with State-Action Graph Embedding
Yuan Zang, Zitian Tang, Junho Cho et al.
Martian World Model: Controllable Video Synthesis with Physically Accurate 3D Reconstructions
Longfei Li, Zhiwen Fan, Wenyan Cong et al.
Improving Autoregressive Visual Generation with Cluster-Oriented Token Prediction
Teng Hu, Jiangning Zhang, Ran Yi et al.
Multimodal 3D Genome Pre-training
Minghao Yang, Pengteng Li, Yan Liang et al.
Query Efficient Black-Box Visual Prompting with Subspace Learning
Haozhen Zhang, Zhaogeng Liu, Hualin Zhang et al.
Fingerprinting Denoising Diffusion Probabilistic Models
Huan Teng, Yuhui Quan, Chengyu Wang et al.
Generalized Top-k Mallows Model for Ranked Choices
Shahrzad Haddadan, Sara Ahmadian
Spectral Analysis of Representational Similarity with Limited Neurons
Hyunmo Kang, Abdulkadir Canatar, SueYeon Chung
Exploring and Leveraging Class Vectors for Classifier Editing
Jaeik Kim, Jaeyoung Do
DyG-Mamba: Continuous State Space Modeling on Dynamic Graphs
Dongyuan Li, Shiyin Tan, Ying Zhang et al.
Edit Less, Achieve More: Dynamic Sparse Neuron Masking for Lifelong Knowledge Editing in LLMs
Jinzhe Liu, Junshu Sun, Shufan Shen et al.
AdaDARE-gamma: Balancing Stability and Plasticity in Multi-modal LLMs through Efficient Adaptation
Jingyi Xie, Jintao Yang, Zhunchen Luo et al.
Improved Scaling Laws in Linear Regression via Data Reuse
Licong Lin, Jingfeng Wu, Peter Bartlett
Harnessing the Universal Geometry of Embeddings
Rishi Jha, Collin Zhang, Vitaly Shmatikov et al.
SkySense-O: Towards Open-World Remote Sensing Interpretation with Vision-Centric Visual-Language Modeling
Qi Zhu, Jiangwei Lao, Deyi Ji et al.
DisMo: Disentangled Motion Representations for Open-World Motion Transfer
Thomas Ressler-Antal, Frank Fundel, Malek Ben Alaya et al.
RLVR-World: Training World Models with Reinforcement Learning
Jialong Wu, Shaofeng Yin, Ningya Feng et al.
Learning Sparse Approximate Inverse Preconditioners for Conjugate Gradient Solvers on GPUs
Zhehao Li, Zhehao Li, Kangbo Lyu et al.
Shaping Sequence Attractor Schema in Recurrent Neural Networks
Zhikun Chu, Bo Ho, xiaolong zou et al.
Reverse-Annealed Sequential Monte Carlo for Efficient Bayesian Optimal Experiment Design
Jake Callahan, Andrew Chin, Jason Pacheco et al.
T-REGS: Minimum Spanning Tree Regularization for Self-Supervised Learning
Julie Mordacq, David Loiseaux, Vicky Kalogeiton et al.
LoMix: Learnable Weighted Multi-Scale Logits Mixing for Medical Image Segmentation
Md Mostafijur Rahman, Radu Marculescu
E-MoFlow: Learning Egomotion and Optical Flow from Event Data via Implicit Regularization
Wenpu Li, Bangyan Liao, Yi Zhou et al.
Knowledge Bridger: Towards Training-Free Missing Modality Completion
Guanzhou Ke, Shengfeng He, Xiao-Li Wang et al.
Rethinking Fine-Tuning when Scaling Test-Time Compute: Limiting Confidence Improves Mathematical Reasoning
Feng Chen, Allan Raventós, Nan Cheng et al.
Safe + Safe = Unsafe? Exploring How Safe Images Can Be Exploited to Jailbreak Large Vision-Language Models
Chenhang Cui, Gelei Deng, An Zhang et al.
Differentiable Structure Learning and Causal Discovery for General Binary Data
Chang Deng, Bryon Aragam
KARMA: Leveraging Multi-Agent LLMs for Automated Knowledge Graph Enrichment
Yuxing Lu, Wei Wu, Xukai Zhao et al.
BimArt: A Unified Approach for the Synthesis of 3D Bimanual Interaction with Articulated Objects
Wanyue Zhang, Rishabh Dabral, Vladislav Golyanik et al.
Integration Matters for Learning PDEs with Backwards SDEs
Sungje Park, Stephen Tu
Generating Multi-Table Time Series EHR from Latent Space with Minimal Preprocessing
Eunbyeol Cho, Jiyoun Kim, Minjae Lee et al.
Adaptive Re-calibration Learning for Balanced Multimodal Intention Recognition
Qu Yang, Xiyang Li, Fu Lin et al.
Unlearning-Aware Minimization
Hoki Kim, Keonwoo Kim, Sungwon Chae et al.
FEEDBACK FRICTION: LLMs Struggle to Fully Incorporate External Feedback
Dongwei Jiang, Bowei Zhang, Andrew Wang et al.
Model–Behavior Alignment under Flexible Evaluation: When the Best-Fitting Model Isn’t the Right One
Itamar Avitan, Tal Golan
Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method
Xinshuai Song, weixing chen, Yang Liu et al.
Shape Abstraction via Marching Differentiable Support Functions
Sunkyung Park, Jeongmin Lee, Dongjun Lee
CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning
Yang Yue, Yulin Wang, Chenxin Tao et al.
Cameras as Relative Positional Encoding
Ruilong Li, Brent Yi, Junchen Liu et al.
Generalization vs Specialization under Concept Shift
Alex Nguyen, David Schwab, Vudtiwat Ngampruetikorn
Joint Vision-Language Social Bias Removal for CLIP
Haoyu Zhang, Yangyang Guo, Mohan Kankanhalli
R1-ShareVL: Incentivizing Reasoning Capabilities of Multimodal Large Language Models via Share-GRPO
Huanjin Yao, Qixiang Yin, Jingyi Zhang et al.
Federated Dialogue-Semantic Diffusion for Emotion Recognition under Incomplete Modalities
Xihang Qiu, Jiarong Cheng, Yuhao Fang et al.
When and how can inexact generative models still sample from the data manifold?
Nisha Chandramoorthy, Adriaan de Clercq
IDEA: Inverted Text with Cooperative Deformable Aggregation for Multi-modal Object Re-Identification
Yuhao Wang, Yongfeng Lv, Pingping Zhang et al.
Tractable Multinomial Logit Contextual Bandits with Non-Linear Utilities
Taehyun Hwang, Dahngoon Kim, Min-hwan Oh
Principled Fine-tuning of LLMs from User-Edits: A Medley of Preference, Supervision, and Reward
Dipendra Misra, Aldo Pacchiano, Ta-Chung Chi et al.
Hyper-Modality Enhancement for Multimodal Sentiment Analysis with Missing Modalities
Yan Zhuang, Minhao Liu, Wei Bai et al.
Learning Across the Gap: Hybrid Multi-armed Bandits with Heterogeneous Offline and Online Data
Qijia He, Minghan Wang, Xutong Liu et al.
DiCoFlex: Model-Agnostic Diverse Counterfactuals with Flexible Control
Oleksii Furman, Ulvi Movsum-zada, Patryk Marszałek et al.
RICCARDO: Radar Hit Prediction and Convolution for Camera-Radar 3D Object Detection
Yunfei Long, Abhinav Kumar, Xiaoming Liu et al.
HyperNet Fields: Efficiently Training Hypernetworks without Ground Truth by Learning Weight Trajectories
Eric Hedlin, Munawar Hayat, Fatih Porikli et al.
WaveAR: Wavelet-Aware Continuous Autoregressive Diffusion for Accurate Human Motion Prediction
shengchuan gao, Shuo Wang, Yabiao Wang et al.
L2DGCN: Learnable Enhancement and Label Selection Dynamic Graph Convolutional Networks for Mitigating Degree Bias
jingxiao zhang, Shifei Ding, Jian Zhang et al.
iG-6DoF: Model-free 6DoF Pose Estimation for Unseen Object via Iterative 3D Gaussian Splatting
Tuo Cao, Fei LUO, Jiongming Qin et al.
Simple and Effective Specialized Representations for Fair Classifiers
Alberto Sinigaglia, Davide Sartor, Marina Ceccon et al.
Near-Optimal Sample Complexity for Online Constrained MDPs
Chang Liu, Yunfan Li, Lin Yang
Track, Inpaint, Resplat: Subject-driven 3D and 4D Generation with Progressive Texture Infilling
Shuhong Zheng, Ashkan Mirzaei, Igor Gilitschenski
Multi-Environment POMDPs: Discrete Model Uncertainty Under Partial Observability
Eline M. Bovy, Caleb Probine, Marnix Suilen et al.
Enhancing Visual Prompting through Expanded Transformation Space and Overfitting Mitigation
Shohei Enomoto
POPEN: Preference-Based Optimization and Ensemble for LVLM-Based Reasoning Segmentation
Lanyun Zhu, Tianrun Chen, Qianxiong Xu et al.
Generating Informative Samples for Risk-Averse Fine-Tuning of Downstream Tasks
Heasung Kim, Taekyun Lee, Hyeji Kim et al.
Symmetry-Preserving Conformer Ensemble Networks for Molecular Representation Learning
Yanqiao Zhu, Yidan Shi, Yuanzhou Chen et al.
Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attention Lens
Zhangqi Jiang, Junkai Chen, Beier Zhu et al.
Differentially Private Relational Learning with Entity-level Privacy Guarantees
Yinan Huang, Haoteng Yin, Eli Chien et al.
UNIALIGN: Scaling Multimodal Alignment within One Unified Model
bo zhou, Liulei Li, Yujia Wang et al.
Zero-Shot 4D Lidar Panoptic Segmentation
Yushan Zhang, Aljoša Ošep, Laura Leal-Taixe et al.
Tight Bounds for Maximum Weight Matroid Independent Set and Matching in the Zero Communication Model
Ilan Doron-Arad
DSPNet: Dual-vision Scene Perception for Robust 3D Question Answering
Jingzhou Luo, Yang Liu, weixing chen et al.
Efficient Motion-Aware Video MLLM
Zijia Zhao, Yuqi Huo, Tongtian Yue et al.
Registration is a Powerful Rotation-Invariance Learner for 3D Anomaly Detection
Yuyang Yu, Zhengwei Chen, Xuemiao Xu et al.
Mechanistic Interpretability of RNNs emulating Hidden Markov Models
Elia Torre, Michele Viscione, Lucas Pompe et al.
Hyperspectral Pansharpening via Diffusion Models with Iteratively Zero-Shot Guidance
Jin-Liang Xiao, Ting-Zhu Huang, Liang-Jian Deng et al.
Find your Needle: Small Object Image Retrieval via Multi-Object Attention Optimization
Michael Green, Matan Levy, Issar Tzachor et al.
Object-centric 3D Motion Field for Robot Learning from Human Videos
Zhao-Heng Yin, Sherry Yang, Pieter Abbeel
TensoFlow: Tensorial Flow-based Sampler for Inverse Rendering
Chun Gu, Xiaofei Wei, Li Zhang et al.
ViDAR: Video Diffusion-Aware 4D Reconstruction From Monocular Inputs
Michal Nazarczuk, Sibi Catley-Chandar, Thomas Tanay et al.
Fundamental Limitations in Pointwise Defences of LLM Finetuning APIs
Xander Davies, Eric Winsor, Alexandra Souly et al.
Balanced Conic Rectified Flow
Kim Shin seong, Mingi Kwon, Jaeseok Jeong et al.
Hierarchical Retrieval: The Geometry and a Pretrain-Finetune Recipe
Chong You, Rajesh Jayaram, Ananda Theertha Suresh et al.
Planning without Search: Refining Frontier LLMs with Offline Goal-Conditioned RL
Joey Hong, Anca Dragan, Sergey Levine
RPG360: Robust 360 Depth Estimation with Perspective Foundation Models and Graph Optimization
Dongki Jung, Jaehoon Choi, Yonghan Lee et al.
Imitation Beyond Expectation Using Pluralistic Stochastic Dominance
Ali Farajzadeh, Danyal Saeed, Syed M Abbas et al.
Irrational Complex Rotations Empower Low-bit Optimizers
Zhen Tian, Xin Zhao, Ji-Rong Wen
Analyzing the Synthetic-to-Real Domain Gap in 3D Hand Pose Estimation
Zhuoran ZHAO, Linlin Yang, Pengzhan Sun et al.
Accurate Scene Text Recognition with Efficient Model Scaling and Cloze Self-Distillation
Andrea Maracani, Savas Ozkan, Sijun Cho et al.
Q-Palette: Fractional-Bit Quantizers Toward Optimal Bit Allocation for Efficient LLM Deployment
DEOKJAE LEE, Hyun Oh Song
Accelerating Parallel Diffusion Model Serving with Residual Compression
Jiajun Luo, Yicheng Xiao, Jianru Xu et al.
TP-MDDN: Task-Preferenced Multi-Demand-Driven Navigation with Autonomous Decision-Making
Shanshan Li, Da Huang, Yu He et al.
Weak-shot Keypoint Estimation via Keyness and Correspondence Transfer
Junjie Chen, Zeyu Luo, Zezheng Liu et al.
Towards Smart Point-and-Shoot Photography
Jiawan Li, Fei Zhou, Zhipeng Zhong et al.
Uni-RL: Unifying Online and Offline RL via Implicit Value Regularization
Haoran Xu, Liyuan Mao, Hui Jin et al.
ModHiFi: Identifying High Fidelity predictive components for Model Modification
Dhruva Kashyap, Chaitanya Murti, Pranav K Nayak et al.
Image Reconstruction from Readout-Multiplexed Single-Photon Detector Arrays
Shashwath Bharadwaj, Ruangrawee Kitichotkul, Akshay Agarwal et al.
Bilevel Optimization for Adversarial Learning Problems: Sharpness, Generation, and Beyond
Risheng Liu, Zhu Liu, Weihao Mao et al.
REDOUBT: Duo Safety Validation for Autonomous Vehicle Motion Planning
Shuguang Wang, Qian Zhou, Kui Wu et al.
Vertical Federated Feature Screening
Huajun Yin, Liyuan Wang, Yingqiu Zhu et al.
GenColor: Generative and Expressive Color Enhancement with Pixel-Perfect Texture Preservation
Yi Dong, Yuxi Wang, Xianhui Lin et al.
Learning on Model Weights using Tree Experts
Eliahu Horwitz, Bar Cavia, Jonathan Kahana et al.
Don't be lazy: CompleteP enables compute-efficient deep transformers
Nolan Dey, Bin Zhang, Lorenzo Noci et al.
Large-scale Multi-view Tensor Clustering with Implicit Linear Kernels
Jiyuan Liu, Xinwang Liu, chuankun Li et al.
$\mu$PC: Scaling Predictive Coding to 100+ Layer Networks
Francesco Innocenti, El Mehdi Achour, Christopher L Buckley
LaM-SLidE: Latent Space Modeling of Spatial Dynamical Systems via Linked Entities
Florian Sestak, Artur Toshev, Andreas Fürst et al.
DualMPNN: Harnessing Structural Alignments for High-Recovery Inverse Protein Folding
Xuhui Liao, qiyu wang, Zhiqiang Liang et al.
Towards Comprehensive Scene Understanding: Integrating First and Third-Person Views for LVLMs
Insu Lee, Wooje Park, Jaeyun Jang et al.
Gated Integration of Low-Rank Adaptation for Continual Learning of Large Language Models
Yan-Shuo Liang, Jia-Rui Chen, Wu-Jun Li
The Impact Label Noise and Choice of Threshold has on Cross-Entropy and Soft-Dice in Image Segmentation
Marcus Nordström, Atsuto Maki, Henrik Hult
Towards Straggler-Resilient Split Federated Learning: An Unbalanced Update Approach
Dandan Liang, Jianing Zhang, Evan Chen et al.
Conformal Prediction in The Loop: A Feedback-Based Uncertainty Model for Trajectory Optimization
Han Wang, Chao Ning
ConceptScope: Characterizing Dataset Bias via Disentangled Visual Concepts
Jinho Choi, Hyesu Lim, Steffen Schneider et al.
Graphs Help Graphs: Multi-Agent Graph Socialized Learning
Jialu Li, Yu Wang, Pengfei Zhu et al.
EfficientLLaVA: Generalizable Auto-Pruning for Large Vision-language Models
Yinan Liang, Ziwei Wang, Xiuwei Xu et al.
Shift Before You Learn: Enabling Low-Rank Representations in Reinforcement Learning
Bastien Dubail, Stefan Stojanovic, Alexandre Proutiere
Towards a General Attention Framework on Gyrovector Spaces for Matrix Manifolds
Rui Wang, Chen Hu, Xiaoning Song et al.
Motions as Queries: One-Stage Multi-Person Holistic Human Motion Capture
Kenkun Liu, Yurong Fu, Weihao Yuan et al.
ArticulatedGS: Self-supervised Digital Twin Modeling of Articulated Objects using 3D Gaussian Splatting
Guo Junfu, Yu Xin, Gaoyi Liu et al.
VTON-VLLM: Aligning Virtual Try-On Models with Human Preferences
Siqi Wan, Jingwen Chen, Qi Cai et al.
Mitigating Reward Over-optimization in Direct Alignment Algorithms with Importance Sampling
Nguyen Phuc, Ngoc-Hieu Nguyen, Duy M. H. Nguyen et al.
Preference-based Reinforcement Learning beyond Pairwise Comparisons: Benefits of Multiple Options
Joongkyu Lee, Seouh-won Yi, Min-hwan Oh
When Does Closeness in Distribution Imply Representational Similarity? An Identifiability Perspective
Beatrix Nielsen, Emanuele Marconato, Andrea Dittadi et al.
Rethinking Noisy Video-Text Retrieval via Relation-aware Alignment
Huakai Lai, Guoxin Xiong, Huayu Mai et al.
Mitigating Spurious Features in Contrastive Learning with Spectral Regularization
Naghmeh Ghanooni, Waleed Mustafa, Dennis Wagner et al.
Per-Architecture Training-Free Metric Optimization for Neural Architecture Search
Mingzhuo Lin, Jianping Luo
RoMa: A Robust Model Watermarking Scheme for Protecting IP in Diffusion Models
Yingsha Xie, Rui Min, Zeyu Qin et al.
Pruning Spurious Subgraphs for Graph Out-of-Distribution Generalization
Tianjun Yao, Haoxuan Li, Yongqiang Chen et al.
On Logic-based Self-Explainable Graph Neural Networks
Alessio Ragno, Marc Plantevit, Céline Robardet
Low-degree evidence for computational transition of recovery rate in stochastic block model
Jingqiu Ding, Yiding Hua, Lucas Slot et al.
Low-Rank Head Avatar Personalization with Registers
Sai Tanmay Reddy Chakkera, Aggelina Chatziagapi, Md Moniruzzaman et al.
Reconstructing Close Human Interaction with Appearance and Proxemics Reasoning
Buzhen Huang, Chen Li, Chongyang Xu et al.
KL-Regularized RLHF with Multiple Reference Models: Exact Solutions and Sample Complexity
Gholamali Aminian, Amir R. Asadi, Idan Shenfeld et al.
Integrating Drug Substructures and Longitudinal Electronic Health Records for Personalized Drug Recommendation
Wenjie Du, Xuqiang Li, Jinke Feng et al.
OSMamba: Omnidirectional Spectral Mamba with Dual-Domain Prior Generator for Exposure Correction
Gehui Li, Bin Chen, Chen Zhao et al.
The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models
Ke Ji, Jiahao Xu, Tian Liang et al.
M3GYM: A Large-Scale Multimodal Multi-view Multi-person Pose Dataset for Fitness Activity Understanding in Real-world Settings
Qingzheng Xu, Ru Cao, Xin Shen et al.
Star with Bilinear Mapping
Zelin Peng, Yu Huang, Zhengqin Xu et al.
Preference-Guided Diffusion for Multi-Objective Offline Optimization
Yashas Annadani, Syrine Belakaria, Stefano Ermon et al.
Memorization in Graph Neural Networks
Adarsh Jamadandi, Jing Xu, Adam Dziedzic et al.
RUBIK: A Structured Benchmark for Image Matching across Geometric Challenges
Thibaut Loiseau, Guillaume Bourmaud
ShapeLLM-Omni: A Native Multimodal LLM for 3D Generation and Understanding
junliang ye, Zhengyi Wang, Ruowen Zhao et al.
Behavior Injection: Preparing Language Models for Reinforcement Learning
Zhepeng Cen, Yihang Yao, William Han et al.
Rethinking Temporal Fusion with a Unified Gradient Descent View for 3D Semantic Occupancy Prediction
Dubing Chen, Huan Zheng, Jin Fang et al.
Reconciling Geospatial Prediction and Retrieval via Sparse Representations
YI LI, CHEN YUANLONG, Weiming Huang et al.
A Unified Framework for Variable Selection in Model-Based Clustering with Missing Not at Random
Binh Ho, Long Nguyen-Chi, TrungTin Nguyen et al.
Modeling Multiple Normal Action Representations for Error Detection in Procedural Tasks
Wei-Jin Huang, Yuan-Ming Li, Zhi-Wei Xia et al.
Planning and Learning in Average Risk-aware MDPs
Weikai Wang, Erick Delage
Bipolar Self-attention for Spiking Transformers
Shuai Wang, Malu Zhang, Jingya Wang et al.
LeanGaussian: Breaking Pixel or Point Cloud Correspondence in Modeling 3D Gaussians
Jiamin WU, Kenkun Liu, Han Gao et al.
FlyLoRA: Boosting Task Decoupling and Parameter Efficiency via Implicit Rank-Wise Mixture-of-Experts
Heming Zou, Yunliang Zang, Wutong Xu et al.
STEP: Enhancing Video-LLMs’ Compositional Reasoning by Spatio-Temporal Graph-guided Self-Training
Haiyi Qiu, Minghe Gao, Long Qian et al.
RigGS: Rigging of 3D Gaussians for Modeling Articulated Objects in Videos
Yuxin Yao, Zhi Deng, Junhui Hou
Rethinking Multimodal Learning from the Perspective of Mitigating Classification Ability Disproportion
Qing-Yuan Jiang, Longfei Huang, Yang Yang
ThermalGen: Style-Disentangled Flow-Based Generative Models for RGB-to-Thermal Image Translation
Jiuhong Xiao, Roshan Nayak, Ning Zhang et al.
Continuous Q-Score Matching: Diffusion Guided Reinforcement Learning for Continuous-Time Control
Chengxiu HUA, Jiawen Gu, Yushun Tang
C-SafeGen: Certified Safe LLM Generation with Claim-Based Streaming Guardrails
Mintong Kang, Zhaorun Chen, Bo Li
Evolutionary Reasoning Does Not Arise in Standard Usage of Protein Language Models
Yasha Ektefaie, Andrew Shen, Lavik Jain et al.
VarFlow: Proper Scoring-Rule Diffusion Distillation via Energy Matching
Huiyang Shao, Xin Xia, Yuxi Ren et al.
Stop Learning it all to Mitigate Visual Hallucination, Focus on the Hallucination Target.
Dokyoon Yoon, Youngsook Song, Woomyoung Park
From Kolmogorov to Cauchy: Shallow XNet Surpasses KANs
Xin Li, Xiaotao Zheng, Zhihong Xia
LoRA Recycle: Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs
Zixuan Hu, Yongxian Wei, Li Shen et al.
HOT: Hadamard-based Optimized Training
Seonggon Kim, Juncheol Shin, Seung-taek Woo et al.
Language Models (Mostly) Know When to Stop Reading
Roy Xie, Junlin Wang, Paul Rosu et al.
On Local Limits of Sparse Random Graphs: Color Convergence and the Refined Configuration Model
Alexander Pluska, Sagar Malhotra
Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Large Model Enhancement
Qianhan Feng, Wenshuo Li, Tong Lin et al.
Representational Difference Explanations
Neehar Kondapaneni, Oisin Mac Aodha, Pietro Perona
G-Net: A Provably Easy Construction of High-Accuracy Random Binary Neural Networks
Alireza Aghasi, Nicholas F. Marshall, Saeid Pourmand et al.
Think Small, Act Big: Primitive Prompt Learning for Lifelong Robot Manipulation
Yuanqi Yao, Siao Liu, Haoming Song et al.
FLUX: Efficient Descriptor-Driven Clustered Federated Learning under Arbitrary Distribution Shifts
Dario Fenoglio, Mohan Li, Pietro Barbiero et al.
BEVDiffuser: Plug-and-Play Diffusion Model for BEV Denoising with Ground-Truth Guidance
Xin Ye, Burhan Yaman, Sheng Cheng et al.
Discovering Hidden Visual Concepts Beyond Linguistic Input in Infant Learning
Xueyi Ke, Satoshi Tsutsui, Yayun Zhang et al.
New Parallel and Streaming Algorithms for Directed Densest Subgraph
Slobodan Mitrovic, Theodore Pan, Mahdi Qaempanah et al.
Geometry Meets Incentives: Sample-Efficient Incentivized Exploration with Linear Contexts
Ben Schiffer, Mark Sellke
Learning Textual Prompts for Open-World Semi-Supervised Learning
Yuxin Fan, Junbiao Cui, Jiye Liang