Most Cited 2025 "risk allocation" Papers
22,274 papers found • Page 94 of 112
Conference
Token Activation Map to Visually Explain Multimodal LLMs
Yi Li, Hualiang Wang, Xinpeng Ding et al.
Hadamard Test is Sufficient for Efficient Quantum Gradient Estimation with Lie Algebraic Symmetries
Mohsen Heidari, Masih Mozakka, Wojciech Szpankowski
Can Knowledge be Transferred from Unimodal to Multimodal? Investigating the Transitivity of Multimodal Knowledge Editing
Lingyong Fang, Xinzhong Wang, Depeng depeng wang et al.
SPiDR: A Simple Approach for Zero-Shot Safety in Sim-to-Real Transfer
Yarden As, Chengrui (Ray) Qu, Benjamin Unger et al.
FATE: Full-head Gaussian Avatar with Textural Editing from Monocular Video
Jiawei Zhang, Zijian Wu, Zhiyang Liang et al.
From Information to Generative Exponent: Learning Rate Induces Phase Transitions in SGD
Konstantinos Tsiolis, Alireza Mousavi-Hosseini, Murat Erdogdu
UniMotion: A Unified Motion Framework for Simulation, Prediction and Planning
Nan Song, Junzhe Jiang, jingyu li et al.
Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation
Chengyue Wu, Xiaokang Chen, Zhiyu Wu et al.
Counterfactual Implicit Feedback Modeling
Chuan Zhou, Lina Yao, Haoxuan Li et al.
ImageSentinel: Protecting Visual Datasets from Unauthorized Retrieval-Augmented Image Generation
Ziyuan Luo, Yangyi Zhao, Ka Chun Cheung et al.
Epistemic Uncertainty Estimation in Regression Ensemble Models with Pairwise Epistemic Estimators
Lucas Berry, David Meger
Dynamic Focused Masking for Autoregressive Embodied Occupancy Prediction
Yuan Sun, Julio Contreras, Jorge Ortiz
WarpGAN: Warping-Guided 3D GAN Inversion with Style-Based Novel View Inpainting
Kaitao Huang, Yan Yan, Jing-Hao Xue et al.
KOALA++: Efficient Kalman-Based Optimization with Gradient-Covariance Products
Zixuan XIa, Aram Davtyan, Paolo Favaro
Co-Speech Gesture Video Generation with Implicit Motion-Audio Entanglement
Xinjie Li, Ziyi Chen, Xinlu Yu et al.
Retrospective In-Context Learning for Temporal Credit Assignment with Large Language Models
Wen-Tse Chen, Jiayu Chen, Fahim Tajwar et al.
Forensics Adapter: Adapting CLIP for Generalizable Face Forgery Detection
Xinjie Cui, Yuezun Li, Ao Luo et al.
Object Concepts Emerge from Motion
Haoqian Liang, Xiaohui Wang, Zhichao Li et al.
Towards Prospective Medical Image Reconstruction via Knowledge-Informed Dynamic Optimal Transport
Taoran Zheng, Yan Yang, Xing Li et al.
ROOT: Rethinking Offline Optimization as Distributional Translation via Probabilistic Bridge
Cuong Dao, The Hung Tran, Phi Le Nguyen et al.
Alleviating Hallucinations in Large Language Models through Multi-Model Contrastive Decoding and Dynamic Hallucination Detection
Chenyu Zhu, Yefeng Liu, Hao Zhang et al.
Seg-VAR:Image Segmentation with Visual Autoregressive Modeling
Rongkun Zheng, Lu Qi, Xi Chen et al.
VidTwin: Video VAE with Decoupled Structure and Dynamics
Yuchi Wang, Junliang Guo, Xinyi Xie et al.
GeoSVR: Taming Sparse Voxels for Geometrically Accurate Surface Reconstruction
Jiahe Li, Jiawei Zhang, Youmin Zhang et al.
Theoretical Investigation of Adafactor for Non-Convex Smooth Optimization
Yusu Hong, Junhong Lin
PyraMotion: Attentional Pyramid-Structured Motion Integration for Co-Speech 3D Gesture Synthesis
Zhizhuo Yin, Yuk Hang Tsui, Pan Hui
Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation
Jiantao Lin, Xin Yang, Meixi Chen et al.
MaskGaussian: Adaptive 3D Gaussian Representation from Probabilistic Masks
Yifei Liu, Zhihang Zhong, Yifan Zhan et al.
Length Generalization via Auxiliary Tasks
Pranjal Awasthi, Anupam Gupta, Ravi Kumar
EnvPoser: Environment-aware Realistic Human Motion Estimation from Sparse Observations with Uncertainty Modeling
Songpengcheng Xia, Yu Zhang, Zhuo Su et al.
Building 3D Representations and Generating Motions From a Single Image via Video-Generation
Weiming Zhi, Ziyong Ma, Tianyi Zhang et al.
Learning Equilibria from Data: Provably Efficient Multi-Agent Imitation Learning
Till Freihaut, Luca Viano, Volkan Cevher et al.
Diffusion Self-Distillation for Zero-Shot Customized Image Generation
Shengqu Cai, Eric Ryan Chan, Yunzhi Zhang et al.
RENO: Real-Time Neural Compression for 3D LiDAR Point Clouds
Kang You, Tong Chen, Dandan Ding et al.
How Classifier Features Transfer to Downstream: An Asymptotic Analysis in a Two-Layer Model
HEE BIN YOO, Sungyoon Lee, Cheongjae Jang et al.
FAME: Adaptive Functional Attention with Expert Routing for Function-on-Function Regression
Yifei Gao, Yong Chen, Chen Zhang
Robust Audio-Visual Segmentation via Audio-Guided Visual Convergent Alignment
Chen Liu, Peike Li, Liying Yang et al.
Learning Preferences without Interaction for Cooperative AI: A Hybrid Offline-Online Approach
Haitong Ma, Haoran Yu, Haobo Fu et al.
UniSTD: Towards Unified Spatio-Temporal Learning across Diverse Disciplines
Chen Tang, Xinzhu Ma, Encheng Su et al.
X-Scene: Large-Scale Driving Scene Generation with High Fidelity and Flexible Controllability
Yu Yang, Alan Liang, Jianbiao Mei et al.
ChatbotID: Identifying Chatbots with Granger Causality Test
Xiaoquan Yi, Haozhao Wang, Yining Qi et al.
AeroGen: Enhancing Remote Sensing Object Detection with Diffusion-Driven Data Generation
Datao Tang, Xiangyong Cao, Xuan Wu et al.
Rising from Ashes: Generalized Federated Learning via Dynamic Parameter Reset
Jiahao Wu, Ming Hu, Yanxin Yang et al.
Automated Detection of Visual Attribute Reliance with a Self-Reflective Agent
Christy Li, Josep Lopez Camuñas, Jake Touchet et al.
Rectifying Shortcut Behaviors in Preference-based Reward Learning
Wenqian Ye, Guangtao Zheng, Aidong Zhang
Perceptual Inductive Bias Is What You Need Before Contrastive Learning
Junru Zhao, Tianqin Li, Dunhan Jiang et al.
Approximate Gradient Coding for Distributed Learning with Heterogeneous Stragglers
Heekang Song, Wan Choi
Improving planning and MBRL with temporally-extended actions
Palash Chatterjee, Roni Khardon
Pool Me Wisely: On the Effect of Pooling in Transformer-Based Models
Sofiane Ennadir, Levente Zólyomi, Oleg Smirnov et al.
Joint Design of Protein Surface and Backbone Using a Diffusion Bridge Model
Guanlue Li, Xufeng Zhao, Fang Wu et al.
CARL: A Framework for Equivariant Image Registration
Hastings Greer, Lin Tian, François-Xavier Vialard et al.
Fine-grained List-wise Alignment for Generative Medication Recommendation
Chenxiao Fan, Chongming Gao, Wentao Shi et al.
ProDAG: Projected Variational Inference for Directed Acyclic Graphs
Ryan Thompson, Edwin Bonilla, Robert Kohn
Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation
Sangmin Bae, Yujin Kim, Reza Bayat et al.
Learning to Sample Effective and Diverse Prompts for Text-to-Image Generation
Taeyoung Yun, Dinghuai Zhang, Jinkyoo Park et al.
MemEIC: A Step Toward Continual and Compositional Knowledge Editing
Jin Seong, Jiyun Park, Wencke Liermann et al.
DeClotH: Decomposable 3D Cloth and Human Body Reconstruction from a Single Image
Hyeongjin Nam, Donghwan Kim, Jeongtaek Oh et al.
Knowledge Distillation Detection for Open-weights Models
Qin Shi, Amber Yijia Zheng, Qifan Song et al.
Automatic Synthetic Data and Fine-grained Adaptive Feature Alignment for Composed Person Retrieval
Delong Liu, Haiwen Li, Zhaohui Hou et al.
Coloring Learning for Heterophilic Graph Representation
Miaomiao Huang, Yuhai Zhao, Daniel Zhengkui Wang et al.
FedLPA: Local Prior Alignment for Heterogeneous Federated Generalized Category Discovery
Geeho Kim, Jinu Lee, Bohyung Han
Minimizing Labeled, Maximizing Unlabeled: An Image-Driven Approach for Video Instance Segmentation
Fangyun Wei, Jinjing Zhao, Kun Yan et al.
Practical Kernel Selection for Kernel-based Conditional Independence Test
Wenjie Wang, Mingming Gong, Biwei Huang et al.
Less is More: Improving LLM Alignment via Preference Data Selection
Xun Deng, Han Zhong, Rui Ai et al.
Alias-Free ViT: Fractional Shift Invariance via Linear Attention
Hagay Michaeli, Daniel Soudry
Beyond Clean Training Data: A Versatile and Model-Agnostic Framework for Out-of-Distribution Detection with Contaminated Training Data
Yuchuan Li, Jae-Mo Kang, Il-Min Kim
Learning Endogenous Attention for Incremental Object Detection
Xiang Song, Yuhang He, Jingyuan Li et al.
How Does Sequence Modeling Architecture Influence Base Capabilities of Pre-trained Language Models? Exploring Key Architecture Design Principles to Avoid Base Capabilities Degradation
Xin Lu, Yanyan Zhao, Si Wei et al.
Learning-Augmented Streaming Algorithms for Correlation Clustering
Yinhao Dong, Shan Jiang, Shi Li et al.
Spectral Conditioning of Attention Improves Transformer Performance
Hemanth Saratchandran, Simon Lucey
TimePerceiver: An Encoder-Decoder Framework for Generalized Time-Series Forecasting
Jaebin Lee, Hankook Lee
Structured Sparse Transition Matrices to Enable State Tracking in State-Space Models
Aleksandar Terzic, Nicolas Menet, Michael Hersche et al.
On Linear Mode Connectivity of Mixture-of-Experts Architectures
Viet-Hoang Tran, Van Hoan Trinh, Khanh-Vinh Bui et al.
Graph-Theoretic Insights into Bayesian Personalized Ranking for Recommendation
Kai Zheng, Jianxin Wang, Jinhui Xu
Generalizable Insights for Graph Transformers in Theory and Practice
Timo Stoll, Luis Müller, Christopher Morris
EarthDial: Turning Multi-sensory Earth Observations to Interactive Dialogues
Sagar Soni, Akshay Dudhane, Hiyam Debary et al.
MobileMamba: Lightweight Multi-Receptive Visual Mamba Network
Haoyang He, Jiangning Zhang, Yuxuan Cai et al.
Metis: A Foundation Speech Generation Model with Masked Generative Pre-training
Yuancheng Wang, Jiachen Zheng, Junan Zhang et al.
One-Step Offline Distillation of Diffusion-based Models via Koopman Modeling
Nimrod Berman, Ilan Naiman, Moshe Eliasof et al.
Holistic Order Prediction in Natural Scenes
Pierre Musacchio, Hyunmin Lee, Jaesik Park
Jailbreaking the Non-Transferable Barrier via Test-Time Data Disguising
Yongli Xiang, Ziming Hong, Lina Yao et al.
FlexAC: Towards Flexible Control of Associative Reasoning in Multimodal Large Language Models
shengming yuan, Xinyu Lyu, Shuailong Wang et al.
Using Diffusion Priors for Video Amodal Segmentation
Kaihua Chen, Deva Ramanan, Tarasha Khurana
Interpreting vision transformers via residual replacement model
Jinyeong Kim, Junhyeok Kim, Yumin Shim et al.
Functional Complexity-adaptive Temporal Tensor Decomposition
Panqi Chen, Lei Cheng, Jianlong Li et al.
SIFusion: A Unified Fusion Framework for Multi-granularity Arctic Sea Ice Forecasting
Jingyi Xu, Shengnan Wang, Weidong Yang et al.
MobilePortrait: Real-Time One-Shot Neural Head Avatars on Mobile Devices
Jianwen Jiang, Gaojie Lin, Zhengkun Rong et al.
GeoRemover: Removing Objects and Their Causal Visual Artifacts
Zixin Zhu, Haoxiang Li, Xuelu Feng et al.
Semi-Supervised State-Space Model with Dynamic Stacking Filter for Real-World Video Deraining
Shangquan Sun, Wenqi Ren, Juxiang Zhou et al.
A Latent Multilayer Graphical Model For Complex, Interdependent Systems
Martin Ondrus, Ivor Cribben, Yang Feng
GUARD: Constructing Realistic Two-Player Matrix and Security Games for Benchmarking Game-Theoretic Algorithms
Noah Krever, Jakub Cerny, Moise Blanchard et al.
Analogy-based Multi-Turn Jailbreak against Large Language Models
Mengjie Wu, Yihao Huang, Zhenjun Lin et al.
SpatialLLM: A Compound 3D-Informed Design towards Spatially-Intelligent Large Multimodal Models
Wufei Ma, Luoxin Ye, Nessa McWeeney et al.
DuSA: Fast and Accurate Dual-Stage Sparse Attention Mechanism Accelerating Both Training and Inference
Chong Wu, Jiawang Cao, Renjie Xu et al.
Omnidirectional 3D Scene Reconstruction from Single Image
Ren Yang, Jiahao Li, Yan Lu
Randomized-MLP Regularization Improves Domain Adaptation and Interpretability in DINOv2
Joel Valdivia Ortega, Lorenz Lamm, Franziska Eckardt et al.
Informed Initialization for Bayesian Optimization and Active Learning
Carl Hvarfner, David Eriksson, Eytan Bakshy et al.
MVBoost: Boost 3D Reconstruction with Multi-View Refinement
Xiangyu Liu, Xiaomei Zhang, Zhiyuan Ma et al.
Text-to-Code Generation for Modular Building Layouts in Building Information Modeling
YINYI WEI, Xiao LI
Coeff-Tuning: A Graph Filter Subspace View for Tuning Attention-Based Large Models
Zichen Miao, WEI CHEN, Qiang Qiu
Disentangling Latent Shifts of In-Context Learning with Weak Supervision
Josip Jukić, Jan Šnajder
Curl Descent : Non-Gradient Learning Dynamics with Sign-Diverse Plasticity
Hugo Ninou, Jonathan Kadmon, N Alex Cayco Gajic
IceDiff: High Resolution and High-Quality Arctic Sea Ice Forecasting with Generative Diffusion Prior
Jingyi Xu, Siwei Tu, Weidong Yang et al.
Low-Rank Graphon Learning for Networks
Xinyuan Fan, Feiyan Ma, Chenlei Leng et al.
Traversing Distortion-Perception Tradeoff using a Single Score-Based Generative Model
Yuhan Wang, Suzhi Bi, Ying-Jun Angela Zhang et al.
Normalized Attention Guidance: Universal Negative Guidance for Diffusion Models
Dar-Yen Chen, Hmrishav Bandyopadhyay, Kai Zou et al.
PLMTrajRec: A Scalable and Generalizable Trajectory Recovery Method with Pre-trained Language Models
Tonglong Wei, Yan Lin, Youfang Lin et al.
A Few Moments Please: Scalable Graphon Learning via Moment Matching
Reza Ramezanpour, Victor Manuel Tenorio Gomez, Antonio G. Marques et al.
Infrequent Exploration in Linear Bandits
Harin Lee, Min-hwan Oh
Towards Open-Vocabulary Audio-Visual Event Localization
Jinxing Zhou, Dan Guo, Ruohao Guo et al.
An Iterative Algorithm for Differentially Private $k$-PCA with Adaptive Noise
Johanna Düngler, Amartya Sanyal
Multidimensional Bayesian Utility Maximization: Tight Approximations to Welfare
Kira Goldner, Taylor Lundy
On Optimal Steering to Achieve Exact Fairness
mohit sharma, Amit Deshpande, Chiranjib Bhattacharyya et al.
Forming Auxiliary High-confident Instance-level Loss to Promote Learning from Label Proportions
Tianhao Ma, Han Chen, Juncheng Hu et al.
Adjoint Schrödinger Bridge Sampler
Guan-Horng Liu, Jaemoo Choi, Yongxin Chen et al.
MOSDT: Self-Distillation-Based Decision Transformer for Multi-Agent Offline Safe Reinforcement Learning
Yuchen Xia, Yunjian Xu
Word-Level Emotional Expression Control in Zero-Shot Text-to-Speech Synthesis
Tianrui Wang, Haoyu Wang, Meng Ge et al.
Geometry-Aware Collaborative Multi-Solutions Optimizer for Model Fine-Tuning with Parameter Efficiency
Van-Anh Nguyen, Trung Le, Mehrtash Harandi et al.
Dual Focus-Attention Transformer for Robust Point Cloud Registration
Kexue Fu, Ming'zhi Yuan, Changwei Wang et al.
Rectification-specific Supervision and Constrained Estimator for Online Stereo Rectification
Rui Gong, Kim-Hui Yap, Weide Liu et al.
SpikingVTG: A Spiking Detection Transformer for Video Temporal Grounding
Malyaban Bal, Brian Matejek, Susmit Jha et al.
Learning Flow Fields in Attention for Controllable Person Image Generation
Zijian Zhou, Shikun Liu, Xiao Han et al.
DKC: Differentiated Knowledge Consolidation for Cloth-Hybrid Lifelong Person Re-identification
Zhenyu Cui, Jiahuan Zhou, Yuxin Peng
Beyond Background Shift: Rethinking Instance Replay in Continual Semantic Segmentation
Hongmei Yin, Tingliang Feng, Fan Lyu et al.
Fuz-RL: A Fuzzy-Guided Robust Framework for Safe Reinforcement Learning under Uncertainty
Xu Wan, Chao Yang, Cheng Yang et al.
RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins
Yao Mu, Tianxing Chen, Zanxin Chen et al.
Rate-In: Information-Driven Adaptive Dropout Rates for Improved Inference-Time Uncertainty Estimation
Tal Zeevi, Ravid Shwartz-Ziv, Yann LeCun et al.
Embodied Scene Understanding for Vision Language Models via MetaVQA
Weizhen Wang, Chenda Duan, Zhenghao Peng et al.
RAPID Hand: Robust, Affordable, Perception-Integrated, Dexterous Manipulation Platfrom for Embodied Intelligence
Zhaoliang Wan, Zetong Bi, Zida Zhou et al.
Multiscale guidance of protein structure prediction with heterogeneous cryo-EM data
Rishwanth Raghu, Axel Levy, Gordon Wetzstein et al.
Attribute-formed Class-specific Concept Space: Endowing Language Bottleneck Model with Better Interpretability and Scalability
Jianyang Zhang, Qianli Luo, Guowu Yang et al.
Certifying Deep Network Risks and Individual Predictions with PAC-Bayes Loss via Localized Priors
Wen Dong
X-Mahalanobis: Transformer Feature Mixing for Reliable OOD Detection
Tong Wei, Bolin Wang, Jiang-Xin Shi et al.
Shape and Texture: What Influences Reliable Optical Flow Estimation?
Libo Long, Xiao Hu, Jochen Lang
Discovering Fine-Grained Visual-Concept Relations by Disentangled Optimal Transport Concept Bottleneck Models
Yan Xie, Zequn Zeng, Hao Zhang et al.
MaintainCoder: Maintainable Code Generation Under Dynamic Requirements
Zhengren Wang, Rui ling, Chufan Wang et al.
Dependency Parsing is More Parameter-Efficient with Normalization
Paolo Gajo, Domenic Rosati, Hassan Sajjad et al.
From Pixels to Views: Learning Angular-Aware and Physics-Consistent Representations for Light Field Microscopy
Feng He, Guodong Tan, Qiankun Li et al.
Design-Based Bandits Under Network Interference: Trade-Off Between Regret and Statistical Inference
Zichen Wang, Haoyang Hong, Chuanhao Li et al.
Is this Generated Person Existed in Real-world? Fine-grained Detecting and Calibrating Abnormal Human-body
Zeqing Wang, Qingyang Ma, Wentao Wan et al.
Generating Creative Chess Puzzles
Xidong Feng, Vivek Veeriah, Marcus Chiam et al.
How Patterns Dictate Learnability in Sequential Data
Mario Morawski, Anaïs Després, Remi Rehm
One Model for ALL: Low-Level Task Interaction Is a Key to Task-Agnostic Image Fusion
Chunyang Cheng, Tianyang Xu, Zhenhua Feng et al.
MiCo: Multi-image Contrast for Reinforcement Visual Reasoning
Xi Chen, Mingkang Zhu, Shaoteng Liu et al.
In-context Learning of Linear Dynamical Systems with Transformers: Approximation Bounds and Depth-separation
Frank Cole, Yuxuan Zhao, Yulong Lu et al.
MobileH2R: Learning Generalizable Human to Mobile Robot Handover Exclusively from Scalable and Diverse Synthetic Data
Zifan Wang, Ziqing Chen, Junyu Chen et al.
TransMLA: Migrating GQA Models to MLA with Full DeepSeek Compatibility and Speedup
Fanxu Meng, Pingzhi Tang, Zengwei Yao et al.
Tackling Biased Evaluators in Dueling Bandits
Ming Tang, Yuxuan Zhou, Chao Huang
Channel-wise Noise Scheduled Diffusion for Inverse Rendering in Indoor Scenes
JunYong Choi, Min-Cheol Sagong, SeokYeong Lee et al.
Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation
Junha Lee, Chunghyun Park, Jaesung Choe et al.
Is Grokking a Computational Glass Relaxation?
Xiaotian Zhang, Yue Shang, Entao Yang et al.
PUATE: Efficient ATE Estimation from Treated (Positive) and Unlabeled Units
Masahiro Kato, Fumiaki Kozai, RYO INOKUCHI
TESTING STATIONARITY AND CHANGE POINT DETECTION IN REINFORCEMENT LEARNING
Mengbing Li, Chengchun Shi, Zhenke Wu et al.
Parallelized Autoregressive Visual Generation
Yuqing Wang, Shuhuai Ren, Zhijie Lin et al.
Learning Extremely High Density Crowds as Active Matters
Feixiang He, Jiangbei Yue, Jialin Zhu et al.
DKDR: Dynamic Knowledge Distillation for Reliability in Federated Learning
Yueyang Yuan, Wenke Huang, Guancheng Wan et al.
CPO: Condition Preference Optimization for Controllable Image Generation
Zonglin Lyu, Ming Li, Xinxin Liu et al.
SceneDiffuser++: City-Scale Traffic Simulation via a Generative World Model
Shuhan Tan, John Wheatley Lambert, Hong Jeon et al.
Private Statistical Estimation via Truncation
Manolis Zampetakis, Felix Zhou
Enhancing Tactile-based Reinforcement Learning for Robotic Control
Elle Miller, Trevor McInroe, David Abel et al.
Incremental Object Keypoint Learning
Mingfu Liang, Jiahuan Zhou, Xu Zou et al.
CaricatureBooth: Data-Free Interactive Caricature Generation in a Photo Booth
Zhiyu Qu, Yunqi Miao, Zhensong Zhang et al.
Reward Fine-Tuning Two-Step Diffusion Models via Learning Differentiable Latent-Space Surrogate Reward
Zhiwei Jia, Yuesong Nan, Huixi Zhao et al.
Coupled Data and Measurement Space Dynamics for Enhanced Diffusion Posterior Sampling
Shayan Mohajer Hamidi, Ben Liang, EN-HUI YANG
OPTICAL: Leveraging Optimal Transport for Contribution Allocation in Dataset Distillation
Xiao Cui, Yulei Qin, Wengang Zhou et al.
Improving the Transferability of Adversarial Attacks on Face Recognition with Diverse Parameters Augmentation
Fengfan Zhou, Bangjie Yin, Hefei Ling et al.
Order-Level Attention Similarity Across Language Models: A Latent Commonality
Jinglin Liang, Jin Zhong, Shuangping Huang et al.
Bilevel Network Learning via Hierarchically Structured Sparsity
Jiayi Fan, Jingyuan Yang, Shuangge Ma et al.
Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer
Jiahao Cui, Hui Li, Qingkun Su et al.
PolyPose: Deformable 2D/3D Registration via Polyrigid Transformations
Vivek Gopalakrishnan, Neel Dey, Polina Golland
An Optimized Franz-Parisi Criterion and its Equivalence with SQ Lower Bounds
Siyu Chen, Theodor Misiakiewicz, Ilias Zadik et al.
Long-Tailed Recognition via Information-Preservable Two-Stage Learning
Fudong Lin, Xu Yuan
Approximation theory for 1-Lipschitz ResNets
Davide Murari, Takashi Furuya, Carola-Bibiane Schönlieb
Improving Regret Approximation for Unsupervised Dynamic Environment Generation
Harry Mead, Bruno Lacerda, Jakob Foerster et al.
Improving Accuracy and Calibration via Differentiated Deep Mutual Learning
Han Liu, Peng Cui, Bingning Wang et al.
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution
Yuxiang Wei, Olivier Duchenne, Jade Copet et al.
PAROAttention: Pattern-Aware ReOrdering for Efficient Sparse and Quantized Attention in Visual Generation Models
Tianchen Zhao, Ke Hong, Xinhao Yang et al.
Gaussian Splashing: Unified Particles for Versatile Motion Synthesis and Rendering
Yutao Feng, Xiang Feng, Yintong Shang et al.
GROVE: A Generalized Reward for Learning Open-Vocabulary Physical Skill
Jieming Cui, Tengyu Liu, Ziyu Meng et al.
Neural Combinatorial Optimization for Time Dependent Traveling Salesman Problem
Ruixiao Yang, Chuchu Fan
The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning
Shivam Agarwal, Zimin Zhang, Lifan Yuan et al.
OPMapper: Enhancing Open-Vocabulary Semantic Segmentation with Multi-Guidance Information
Xuehui Wang, Chongjie Si, Xue Yang et al.
COB-GS: Clear Object Boundaries in 3DGS Segmentation Based on Boundary-Adaptive Gaussian Splitting
Jiaxin Zhang, Junjun Jiang, Youyu Chen et al.
ManipTrans: Efficient Dexterous Bimanual Manipulation Transfer via Residual Learning
Kailin Li, Puhao Li, Tengyu Liu et al.
Compress to Impress: Efficient LLM Adaptation Using a Single Gradient Step on 100 Samples
Shiva Sreeram, Alaa Maalouf, Pratyusha Sharma et al.
Sequentially Auditing Differential Privacy
Tomás González Lara, Mateo Dulce Rubio, Aaditya Ramdas et al.
ACT as Human: Multimodal Large Language Model Data Annotation with Critical Thinking
Lequan Lin, Dai Shi, Andi Han et al.
Rethinking Scale-Aware Temporal Encoding for Event-based Object Detection
Lin Zhu, Tengyu Long, Xiao Wang et al.
Adaptive Parameter Selection for Tuning Vision-Language Models
Yi Zhang, Yi-Xuan Deng, Meng-Hao Guo et al.
Towards Satellite Image Road Graph Extraction: A Global-Scale Dataset and A Novel Method
Pan Yin, Kaiyu Li, Xiangyong Cao et al.
Spike4DGS: Towards High-Speed Dynamic Scene Rendering with 4D Gaussian Splatting via a Spike Camera Array
Qinghong Ye, Yiqian Chang, Jianing Li et al.
MMRL: Multi-Modal Representation Learning for Vision-Language Models
Yuncheng Guo, Xiaodong Gu
Native Segmentation Vision Transformers
Guillem Brasó, Aljosa Osep, Laura Leal-Taixé
On the Universal Near Optimality of Hedge in Combinatorial Settings
Zhiyuan Fan, Arnab Maiti, Lillian Ratliff et al.
Shortcut Features as Top Eigenfunctions of NTK: A Linear Neural Network Case and More
Jinwoo Lim, Suhyun Kim, Soo-Mook Moon
Exploiting Task Relationships in Continual Learning via Transferability-Aware Task Embeddings
Yanru Wu, Jianning Wang, Xiangyu Chen et al.
MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models
Wenyi Hong, Yean Cheng, Zhuoyi Yang et al.
Efficient Multi-bit Quantization Network Training via Weight Bias Correction and Bit-wise Coreset Sampling
Jinhee Kim, Jae Jun An, Kang Eun Jeon et al.
Language‑Bias‑Resilient Visual Question Answering via Adaptive Multi‑Margin Collaborative Debiasing
Huanjia Zhu, Shuyuan Zheng, Yishu Liu et al.
Flow-NeRF: Joint Learning of Geometry, Poses, and Dense Flow within Unified Neural Representations
Xunzhi Zheng, Dan Xu