Most Cited 2025 "constrained online convex optimization" Papers
22,274 papers found • Page 108 of 112
Conference
Bi-Directional Communication-Efficient Stochastic FL via Remote Source Generation
Maximilian Egger, Rawad Bitar, Antonia Wachter-Zeh et al.
Extragradient Method for $(L_0, L_1)$-Lipschitz Root-finding Problems
Sayantan Choudhury, Nicolas Loizou
OmniTry: Virtual Try-On Anything without Masks
Yutong Feng, Linlin Zhang, Hengyuan Cao et al.
Light Transport-aware Diffusion Posterior Sampling for Single-View Reconstruction of 3D Volumes
Ludwic Leonard, Nils Thuerey, rüdiger westermann
MobileODE: An Extra Lightweight Network
Le Yu, Jun Wu, Bo Gou et al.
Purest Quantum State Identification
Yingqi Yu, Honglin Chen, Jun Wu et al.
Plug-and-play Feature Causality Decomposition for Multimodal Representation Learning
Ye Liu, Zihan Ji, Hongmin Cai
Aligning by Misaligning: Boundary-aware Curriculum Learning for Multimodal Alignment
Hua Ye, Hang Ding, Siyuan Chen et al.
Finite Sample Analysis of Linear Temporal Difference Learning with Arbitrary Features
Zixuan Xie, Xinyu Liu, Rohan Chandra et al.
A Dataset for Semantic Segmentation in the Presence of Unknowns
Zakaria Laskar, Tomas Vojir, Matej Grcic et al.
Principled Model Routing for Unknown Mixtures of Source Domains
Christoph Dann, Yishay Mansour, Teodor Vanislavov Marinov et al.
FoundHand: Large-Scale Domain-Specific Learning for Controllable Hand Image Generation
Kefan Chen, Chaerin Min, Linguang Zhang et al.
The $\varphi$ Curve: The Shape of Generalization through the Lens of Norm-based Capacity Control
Yichen Wang, Yudong Chen, Lorenzo Rosasco et al.
Objective Soups: Multilingual Multi-Task Modeling for Speech Processing
A F M Saif, Lisha Chen, Xiaodong Cui et al.
Native-Resolution Image Synthesis
ZiDong Wang, LEI BAI, Xiangyu Yue et al.
Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint?
Xi Chen, Kaituo Feng, Changsheng Li et al.
DON’T NEED RETRAINING: A Mixture of DETR and Vision Foundation Models for Cross-Domain Few-Shot Object Detection
Changhan Liu, xunzhi xiang, Zixuan Duan et al.
Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations
Jiaming Han, Hao Chen, Yang Zhao et al.
Efficient Event-Based Object Detection: A Hybrid Neural Network with Spatial and Temporal Attention
Soikat Hasan Ahmed, Jan Finkbeiner, Emre Neftci
HiMoLE: Towards OOD-Robust LoRA via Hierarchical Mixture of Experts
Yinuo Jiang, Yan Xiaodong, Keyan Ding et al.
Locally Orderless Images for Optimization in Differentiable Rendering
Ishit Mehta, Manmohan Chandraker, Ravi Ramamoorthi
SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training
Jintao Zhang, Jia wei, Haoxu Wang et al.
Spike-RetinexFormer: Rethinking Low-light Image Enhancement with Spiking Neural Networks
Hongzhi Wang, Xiubo Liang, Jinxing Han et al.
TrajMamba: An Efficient and Semantic-rich Vehicle Trajectory Pre-training Model
Yichen Liu, Yan Lin, Shengnan Guo et al.
Representation-Level Counterfactual Calibration for Debiased Zero-Shot Recognition
Pei Peng, Ming-Kun Xie, Hang Hao et al.
DeDe: Detecting Backdoor Samples for SSL Encoders via Decoders
Sizai Hou, Songze Li, Duanyi Yao
Gate to the Vessel: Residual Experts Restore What SAM Overlooks
Weili Jiang, Jinrong Lv, Xun Gong et al.
Do Computer Vision Foundation Models Learn the Low-level Characteristics of the Human Visual System?
Yancheng Cai, Fei Yin, Dounia Hammou et al.
CSBrain: A Cross-scale Spatiotemporal Brain Foundation Model for EEG Decoding
Yuchen Zhou, Jiamin Wu, Zichen Ren et al.
A Driving-Style-Adaptive Framework for Vehicle Trajectory Prediction
Di Wen, Yu Wang, Zhigang Wu et al.
CHASM: Unveiling Covert Advertisements on Chinese Social Media
Jingyi Zheng, Tianyi Hu, Yule Liu et al.
Style-Editor: Text-driven Object-centric Style Editing
Jihun Park, Jongmin Gim, Kyoungmin Lee et al.
ShapeCraft: LLM Agents for Structured, Textured and Interactive 3D Modeling
Shuyuan Zhang, ChenHan Jiang, Zuoou Li et al.
Exploring Temporally-Aware Features for Point Tracking
Inès Hyeonsu Kim, Seokju Cho, Gabriel Huang et al.
Generative Inbetweening through Frame-wise Conditions-Driven Video Generation
Tianyi Zhu, Dongwei Ren, Qilong Wang et al.
What do you know? Bayesian knowledge inference for navigating agents
Matthias Schultheis, Jana-Sophie Schönfeld, Constantin Rothkopf et al.
Empowering Decision Trees via Shape Function Branching
Nakul Upadhya, Eldan Cohen
Block-Biased Mamba for Long-Range Sequence Processing
Annan Yu, N. Benjamin Erichson
To Think or Not To Think: A Study of Thinking in Rule-Based Visual Reinforcement Fine-Tuning
Ming Li, Jike Zhong, Shitian Zhao et al.
Segment then Splat: Unified 3D Open-Vocabulary Segmentation via Gaussian Splatting
Yiren Lu, Yunlai Zhou, Yiran Qiao et al.
Nearly-Linear Time and Massively Parallel Algorithms for $k$-anonymity
Kevin Aydin, Honghao Lin, David Woodruff et al.
Understand Before You Generate: Self-Guided Training for Autoregressive Image Generation
Xiaoyu Yue, ZiDong Wang, Yuqing Wang et al.
Pareto-Optimal Energy Alignment for Designing Nature-Like Antibodies
Yibo Wen, Chenwei Xu, Jerry Yao-Chieh Hu et al.
Final-Model-Only Data Attribution with a Unifying View of Gradient-Based Methods
Dennis Wei, Inkit Padhi, Soumya Ghosh et al.
Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields
Shijie Zhou, Hui Ren, Yijia Weng et al.
Faster Video Diffusion with Trainable Sparse Attention
Peiyuan Zhang, Yongqi Chen, Haofeng Huang et al.
Accurate Differential Operators for Hybrid Neural Fields
Aditya Chetan, Guandao Yang, Zichen Wang et al.
PhySwin: An Efficient and Physically-Informed Foundation Model for Multispectral Earth Observation
Chong Tang, Joseph Powell, Dirk Koch et al.
Does Representation Guarantee Welfare?
Jakob de Raaij, Ariel Procaccia, Alexandros Psomas
BlurDM: A Blur Diffusion Model for Image Deblurring
Jin-Ting He, Fu-Jen Tsai, Yan-Tsung Peng et al.
Measuring the Faithfulness of Thinking Drafts in Large Reasoning Models
Zidi Xiong, Shan Chen, Zhenting Qi et al.
Multi-Agent Reinforcement Learning with Communication-Constrained Priors
Guang Yang, Jingwen Qiao, Tianpei Yang et al.
Query-Efficient Locally Private Hypothesis Selection via the Scheffe Graph
Gautam Kamath, Alireza F. Pour, Matthew Regehr et al.
Model Reconciliation via Cost-Optimal Explanations in Probabilistic Logic Programming
Yinxu Tang, Stylianos Loukas Vasileiou, Vincent Derkinderen et al.
A Beyond-Worst-Case Analysis of Greedy k-means++
Qingyun Chen, Sungjin Im, Ben Moseley et al.
Improved Regret and Contextual Linear Extension for Pandora's Box and Prophet Inequality
Junyan Liu, Ziyun Chen, Kun Wang et al.
Restricted Global-Aware Graph Filters Bridging GNNs and Transformer for Node Classification
Jingyuan Zhang, Xin Wang, Lei Yu et al.
Open Ad-hoc Categorization with Contextualized Feature Learning
Zilin Wang, Sangwoo Mo, Stella X. Yu et al.
DAA: Amplifying Unknown Discrepancy for Test-Time Discovery
Tianle Liu, Fan Lyu, Chenggong Ni et al.
Yggdrasil: Bridging Dynamic Speculation and Static Runtime for Latency-Optimal Tree-Based LLM Decoding
Yue Guan, Changming Yu, Shihan Fang et al.
Value Diffusion Reinforcement Learning
Xiaoliang Hu, Fuyun Wang, Tong Zhang et al.
MERGE: Multi-faceted Hierarchical Graph-based GNN for Gene Expression Prediction from Whole Slide Histopathology Images
Aniruddha Ganguly, Debolina Chatterjee, Wentao Huang et al.
Does Object Binding Naturally Emerge in Large Pretrained Vision Transformers?
Yihao Li, Saeed Salehi, Lyle Ungar et al.
GIST: Greedy Independent Set Thresholding for Max-Min Diversification with Submodular Utility
Matthew Fahrbach, Srikumar Ramalingam, Morteza Zadimoghaddam et al.
Lifelong Test-Time Adaptation via Online Learning in Tracked Low-Dimensional Subspace
Dexin Duan, Rui Xu, Peilin Liu et al.
Towards Reliable LLM-based Robots Planning via Combined Uncertainty Estimation
Shiyuan Yin, Chenjia Bai, Zihao Zhang et al.
Partner Modelling Emerges in Recurrent Agents (But Only When It Matters)
Ruaridh Mon-Williams, Max Taylor-Davies, Elizabeth Mieczkowski et al.
Train on Pins and Test on Obstacles for Rectilinear Steiner Minimum Tree
Xingbo Du, Ruizhe Zhong, Junchi Yan
Omni-ID: Holistic Identity Representation Designed for Generative Tasks
Guocheng Qian, Kuan-Chieh Wang, Or Patashnik et al.
SoundVista: Novel-View Ambient Sound Synthesis via Visual-Acoustic Binding
Mingfei Chen, Israel D. Gebru, Ishwarya Ananthabhotla et al.
MVDoppler-Pose: Multi-Modal Multi-View mmWave Sensing for Long-Distance Self-Occluded Human Walking Pose Estimation
Jae-Ho Choi, Soheil Hor, Shubo Yang et al.
BraVE: Offline Reinforcement Learning for Discrete Combinatorial Action Spaces
Matthew Landers, Taylor W. Killian, Hugo Barnes et al.
Beyond Node-Centric Modeling: Sketching Signed Networks with Simplicial Complexes
Wei Wu, Xuan Tan, Yan Peng et al.
KScope: A Framework for Characterizing the Knowledge Status of Language Models
Yuxin Xiao, Shan Chen, Jack Gallifant et al.
From Contextual Combinatorial Semi-Bandits to Bandit List Classification: Improved Sample Complexity with Sparse Rewards
Liad Erez, Tomer Koren
A Bias-Free Training Paradigm for More General AI-generated Image Detection
Fabrizio Guillaro, Giada Zingarini, Ben Usman et al.
MultiNet: Adaptive Multi-Viewed Subgraph Convolutional Networks for Graph Classification
Xinya Qin, Lu Bai, Lixin Cui et al.
DenoiseRotator: Enhance Pruning Robustness for LLMs via Importance Concentration
Tianteng Gu, Bei Liu, Bo Xiao et al.
OpenMMEgo: Enhancing Egocentric Understanding for LMMs with Open Weights and Data
Hao Luo, Zihao Yue, Wanpeng Zhang et al.
NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation
Xiangyan Liu, Jinjie Ni, Zijian Wu et al.
Policy Gradient Methods Converge Globally in Imperfect-Information Extensive-Form Games
Fivos Kalogiannis, Gabriele Farina
ClimbingCap: Multi-Modal Dataset and Method for Rock Climbing in World Coordinate
Ming Yan, Xincheng Lin, Yuhua Luo et al.
3DOT: Texture Transfer for 3DGS Objects from a Single Reference Image
Xiao Cao, Beibei Lin, Bo Wang et al.
PhD: A ChatGPT-Prompted Visual Hallucination Evaluation Dataset
Jiazhen Liu, Yuhan Fu, Ruobing Xie et al.
Where Does It Exist from the Low-Altitude: Spatial Aerial Video Grounding
Yang Zhan, Yuan Yuan
Fast Computation and Optimization for Opinion-Based Quantities of Friedkin-Johnsen Model
Haoxin Sun, Yubo Sun, Xiaotian Zhou et al.
DeepLA-Net: Very Deep Local Aggregation Networks for Point Cloud Analysis
Ziyin Zeng, Mingyue Dong, Jian Zhou et al.
LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant
Yikun Liu, Yajie Zhang, jiayin cai et al.
Descriptor-In-Pixel : Point-Feature Tracking For Pixel Processor Arrays
Laurie Bose, Piotr Dudek, Jianing Chen
PandaPose: 3D Human Pose Lifting from a Single Image via Propagating 2D Pose Prior to 3D Anchor Space
Jinghong Zheng, Changlong Jiang, Yang Xiao et al.
The quest for the GRAph Level autoEncoder (GRALE)
Paul Krzakala, Gabriel Melo, Charlotte Laclau et al.
MiniMax-Remover: Taming Bad Noise Helps Video Object Removal
Bojia Zi, Weixuan Peng, Xianbiao Qi et al.
Faster Parameter-Efficient Tuning with Token Redundancy Reduction
Kwonyoung Kim, Jungin Park, Jin Kim et al.
Mixture of Scope Experts at Test: Generalizing Deeper Graph Neural Networks with Shallow Variants
Gangda Deng, Hongkuan Zhou, Rajgopal Kannan et al.
Generative Multiview Relighting for 3D Reconstruction under Extreme Illumination Variation
Hadi Alzayer, Philipp Henzler, Jonathan T. Barron et al.
Enhancing the Maximum Effective Window for Long-Term Time Series Forecasting
Jiahui Zhang, Zhengyang Zhou, Wenjie Du et al.
Feature Selection for Latent Factor Models
Rittwika Kansabanik, Adrian Barbu
Quantifying and Alleviating Co-Adaptation in Sparse-View 3D Gaussian Splatting
Kangjie Chen, Yingji Zhong, Zhihao Li et al.
Learning Task-Agnostic Representations through Multi-Teacher Distillation
Philippe Formont, Maxime Darrin, Banafsheh Karimian et al.
Efficient Representativeness-Aware Coreset Selection
Zihao Cheng, Binrui Wu, Zhiwei Li et al.
The Parameterized Complexity of Computing the VC-Dimension
Florent Foucaud, Harmender Gahlawat, Fionn Mc Inerney et al.
Scalable Signature Kernel Computations via Local Neumann Series Expansions
Matthew Tamayo-Rios, Alexander Schell, Rima Alaifari
Stochastic Human Motion Prediction with Memory of Action Transition and Action Characteristic
Jianwei Tang, Hong Yang, Tengyue Chen et al.
Quantum Visual Fields with Neural Amplitude Encoding
Shuteng Wang, Christian Theobalt, Vladislav Golyanik
Reinforcement learning for one-shot DAG scheduling with comparability identification and dense reward
Xumai Qi, Dongdong Zhang, Taotao Liu et al.
Inverse Methods for Missing Data Imputation
Hao Wang, zhengnan li, Zhichao Chen et al.
Meta Guidance: Incorporating Inductive Biases into Deep Time Series Imputers
Jiacheng You, Xinyang Chen, Yu Sun et al.
DEGauss: Defending Against Malicious 3D Editing for Gaussian Splatting
Lingzhuang Meng, Mingwen Shao, Yuanjian Qiao et al.
Attention IoU: Examining Biases in CelebA using Attention Maps
Aaron Serianni, Tyler Zhu, Olga Russakovsky et al.
Locally Optimal Private Sampling: Beyond the Global Minimax
Hrad Ghoukasian, Bonwoo Lee, Shahab Asoodeh
Attribution-Driven Adaptive Token Pruning for Transformers
YAOYAO YAN, Hui Yu, Weizhi Xu
Bootstrap Your Uncertainty: Adaptive Robust Classification Driven by Optimal-Transport
Jiawei Huang, Minming Li, Hu Ding
Kernel von Mises Formula of the Influence Function
Yaroslav Mukhin
Closed-Form Training Dynamics Reveal Learned Features and Linear Structure in Word2Vec-like Models
Dhruva Karkada, James Simon, Yasaman Bahri et al.
Tool-Augmented Spatiotemporal Reasoning for Streamlining Video Question Answering Task
Sunqi Fan, Jiashuo Cui, Meng-Hao Guo et al.
Co-Regularization Enhances Knowledge Transfer in High Dimensions
Shuo Shuo Liu, Haotian Lin, Matthew Reimherr et al.
Prof. Robot: Differentiable Robot Rendering Without Static and Self-Collisions
Quanyuan Ruan, Jiabao Lei, Wenhao Yuan et al.
Low-Biased General Annotated Dataset Generation
Dengyang Jiang, Haoyu Wang, Lei Zhang et al.
TSENOR: Highly-Efficient Algorithm for Finding Transposable N:M Sparse Masks
Xiang Meng, Mehdi Makni, Rahul Mazumder
ArtFormer: Controllable Generation of Diverse 3D Articulated Objects
Jiayi Su, Youhe Feng, Zheng Li et al.
Diagnosing and Addressing Pitfalls in KG-RAG Datasets: Toward More Reliable Benchmarking
Liangliang Zhang, Zhuorui Jiang, Hongliang Chi et al.
Do Neural Networks Need Gradient Descent to Generalize? A Theoretical Study
Yotam Alexander, Yonatan Slutzky, Yuval Ran-Milo et al.
Learning Physics-Based Full-Body Human Reaching and Grasping from Brief Walking References
Yitang Li, Mingxian Lin, Zhuo Lin et al.
MotiF: Making Text Count in Image Animation with Motion Focal Loss
Shijie Wang, Samaneh Azadi, Rohit Girdhar et al.
DOVTrack: Data-Efficient Open-Vocabulary Tracking
Zekun Qian, Ruize Han, Zhixiang Wang et al.
Uncertainty-quantified Rollout Policy Adaptation for Unlabelled Cross-domain Video Temporal Grounding
Jian Hu, Zixu Cheng, Shaogang Gong et al.
QuARI: Query Adaptive Retrieval Improvement
Eric Xing, Abby Stylianou, Robert Pless et al.
Prior-Guided Diffusion Planning for Offline Reinforcement Learning
Donghyeon Ki, JunHyeok Oh, Seong-Woong Shim et al.
Stable Cinemetrics : Structured Taxonomy and Evaluation for Professional Video Generation
Agneet Chatterjee, Rahim Entezari, Maksym Zhuravinskyi et al.
Partition-Then-Adapt: Combating Prediction Bias for Reliable Multi-Modal Test-Time Adaptation
Guowei Wang, Fan Lyu, Changxing Ding
Novel Class Discovery for Point Cloud Segmentation via Joint Learning of Causal Representation and Reasoning
Yang Li, Aming WU, Zihao Zhang et al.
FP4 All the Way: Fully Quantized Training of Large Language Models
Brian Chmiel, Maxim Fishman, Ron Banner et al.
MambaVision: A Hybrid Mamba-Transformer Vision Backbone
Ali Hatamizadeh, Jan Kautz
Generative Photomontage
Sean J. Liu, Nupur Kumari, Ariel Shamir et al.
A Black-Box Debiasing Framework for Conditional Sampling
Han Cui, Jingbo Liu
Vinci: Deep Thinking in Text-to-Image Generation using Unified Model with Reinforcement Learning
wang lin, Wentao Hu, Liyu Jia et al.
Can Large Vision-Language Models Correct Semantic Grounding Errors By Themselves?
Yuan-Hong Liao, Rafid Mahmood, Sanja Fidler et al.
SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement
Xiyao Wang, Zhengyuan Yang, Chao Feng et al.
Seeing through Uncertainty: Robust Task-Oriented Optimization in Visual Navigation
Yiyuan Pan, Yunzhe XU, Zhe Liu et al.
Community Forensics: Using Thousands of Generators to Train Fake Image Detectors
Jeongsoo Park, Andrew Owens
Fortifying Time Series: DTW-Certified Robust Anomaly Detection
Shijie Liu, Tansu Alpcan, Christopher Leckie et al.
AlignedGen: Aligning Style Across Generated Images
Jiexuan Zhang, Yiheng Du, Qian Wang et al.
SAP: Exact Sorting in Splatting via Screen-Aligned Primitives
Zhanke Wang, Zhiyan Wang, Kaiqiang Xiong et al.
QiMeng-MuPa: Mutual-Supervised Learning for Sequential-to-Parallel Code Translation
Changxin Ke, Rui Zhang, Shuo Wang et al.
Interpreting Object-level Foundation Models via Visual Precision Search
Ruoyu Chen, Siyuan Liang, Jingzhi Li et al.
Enhancing the Outcome Reward-based RL Training of MLLMs with Self-Consistency Sampling
Jiahao Wang, Weiye Xu, Aijun Yang et al.
An Adaptive Quantum Circuit of Dempster's Rule of Combination for Uncertain Pattern Classification
Fuyuan Xiao, Yu Zhou, Witold Pedrycz
PanoGS: Gaussian-based Panoptic Segmentation for 3D Open Vocabulary Scene Understanding
Hongjia Zhai, Hai Li, Zhenzhe Li et al.
An Information-theoretical Framework for Understanding Out-of-distribution Detection with Pretrained Vision-Language Models
Bo Peng, Jie Lu, Guangquan Zhang et al.
Hyperbolic Safety-Aware Vision-Language Models
Tobia Poppi, Tejaswi Kasarla, Pascal Mettes et al.
Do Automatic Factuality Metrics Measure Factuality? A Critical Evaluation
Sanjana Ramprasad, Byron Wallace
MixPrompt: Efficient Mixed Prompting for Multimodal Semantic Segmentation
Zhiwei Hao, Zhongyu Xiao, Jianyuan Guo et al.
Real-Time Scene-Adaptive Tone Mapping for High-Dynamic Range Object Detection
Gongzhe Li, Linwei Qiu, Peibei Cao et al.
Discovering Data Structures: Nearest Neighbor Search and Beyond
Omar Salemohamed, Laurent Charlin, Shivam Garg et al.
LiMoE: Mixture of LiDAR Representation Learners from Automotive Scenes
Xiang Xu, Lingdong Kong, hui shuai et al.
Orient Anything V2: Unifying Orientation and Rotation Understanding
Zehan Wang, Ziang Zhang, Jiayang Xu et al.
GLane3D: Detecting Lanes with Graph of 3D Keypoints
Halil İbrahim Öztürk, Muhammet Esat Kalfaoglu, Ozsel Kilinc
Foundation Cures Personalization: Improving Personalized Models’ Prompt Consistency via Hidden Foundation Knowledge
Yiyang Cai, Zhengkai Jiang, Yulong Liu et al.
OPTFM: A Scalable Multi-View Graph Transformer for Hierarchical Pre-Training in Combinatorial Optimization
Hao Yuan, Wenli Ouyang, Changwen Zhang et al.
Factor Decorrelation Enhanced Data Removal from Deep Predictive Models
Wenhao Yang, Lin Li, Xiaohui Tao et al.
Masked Point-Entity Contrast for Open-Vocabulary 3D Scene Understanding
Yan Wang, Baoxiong Jia, Ziyu Zhu et al.
Wavy Transformer
Satoshi Noguchi, Yoshinobu Kawahara
Efficient Algorithms for Robust and Partial Semi-Discrete Optimal Transport
Pankaj Agarwal, Sharath Raghvendra, Pouyan Shirzadian et al.
Minimax Adaptive Online Nonparametric Regression over Besov spaces
Paul Liautaud, Pierre Gaillard, Olivier Wintenberger
The Unseen Threat: Residual Knowledge in Machine Unlearning under Perturbed Samples
Hsiang Hsu, Pradeep Niroula, Zichang He et al.
TAET: Two-Stage Adversarial Equalization Training on Long-Tailed Distributions
Wang Yu-Hang, Junkang Guo, Aolei Liu et al.
PermLLM: Learnable Channel Permutation for N:M Sparse Large Language Models
Lancheng Zou, Shuo Yin, Zehua Pei et al.
AneuG-Flow: A Large-Scale Synthetic Dataset of Diverse Intracranial Aneurysm Geometries and Hemodynamics
Wenhao Ding, Yiying Sheng, Simão de Castro et al.
HyperNVD: Accelerating Neural Video Decomposition via Hypernetworks
Maria Pilligua, Danna Xue, Javier Vazquez-Corral
Scalable Cross-View Sample Alignment for Multi-View Clustering with View Structure Similarity
Jun Wang, Zhenglai Li, Chang Tang et al.
Towards Dynamic 3D Reconstruction of Hand-Instrument Interaction in Ophthalmic Surgery
Ming Hu, Zhengdi Yu, feilong tang et al.
Non-Asymptotic Analysis Of Data Augmentation For Precision Matrix Estimation
Lucas Morisset, Adrien Hardy, Alain Durmus
Time of the Flight of the Gaussians: Optimizing Depth Indirectly in Dynamic Radiance Fields
Runfeng Li, Mikhail Okunev, Zixuan Guo et al.
HOTFormerLoc: Hierarchical Octree Transformer for Versatile Lidar Place Recognition Across Ground and Aerial Views
Ethan Griffiths, Maryam Haghighat, Simon Denman et al.
PerLA: Perceptive 3D Language Assistant
Guofeng Mei, Wei Lin, Luigi Riz et al.
Disentangled Cross-Modal Representation Learning with Enhanced Mutual Supervision
Lu Gao, Wenlan Chen, Daoyuan Wang et al.
EAG3R: Event-Augmented 3D Geometry Estimation for Dynamic and Extreme-Lighting Scenes
Xiaoshan Wu, Yifei Yu, Xiaoyang Lyu et al.
Interactive and Hybrid Imitation Learning: Provably Beating Behavior Cloning
Yichen Li, Chicheng Zhang
Learning to Add, Multiply, and Execute Algorithmic Instructions Exactly with Neural Networks
Artur Back de Luca, George Giapitzakis, Kimon Fountoulakis
Follow-the-Perturbed-Leader Nearly Achieves Best-of-Both-Worlds for the m-Set Semi-Bandit Problems
Jingxin Zhan, Yuchen Xin, Chenjie Sun et al.
Pro3D-Editor: A Progressive Framework for Consistent and Precise 3D Editing
Yang Zheng, Mengqi Huang, Nan Chen et al.
Magma: A Foundation Model for Multimodal AI Agents
Jianwei Yang, Reuben Tan, Qianhui Wu et al.
Estimation of Stochastic Optimal Transport Maps
Sloan Nietert, Ziv Goldfeld
UFT: Unifying Supervised and Reinforcement Fine-Tuning
Mingyang Liu, Gabriele Farina, Asuman Ozdaglar
Computational Budget Should Be Considered in Data Selection
Weilin Wan, Weizhong Zhang, Cheng Jin
GOAL: Global-local Object Alignment Learning
Hyungyu Choi, Young Kyun Jang, Chanho Eom
Reasoning by Superposition: A Theoretical Perspective on Chain of Continuous Thought
Hanlin Zhu, Shibo Hao, Zhiting Hu et al.
Modeling Neural Activity with Conditionally Linear Dynamical Systems
Victor Geadah, Amin Nejatbakhsh, David Lipshutz et al.
GenAssets: Generating in-the-wild 3D Assets in Latent Space
Ze Yang, Jingkang Wang, Haowei Zhang et al.
Dense Metric Depth Estimation via Event-based Differential Focus Volume Prompting
Boyu Li, Peiqi Duan, Zhaojun Huang et al.
Compress, Gather, and Recompute: REFORMing Long-Context Processing in Transformers
Woomin Song, Sai Muralidhar Jayanthi, Srikanth Ronanki et al.
Towards Enhanced Image Inpainting: Mitigating Unwanted Object Insertion and Preserving Color Consistency
Yikai Wang, Chenjie Cao, Junqiu Yu et al.
Progressive Data Dropout: An Embarrassingly Simple Approach to Train Faster
Shriram M S, Xinyue Hao, Shihao Hou et al.
Learning non-equilibrium diffusions with Schrödinger bridges: from exactly solvable to simulation-free
Stephen Zhang, Michael Stumpf
LT3SD: Latent Trees for 3D Scene Diffusion
Quan Meng, Lei Li, Matthias Nießner et al.
Realistic Test-Time Adaptation of Vision-Language Models
Maxime Zanella, Clément Fuchs, Christophe De Vleeschouwer et al.
AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data
Zengqun Zhao, Ziquan Liu, Yu Cao et al.
On Evaluating Policies for Robust POMDPs
Merlijn Krale, Eline M. Bovy, Maris F. L. Galesloot et al.
Iterative Predictor-Critic Code Decoding for Real-World Image Dehazing
Jiayi Fu, Siyu Liu, Zikun Liu et al.
Bridging 3D Anomaly Localization and Repair via High-Quality Continuous Geometric Representation
Bozhong Zheng, Jinye Gan, Xiaohao Xu et al.