Most Cited 2025 "distribution approximation" Papers
22,274 papers found • Page 71 of 112
Conference
Differentially Private Boxplots
Kelly Ramsay, Jairo Diaz-Rodriguez
ELBOing Stein: Variational Bayes with Stein Mixture Inference
Ola Rønning, Eric Nalisnick, Christophe Ley et al.
COGNATE: Acceleration of Sparse Tensor Programs on Emerging Hardware using Transfer Learning
Chamika Sudusinghe, Gerasimos Gerogiannis, Damitha Lenadora et al.
Understanding Sharpness Dynamics in NN Training with a Minimalist Example: The Effects of Dataset Difficulty, Depth, Stochasticity, and More
Geonhui Yoo, Minhak Song, Chulhee Yun
Incremental Gradient Descent with Small Epoch Counts is Surprisingly Slow on Ill-Conditioned Problems
Yujun Kim, Jaeyoung Cha, Chulhee Yun
SEAM: Semantically Equivalent Across Modalities Benchmark for Vision-Language Models
Zhenwei Tang, Difan Jiao, Blair Yang et al.
High Dynamic Range Novel View Synthesis with Single Exposure
Kaixuan Zhang, HuWang, Minxian Li et al.
Dynamic Syntactic Feature Filtering and Injecting Networks for Cross-lingual Dependency Parsing
Jianjian Liu, Zhengtao Yu, Ying Li et al.
Building, Reusing, and Generalizing Abstract Representations from Concrete Sequences
Shuchen Wu, Mirko Thalmann, Peter Dayan et al.
LAMA-UT: Language Agnostic Multilingual ASR Through Orthography Unification and Language-Specific Transliteration
Sangmin Lee, Woojin Chung, Hong-Goo Kang
InstaTrain: Adaptive Training via Ultra-Fast Natural Annealing within Dynamical Systems
Chuan Liu, Ruibing Song, Chunshu Wu et al.
Uncertainty-Aware Self-Training for CTC-Based Automatic Speech Recognition
Eungbeom Kim, Kyogu Lee
Catoni Contextual Bandits are Robust to Heavy-tailed Rewards
Chenlu Ye, Yujia Jin, Alekh Agarwal et al.
Efficient Multivariate Robust Mean Estimation Under Mean-Shift Contamination
Ilias Diakonikolas, Giannis Iakovidis, Daniel Kane et al.
An Optimistic Algorithm for online CMDPS with Anytime Adversarial Constraints
Jiahui Zhu, Kihyun Yu, Dabeen Lee et al.
KinDEL: DNA-Encoded Library Dataset for Kinase Inhibitors
Benson Chen, Tomasz Danel, Gabriel Dreiman et al.
Deep Submodular Optimization and LLM for Multimodal Content Extraction and Automatic Poster Generation from Long Document
Vijay Jaisankar, Sambaran Bandyopadhyay, Kalp Vyas et al.
Empowering Self-Learning of LLMs: Inner Knowledge Explicitation as a Catalyst
Shijue Huang, Wanjun Zhong, Deng Cai et al.
Logic Induced High-Order Reasoning Network for Event-Event Relation Extraction
Peixin Huang, Xiang Zhao, Minghao Hu et al.
Learning from Noisy Labels via Self-Taught On-the-Fly Meta Loss Rescaling
Michael Heck, Christian Geishauser, Nurul Lubis et al.
ProsodyFM: Unsupervised Phrasing and Intonation Control for Intelligible Speech Synthesis
Xiangheng He, Junjie Chen, Zixing Zhang et al.
QoS-Efficient Serving of Multiple Mixture-of-Expert LLMs Using Partial Runtime Reconfiguration
HamidReza Imani, Jiaxin Peng, Peiman Mohseni et al.
CSSinger: End-to-End Chunkwise Streaming Singing Voice Synthesis System Based on Conditional Variational Autoencoder
Jianwei Cui, Yu Gu, Shihao Chen et al.
Small Language Model Makes an Effective Long Text Extractor
Yelin Chen, Fanjin Zhang, Jie Tang
CSL-L2M: Controllable Song-Level Lyric-to-Melody Generation Based on Conditional Transformer with Fine-Grained Lyric and Musical Controls
Li Chai, Donglin Wang
Implicit In-Context Learning: Evidence from Artificial Language Experiments
Xiaomeng Ma, Qihui Xu
On the Learnability of Distribution Classes with Adaptive Adversaries
Tosca Lechner, Alex Bie, Gautam Kamath
SrSv: Integrating Sequential Rollouts with Sequential Value Estimation for Multi-agent Reinforcement Learning
Xu Wan, Chao Yang, Cheng Yang et al.
A Multiagent Path Search Algorithm for Large-Scale Coalition Structure Generation
Redha Taguelmimt, Samir Aknine, Djamila Boukredera et al.
Towards Attributions of Input Variables in a Coalition
Xinhao Zheng, Huiqi Deng, Quanshi Zhang
Optimally Solving Simultaneous-Move Dec-POMDPs: The Sequential Central Planning Approach
Johan Peralez, Aurélien Delage, Jacopo Castellini et al.
NTK-DFL: Enhancing Decentralized Federated Learning in Heterogeneous Settings via Neural Tangent Kernel
Gabriel Thompson, Kai Yue, Chau-Wai Wong et al.
Unsupervised Translation of Emergent Communication
Ido Levy, Orr Paradise, Boaz Carmeli et al.
Discovering Spoofing Attempts on Language Model Watermarks
Thibaud Gloaguen, Nikola Jovanović, Robin Staab et al.
Craftium: Bridging Flexibility and Efficiency for Rich 3D Single- and Multi-Agent Environments
Mikel Malagón, Josu Ceberio, Jose A Lozano
PRO-VPT: Distribution-Adaptive Visual Prompt Tuning via Prompt Relocation
Chikai Shang, Mengke Li, Yiqun Zhang et al.
FastDINOv2: Frequency Based Curriculum Learning Improves Robustness and Training Speed
Jiaqi Zhang, Juntuo Wang, Zhixin Sun et al.
Targeted Forgetting of Image Subgroups in CLIP Models
Zeliang Zhang, Gaowen Liu, Charles Fleming et al.
Resolution of Simpson's paradox via the common cause principle
Arshak Hovhannisyan, Armen Allahverdyan
Self-Supervised Large Scale Point Cloud Completion for Archaeological Site Restoration
Aocheng Li, James R. Zimmer-Dauphinee, Rajesh Kalyanam et al.
BASIC: Boosting Visual Alignment with Intrinsic Refined Embeddings in Multimodal Large Language Models
Jianting Tang, Yubo Wang, Haoyu Cao et al.
Where and How to Perturb: On the Design of Perturbation Guidance in Diffusion and Flow Models
Donghoon Ahn, Jiwon Kang, Sanghyun Lee et al.
SpEx: A Spectral Approach to Explainable Clustering
Tal Argov, Tal Wagner
Non-Stationary Lipschitz Bandits
Nicolas Nguyen, Solenne Gaucher, Claire Vernade
ESC: Erasing Space Concept for Knowledge Deletion
Tae-Young Lee, Sundong Park, Minwoo Jeon et al.
MMPB: It’s Time for Multi-Modal Personalization
Jaeik Kim, Woojin Kim, Woohyeon Park et al.
Taxonomy of reduction matrices for Graph Coarsening
Antonin Joly, Nicolas Keriven, Aline Roumy
DONUT: A Decoder-Only Model for Trajectory Prediction
Markus Knoche, Daan de Geus, Bastian Leibe
Flattening Hierarchies with Policy Bootstrapping
John Zhou, Jonathan Kao
FreeCus: Free Lunch Subject-driven Customization in Diffusion Transformers
Yanbing Zhang, Zhe Wang, Qin Zhou et al.
Understanding Generalization in Physics Informed Models through Affine Variety Dimensions
Takeshi Koshizuka, Issei Sato
Simulation-Based Inference for Adaptive Experiments
Brian Cho, Aurelien Bibaut, Nathan Kallus
Military AI Needs Technically-Informed Regulation to Safeguard AI Research and its Applications
Riley Simmons-Edler, Jean Dong, Paul Lushenko et al.
CF3: Compact and Fast 3D Feature Fields
Hyunjoon Lee, Joonkyu Min, Jaesik Park
From Noise to Narrative: Tracing the Origins of Hallucinations in Transformers
Praneet Suresh, Jack Stanley, Sonia Joseph et al.
MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization
Chenglong Wang, Yang Gan, Hang Zhou et al.
Benchmarking Egocentric Visual-Inertial SLAM at City Scale
Anusha Krishnan, Shaohui Liu, Paul-Edouard Sarlin et al.
Fitted Neural Lossless Image Compression
Zhe Zhang, Zhenzhong Chen, Shan Liu
Describe, Adapt and Combine: Empowering CLIP Encoders for Open-set 3D Object Retrieval
Zhichuan Wang, Yang Zhou, Zhe Liu et al.
Towards Unsupervised Training of Matching-based Graph Edit Distance Solver via Preference-aware GAN
Wei Huang, Hanchen Wang, Dong Wen et al.
Gradient Descent as Loss Landscape Navigation: a Normative Framework for Deriving Learning Rules
John Vastola, Samuel J Gershman, Kanaka Rajan
Neural Tangent Knowledge Distillation for Optical Convolutional Networks
Jinlin Xiang, Minho Choi, Yubo Zhang et al.
Separation of Powers: On Segregating Knowledge from Observation in LLM-enabled Knowledge-based Visual Question Answering
Zhen Yang, Zhuo Tao, Qi Chen et al.
Boosting Domain Generalized and Adaptive Detection with Diffusion Models: Fitness, Generalization, and Transferability
Boyong He, Yuxiang Ji, Zhuoyue Tan et al.
Planning and Learning in Average Risk-aware MDPs
Weikai Wang, Erick Delage
Adam Reduces a Unique Form of Sharpness: Theoretical Insights Near the Minimizer Manifold
Xinghan Li, Haodong Wen, Kaifeng Lyu
TrafficLoc: Localizing Traffic Surveillance Cameras in 3D Scenes
Yan Xia, Yunxiang Lu, Rui Song et al.
Active Test-time Vision-Language Navigation
Heeju Ko, Sung June Kim, Gyeongrok Oh et al.
One Token Embedding Is Enough to Deadlock Your Large Reasoning Model
Mohan Zhang, Yihua Zhang, Jinghan Jia et al.
LawDIS: Language-Window-based Controllable Dichotomous Image Segmentation
Xinyu Yan, Meijun Sun, Ge-Peng Ji et al.
Grids Often Outperform Implicit Neural Representation at Compressing Dense Signals
Namhoon Kim, Sara Fridovich-Keil
EasySpec: Layer-Parallel Speculative Decoding for Efficient Multi-GPU Utilization
Yize Wu, KE GAO, Ling Li et al.
ProSpero: Active Learning for Robust Protein Design Beyond Wild-Type Neighborhoods
Michal Kmicikiewicz, Vincent Fortuin, Ewa Szczurek
UniZyme: A Unified Protein Cleavage Site Predictor Enhanced with Enzyme Active-Site Knowledge
Chenao Li, Shuo Yan, Enyan Dai
FaCT: Faithful Concept Traces for Explaining Neural Network Decisions
Amin Parchami-Araghi, Sukrut Rao, Jonas Fischer et al.
Improving Large Vision and Language Models by Learning from a Panel of Peers
Jefferson Hernandez, Jing Shi, Simon Jenni et al.
CAGE: Continuity-Aware edGE Network Unlocks Robust Floorplan Reconstruction
Yiyi Liu, Chunyang Liu, Bohan Wang et al.
Learning Generalizable Shape Completion with SIM(3) Equivariance
Yuqing Wang, Zhaiyu Chen, Xiaoxiang Zhu
Sinusoidal Initialization, Time for a New Start
Alberto Fernandez-Hernandez, Jose Mestre, Manuel F. Dolz et al.
Learning Cocoercive Conservative Denoisers via Helmholtz Decomposition for Poisson Imaging Inverse Problems
Deliang Wei, Peng Chen, Haobo Xu et al.
UtilGen: Utility-Centric Generative Data Augmentation with Dual-Level Task Adaptation
Jiyu Guo, Shuo Yang, Yiming Huang et al.
Borrowing Eyes for the Blind Spot: Overcoming Data Scarcity in Malicious Video Detection via Cross-Domain Retrieval Augmentation
Rongpei Hong, Jian Lang, Ting Zhong et al.
Pinpointing Attention-Causal Communication in Language Models
Gabriel Franco, Mark Crovella
Conformal Prediction and MLLM aided Uncertainty Quantification in Scene Graph Generation
Sayak Nag, Udita Ghosh, Calvin-Khang Ta et al.
Pretraining a Shared Q-Network for Data-Efficient Offline Reinforcement Learning
Jongchan Park, Mingyu Park, Donghwan Lee
Not All Data are Good Labels: On the Self-supervised Labeling for Time Series Forecasting
Yuxuan Yang, Dalin Zhang, Yuxuan Liang et al.
Exploring Polyglot Harmony: On Multilingual Data Allocation for Large Language Models Pretraining
Ping Guo, Yubing Ren, BINBINLIU et al.
SSTAG: Structure-Aware Self-Supervised Learning Method for Text-Attributed Graphs
Ruyue Liu, Rong Yin, Xiangzhen Bo et al.
MigGPT: Harnessing Large Language Models for Automated Migration of Out-of-Tree Linux Kernel Patches Across Versions
Pucheng Dang, Di Huang, Dong Li et al.
A Real-world Display Inverse Rendering Dataset
Seokjun Choi, Hoon-Gyu Chung, Yujin Jeon et al.
Amortized Variational Transdimensional Inference
Laurence Davies, Daniel MacKinlay, Rafael Oliveira et al.
What’s in Common? Multimodal Models Hallucinate When Reasoning Across Scenes
Candace Ross, Florian Bordes, Adina Williams et al.
Position: Biology is the Challenge Physics-Informed ML Needs to Evolve
Julien Martinelli
Feature-Based Instance Neighbor Discovery: Advanced Stable Test-Time Adaptation in Dynamic World
Qinting Jiang, Chuyang Ye, Dongyan Wei et al.
Pruning All-Rounder: Rethinking and Improving Inference Efficiency for Large Vision Language Models
Wei Suo, Ji Ma, Mengyang Sun et al.
Evidential Knowledge Distillation
Liangyu Xiang, Junyu Gao, Changsheng Xu
Dynamical modeling of nonlinear latent factors in multiscale neural activity with real-time inference
Eray Erturk, Maryam Shanechi
SpecMER: Fast Protein Generation with K-mer Guided Speculative Decoding
Thomas Walton, Darin Tsui, Aryan Musharaf et al.
Aggregation Hides Out-of-Distribution Generalization Failures from Spurious Correlations
Olawale Salaudeen, Haoran Zhang, Kumail Alhamoud et al.
TEMPO: Temporal Multi-scale Autoregressive Generation of Protein Conformational Ensembles
Yaoyao Xu, Di Wang, Zihan Zhou et al.
Training Data Provenance Verification: Did Your Model Use Synthetic Data from My Generative Model for Training?
Yuechen Xie, Jie Song, Huiqiong Wang et al.
Towards Fine-grained Interactive Segmentation in Images and Videos
Yuan Yao, Qiushi Yang, Miaomiao Cui et al.
Restoring Pruned Large Language Models via Lost Component Compensation
Zijian Feng, Hanzhang Zhou, Zixiao Zhu et al.
Enhancing LLM Watermark Resilience Against Both Scrubbing and Spoofing Attacks
Huanming Shen, Baizhou Huang, Xiaojun Wan
Embracing Trustworthy Brain-Agent Collaboration as Paradigm Extension for Intelligent Assistive Technologies
Yankai Chen, Xinni Zhang, Yifei Zhang et al.
DyFlow: Dynamic Workflow Framework for Agentic Reasoning
Yanbo Wang, Zixiang Xu, Yue Huang et al.
Position: Benchmarking is Broken - Don't Let AI be Its Own Judge
Zerui Cheng, Stella Wohnig, Ruchika Gupta et al.
PANDA: Towards Generalist Video Anomaly Detection via Agentic AI Engineer
Zhiwei Yang, Chen Gao, Mike Zheng Shou
Generating, Fast and Slow: Scalable Parallel Video Generation with Video Interface Networks
Bhishma Dedhia, David Bourgin, Krishna Kumar Singh et al.
RCP-Bench: Benchmarking Robustness for Collaborative Perception Under Diverse Corruptions
Shihang Du, Sanqing Qu, Tianhang Wang et al.
DrivAerStar: An Industrial-Grade CFD Dataset for Vehicle Aerodynamic Optimization
Jiyan Qiu, Lyulin Kuang, Guan Wang et al.
VisHall3D: Monocular Semantic Scene Completion from Reconstructing the Visible Regions to Hallucinating the Invisible Regions
Haoang Lu, Yuanqi Su, Xiaoning Zhang et al.
DIO: Decomposable Implicit 4D Occupancy-Flow World Model
Christopher Diehl, Quinlan Sykora, Ben Agro et al.
MineAnyBuild: Benchmarking Spatial Planning for Open-world AI Agents
Ziming Wei, Bingqian Lin, Zijian Jiao et al.
GlobalTomo: A global dataset for physics-ML seismic wavefield modeling and FWI
Shiqian Li, Zhi Li, Zhancun Mu et al.
Gaussian Splatting Feature Fields for (Privacy-Preserving) Visual Localization
Maxime Pietrantoni, Gabriela Csurka, Torsten Sattler
Accelerating data-driven algorithm selection for combinatorial partitioning problems
Vaggos Chatziafratis, Ishani Karmarkar, Yingxi Li et al.
DAMM-Diffusion: Learning Divergence-Aware Multi-Modal Diffusion Model for Nanoparticles Distribution Prediction
Junjie Zhou, Shouju Wang, Yuxia Tang et al.
The Narrow Gate: Localized Image-Text Communication in Native Multimodal Models
Alessandro Serra, Francesco Ortu, Emanuele Panizon et al.
Geminio: Language-Guided Gradient Inversion Attacks in Federated Learning
Junjie Shan, Ziqi Zhao, Jialin Lu et al.
Hephaestus: Mixture Generative Modeling with Energy Guidance for Large-scale QoS Degradation
Nguyen Do, Bach Ngo, Youval Kashuv et al.
TRIM: Scalable 3D Gaussian Diffusion Inference with Temporal and Spatial Trimming
Zeyuan Yin, Xiaoming Liu
Information-Bottleneck Driven Binary Neural Network for Change Detection
Kaijie Yin, Zhiyuan Zhang, Shu Kong et al.
GlassWizard: Harvesting Diffusion Priors for Glass Surface Detection
Wenxue Li, Tian Ye, Xinyu Xiong et al.
Saliency-Aware Quantized Imitation Learning for Efficient Robotic Control
Seongmin Park, Hyungmin Kim, Sangwoo kim et al.
Value Improved Actor Critic Algorithms
Yaniv Oren, Moritz Zanger, Pascal van der Vaart et al.
SonoGym: High Performance Simulation for Challenging Surgical Tasks with Robotic Ultrasound
Yunke Ao, Masoud Moghani, Mayank Mittal et al.
Open Ad-hoc Categorization with Contextualized Feature Learning
Zilin Wang, Sangwoo Mo, Stella X. Yu et al.
Stepsize anything: A unified learning rate schedule for budgeted-iteration training
Anda Tang, Yiming Dong, Yutao Zeng et al.
Visual Diversity and Region-aware Prompt Learning for Zero-shot HOI Detection
Chanhyeong Yang, Taehoon song, Jihwan Park et al.
Benefit From Seen: Enhancing Open-Vocabulary Object Detection by Bridging Visual and Textual Co-Occurrence Knowledge
Yanqi Li, Jianwei Niu, Tao Ren
Understanding and Enhancing Mask-Based Pretraining towards Universal Representations
Mingze Dong, Leda Wang, Yuval Kluger
From Condensation to Rank Collapse: A Two-Stage Analysis of Transformer Training Dynamics
Zheng-An Chen, Tao Luo
InstanceAssemble: Layout-Aware Image Generation via Instance Assembling Attention
Qiang Xiang, Shuang Sun, Binglei Li et al.
Wasserstein Style Distribution Analysis and Transform for Stylized Image Generation
Xi Yu, Xiang Gu, Zhihao Shi et al.
VideoMiner: Iteratively Grounding Key Frames of Hour-Long Videos via Tree-based Group Relative Policy Optimization
Xinye Cao, Hongcan Guo, Jiawen Qian et al.
Linearly Constrained Diffusion Implicit Models
Vivek Jayaram, Ira Kemelmacher-Shlizerman, Steve Seitz et al.
IRGPT: Understanding Real-world Infrared Image with Bi-cross-modal Curriculum on Large-scale Benchmark
Zhe Cao, Jin Zhang, Ruiheng Zhang
To Distill or Decide? Understanding the Algorithmic Trade-off in Partially Observable RL
Yuda Song, Dhruv Rohatgi, Aarti Singh et al.
Structure-Aware Fusion with Progressive Injection for Multimodal Molecular Representation Learning
Zihao Jing, Yan Sun, Yan Yi Li et al.
Valid Inference with Imperfect Synthetic Data
Yewon Byun, Shantanu Gupta, Zachary Lipton et al.
SyncSDE: A Probabilistic Framework for Diffusion Synchronization
Hyunjun Lee, Hyunsoo Lee, Sookwan Han
Words That Unite The World: A Unified Framework for Deciphering Central Bank Communications
Agam Shah, Siddhant Sukhani, Huzaifa Pardawala et al.
Learning Robust Representations with Long-Term Information for Generalization in Visual Reinforcement Learning
Rui Yang, Jie Wang, Qijie Peng et al.
ModuLM: Enabling Modular and Multimodal Molecular Relational Learning with Large Language Models
Zhuo Chen, YIZHEN ZHENG, Huan Yee Koh et al.
CULTURE3D: A Large-Scale and Diverse Dataset of Cultural Landmarks and Terrains for Gaussian-Based Scene Rendering
xinyi zheng, Steve Zhang, Weizhe Lin et al.
Memory-Efficient Generative Models via Product Quantization
Jie Shao, Hanxiao Zhang, Hao Yu et al.
SSVQ: Unleashing the Potential of Vector Quantization with Sign-Splitting
Shuaiting Li, Juncan Deng, Chengxuan Wang et al.
FlowFeat: Pixel-Dense Embedding of Motion Profiles
Nikita Araslanov, Anna Sonnweber, Daniel Cremers
Ask and Remember: A Questions-Only Replay Strategy for Continual Visual Question Answering
Imad Eddine MAROUF, Enzo Tartaglione, Stéphane Lathuilière et al.
VADB: A Large-Scale Video Aesthetic Database with Professional and Multi-Dimensional Annotations
Qianqian Qiao, DanDan Zheng, Yihang Bo et al.
Adapting to Observation Length of Trajectory Prediction via Contrastive Learning
Ruiqi Qiu, JUN GONG, Xinyu Zhang et al.
How Many Tokens Do 3D Point Cloud Transformer Architectures Really Need?
Tuan Tran Anh, Duy M. H. Nguyen, Hoai-Chau Tran et al.
Achieving More with Less: Additive Prompt Tuning for Rehearsal-Free Class-Incremental Learning
Haoran Chen, Ping Wang, Zihan Zhou et al.
Prediction-Powered Semi-Supervised Learning with Online Power Tuning
Noa Shoham, Ron Dorfman, Shalev Shaer et al.
Curious Causality-Seeking Agents Learn Meta Causal World
Zhiyu Zhao, Haoxuan Li, Haifeng Zhang et al.
Free-MoRef: Instantly Multiplexing Context Perception Capabilities of Video-MLLMs within Single Inference
KUO WANG, Quanlong Zheng, Junlin Xie et al.
RoFt-Mol: Benchmarking Robust Fine-tuning with Molecular Graph Foundation Models
Shikun Liu, Deyu Zou, Nima Shoghi et al.
Correspondence-Free Fast and Robust Spherical Point Pattern Registration
Anik Sarker, Alan Asbeck
Intrinsic Goals for Autonomous Agents: Model-Based Exploration in Virtual Zebrafish Predicts Ethological Behavior and Whole-Brain Dynamics
Reece Keller, Alyn Kirsch, Felix Pei et al.
Tighter CMI-Based Generalization Bounds via Stochastic Projection and Quantization
Milad Sefidgaran, Kimia Nadjahi, Abdellatif Zaidi
Learning Pixel-adaptive Multi-layer Perceptrons for Real-time Image Enhancement
Junyu Lou, Xiaorui Zhao, Kexuan Shi et al.
Instance-wise Supervision-level Optimization in Active Learning
Shinnosuke Matsuo, Riku Togashi, Ryoma Bise et al.
Reducing Class-wise Confusion for Incremental Learning with Disentangled Manifolds
Huitong Chen, Yu Wang, Yan Fan et al.
Text Embedding Knows How to Quantize Text-Guided Diffusion Models
Hongjae Lee, Myungjun Son, Dongjea Kang et al.
Re-ttention: Ultra Sparse Visual Generation via Attention Statistical Reshape
Ruichen Chen, Keith Mills, Liyao Jiang et al.
Tight Bounds on the Distortion of Randomized and Deterministic Distributed Voting
Mohammad Abam, Davoud Kareshki, Marzieh Nilipour et al.
LiSu: A Dataset and Method for LiDAR Surface Normal Estimation
Dušan Malić, Christian Fruhwirth-Reisinger, Samuel Schulter et al.
RGNMR: A Gauss-Newton method for robust matrix completion with theoretical guarantees
Eilon Vaknin Laufer, Boaz Nadler
DACoN: DINO for Anime Paint Bucket Colorization with Any Number of Reference Images
Kazuma Nagata, Naoshi Kaneko
ONDA-Pose: Occlusion-Aware Neural Domain Adaptation for Self-Supervised 6D Object Pose Estimation
Tao Tan, Qiulei Dong
RAM-W600: A Multi-Task Wrist Dataset and Benchmark for Rheumatoid Arthritis
YANG SONGXIAO, Haolin Wang, Yao Fu et al.
Measuring the Impact of Rotation Equivariance on Aerial Object Detection
Xiuyu Wu, Xinhao Wang, Xiubin Zhu et al.
Injecting Frame-Event Complementary Fusion into Diffusion for Optical Flow in Challenging Scenes
Haonan Wang, Hanyu Zhou, Haoyue Liu et al.
MirrorVerse: Pushing Diffusion Models to Realistically Reflect the World
Ankit Dhiman, Manan Shah, R. Venkatesh Babu
VLMs have Tunnel Vision: Evaluating Nonlocal Visual Reasoning in Leading VLMs
Shmuel Berman, Jia Deng
DynImg: Key Frames with Visual Prompts are Good Representation for Multi-Modal Video Understanding
Xiaoyi Bao, Chen-Wei Xie, Hao Tang et al.
Seeing the Unseen: A Semantic Alignment and Context-Aware Prompt Framework for Open-Vocabulary Camouflaged Object Segmentation
Peng Ren, Tian Bai, Jing Sun et al.
Decoupled Subgraph Federated Learning
Javad Aliakbari, Johan Östman, Alexandre Graell i Amat
Improved Diffusion-based Generative Model with Better Adversarial Robustness
Zekun Wang, Mingyang Yi, Shuchen Xue et al.
DirectTriGS: Triplane-based Gaussian Splatting Field Representation for 3D Generation
Xiaoliang Ju, Hongsheng Li
GeoDepth: From Point-to-Depth to Plane-to-Depth Modeling for Self-Supervised Monocular Depth Estimation
Haifeng Wu, Shuhang Gu, Lixin Duan et al.
On the Complexity of Finding Stationary Points in Nonconvex Simple Bilevel Optimization
Jincheng Cao, Ruichen Jiang, Erfan Yazdandoost Hamedani et al.
SinGS: Animatable Single-Image Human Gaussian Splats with Kinematic Priors
Yufan Wu, Xuanhong Chen, Wen Li et al.
REINFORCEMENT LEARNING FOR INDIVIDUAL OPTIMAL POLICY FROM HETEROGENEOUS DATA
Rui Miao, Babak Shahbaba, Annie Qu
Active Event-based Stereo Vision
Jianing Li, Yunjian Zhang, Haiqian Han et al.
LiFT: Learning to Fine-Tune via Bayesian Parameter Efficient Meta Fine-Tuning
Minyoung Kim, Timothy Hospedales
Visual Structures Help Visual Reasoning: Addressing the Binding Problem in LVLMs
Amirmohammad Izadi, Mohammadali Banayeeanzade, Fatemeh Askari et al.
Synthesizing Performance Constraints for Evaluating and Improving Code Efficiency
Jun Yang, Cheng-Chi Wang, Bogdan Stoica et al.
Continuity and Isolation Lead to Doubts or Dilemmas in Large Language Models
Hector Pasten, Felipe Urrutia, Hector Orellana et al.
MaDCoW: Marginal Distortion Correction for Wide-Angle Photography with Arbitrary Objects
Kevin Zhang, Jia-Bin Huang, Jose Echevarria et al.
Fast exact recovery of noisy matrix from few entries: the infinity norm approach
BaoLinh Tran, Van Vu
AutoScape: Geometry-Consistent Long-Horizon Scene Generation
Jiacheng Chen, Ziyu Jiang, Mingfu Liang et al.
ReID5o: Achieving Omni Multi-modal Person Re-identification in a Single Model
Jialong Zuo, Yongtai Deng, Mengdan Tan et al.
DICE: Staleness-Centric Optimizations for Parallel Diffusion MoE Inference
Jiajun Luo, Lizhuo Luo, Jianru Xu et al.
RayGaussX: Accelerating Gaussian-Based Ray Marching for Real-Time and High-Quality Novel View Synthesis
Hugo Blanc, Jean-Emmanuel Deschaud, Alexis Paljic
BlinkTrack: Feature Tracking over 80 FPS via Events and Images
Yichen Shen, Yijin Li, Shuo Chen et al.
ZeCO: Zero-Communication Overhead Sequence Parallelism for Linear Attention
Yuhong CHOU, Zehao Liu, Rui-Jie Zhu et al.
Towards Learning High-Precision Least Squares Algorithms with Sequence Models
Jerry Liu, Jessica Grogan, Owen Dugan et al.
C-LoRA: Contextual Low-Rank Adaptation for Uncertainty Estimation in Large Language Models
Amir Hossein Rahmati, Sanket Jantre, Weifeng Zhang et al.