Most Cited 2025 "squared error loss" Papers
22,274 papers found • Page 62 of 112
Conference
Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models
Jinhui Yi, Syed Talal Wasim, Yanan Luo et al.
Beyond Circuit Connections: A Non-Message Passing Graph Transformer Approach for Quantum Error Mitigation
Tianyi Bao, Xinyu Ye, Hang Ruan et al.
Efficient Test-time Adaptive Object Detection via Sensitivity-Guided Pruning
Kunyu Wang, Xueyang Fu, Xin Lu et al.
COBRA: COmBinatorial Retrieval Augmentation for Few-Shot Adaptation
Arnav Mohanty Das, Gantavya Bhatt, Lilly Kumari et al.
CAV-MAE Sync: Improving Contrastive Audio-Visual Mask Autoencoders via Fine-Grained Alignment
Edson Araujo, Andrew Rouditchenko, Yuan Gong et al.
Deterministic-to-Stochastic Diverse Latent Feature Mapping for Human Motion Synthesis
Hua Yu, Weiming Liu, Gui Xu et al.
Dense Match Summarization for Faster Two-view Estimation
Jonathan Astermark, Anders Heyden, Viktor Larsson
VEU-Bench: Towards Comprehensive Understanding of Video Editing
Bozheng Li, Yongliang Wu, YI LU et al.
SEC-Prompt:SEmantic Complementary Prompting for Few-Shot Class-Incremental Learning
Ye Liu, Meng Yang
Learning from Imperfect Human Feedback: A Tale from Corruption-Robust Dueling
Yuwei Cheng, Fan Yao, Xuefeng Liu et al.
Swing-by Dynamics in Concept Learning and Compositional Generalization
Yongyi Yang, Core Francisco Park, Ekdeep Singh Lubana et al.
A Differentiable Rank-Based Objective for Better Feature Learning
Krunoslav Lehman Pavasovic, Giulio Biroli, Levent Sagun
Segment Anything, Even Occluded
Wei-En Tai, Yu-Lin Shih, Cheng Sun et al.
Geometry-Aware Approaches for Balancing Performance and Theoretical Guarantees in Linear Bandits
Yuwei Luo, Mohsen Bayati
Graph Neural Networks Can (Often) Count Substructures
Paolo Pellizzoni, Till Schulz, Karsten Borgwardt
EIDT-V: Exploiting Intersections in Diffusion Trajectories for Model-Agnostic, Zero-Shot, Training-Free Text-to-Video Generation
Diljeet Jagpal, Xi Chen, Vinay P. Namboodiri
MAGNet: Motif-Agnostic Generation of Molecules from Scaffolds
Leon Hetzel, Johanna Sommer, Bastian Rieck et al.
MICAS: Multi-grained In-Context Adaptive Sampling for 3D Point Cloud Processing
Feifei Shao, Ping Liu, Zhao Wang et al.
Shining Yourself: High-Fidelity Ornaments Virtual Try-on with Diffusion Model
Yingmao Miao, Zhanpeng Huang, Rui Han et al.
A Multiscale Frequency Domain Causal Framework for Enhanced Pathological Analysis
Xiaoyu Cui, Weixing Chen, Jiandong Su
FlexGS: Train Once, Deploy Everywhere with Many-in-One Flexible 3D Gaussian Splatting
Hengyu Liu, Yuehao Wang, Chenxin Li et al.
L-SWAG: Layer-Sample Wise Activation with Gradients Information for Zero-Shot NAS on Vision Transformers
Sofia Casarin, Sergio Escalera, Oswald Lanz
Durable Quantization Conditioned Misalignment Attack on Large Language Models
Peiran Dong, Haowei Li, Song Guo
Ferret: An Efficient Online Continual Learning Framework under Varying Memory Constraints
Yuhao Zhou, Yuxin Tian, Jindi Lv et al.
Where's the Liability in the Generative Era? Recovery-based Black-Box Detection of AI-Generated Content
Haoyue Bai, Yiyou Sun, Wei Cheng et al.
Incomplete Multi-modal Brain Tumor Segmentation via Learnable Sorting State Space Model
Zheyu Zhang, Yayuan Lu, Feipeng Ma et al.
Residual Deep Gaussian Processes on Manifolds
Kacper Wyrwal, Andreas Krause, Viacheslav (Slava) Borovitskiy
Bayesian Analysis of Combinatorial Gaussian Process Bandits
Jack Sandberg, Niklas Åkerblom, Morteza Haghir Chehreghani
LLM-driven Multimodal and Multi-Identity Listening Head Generation
Peiwen Lai, Weizhi Zhong, Yipeng Qin et al.
PianoMotion10M: Dataset and Benchmark for Hand Motion Generation in Piano Performance
Qijun Gan, Song Wang, Shengtao Wu et al.
FSBench: A Figure Skating Benchmark for Advancing Artistic Sports Understanding
Rong Gao, Xin Liu, Zhuozhao Hu et al.
DiN: Diffusion Model for Robust Medical VQA with Semantic Noisy Labels
Erjian Guo, Zhen Zhao, Zicheng Wang et al.
V2V3D: View-to-View Denoised 3D Reconstruction for Light Field Microscopy
Jiayin Zhao, Zhenqi Fu, Tao Yu et al.
Handling Spatial-Temporal Data Heterogeneity for Federated Continual Learning via Tail Anchor
Hao Yu, Xin Yang, Le Zhang et al.
Infighting in the Dark: Multi-Label Backdoor Attack in Federated Learning
Ye Li, Yanchao Zhao, chengcheng zhu et al.
SynTab-LLaVA: Enhancing Multimodal Table Understanding with Decoupled Synthesis
Bangbang Zhou, Zuan Gao, Zixiao Wang et al.
Beyond Human Perception: Understanding Multi-Object World from Monocular View
Keyu Guo, Yongle Huang, Shijie Sun et al.
Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Models
Jungwon Park, Jungmin Ko, Dongnam Byun et al.
Inner Information Analysis Algorithm for Deep Neural Network based on Community
Guipeng Lan, Shuai Xiao, Meng Xi et al.
BOE-ViT: Boosting Orientation Estimation with Equivariance in Self-Supervised 3D Subtomogram Alignment
Runmin Jiang, Jackson Daggett, Shriya Pingulkar et al.
Optimal Transport-Guided Source-Free Adaptation for Face Anti-Spoofing
Zhuowei Li, Tianchen Zhao, Xiang Xu et al.
MDP: Multidimensional Vision Model Pruning with Latency Constraint
Xinglong Sun, Barath Lakshmanan, Maying Shen et al.
Hyperbolic Uncertainty-Aware Few-Shot Incremental Point Cloud Segmentation
Tanuj Sur, Samrat Mukherjee, Kaizer Rahaman et al.
Prevalence of Negative Transfer in Continual Reinforcement Learning: Analyses and a Simple Baseline
Hongjoon Ahn, Jinu Hyeon, Youngmin Oh et al.
Isometric Regularization for Manifolds of Functional Data
Hyeongjun Heo, Seonghun Oh, JaeYong Lee et al.
Perceptual Video Compression with Neural Wrapping
Muhammad Umar Karim Khan, Aaron Chadha, Mohammad Ashraful Anam et al.
Privacy-Aware Lifelong Learning
Ozan Özdenizci, Elmar Rueckert, Robert Legenstein
Test-time Adaptation for Image Compression with Distribution Regularization
Kecheng Chen, Pingping Zhang, Tiexin Qin et al.
Parameter Expanded Stochastic Gradient Markov Chain Monte Carlo
Hyunsu Kim, Giung Nam, Chulhee Yun et al.
MetaShadow: Object-Centered Shadow Detection, Removal, and Synthesis
Tianyu Wang, Jianming Zhang, Haitian Zheng et al.
Three Cars Approaching within 100m! Enhancing Distant Geometry by Tri-Axis Voxel Scanning for Camera-based Semantic Scene Completion
Jongseong Bae, Junwoo Ha, Ha Young Kim
Self-Evolving Visual Concept Library using Vision-Language Critics
Atharva Sehgal, Patrick Yuan, Ziniu Hu et al.
Deep Fair Multi-View Clustering with Attention KAN
HaiMing Xu, Qianqian Wang, Boyue Wang et al.
Distinguish Then Exploit: Source-free Open Set Domain Adaptation via Weight Barcode Estimation and Sparse Label Assignment
Weiming Liu, Jun Dan, Fan Wang et al.
Test-Time Adaptation for Combating Missing Modalities in Egocentric Videos
Merey Ramazanova, Alejandro Pardo, Bernard Ghanem et al.
QPM: Discrete Optimization for Globally Interpretable Image Classification
Thomas Norrenbrock, Timo Kaiser, Sovan Biswas et al.
One-Step Event-Driven High-Speed Autofocus
Yuhan Bao, Shaohua Gao, Wenyong Li et al.
Condensing Action Segmentation Datasets via Generative Network Inversion
Guodong Ding, Rongyu Chen, Angela Yao
Overcoming Shortcut Problem in VLM for Robust Out-of-Distribution Detection
Zhuo Xu, Xiang Xiang, Yifan Liang
T2V-Turbo-v2: Enhancing Video Model Post-Training through Data, Reward, and Conditional Guidance Design
Jiachen Li, Qian Long, Jian (Skyler) Zheng et al.
Blurry-Edges: Photon-Limited Depth Estimation from Defocused Boundaries
Wei Xu, Charlie Wagner, Junjie Luo et al.
Color Alignment in Diffusion
Ka Chun SHUM, Binh-Son Hua, Thanh Nguyen et al.
Harnessing Frozen Unimodal Encoders for Flexible Multimodal Alignment
Mayug Maniparambil, Raiymbek Akshulakov, YASSER ABDELAZIZ DAHOU DJILALI et al.
When Domain Generalization meets Generalized Category Discovery: An Adaptive Task-Arithmetic Driven Approach
Vaibhav Rathore, Shubhranil B, Saikat Dutta et al.
Towards Faster Decentralized Stochastic Optimization with Communication Compression
Rustem Islamov, Yuan Gao, Sebastian Stich
Few-shot Personalized Scanpath Prediction
Ruoyu Xue, Jingyi Xu, Sounak Mondal et al.
COSMIC: Clique-Oriented Semantic Multi-space Integration for Robust CLIP Test-Time Adaptation
Fanding Huang, Jingyan Jiang, Qinting Jiang et al.
Boltzmann Attention Sampling for Image Analysis with Small Objects
Theodore Zhao, Sid Kiblawi, Mu Wei et al.
Learning 3D Perception from Others' Predictions
Jinsu Yoo, Zhenyang Feng, Tai-Yu Pan et al.
Dynamic Neural Surfaces for Elastic 4D Shape Representation and Analysis
Awais Nizamani, Hamid Laga, Guanjin Wang et al.
Protecting against simultaneous data poisoning attacks
Neel Alex, Muhammad Shoaib Ahmed Siddiqui, Amartya Sanyal et al.
Correlative and Discriminative Label Grouping for Multi-Label Visual Prompt Tuning
Lei-Lei Ma, Shuo Xu, Ming-Kun Xie et al.
Diff-Palm: Realistic Palmprint Generation with Polynomial Creases and Intra-Class Variation Controllable Diffusion Models
Jianlong Jin, Chenglong Zhao, Ruixin Zhang et al.
PIDSR: Complementary Polarized Image Demosaicing and Super-Resolution
Shuangfan Zhou, Chu Zhou, Youwei Lyu et al.
Debiasing Mini-Batch Quadratics for Applications in Deep Learning
Lukas Nicola Tatzel, Bálint Mucsányi, Osane Hackel et al.
A Theory of Initialisation's Impact on Specialisation
Devon Jarvis, Sebastian Lee, Clementine Domine et al.
L-WISE: Boosting Human Visual Category Learning Through Model-Based Image Selection and Enhancement
Morgan B Talbot, Gabriel Kreiman, James DiCarlo et al.
CheckManual: A New Challenge and Benchmark for Manual-based Appliance Manipulation
Yuxing Long, Jiyao Zhang, Mingjie Pan et al.
Privately Counting Partially Ordered Data
Matthew Joseph, Mónica Ribero, Alexander Yu
Controllable Human Image Generation with Personalized Multi-Garments
Yisol Choi, Sangkyung Kwak, Sihyun Yu et al.
DeepCompress-ViT: Rethinking Model Compression to Enhance Efficiency of Vision Transformers at the Edge
Sabbir Ahmed, Abdullah Al Arafat, Deniz Najafi et al.
ProHOC: Probabilistic Hierarchical Out-of-Distribution Classification via Multi-Depth Networks
Erik Wallin, Fredrik Kahl, Lars Hammarstrand
Provable unlearning in topic modeling and downstream tasks
Stanley Wei, Sadhika Malladi, Sanjeev Arora et al.
From Sparse Signal to Smooth Motion: Real-Time Motion Generation with Rolling Prediction Models
German Barquero, Nadine Bertsch, Manojkumar Marramreddy et al.
OptionZero: Planning with Learned Options
Po-Wei Huang, Pei-Chiun Peng, Hung Guei et al.
Discovering Group Structures via Unitary Representation Learning
Dongsung Huh
Segment This Thing: Foveated Tokenization for Efficient Point-Prompted Segmentation
Tanner Schmidt, Richard Newcombe
Decision SpikeFormer: Spike-Driven Transformer for Decision Making
Wei Huang, Qinying Gu, Nanyang Ye
A Non-Contrastive Learning Framework for Sequential Recommendation with Preference-Preserving Profile Generation
Huimin Zeng, Xiaojie Wang, Anoop Jain et al.
FoundHand: Large-Scale Domain-Specific Learning for Controllable Hand Image Generation
Kefan Chen, Chaerin Min, Linguang Zhang et al.
Rethinking Correspondence-based Category-Level Object Pose Estimation
Huan Ren, Wenfei Yang, Shifeng Zhang et al.
DeDe: Detecting Backdoor Samples for SSL Encoders via Decoders
Sizai Hou, Songze Li, Duanyi Yao
Lambda-Skip Connections: the architectural component that prevents Rank Collapse
Federico Arangath Joseph, Jerome Sieber, Melanie Zeilinger et al.
GenVDM: Generating Vector Displacement Maps From a Single Image
Yuezhi Yang, Qimin Chen, Vladimir G. Kim et al.
Controlled LLM Decoding via Discrete Auto-regressive Biasing
Patrick Pynadath, Ruqi Zhang
SoundVista: Novel-View Ambient Sound Synthesis via Visual-Acoustic Binding
Mingfei Chen, Israel D. Gebru, Ishwarya Ananthabhotla et al.
A Unified, Resilient, and Explainable Adversarial Patch Detector
Vishesh Kumar, Akshay Agarwal
AIMS.au: A Dataset for the Analysis of Modern Slavery Countermeasures in Corporate Statements
Adriana-Eufrosina Bora, Pierre-Luc St-Charles, Mirko Bronzi et al.
BaB-ND: Long-Horizon Motion Planning with Branch-and-Bound and Neural Dynamics
Keyi Shen, Jiangwei Yu, Jose Barreiros et al.
The Case for Cleaner Biosignals: High-fidelity Neural Compressor Enables Transfer from Cleaner iEEG to Noisier EEG
Francesco Carzaniga, Gary Hoppeler, Michael Hersche et al.
3D-SPATIAL MULTIMODAL MEMORY
Xueyan Zou, Yuchen Song, Ri-Zhao Qiu et al.
Towards Generalizable Reinforcement Learning via Causality-Guided Self-Adaptive Representations
Yupei Yang, Biwei Huang, Fan Feng et al.
CPSample: Classifier Protected Sampling for Guarding Training Data During Diffusion
Joshua Kazdan, Hao Sun, Jiaqi Han et al.
SACB-Net: Spatial-awareness Convolutions for Medical Image Registration
Xinxing Cheng, Tianyang Zhang, Wenqi Lu et al.
VideoGuide: Improving Video Diffusion Models without Training Through a Teacher's Guide
Dohun Lee, Bryan Sangwoo Kim, Geon Yeong Park et al.
Offline RL with Smooth OOD Generalization in Convex Hull and its Neighborhood
Qingmao Yao, Zhichao Lei, Tianyuan Chen et al.
Approximating Full Conformal Prediction for Neural Network Regression with Gauss-Newton Influence
Dharmesh Tailor, Alvaro Correia, Eric Nalisnick et al.
EBS-EKF: Accurate and High Frequency Event-based Star Tracking
Albert Reed, Connor Hashemi, Dennis Melamed et al.
HOTFormerLoc: Hierarchical Octree Transformer for Versatile Lidar Place Recognition Across Ground and Aerial Views
Ethan Griffiths, Maryam Haghighat, Simon Denman et al.
Thin-Shell-SfT: Fine-Grained Monocular Non-rigid 3D Surface Tracking with Neural Deformation Fields
Navami Kairanda, Marc Habermann, Shanthika Shankar Naik et al.
Learning on One Mode: Addressing Multi-modality in Offline Reinforcement Learning
Mianchu Wang, Yue Jin, Giovanni Montana
Kaputt: A Large-Scale Dataset for Visual Defect Detection
Sebastian Höfer, Dorian Henning, Artemij Amiranashvili et al.
VTimeCoT: Thinking by Drawing for Video Temporal Grounding and Reasoning
Jinglei Zhang, Yuanfan Guo, Rolandos Alexandros Potamias et al.
SG-LDM: Semantic-Guided LiDAR Generation via Latent-Aligned Diffusion
Zhengkang Xiang, Zizhao Li, Amir Khodabandeh et al.
LookOut: Real-World Humanoid Egocentric Navigation
Boxiao Pan, Adam Harley, Francis Engelmann et al.
GCRayDiffusion: Pose-Free Surface Reconstruction via Geometric Consistent Ray Diffusion
Li-Heng Chen, Zi-Xin Zou, Chang Liu et al.
Global Regulation and Excitation via Attention Tuning for Stereo Matching
Jiahao LI, Xinhong Chen, Zhengmin JIANG et al.
Object-level Correlation for Few-Shot Segmentation
chunlin wen, Yu Zhang, Jie Fan et al.
SketchSplat: 3D Edge Reconstruction via Differentiable Multi-view Sketch Splatting
Haiyang Ying, Matthias Zwicker
Degradation-Modeled Multipath Diffusion for Tunable Metalens Photography
Jianing Zhang, Jiayi Zhu, Feiyu Ji et al.
Free-running vs Synchronous: Single-Photon Lidar for High-flux 3D Imaging
Ruangrawee Kitichotkul, Shashwath Bharadwaj, Joshua Rapp et al.
Noise2Score3D: Tweedie's Approach for Unsupervised Point Cloud Denoising
Xiangbin Wei, Yuanfeng Wang, Ao XU et al.
Prior2Former - Evidential Modeling of Mask Transformers for Assumption-Free Open-World Panoptic Segmentation
Sebastian Schmidt, Julius Koerner, Dominik Fuchsgruber et al.
CoMoGaussian: Continuous Motion-Aware Gaussian Splatting from Motion-Blurred Images
Jungho Lee, DongHyeong Kim, Dogyoon Lee et al.
Towards Open-World Generation of Stereo Images and Unsupervised Matching
Feng Qiao, Zhexiao Xiong, Eric Xing et al.
Bridging 3D Anomaly Localization and Repair via High-Quality Continuous Geometric Representation
Bozhong Zheng, Jinye Gan, Xiaohao Xu et al.
Leveraging Local Patch Alignment to Seam-cutting for Large Parallax Image Stitching
Tianli Liao, Chenyang Zhao, Lei Li et al.
Cross-Architecture Distillation Made Simple with Redundancy Suppression
Weijia Zhang, Yuehao Liu, Wu Ran et al.
MGSR: 2D/3D Mutual-boosted Gaussian Splatting for High-fidelity Surface Reconstruction under Various Light Conditions
Qingyuan Zhou, Yuehu Gong, Weidong Yang et al.
StealthAttack: Robust 3D Gaussian Splatting Poisoning via Density-Guided Illusions
Bo-Hsu Ke, You-Zhe Xie, Yu-Lun Liu et al.
Continual Multiple Instance Learning with Enhanced Localization for Histopathological Whole Slide Image Analysis
Byung Hyun Lee, Wongi Jeong, Woojae Han et al.
MOSAIC: Generating Consistent, Privacy-Preserving Scenes from Multiple Depth Views in Multi-Room Environments
Zhixuan Liu, Haokun Zhu, Rui Chen et al.
Boosting Vision Semantic Density with Anatomy Normality Modeling for Medical Vision-language Pre-training
Weiwei Cao, Jianpeng Zhang, Zhongyi Shui et al.
Hybrid Concept Bottleneck Models
Yang Liu, Tianwei Zhang, Shi Gu
Generative Active Learning for Long-tail Trajectory Prediction via Controllable Diffusion Model
Daehee Park, Monu Surana, Pranav Desai et al.
2D Gaussian Splatting-based Sparse-view Transparent Object Depth Reconstruction via Physics Simulation for Scene Update
Jeongyun Kim, Seunghoon Jeong, Giseop Kim et al.
ViCTr: Vital Consistency Transfer for Pathology Aware Image Synthesis
Onkar Susladkar, Gayatri Deshmukh, Yalcin Tur et al.
Sim-DETR: Unlock DETR for Temporal Sentence Grounding
Jiajin Tang, Zhengxuan Wei, Yuchen Zhu et al.
Robust 3D Object Detection using Probabilistic Point Clouds from Single-Photon LiDARs
Bhavya Goyal, Felipe Gutierrez-Barragan, Wei Lin et al.
Representing 3D Shapes With 64 Latent Vectors for 3D Diffusion Models
In Cho, Youngbeom Yoo, Subin Jeon et al.
LINR-PCGC: Lossless Implicit Neural Representations for Point Cloud Geometry Compression
Wenjie Huang, Qi Yang, Shuting Xia et al.
Towards More Diverse and Challenging Pre-training for Point Cloud Learning: Self-Supervised Cross Reconstruction with Decoupled Views
Xiangdong Zhang, Shaofeng Zhang, Junchi Yan
Demeter: A Parametric Model of Crop Plant Morphology from the Real World
Tianhang Cheng, Albert Zhai, Evan Chen et al.
Mixed Signals: A Diverse Point Cloud Dataset for Heterogeneous LiDAR V2X Collaboration
Katie Luo, Minh-Quan Dao, Zhenzhen Liu et al.
HUG: Hierarchical Urban Gaussian Splatting with Block-Based Reconstruction for Large-Scale Aerial Scenes
Mai Su, Zhongtao Wang, Huishan Au et al.
Understanding Co-speech Gestures in-the-wild
Sindhu Hegde, K R Prajwal, Taein Kwon et al.
DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior
Junzhe Lu, Jing Lin, Hongkun Dou et al.
Alleviating Textual Reliance in Medical Language-guided Segmentation via Prototype-driven Semantic Approximation
Shuchang Ye, Usman Naseem, Mingyuan Meng et al.
RePoseD: Efficient Relative Pose Estimation With Known Depth Information
Yaqing Ding, Viktor Kocur, VACLAV VAVRA et al.
Advancing Visual Large Language Model for Multi-granular Versatile Perception
Wentao Xiang, Haoxian Tan, Cong Wei et al.
Beyond Single Images: Retrieval Self-Augmented Unsupervised Camouflaged Object Detection
Ji Du, Xin WANG, Fangwei Hao et al.
Everything is a Video: Unifying Modalities through Next-Frame Prediction
G Thomas Hudson, Dean Slack, Thomas Winterbottom et al.
Multi-Modal Few-Shot Temporal Action Segmentation
Zijia Lu, Ehsan Elhamifar
Topology-Aware Dynamic Reweighting for Distribution Shifts on Graph
Weihuang Zheng, Jiashuo Liu, Jiaxing Li et al.
Fast unsupervised ground metric learning with tree-Wasserstein distance
Kira Michaela Düsterwald, Samo Hromadka, Makoto Yamada
Learning system dynamics without forgetting
Xikun ZHANG, Dongjin Song, Yushan Jiang et al.
Gaussian Mixture Counterfactual Generator
Jong-Hoon Ahn, Akshay Vashist
NeurFlow: Interpreting Neural Networks through Neuron Groups and Functional Interactions
Tue Cao, Nhat Hoang-Xuan, Hieu Pham et al.
Progressive Token Length Scaling in Transformer Encoders for Efficient Universal Segmentation
Abhishek Aich, Yumin Suh, Samuel Schulter et al.
Positive-Unlabeled Diffusion Models for Preventing Sensitive Data Generation
Hiroshi Takahashi, Tomoharu Iwata, Atsutoshi Kumagai et al.
Fine-tuning with Reserved Majority for Noise Reduction
Shuyang Jiang, Yusheng Liao, Ya Zhang et al.
Prompt as Knowledge Bank: Boost Vision-language model via Structural Representation for zero-shot medical detection
Yuguang Yang, Tongfei Chen, Haoyu Huang et al.
Beyond-Expert Performance with Limited Demonstrations: Efficient Imitation Learning with Double Exploration
Heyang Zhao, Xingrui Yu, David Bossens et al.
ImpScore: A Learnable Metric For Quantifying The Implicitness Level of Sentences
Yuxin Wang, Xiaomeng Zhu, Weimin Lyu et al.
Enhancing Document Understanding with Group Position Embedding: A Novel Approach to Incorporate Layout Information
Yuke Zhu, Yue Zhang, Dongdong Liu et al.
Near-Optimal Online Learning for Multi-Agent Submodular Coordination: Tight Approximation and Communication Efficiency
Qixin ZHANG, Zongqi Wan, Yu Yang et al.
A Conditional Independence Test in the Presence of Discretization
Boyang Sun, Yu Yao, Guang-Yuan Hao et al.
Prototype antithesis for biological few-shot class-incremental learning
Binghao Liu, Han Yang, Fang Wan et al.
DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback
GUOJUN XIONG, Ujwal Dinesha, Debajoy Mukherjee et al.
Language-Assisted Feature Transformation for Anomaly Detection
EungGu Yun, Heonjin Ha, Yeongwoo Nam et al.
DS-LLM: Leveraging Dynamical Systems to Enhance Both Training and Inference of Large Language Models
Ruibing Song, Chuan Liu, Chunshu Wu et al.
FlashRNN: I/O-Aware Optimization of Traditional RNNs on modern hardware
Korbinian Pöppel, Maximilian Beck, Sepp Hochreiter
UIFace: Unleashing Inherent Model Capabilities to Enhance Intra-Class Diversity in Synthetic Face Recognition
Xiao Lin, Yuge Huang, Jianqing Xu et al.
Optimizing importance weighting in the presence of sub-population shifts
Floris Holstege, Bram Wouters, Noud Giersbergen et al.
Multimodal Lego: Model Merging and Fine-Tuning Across Topologies and Modalities in Biomedicine
Konstantin Hemker, Nikola Simidjievski, Mateja Jamnik
Implicit Bias of Mirror Flow for Shallow Neural Networks in Univariate Regression
Shuang Liang, Guido Montufar
Algorithmic Stability Based Generalization Bounds for Adversarial Training
Runzhi Tian, Yongyi Mao
A Generalist Hanabi Agent
Arjun V Sudhakar, Hadi Nekoei, Mathieu Reymond et al.
Matrix Product Sketching via Coordinated Sampling
Majid Daliri, Juliana Freire, Danrong Li et al.
Learning Spatiotemporal Dynamical Systems from Point Process Observations
Valerii Iakovlev, Harri Lähdesmäki
TimeInf: Time Series Data Contribution via Influence Functions
Yizi Zhang, Jingyan Shen, Xiaoxue Xiong et al.
Efficient Biological Data Acquisition through Inference Set Design
Ihor Neporozhnii, Julien Roy, Emmanuel Bengio et al.
KinPFN: Bayesian Approximation of RNA Folding Kinetics using Prior-Data Fitted Networks
Dominik Scheuer, Frederic Runge, Jörg Franke et al.
Spectro-Riemannian Graph Neural Networks
Karish Grover, Haiyang Yu, Xiang song et al.
Efficient Sparse PCA via Block-Diagonalization
Alberto Del Pia, Dekun Zhou, Yinglun Zhu
Minimal Impact ControlNet: Advancing Multi-ControlNet Integration
Shikun Sun, Min Zhou, Zixuan Wang et al.
Interference Among First-Price Pacing Equilibria: A Bias and Variance Analysis
Luofeng Liao, Christian Kroer, Sergei Leonenkov et al.
Bonsai: Gradient-free Graph Condensation for Node Classification
Mridul Gupta, Samyak Jain, Vansh Ramani et al.
Differentiable Causal Discovery for Latent Hierarchical Causal Models
Parjanya Prashant, Ignavier Ng, Kun Zhang et al.
Enhanced Diffusion Sampling via Extrapolation with Multiple ODE Solutions
Jinyoung Choi, Junoh Kang, Bohyung Han
CarbonSense: A Multimodal Dataset and Baseline for Carbon Flux Modelling
Matthew Fortier, Mats L. Richter, Oliver Sonnentag et al.
Learned Reference-based Diffusion Sampler for multi-modal distributions
Maxence Noble, Louis Grenioux, Marylou Gabrié et al.
Equivariant Denoisers Cannot Copy Graphs: Align Your Graph Diffusion Models
Najwa Laabid, Severi Rissanen, Markus Heinonen et al.
Brain-inspired $L_p$-Convolution benefits large kernels and aligns better with visual cortex
Jea Kwon, Sungjun Lim, Kyungwoo Song et al.
Learning multi-modal generative models with permutation-invariant encoders and tighter variational objectives
Marcel Hirt, Domenico Campolo, Victoria Leong et al.
LevAttention: Time, Space and Streaming Efficient Algorithm for Heavy Attentions
Ravindran Kannan, Chiranjib Bhattacharyya, Praneeth Kacham et al.
Long-time asymptotics of noisy SVGD outside the population limit
Victor Priser, PASCAL BIANCHI, Adil Salim
Mining your own secrets: Diffusion Classifier Scores for Continual Personalization of Text-to-Image Diffusion Models
Saurav Jha, Shiqi Yang, Masato Ishii et al.
InvestESG: A multi-agent reinforcement learning benchmark for studying climate investment as a social dilemma
Xiaoxuan Hou, Jiayi Yuan, Joel Z Leibo et al.
Solving Differential Equations with Constrained Learning
Viggo Moro, Luiz Chamon