Most Cited 2025 "masked autoencoder paradigm" Papers
22,274 papers found • Page 86 of 112
Conference
CaMiT: A Time-Aware Car Model Dataset for Classification and Generation
Frédéric Lin, Biruk Abere Ambaw, Adrian Popescu et al.
Toward Real-world Text Image Forgery Localization: Structured and Interpretable Data Synthesis
Zeqin Yu, Haotao Xie, Jian Zhang et al.
Massive Sound Embedding Benchmark (MSEB)
Georg Heigold, Ehsan Variani, Tom Bagby et al.
OpenLex3D: A Tiered Benchmark for Open-Vocabulary 3D Scene Representations
MONITRS: Multimodal Observations of Natural Incidents Through Remote Sensing
Shreelekha Revankar, Utkarsh Mall, Cheng Perng Phoo et al.
BADGR: Bundle Adjustment Diffusion Conditioned by Gradients for Wide-Baseline Floor Plan Reconstruction
Yuguang Li, Ivaylo Boyadzhiev, Zixuan Liu et al.
NS-Gym: A Comprehensive and Open-Source Simulation Framework for Non-Stationary Markov Decision Processes
Nathaniel S. Keplinger, Baiting Luo, Yunuo Zhang et al.
HSI-GPT: A General-Purpose Large Scene-Motion-Language Model for Human Scene Interaction
Yuan Wang, Yali Li, Lixiang Li et al.
EVAAA: A Virtual Environment Platform for Essential Variables in Autonomous and Adaptive Agents
Sungwoo Lee, Jungmin Lee, Sohee Kim et al.
DermaCon-IN: A Multiconcept-Annotated Dermatological Image Dataset of Indian Skin Disorders for Clinical AI Research
Shanawaj Sahebpatel Madarkar, Mahajabeen Madarkar, Madhumitha Venkatesh et al.
TaiwanVQA: Benchmarking and Enhancing Cultural Understanding in Vision-Language Models
Hsin Yi Hsieh, Shang-Wei Liu, Chang-Chih Meng et al.
DGCBench: A Deep Graph Clustering Benchmark
Benyu Wu, Yue Liu, Qiaoyu Tan et al.
Plug-and-Play Interpretable Responsible Text-to-Image Generation via Dual-Space Multi-facet Concept Control
Basim Azam, Naveed Akhtar
TIDMAD: Time Series Dataset for Discovering Dark Matter with AI Denoising
Jessica Fry, Xinyi Fu, Zhenghao Fu et al.
AGI-Elo: How Far Are We From Mastering A Task?
Shuo Sun, Yimin Zhao, Christina Lee et al.
Semantic-KG: Using Knowledge Graphs to Construct Benchmarks for Measuring Semantic Similarity
Qiyao Wei, Edward R Morrell, Lea Goetz et al.
OffsetOPT: Explicit Surface Reconstruction without Normals
Huan Lei
Online Task-Free Continual Learning via Dynamic Expansionable Memory Distribution
Fei Ye, Adrian Bors
COGNAC: Cooperative Graph-based Networked Agent Challenges for Multi-Agent Reinforcement Learning
Jules Sintes, Ana Busic
ClinBench: A Standardized Multi-Domain Framework for Evaluating Large Language Models in Clinical Information Extraction
Ismael Villanueva Miranda, Zifan Gu, Donghan Yang et al.
Active Hyperspectral Imaging Using an Event Camera
Bohan Yu, Jinxiu Liang, Zhuofeng Wang et al.
A Temporal Difference Method for Stochastic Continuous Dynamics
Haruki Settai, Naoya Takeishi, Takehisa Yairi
Torch-Uncertainty: Deep Learning Uncertainty Quantification
Adrien Lafage, Olivier Laurent, Firas Gabetni et al.
FEEL: Quantifying Heterogeneity in Physiological Signals for Generalizable Emotion Recognition
Pragya Singh, Ankush Gupta, Somay Jalan et al.
NoBOOM: Chemical Process Datasets for Industrial Anomaly Detection
Dennis Wagner, Fabian Hartung, Justus Arweiler et al.
Contrastive Learning with Data Misalignment: Feature Purity, Training Dynamics and Theoretical Generalization Guarantees
Jiawei Sun, Shuai Zhang, Hongkang Li et al.
STARC-9: A Large-scale Dataset for Multi-Class Tissue Classification for CRC Histopathology
Barathi Subramanian, Rathinaraja Jeyaraj, Mitchell Peterson et al.
Quantifying Generalisation in Imitation Learning
Nathan Gavenski, Odinaldo Rodrigues
Automated Proof of Polynomial Inequalities via Reinforcement Learning
Banglong Liu, Niuniu Qi, Xia Zeng et al.
TAPVid-360: Tracking Any Point in 360 from Narrow Field of View Video
Finlay Hudson, James Gardner, William Smith
FLiP: Towards Comprehensive and Reliable Evaluation of Federated Prompt Learning
Dongping Liao, Xitong Gao, Cheng-Zhong Xu
Easy-editable Image Vectorization with Multi-layer Multi-scale Distributed Visual Feature Embedding
Ye Chen, Zhangli Hu, Zhongyin Zhao et al.
MMOT: The First Challenging Benchmark for Drone-based Multispectral Multi-Object Tracking
Tianhao Li, Tingfa Xu, Ying Wang et al.
Learning Theory for Kernel Bilevel Optimization
Fares El Khoury, Edouard Pauwels, Samuel Vaiter et al.
ReinAD: Towards Real-world Industrial Anomaly Detection with a Comprehensive Contrastive Dataset
Xu Wang, Jingyuan Zhuo, Zhiyuan You et al.
LexiCon: a Benchmark for Planning under Temporal Constraints in Natural Language
Periklis Mantenoglou, Rishi Hazra, Pedro Zuidberg Dos Martires et al.
Risk-Averse Total-Reward Reinforcement Learning
Xihong Su, Jia Lin Hau, Gersi Doko et al.
MathArena: Evaluating LLMs on Uncontaminated Math Competitions
Mislav Balunovic, Jasper Dekoninck, Ivo Petrov et al.
Bridging Equivariant GNNs and Spherical CNNs for Structured Physical Domains
Colin Kohler, Purvik Patel, Nathan Vaska et al.
PSI: A Benchmark for Human Interpretation and Response in Traffic Interactions
TAOTAO JING, Tina Chen, Renran Tian et al.
ML4CO-Bench-101: Benchmark Machine Learning for Classic Combinatorial Problems on Graphs
Jiale Ma, Wenzheng Pan, Yang Li et al.
Uncertainty-Sensitive Privileged Learning
Fan-Ming Luo, Lei Yuan, Yang Yu
Spik-NeRF: Spiking Neural Networks for Neural Radiance Fields
Gang Wan, Qinlong Lan, Zihan Li et al.
BrainMoE: Cognition Joint Embedding via Mixture-of-Expert Towards Robust Brain Foundation Model
Ziquan Wei, Tingting Dan, Tianlong Chen et al.
Uncover Governing Law of Pathology Propagation Mechanism Through A Mean-Field Game
Tingting Dan, Zhihao Fan, Guorong Wu
Q-PART: Quasi-Periodic Adaptive Regression with Test-time Training for Pediatric Left Ventricular Ejection Fraction Regression
Jie Liu, Tiexin Qin, Hui Liu et al.
Rooms from Motion: Un-posed Indoor 3D Object Detection as Localization and Mapping
Justin Lazarow, Kai Kang, Afshin Dehghan
RidgeLoRA: Matrix Ridge Enhanced Low-Rank Adaptation of Large Language Models
Junda Zhu, Jun Ai, Yujun Li et al.
Distributional LLM-as-a-Judge
Luyu Chen, Zeyu Zhang, Haoran Tan et al.
Refinement Methods for Distributed Distribution Estimation under $\ell^p$-Losses
Deheng Yuan, Tao Guo, Zhongyi Huang
BOOTPLACE: Bootstrapped Object Placement with Detection Transformers
Hang Zhou, Xinxin Zuo, Rui Ma et al.
Deep Legendre Transform
Aleksey Minabutdinov, Patrick Cheridito
Dual-Space Semantic Synergy Distillation for Continual Learning of Unlabeled Streams
Donghao Sun, Xi Wang, Xu Yang et al.
Sample-Efficient Multi-Round Generative Data Augmentation for Long-Tail Instance Segmentation
Byunghyun Kim, Minyoung Bae, Jae-Gil Lee
CF-VLM:CounterFactual Vision-Language Fine-tuning
jusheng zhang, Kaitong Cai, Yijia Fan et al.
Learning CAD Modeling Sequences via Projection and Part Awareness
Yang Liu, Daxuan Ren, Yijie Ding et al.
Gaze-VLM: Bridging Gaze and VLMs through Attention Regularization for Egocentric Understanding
Anupam Pani, Yanchao Yang
SurfelSplat: Learning Efficient and Generalizable Gaussian Surfel Representations for Sparse-View Surface Reconstruction
Chensheng Dai, Shengjun Zhang, Min Chen et al.
Towards a Geometric Understanding of Tensor Learning via the t-Product
Andong Wang, Yuning Qiu, Haonan Huang et al.
Fully Autonomous Neuromorphic Navigation and Dynamic Obstacle Avoidance
Xiaochen Shang, Pengwei Luo, Xinning Wang et al.
DISCO: DISCrete nOise for Conditional Control in Text-to-Image Diffusion Models
Longquan Dai, Wu Ming, Dejiao Xue et al.
AniGrad: Anisotropic Gradient-Adaptive Sampling for 3D Reconstruction From Monocular Video
Noah Stier, Alex Rich, Pradeep Sen et al.
CVGL: Causal Learning and Geometric Topology
Songsong Ouyang, Yingying Zhu
Recurrent Attention-based Token Selection for Efficient Streaming Video-LLMs
Evangelos Dorovatas, Soroush Seifi, Gunshi Gupta et al.
PaceLLM: Brain-Inspired Large Language Models for Long-Context Understanding
Kangcong Li, Peng Ye, Chongjun Tu et al.
RUAGO: Effective and Practical Retain-Free Unlearning via Adversarial Attack and OOD Generator
SangYong Lee, Sangjun Chung, Simon Woo
No-Regret Thompson Sampling for Finite-Horizon Markov Decision Processes with Gaussian Processes
Jasmine Bayrooti, Sattar Vakili, Amanda Prorok et al.
PLD: A Choice-Theoretic List-Wise Knowledge Distillation
Ejafa Bassam, Dawei Zhu, Kaigui Bian
Streaming Audio Generation from Discrete Tokens via Streaming Flow Matching
Ha-Yeong Choi, Sang-Hoon Lee
Shadow Generation Using Diffusion Model with Geometry Prior
Haonan Zhao, Qingyang Liu, Xinhao Tao et al.
Flow Field Reconstruction with Sensor Placement Policy Learning
Ruoyan Li, Guancheng Wan, Zijie Huang et al.
UMU-Bench: Closing the Modality Gap in Multimodal Unlearning Evaluation
Chengye Wang, Yuyuan Li, XiaoHua Feng et al.
VLMs-Guided Representation Distillation for Efficient Vision-Based Reinforcement Learning
Haoran Xu, Peixi Peng, Guang Tan et al.
Learning Multi-Source and Robust Representations for Continual Learning
Fei Ye, Yongcheng Zhong, Qihe Liu et al.
DreamTrack: Dreaming the Future for Multimodal Visual Object Tracking
Mingzhe Guo, Weiping Tan, Wenyu Ran et al.
Spectral Estimation with Free Decompression
Siavash Ameli, Chris van der Heide, Liam Hodgkinson et al.
Asymptotically Stable Quaternion-valued Hopfield-structured Neural Network with Periodic Projection-based Supervised Learning Rules
Tianwei Wang, Xinhui Ma, Wei Pang
CLIP is Almost All You Need: Towards Parameter-Efficient Scene Text Retrieval without OCR
Xugong Qin, peng zhang, Jun Jie Ou Yang et al.
CoUn: Empowering Machine Unlearning via Contrastive Learning
Yasser Khalil, Mehdi Setayesh, Hongliang Li
MODfinity: Unsupervised Domain Adaptation with Multimodal Information Flow Intertwining
Shanglin Liu, Jianming Lv, Jingdan Kang et al.
Diffusion Bridge: Leveraging Diffusion Model to Reduce the Modality Gap Between Text and Vision for Zero-Shot Image Captioning
Jeongryong Lee, Yejee Shin, Geonhui Son et al.
Understanding outer learning rates in Local SGD
Ahmed Khaled, Satyen Kale, Arthur Douillard et al.
Path-specific effects for pulse-oximetry guided decisions in critical care
Kevin Zhang, Yonghan Jung, Divyat Mahajan et al.
CovMatch: Cross-Covariance Guided Multimodal Dataset Distillation with Trainable Text Encoder
Yongmin Lee, Hye Won Chung
Direct Fisher Score Estimation for Likelihood Maximization
Sherman Khoo, Yakun Wang, Song Liu et al.
Denoising Trajectory Biases for Zero-Shot AI-Generated Image Detection
Yachao Liang, Min Yu, Gang Li et al.
Bridging Scales: Spectral Theory Reveals How Local Connectivity Rules Sculpt Global Neural Dynamics in Spatially Extended Networks
Yuhan Huang, Keren Gao, Dongping Yang et al.
Adversarial Robustness of Nonparametric Regression
Parsa Moradi, Hanzaleh Nodehi, Mohammad Maddah-Ali
Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning
Haolei Xu, Yuchen Yan, Yongliang Shen et al.
Prediction with expert advice under additive noise
Alankrita Bhatt, Victoria Kostina
RCCDA: Adaptive Model Updates in the Presence of Concept Drift under a Constrained Resource Budget
Adam Piaseczny, Md Kamran Chowdhury Shisher, Shiqiang Wang et al.
MESC-3D:Mining Effective Semantic Cues for 3D Reconstruction from a Single Image
Shaoming Li, Qing Cai, Songqi KONG et al.
When Data Can't Meet: Estimating Correlation Across Privacy Barriers
Abhinav Chakraborty, Arnab Auddy, T. Tony Cai
ENMA: Tokenwise Autoregression for Continuous Neural PDE Operators
Armand Kassaï Koupaï, Lise Le Boudec, Louis Serrano et al.
RobSense: A Robust Multi-modal Foundation Model for Remote Sensing with Static, Temporal, and Incomplete Data Adaptability
Minh Kha Do, Kang Han, Phu Lai et al.
A Physics-preserved Transfer Learning Method for Differential Equations
Hao-Ran Yang, Chuan-Xian Ren
ResponseRank: Data-Efficient Reward Modeling through Preference Strength Learning
Timo Kaufmann, Yannick Metz, Daniel Keim et al.
VLMs can Aggregate Scattered Training Patches
Zhanhui Zhou, Lingjie Chen, Chao Yang et al.
Decoding Causal Structure: End-to-End Mediation Pathways Inference
Yulong Li, Xiwei Liu, feilong tang et al.
Token-Level Self-Play with Importance-Aware Guidance for Large Language Models
Tue Le, Hoang Tran, Quyen Tran et al.
Time-uniform and Asymptotic Confidence Sequence of Quantile under Local Differential Privacy
Leheng Cai, Qirui Hu, Juntao Sun et al.
Structural Causal Bandits under Markov Equivalence
Min Woo Park, Andy Arditi, Elias Bareinboim et al.
Tortoise and Hare Guidance: Accelerating Diffusion Model Inference with Multirate Integration
Yunghee Lee, Byeonghyun Pak, Junwha Hong et al.
When Causal Dynamics Matter: Adapting Causal Strategies through Meta-Aware Interventions
Moritz Willig, Tim Woydt, Devendra Singh Dhami et al.
TARFVAE: Efficient One-Step Generative Time Series Forecasting via TARFLOW based VAE
Jiawen Wei, jiang lan, Pengbo Wei et al.
RespoDiff: Dual-Module Bottleneck Transformation for Responsible & Faithful T2I Generation
Silpa Vadakkeeveetil Sreelatha, Sauradip Nag, Muhammad Awais et al.
HyRF: Hybrid Radiance Fields for Memory-efficient and High-quality Novel View Synthesis
Zipeng Wang, Dan Xu
Rationalized All-Atom Protein Design with Unified Multi-Modal Bayesian Flow
Hanlin Wu, Yuxuan Song, Zhe Zhang et al.
Correlated Low-Rank Adaptation for ConvNets
Wu Ran, Weijia Zhang, ShuYang Pang et al.
Bayesian Ego-graph inference for Networked Multi-Agent Reinforcement Learning
Wei Duan, Jie Lu, Junyu Xuan
A Multimodal BiMamba Network with Test-Time Adaptation for Emotion Recognition Based on Physiological Signals
Ziyu Jia, Tingyu Du, Zhengyu Tian et al.
Tree-Sliced Entropy Partial Transport
Viet-Hoang Tran, Thanh Tran, Thanh Chu et al.
Robust Explanations of Graph Neural Networks via Graph Curvatures
Yazheng Liu, Xi Zhang, Sihong Xie et al.
Adaptive Context Length Optimization with Low-Frequency Truncation for Multi-Agent Reinforcement Learning
Wenchang Duan, Yaoliang Yu, Jiwan He et al.
PCM : Picard Consistency Model for Fast Parallel Sampling of Diffusion Models
Junhyuk So, Jiwoong Shin, Chaeyeon Jang et al.
Embedding Principle of Homogeneous Neural Network for Classification Problem
Jiahan Zhang, Yaoyu Zhang, Tao Luo
Gradient-Guided Epsilon Constraint Method for Online Continual Learning
Song Lai, Changyi Ma, Fei Zhu et al.
NoiseCtrl: A Sampling-Algorithm-Agnostic Conditional Generation Method for Diffusion Models
Longquan Dai, He Wang, Jinhui Tang
Finding separatrices of dynamical flows with Deep Koopman Eigenfunctions
Kabir Dabholkar, Omri Barak
Seeing the Wind from a Falling Leaf
Zhiyuan Gao, Jiageng Mao, Hong-Xing "Koven" Yu et al.
Yo’Chameleon: Personalized Vision and Language Generation
Thao Nguyen, Krishna Kumar Singh, Jing Shi et al.
Theoretical Insights into In-context Learning with Unlabeled Data
Yingcong Li, Xiangyu Chang, Muti Kara et al.
Exploring Tradeoffs through Mode Connectivity for Multi-Task Learning
Zhipeng Zhou, Ziqiao Meng, Pengcheng Wu et al.
Efficient Safe Meta-Reinforcement Learning: Provable Near-Optimality and Anytime Safety
Siyuan Xu, Minghui Zhu
Towards a Pairwise Ranking Model with Orderliness and Monotonicity for Label Enhancement
Yunan Lu, Xixi Zhang, Yaojin Lin et al.
Spiking Transformer: Introducing Accurate Addition-Only Spiking Self-Attention for Transformer
Yufei Guo, Xiaode Liu, Yuanpei Chen et al.
Brain-tuning Improves Generalizability and Efficiency of Brain Alignment in Speech Models
Omer Moussa, Mariya Toneva
RAPTR: Radar-based 3D Pose Estimation using Transformer
Sorachi Kato, Ryoma Yataka, Pu Wang et al.
PRIMT: Preference-based Reinforcement Learning with Multimodal Feedback and Trajectory Synthesis from Foundation Models
Ruiqi Wang, Dezhong Zhao, Ziqin Yuan et al.
SINR: Sparsity Driven Compressed Implicit Neural Representations
Dhananjaya Jayasundara, Sudarshan Rajagopalan, Yasiru Ranasinghe et al.
MJ-Video: Benchmarking and Rewarding Video Generation with Fine-Grained Video Preference
Haibo Tong, Zhaoyang Wang, Zhaorun Chen et al.
IDOL: Meeting Diverse Distribution Shifts with Prior Physics for Tropical Cyclone Multi-Task Estimation
HantingYan Yan, Pan Mu, Shiqi Zhang et al.
Masked Gated Linear Unit
Yukito Tajima, Nakamasa Inoue, Yusuke Sekikawa et al.
FedRAM: Federated Reweighting and Aggregation for Multi-Task Learning
Fan Wu, Xinyu Yan, Jiabei Liu et al.
Memory by accident: a theory of learning as a byproduct of network stabilization
Basile Confavreux, William Dorrell, Nishil Patel et al.
Consistency Conditions for Differentiable Surrogate Losses
Drona Khurana, Anish Thilagar, Dhamma Kimpara et al.
Multi-Task Vehicle Routing Solver via Mixture of Specialized Experts under State-Decomposable MDP
Yuxin Pan, Zhiguang Cao, Chengyang GU et al.
DLoFT: Gradient-Decoupled Fine-Tuning for Generalizable Long Chain-of-Thought Reasoning
Sitong Wu, Haoru Tan, Jingyao Li et al.
Smart Surrogate Losses for Contextual Stochastic Linear Optimization with Robust Constraints
Hyungki Im, Wyame Benslimane, Paul Grigas
Sketch Down the FLOPs: Towards Efficient Networks for Human Sketch
Aneeshan Sain, Subhajit Maity, Pinaki Nath Chowdhury et al.
Advancing Adversarial Robustness in GNeRFs: The IL2-NeRF Attack
Nicole Meng, Caleb Manicke, Ronak Sahu et al.
Learning-enabled Polynomial Lyapunov Function Synthesis via High-Accuracy Counterexample-Guided Framework
Hanrui Zhao, Niuniu Qi, Mengxin Ren et al.
Solving Neural Min-Max Games: The Role of Architecture, Initialization & Dynamics
Deep Patel, Emmanouil-Vasileios Vlatakis-Gkaragkounis
Preference-driven Knowledge Distillation for Few-shot Node Classification
Xing Wei, Chunchun Chen, Rui Fan et al.
Bit-swapping Oriented Twin-memory Multi-view Clustering in Lifelong Incomplete Scenarios
Shengju Yu, Pei Zhang, Siwei Wang et al.
Multi-Kernel Correlation-Attention Vision Transformer for Enhanced Contextual Understanding and Multi-Scale Integration
Hongkang Zhang, Shao-Lun Huang, Ercan KURUOGLU et al.
Efficient Fairness-Performance Pareto Front Computation
Mark Kozdoba, Binyamin Perets, Shie Mannor
Eliciting Reasoning in Language Models with Cognitive Tools
Brown Wilfried Ebouky Doualla Dina, Andrea Bartezzaghi, Mattia Rigotti
Personalized Visual Content Generation in Conversational Systems
Xianquan Wang, Zhaocheng Du, Huibo Xu et al.
CheXwhatsApp: A Dataset for Exploring Challenges in the Diagnosis of Chest X-rays through Mobile Devices
Mariamma Antony, Rajiv Porana, Sahil M. Lathiya et al.
Explaining the Law of Supply and Demand via Online Learning
Stratis Skoulakis
Inference of Whole Brain Electrophysiological Networks Through Multimodal Integration of Simultaneous Scalp and Intracranial EEG
Shihao Yang, Feng Liu
RFMPose: Generative Category-level Object Pose Estimation via Riemannian Flow Matching
Wenzhe Ouyang, Qi Ye, Jinghua Wang et al.
DiskVPS: Vanishing Point Detector via Hough Transform in a Disk Region
Jianping Wu
Delving into Large Language Models for Effective Time-Series Anomaly Detection
JUN WOO PARK, Kyudan Jung, Dohyun Lee et al.
Enhancing Compositional Reasoning in CLIP via Reconstruction and Alignment of Text Descriptions
Jihoon Kwon, Kyle Min, Jy-yong Sohn
Covariances for Free: Exploiting Mean Distributions for Training-free Federated Learning
Dipam Goswami, Simone Magistri, Kai Wang et al.
Understanding Parametric and Contextual Knowledge Reconciliation within Large Language Models
Jun Zhao, Yongzhuo Yang, Xiang Hu et al.
EchoTraffic: Enhancing Traffic Anomaly Understanding with Audio-Visual Insights
Zhenghao Xing, Hao Chen, Binzhu Xie et al.
On Geometry-Enhanced Parameter-Efficient Fine-Tuning for 3D Scene Segmentation
Liyao Tang, Zhe Chen, Dacheng Tao
Functional Matching of Logic Subgraphs: Beyond Structural Isomorphism
Ziyang Zheng, Kezhi Li, Zhengyuan Shi et al.
Sea-ing in Low-light
Nisha Varghese, A. N. Rajagopalan
Smoothed Differentiation Efficiently Mitigates Shattered Gradients in Explanations
Adrian Hill, Neal McKee, Johannes Maeß et al.
LAL: Enhancing 3D Human Motion Prediction with Latency-aware Auxiliary Learning
Xiaoning Sun, Dong Wei, Huaijiang Sun et al.
EventPSR: Surface Normal and Reflectance Estimation from Photometric Stereo Using an Event Camera
Bohan Yu, Jin Han, Boxin Shi et al.
Positional Fragility in LLMs: How Offset Effects Reshape Our Understanding of Memorization Risks
Yixuan Xu, Antoine Bosselut, Imanol Schlag
Flick: Empowering Federated Learning with Commonsense Knowledge
Ran Zhu, Mingkun Yang, Shiqiang Wang et al.
MDReID: Modality-Decoupled Learning for Any-to-Any Multi-Modal Object Re-Identification
Yingying Feng, Jie Li, Jie Hu et al.
Discovering Important Experts for Mixture-of-Experts Models Pruning Through a Theoretical Perspective
Weizhong Huang, Yuxin Zhang, Xiawu Zheng et al.
MR. Video: MapReduce as an Effective Principle for Long Video Understanding
Ziqi Pang, Yu-Xiong Wang
Miss-ReID: Delivering Robust Multi-Modality Object Re-Identification Despite Missing Modalities
Xi ruida
Structure-from-Motion with a Non-Parametric Camera Model
Yihan Wang, Linfei Pan, Marc Pollefeys et al.
PointTruss: K-Truss for Point Cloud Registration
Yue Wu, Jun Jiang, Yongzhe Yuan et al.
GMM-based VAE model with Normalising Flow for effective stochastic segmentation
Conghui Li, Chern Hong Lim, Xin Wang
A Cautionary Tale on Integrating Studies with Disparate Outcome Measures for Causal Inference
Harsh Parikh, Trang Nguyen, Elizabeth Stuart et al.
VideoTitans: Scalable Video Prediction with Integrated Short- and Long-term Memory
Young-Jae Park, Minseok Seo, Hae-Gon Jeon
SeqMvRL: A Sequential Fusion Framework for Multi-view Representation Learning
Ren Wang, Haoliang Sun, Yuxiu Lin et al.
Deep Taxonomic Networks for Unsupervised Hierarchical Prototype Discovery
Zekun Wang, Ethan Haarer, Tianyi Zhu et al.
SNAP: Low-Latency Test-Time Adaptation with Sparse Updates
Hyeongheon Cha, Dong Min Kim, Hye Won Chung et al.
Rebalancing Contrastive Alignment with Bottlenecked Semantic Increments in Text-Video Retrieval
Jian Xiao, Zijie Song, Jialong Hu et al.
RSAVQ: Riemannian Sensitivity-Aware Vector Quantization for Large Language Models
Zukang Xu, Xing Hu, Qiang Wu et al.
ZeroS: Zero‑Sum Linear Attention for Efficient Transformers
Jiecheng Lu, Xu Han, Yan Sun et al.
Faster Generic Identification in Tree-Shaped Structural Causal Models
Yasmine Briefs, Markus Bläser
Bounds on the computational complexity of neurons due to dendritic morphology
Anamika Agrawal, Michael Buice
HEIR: Learning Graph-Based Motion Hierarchies
Cheng Zheng, William Koch, Baiang Li et al.
AdaMSS: Adaptive Multi-Subspace Approach for Parameter-Efficient Fine-Tuning
Jingjing Zheng, Wanglong Lu, Yiming Dong et al.
Knowledge Memorization and Rumination for Pre-trained Model-based Class-Incremental Learning
Zijian Gao, Wangwang Jia, Xingxing Zhang et al.
Certifying Concavity and Monotonicity in Games via Sum-of-Squares Hierarchies
Vincent Leon, Iosif Sakos, Ryann Sim et al.
Variational Inference with Mixtures of Isotropic Gaussians
Marguerite Petit-Talamon, Marc Lambert, Anna Korba
TF-MAS: Training-free Mamba2 Architecture Search
Yi Fan, Yu-Bin Yang
Learning from Disjoint Views: A Contrastive Prototype Matching Network for Fully Incomplete Multi-View Clustering
Yiming Wang, Qun Li, Dongxia Chang et al.
Efficiently Maintaining the Multilingual Capacity of MCLIP in Downstream Cross-Modal Retrieval Tasks
Fengmao Lyu, Jitong Lei, Guosheng Lin et al.
Prompted Policy Search: Reinforcement Learning through Linguistic and Numerical Reasoning in LLMs
Yifan Zhou, Sachin Grover, Mohamed El Mistiri et al.
Subspace Constraint and Contribution Estimation for Heterogeneous Federated Learning
Xiangtao Zhang, Sheng Li, Ao Li et al.
For Better or for Worse, Transformers Seek Patterns for Memorization
Madhur Panwar, Gail Weiss, Navin Goyal et al.
GeoAda: Efficiently Finetune Geometric Diffusion Models with Equivariant Adapters
Wanjia Zhao, Jiaqi Han, Siyi Gu et al.
Hierarchical Demonstration Order Optimization for Many-shot In-Context Learning
Yinhan He, Wendy Zheng, Song Wang et al.
ACAttack: Adaptive Cross Attacking RGB-T Tracker via Multi-Modal Response Decoupling
Xinyu Xiang, Qinglong Yan, HAO ZHANG et al.
Agnostic Continuous-Time Online Learning
Pramith Devulapalli, Changlong Wu, Ananth Grama et al.
KeeA*: Epistemic Exploratory A* Search via Knowledge Calibration
Dengwei Zhao, Shikui Tu, Yanan Sun et al.