Most Cited 2025 "cross-device fl" Papers
22,274 papers found • Page 104 of 112
Conference
ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation
Ali Athar, Xueqing Deng, Liang-Chieh Chen
Sekai: A Video Dataset towards World Exploration
Zhen Li, Chuanhao Li, Xiaofeng Mao et al.
Don’t Let It Fade: Preserving Edits in Diffusion Language Models via Token Timestep Allocation
Woojin Kim, Jaeyoung Do
ReID5o: Achieving Omni Multi-modal Person Re-identification in a Single Model
Jialong Zuo, Yongtai Deng, Mengdan Tan et al.
Simple and Efficient Heterogeneous Temporal Graph Neural Network
Yili Wang, Tairan Huang, Changlong He et al.
Provable Meta-Learning with Low-Rank Adaptations
Jacob Block, Sundararajan Srinivasan, Liam Collins et al.
A4A: Adapter for Adapter Transfer via All-for-All Mapping for Cross-Architecture Models
Keyu Tu, Mengqi Huang, Zhuowei Chen et al.
Towards Precise Embodied Dialogue Localization via Causality Guided Diffusion
Haoyu Wang, Le Wang, Sanping Zhou et al.
Exponential Dynamic Energy Network for High Capacity Sequence Memory
Arjun Karuvally, Pichsinee Lertsaroj, Terrence Sejnowski et al.
TS-MOF: Two-Stage Multi-Objective Fine-tuning for Long-Tailed Recognition
Zhe Zhao, Zhiheng Gong, Pengkun Wang et al.
Disentangling Safe and Unsafe Image Corruptions via Anisotropy and Locality
Ramchandran Muthukumar, Ambar Pal, Jeremias Sulam et al.
Efficient Data Driven Mixture-of-Expert Extraction from Trained Networks
Uranik Berisha, Jens Mehnert, Alexandru Paul Condurache
Understanding Softmax Attention Layers:\\ Exact Mean-Field Analysis on a Toy Problem
Elvis Dohmatob
Deciphering the Extremes: A Novel Approach for Pathological Long-tailed Recognition in Scientific Discovery
Zhe Zhao, HaiBin Wen, Xianfu Liu et al.
Doppelgängers and Adversarial Vulnerability
George Kamberov
LUNA: Efficient and Topology-Agnostic Foundation Model for EEG Signal Analysis
Berkay Döner, Thorir Mar Ingolfsson, Luca Benini et al.
FEEL: Quantifying Heterogeneity in Physiological Signals for Generalizable Emotion Recognition
Pragya Singh, Ankush Gupta, Somay Jalan et al.
Towards Multi-Table Learning: A Novel Paradigm for Complementarity Quantification and Integration
Zhang Junyu, Lizhong Ding, MinghongZhang et al.
Fantastic Bugs and Where to Find Them in AI Benchmarks
Sang Truong, Yuheng Tu, Michael Hardy et al.
FSEO: Few-Shot Evolutionary Optimization via Meta-Learning for Expensive Multi-Objective Optimization
Xunzhao Yu
SplashNet: Split‑and‑Share Encoders for Accurate and Efficient Typing with Surface Electromyography
Nima Hadidi, Jason Chan, Ebrahim Feghhi et al.
Unlocking hidden biomolecular conformational landscapes in diffusion models at inference time
Daniel D. Richman, Jessica Karaguesian, Carl-Mikael Suomivuori et al.
Impartial Selection with Predictions
Rethinking PCA Through Duality
Jan Quan, Johan Suykens, Panagiotis Patrinos
Nonlinearly Preconditioned Gradient Methods: Momentum and Stochastic Analysis
Konstantinos Oikonomidis, Jan Quan, Panagiotis Patrinos
Energy-based generator matching: A neural sampler for general state space
Dongyeop Woo, Minsu Kim, Minkyu Kim et al.
OmniResponse: Online Multimodal Conversational Response Generation in Dyadic Interactions
Cheng Luo, Jianghui Wang, Bing Li et al.
Seemingly Redundant Modules Enhance Robust Odor Learning in Fruit Flies
HaiYang Li, Liao Yu, Qiang Yu et al.
Differentially Private Federated Low Rank Adaptation Beyond Fixed-Matrix
Ming Wen, Jiaqi Zhu, Yuedong Xu et al.
MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention
Yuhan Wang, Fangzhou Hong, Shuai Yang et al.
Generalization Bounds for Rank-sparse Neural Networks
Antoine Ledent, Rodrigo Alves, Yunwen Lei
On Agnostic PAC Learning in the Small Error Regime
Julian Asilis, Mikael Møller Høgsgaard, Grigoris Velegkas
RAG-IGBench: Innovative Evaluation for RAG-based Interleaved Generation in Open-domain Question Answering
Rongyang Zhang, Yuqing Huang, Chengqiang Lu et al.
AdaReasoner: Adaptive Reasoning Enables More Flexible Thinking
Xiangqi Wang, Yue Huang, Yanbo Wang et al.
LooGLE v2: Are LLMs Ready for Real World Long Dependency Challenges?
Ziyuan He, Yuxuan Wang, Jiaqi Li et al.
Matrix-Free Shared Intrinsics Bundle Adjustment
Daniel Safari
Learning Theory for Kernel Bilevel Optimization
Fares El Khoury, Edouard Pauwels, Samuel Vaiter et al.
Seeing More with Less: Human-like Representations in Vision Models
Andrey Gizdov, Shimon Ullman, Daniel Harari
DCcluster-Opt: Benchmarking Dynamic Multi-Objective Optimization for Geo-Distributed Data Center Workloads
Antonio Guillen-Perez, Avisek Naug, Vineet Gundecha et al.
The Leaderboard Illusion
Shivalika Singh, Yiyang Nan, Alex Wang et al.
SeedVR: Seeding Infinity in Diffusion Transformer Towards Generic Video Restoration
Jianyi Wang, Zhijie Lin, Meng Wei et al.
Chain of Semantics Programming in 3D Gaussian Splatting Representation for 3D Vision Grounding
Jiaxin Shi, Mingyue Xiang, Hao Sun et al.
CineTechBench: A Benchmark for Cinematographic Technique Understanding and Generation
Xinran Wang, Songyu Xu, Shan Xiangxuan et al.
Look-Ahead Reasoning on Learning Platforms
Haiqing Zhu, Tijana Zrnic, Celestine Mendler-Dünner
Comprehensive Assessment and Analysis for NSFW Content Erasure in Text-to-Image Diffusion models
Die Chen, Zhiwen Li, Cen Chen et al.
Robust Sampling for Active Statistical Inference
Puheng Li, Tijana Zrnic, Emmanuel Candes
AniGS: Animatable Gaussian Avatar from a Single Image with Inconsistent Gaussian Reconstruction
Lingteng Qiu, Shenhao Zhu, Qi Zuo et al.
Fuzzy Multimodal Learning for Trusted Cross-modal Retrieval
Siyuan Duan, Yuan Sun, Dezhong Peng et al.
FineGRAIN: Evaluating Failure Modes of Text-to-Image Models with Vision Language Model Judges
Kevin Hayes, Micah Goldblum, Vikash Sehwag et al.
NVILA: Efficient Frontier Visual Language Models
Zhijian Liu, Ligeng Zhu, Baifeng Shi et al.
Conditional Forecasts and Proper Scoring Rules for Reliable and Accurate Performative Predictions
Philip Boeken, Onno Zoeter, Joris Mooij
MAESTRO : Adaptive Sparse Attention and Robust Learning for Multimodal Dynamic Time Series
Payal Mohapatra, Yueyuan Sui, Akash Pandey et al.
FloVD: Optical Flow Meets Video Diffusion Model for Enhanced Camera-Controlled Video Synthesis
Wonjoon Jin, Qi Dai, Chong Luo et al.
UniHOPE: A Unified Approach for Hand-Only and Hand-Object Pose Estimation
Yinqiao Wang, Hao Xu, Pheng-Ann Heng et al.
SaFiRe: Saccade-Fixation Reiteration with Mamba for Referring Image Segmentation
Zhenjie Mao, Yang Yuhuan, Chaofan Ma et al.
Stochastically Dominant Peer Prediction
Yichi Zhang, Shengwei Xu, Grant Schoenebeck et al.
Cross-Modal Interactive Perception Network with Mamba for Lung Tumor Segmentation in PET-CT Images
Jie Mei, Chenyu Lin, Yu Qiu et al.
No Thing, Nothing: Highlighting Safety-Critical Classes for Robust LiDAR Semantic Segmentation in Adverse Weather
Junsung Park, HwiJeong Lee, Inha Kang et al.
Learning Partonomic 3D Reconstruction from Image Collections
Xiaoqian Ruan, Pei Yu, Dian Jia et al.
LOGICZSL: Exploring Logic-induced Representation for Compositional Zero-shot Learning
Peng Wu, Xiankai Lu, Hao Hu et al.
CIDD: Collaborative Intelligence for Structure-Based Drug Design Empowered by LLMs
Bowen Gao, Yanwen Huang, Yiqiao Liu et al.
AANet: Virtual Screening under Structural Uncertainty via Alignment and Aggregation
Wenyu Zhu, Jianhui Wang, Bowen Gao et al.
Self-Supervised Learning of Graph Representations for Network Intrusion Detection
Lorenzo Guerra, Thomas Chapuis, Guillaume Duc et al.
BO4Mob: Bayesian Optimization Benchmarks for High-Dimensional Urban Mobility Problem
Seunghee Ryu, Donghoon Kwon, Seongjin Choi et al.
Towards Realistic Earth-Observation Constellation Scheduling: Benchmark and Methodology
Luting Wang, Yinghao Xiang, Hongliang Huang et al.
Stability and Oracle Inequalities for Optimal Transport Maps between General Distributions
Shubo Li, Yizhe Ding, Lingzhou Xue et al.
$\texttt{AVROBUSTBENCH}$: Benchmarking the Robustness of Audio-Visual Recognition Models at Test-Time
Sarthak Kumar Maharana, Saksham Singh Kushwaha, Baoming Zhang et al.
EPFL-Smart-Kitchen: An Ego-Exo Multi-Modal Dataset for Challenging Action and Motion Understanding in Video-Language Models
Andy Bonnetto, Haozhe Qi, Franklin Leong et al.
OCTDiff: Bridged Diffusion Model for Portable OCT Super-Resolution and Enhancement
Ye Tian, Angela McCarthy, Gabriel Gomide et al.
3D Student Splatting and Scooping
Jialin Zhu, Jiangbei Yue, Feixiang He et al.
LEDiff: Latent Exposure Diffusion for HDR Generation
Chao Wang, Zhihao Xia, Thomas Leimkuehler et al.
RL Tango: Reinforcing Generator and Verifier Together for Language Reasoning
Kaiwen Zha, Zhengqi Gao, Maohao Shen et al.
Abstain Mask Retain Core: Time Series Prediction by Adaptive Masking Loss with Representation Consistency
Renzhao Liang, Sizhe Xu, Chenggang Xie et al.
FailureSensorIQ: A Multi-Choice QA Dataset for Understanding Sensor Relationships and Failure Modes
Christodoulos Constantinides, Dhaval Patel, Shuxin Lin et al.
PSMBench: A Benchmark and Dataset for Evaluating LLMs Extraction of Protocol State Machines from RFC Specifications
Zilin Shen, Xinyu Luo, Imtiaz Karim et al.
PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting
Cheng Zhang, Haofei Xu, Qianyi Wu et al.
Geometry of Decision Making in Language Models
Abhinav Joshi, Divyanshu Bhatt, Ashutosh Modi
UMU-Bench: Closing the Modality Gap in Multimodal Unlearning Evaluation
Chengye Wang, Yuyuan Li, XiaoHua Feng et al.
Ridge Boosting is Both Robust and Efficient
David Bruns-Smith, Zhongming Xie, Avi Feller
Towards Zero-Shot Anomaly Detection and Reasoning with Multimodal Large Language Models
Jiacong Xu, Shao-Yuan Lo, Bardia Safaei et al.
GS-2DGS: Geometrically Supervised 2DGS for Reflective Object Reconstruction
Jinguang Tong, Xuesong li, Fahira Afzal Maken et al.
Distributionally Robust Learning for Multi-source Unsupervised Domain Adaptation
Zhenyu Wang, Peter Bühlmann, Zijian Guo
HotSpot: Signed Distance Function Optimization with an Asymptotically Sufficient Condition
Zimo Wang, Cheng Wang, Taiki Yoshino et al.
Locality-Aware Zero-Shot Human-Object Interaction Detection
Sanghyun Kim, Deunsol Jung, Minsu Cho
Functional Scaling Laws in Kernel Regression: Loss Dynamics and Learning Rate Schedules
Binghui Li, Fengling Chen, Zixun Huang et al.
DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks
Canyu Zhao, Yanlong Sun, Mingyu Liu et al.
SeriesBench: A Benchmark for Narrative-Driven Drama Series Understanding
chenkai zhang, Yiming Lei, Zeming Liu et al.
DSAS: A Universal Plug-and-Play Framework for Attention Optimization in Multi-Document Question Answering
Jiakai Li, Rongzheng Wang, Yizhuo Ma et al.
Diversifying Parallel Ergodic Search: A Signature Kernel Evolution Strategy
Sreevardhan Sirigiri, Christian Hughes, Ian Abraham et al.
Unified Algorithms for RL with Decision-Estimation Coefficients: PAC, Reward-Free, Preference-Based Learning, and Beyond
Fan Chen, Song Mei, Yu Bai
AION-1: Omnimodal Foundation Model for Astronomical Sciences
Liam Parker, Francois Lanusse, Jeff Shen et al.
Predictive Preference Learning from Human Interventions
Haoyuan Cai, Zhenghao (Mark) Peng, Bolei Zhou
Kernel Learning with Adversarial Features: Numerical Efficiency and Adaptive Regularization
Antonio Ribeiro, David Vävinggren, Dave Zachariah et al.
A Difference-of-Convex Functions Approach to Energy-Based Iterative Reasoning
Daniel Tschernutter, David Diego Castro, Maciej Kasiński
Best-of-N Jailbreaking
John Hughes, Sara Price, Aengus Lynch et al.
HouseLayout3D: A Benchmark and Training-free Baseline for 3D Layout Estimation in the Wild
Valentin Bieri, Marie-Julie Rakotosaona, Keisuke Tateno et al.
Efficient Safe Meta-Reinforcement Learning: Provable Near-Optimality and Anytime Safety
Siyuan Xu, Minghui Zhu
Understanding Fairness and Prediction Error through Subspace Decomposition and Influence Analysis
Enze Shi, Pankaj Bhagwat, Zhixian Yang et al.
Uncertainty Meets Diversity: A Comprehensive Active Learning Framework for Indoor 3D Object Detection
Jiangyi Wang, Na Zhao
Text-Driven Fashion Image Editing with Compositional Concept Learning and Counterfactual Abduction
Shanshan Huang, Haoxuan Li, Chunyuan Zheng et al.
Number it: Temporal Grounding Videos like Flipping Manga
Yongliang Wu, Xinting Hu, Yuyang Sun et al.
Autoregressive Sequential Pretraining for Visual Tracking
Shiyi Liang, Yifan Bai, Yihong Gong et al.
A Selective Re-learning Mechanism for Hyperspectral Fusion Imaging
Yuanye Liu, jinyang liu, Renwei Dian et al.
CADRef: Robust Out-of-Distribution Detection via Class-Aware Decoupled Relative Feature Leveraging
Zhiwei Ling, Yachen Chang, Hailiang Zhao et al.
Imputation-free and Alignment-free: Incomplete Multi-view Clustering Driven by Consensus Semantic Learning
yuzhuo dai, Jiaqi Jin, Zhibin Dong et al.
DeepDiver: Adaptive Web-Search Intensity Scaling via Reinforcement Learning
Wenxuan Shi, Haochen Tan, Chuqiao Kuang et al.
Beyond Expectations: Quantile-Guided Alignment for Risk-Calibrated Language Models
Xinran Wang, Jin Du, Azal Khan et al.
Minority-Focused Text-to-Image Generation via Prompt Optimization
Soobin Um, Jong Chul Ye
OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts
Yuxuan Wang, Yueqian Wang, Bo Chen et al.
Video Diffusion Models Excel at Tracking Similar-Looking Objects Without Supervision
Chenshuang Zhang, Kang Zhang, Joon Son Chung et al.
DiffEye: Diffusion-Based Continuous Eye-Tracking Data Generation Conditioned on Natural Images
Ozgur Kara, Harris Nisar, James Rehg
Variational Polya Tree
Lu Xu, Tsai Hor Chan, Lequan Yu et al.
PhysDiff: A Physically-Guided Diffusion Model for Multivariate Time Series Anomaly Detection
Long Li, Wanghu Chen, Wencheng Zhang et al.
Situat3DChange: Situated 3D Change Understanding Dataset for Multimodal Large Language Model
Ruiping Liu, Junwei Zheng, Yufan Chen et al.
Occlusion-aware Text-Image-Point Cloud Pretraining for Open-World 3D Object Recognition
Khanh Nguyen, Ghulam Mubashar Hassan, Ajmal Mian
Distortion of AI Alignment: Does Preference Optimization Optimize for Preferences?
Paul Gölz, Nika Haghtalab, Kunhe Yang
Absence Bench: Language Models Can’t See What’s Missing
Harvey Yiyun Fu, Aryan Shrivastava, Jared Moore et al.
Relative Pose Estimation through Affine Corrections of Monocular Depth Priors
Yifan Yu, Shaohui Liu, Rémi Pautrat et al.
Unseen Visual Anomaly Generation
HAN SUN, Yunkang Cao, Hao Dong et al.
Mamba-Reg: Vision Mamba Also Needs Registers
Feng Wang, Jiahao Wang, Sucheng Ren et al.
Future-Aware End-to-End Driving: Bidirectional Modeling of Trajectory Planning and Scene Evolution
Bozhou Zhang, Nan Song, jingyu li et al.
Flexible Realignment of Language Models
Wenhong Zhu, Ruobing Xie, Weinan Zhang et al.
LABridge: Text–Image Latent Alignment Framework via Mean-Conditioned OU Process
Huiyang Shao, Xin Xia, Yuxi Ren et al.
BikeBench: A Bicycle Design Benchmark for Generative Models with Objectives and Constraints
Lyle Regenwetter, Yazan Abu Obaideh, Fabien Chiotti et al.
Efficiently Maintaining the Multilingual Capacity of MCLIP in Downstream Cross-Modal Retrieval Tasks
Fengmao Lyu, Jitong Lei, Guosheng Lin et al.
Lifting the Veil on Visual Information Flow in MLLMs: Unlocking Pathways to Faster Inference
Hao Yin, Guangzong Si, Zilei Wang
Not Just Text: Uncovering Vision Modality Typographic Threats in Image Generation Models
Hao Cheng, Erjia Xiao, Jiayan Yang et al.
Can Multi-Modal LLMs Provide Live Step-by-Step Task Guidance?
Apratim Bhattacharyya, Bicheng Xu, Sanjay Haresh et al.
Auto-Connect: Connectivity-Preserving RigFormer with Direct Preference Optimization
jingfeng Guo, Jian Liu, Jinnan Chen et al.
Distribution Prototype Diffusion Learning for Open-set Supervised Anomaly Detection
Fuyun Wang, Tong Zhang, Yuanzhi Wang et al.
Progressive Inference-Time Annealing of Diffusion Models for Sampling from Boltzmann Densities
Tara Akhound-Sadegh, Jungyoon Lee, Joey Bose et al.
Breaking the Frozen Subspace: Importance Sampling for Low-Rank Optimization in LLM Pretraining
Haochen Zhang, Junze Yin, Guanchu Wang et al.
Towards Fine-Grained Interpretability: Counterfactual Explanations for Misclassification with Saliency Partition
ZHANG LINTONG, Kang Yin, Seong-Whan Lee
How many measurements are enough? Bayesian recovery in inverse problems with general distributions
Ben Adcock, Zi Yuan (Nick) Huang
RAEncoder: A Label-Free Reversible Adversarial Examples Encoder for Dataset Intellectual Property Protection
Fan Xing, Zhuo Tian, Xuefeng Fan et al.
ELECTRA: A Cartesian Network for 3D Charge Density Prediction with Floating Orbitals
Jonas Elsborg, Luca Thiede, Alan Aspuru-Guzik et al.
CURE: Concept Unlearning via Orthogonal Representation Editing in Diffusion Models
Shristi Das Biswas, Arani Roy, Kaushik Roy
URB - Urban Routing Benchmark for RL-equipped Connected Autonomous Vehicles
Ahmet Onur Akman, Anastasia Psarou, Michał Hoffmann et al.
Real-World Adverse Weather Image Restoration via Dual-Level Reinforcement Learning with High-Quality Cold Start
Fuyang Liu, Jiaqi Xu, Xiaowei Hu
Posterior Contraction for Sparse Neural Networks in Besov Spaces with Intrinsic Dimensionality
Kyeongwon Lee, Lizhen Lin, Jaewoo Park et al.
SECODEPLT: A Unified Benchmark for Evaluating the Security Risks and Capabilities of Code GenAI
Yuzhou Nie, Zhun Wang, Yu Yang et al.
Training-free Neural Architecture Search through Variance of Knowledge of Deep Network Weights
Ondrej Tybl, Lukas Neumann
Deep Edge Filter: Return of the Human-Crafted Layer in Deep Learning
Dongkwan Lee, JunHoo Lee, Nojun Kwak
UniRelight: Learning Joint Decomposition and Synthesis for Video Relighting
Kai He, Ruofan Liang, Jacob Munkberg et al.
Shift the Lens: Environment-Aware Unsupervised Camouflaged Object Detection
Ji Du, Fangwei Hao, Mingyang Yu et al.
KeeA*: Epistemic Exploratory A* Search via Knowledge Calibration
Dengwei Zhao, Shikui Tu, Yanan Sun et al.
MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos
Zhengqi Li, Richard Tucker, Forrester Cole et al.
Reinforced Active Learning for Large-Scale Virtual Screening with Learnable Policy Model
Yicong Chen, Jiahua Rao, Jiancong Xie et al.
MAD: Memory-Augmented Detection of 3D Objects
Ben Agro, Sergio Casas, Patrick Wang et al.
Dynamic Pseudo Labeling via Gradient Cutting for High-Low Entropy Exploration
Jae Hyeon Park, Joo Hyeon Jeon, Jae Yun Lee et al.
Vocabulary-Guided Gait Recognition
Panjian Huang, Saihui Hou, Chunshui Cao et al.
Is Your Diffusion Model Actually Denoising?
Daniel Pfrommer, Zehao Dou, Christopher Scarvelis et al.
Learning in Stackelberg Mean Field Games: A Non-Asymptotic Analysis
Sihan Zeng, Benjamin Patrick Evans, Sujay Bhatt et al.
Q-Bench-Video: Benchmark the Video Quality Understanding of LMMs
Zicheng Zhang, Ziheng Jia, Haoning Wu et al.
RF-Agent: Automated Reward Function Design via Language Agent Tree Search
Ning Gao, Xiuhui Zhang, Xingyu Jiang et al.
Alignment, Mining and Fusion: Representation Alignment with Hard Negative Mining and Selective Knowledge Fusion for Medical Visual Question Answering
Yuanhao Zou, Zhaozheng Yin
NeuroRenderedFake: A Challenging Benchmark to Detect Fake Images Generated by Advanced Neural Rendering Methods
Chengdong Dong, B. V. K. Vijaya Kumar, Zhenyu Zhou et al.
MATCH: Multi-faceted Adaptive Topo-Consistency for Semi-Supervised Histopathology Segmentation
Meilong Xu, Xiaoling Hu, Shahira Abousamra et al.
Mamba Modulation: On the Length Generalization of Mamba Models
Peng Lu, Jerry Huang, QIUHAO Zeng et al.
Efficient Test-time Adaptive Object Detection via Sensitivity-Guided Pruning
Kunyu Wang, Xueyang Fu, Xin Lu et al.
Streaming Stochastic Submodular Maximization with On-Demand User Requests
Honglian Wang, Sijing Tu, Lutz Oettershagen et al.
MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research
Hui Chen, Miao Xiong, Yujie Lu et al.
Image Super-Resolution with Guarantees via Conformalized Generative Models
Eduardo Adame, Daniel Csillag, Guilherme Tegoni Goedert
FLAME: Fast Long-context Adaptive Memory for Event-based Vision
Biswadeep Chakraborty, Saibal Mukhopadhyay
InsTaG: Learning Personalized 3D Talking Head from Few-Second Video
Jiahe Li, Jiawei Zhang, Xiao Bai et al.
Bridging Human and LLM Judgments: Understanding and Narrowing the Gap
Felipe Maia Polo, Xinhe Wang, Mikhail Yurochkin et al.
LaX: Boosting Low-Rank Training of Foundation Models via Latent Crossing
Ruijie (Ray) Zhang, Ziyue (Alvin) Liu, Zhengyang Wang et al.
The Fragile Truth of Saliency: Improving LLM Input Attribution via Attention Bias Optimization
Yihua Zhang, Changsheng Wang, Yiwei Chen et al.
Constrained Feedback Learning for Non-Stationary Multi-Armed Bandits
Shaoang Li, Jian Li
MUniverse: A Simulation and Benchmarking Suite for Motor Unit Decomposition
Pranav Mamidanna, Thomas Klotz, Dimitrios Chalatsis et al.
Joint Optimization of Neural Radiance Fields and Continuous Camera Motion from a Monocular Video
Hoang Chuong Nguyen, Wei Mao, Jose M. Alvarez et al.
PMNI: Pose-free Multi-view Normal Integration for Reflective and Textureless Surface Reconstruction
Mingzhi Pei, Xu Cao, Xiangyi Wang et al.
3D Interaction Geometric Pre-training for Molecular Relational Learning
Namkyeong Lee, Yunhak Oh, Heewoong Noh et al.
LayerIF: Estimating Layer Quality for Large Language Models using Influence Functions
Hadi Askari, Shivanshu Gupta, Fei Wang et al.
Can NeRFs "See" without Cameras?
Chaitanya Amballa, Yu-Lin Wei, Sattwik Basu et al.
Semantic Surgery: Zero-Shot Concept Erasure in Diffusion Models
Lexiang Xiong, Liu Chengyu, Jingwen Ye et al.
ControlFusion: A Controllable Image Fusion Network with Language-Vision Degradation Prompts
Linfeng Tang, Yeda Wang, Zhanchuan Cai et al.
Video-Bench: Human-Aligned Video Generation Benchmark
Hui Han, Siyuan Li, Jiaqi Chen et al.
Dual Consolidation for Pre-Trained Model-Based Domain-Incremental Learning
Da-Wei Zhou, Zi-Wen Cai, Han-Jia Ye et al.
Counterfactual Identifiability via Dynamic Optimal Transport
Fabio De Sousa Ribeiro, Ainkaran Santhirasekaram, Ben Glocker
MFogHub: Bridging Multi-Regional and Multi-Satellite Data for Global Marine Fog Detection and Forecasting
Mengqiu XU, Kaixin Chen, Heng Guo et al.
COBRA: COmBinatorial Retrieval Augmentation for Few-Shot Adaptation
Arnav Mohanty Das, Gantavya Bhatt, Lilly Kumari et al.
On the Edge of Memorization in Diffusion Models
Sam Buchanan, Druv Pai, Yi Ma et al.
All-directional Disparity Estimation for Real-world QPD Images
Hongtao Yu, Shaohui Song, Lihu Sun et al.
CLIP-driven Coarse-to-fine Semantic Guidance for Fine-grained Open-set Semi-supervised Learning
Xiaokun Li, Yaping Huang, Qingji Guan
Rainbow Delay Compensation: A Multi-Agent Reinforcement Learning Framework for Mitigating Observation Delays
Songchen Fu, Siang Chen, Shaojing Zhao et al.
CARE Transformer: Mobile-Friendly Linear Visual Transformer via Decoupled Dual Interaction
Yuan Zhou, Qingshan Xu, Jiequan Cui et al.
Learning to Route: Per-Sample Adaptive Routing for Multimodal Multitask Prediction
Marzieh Ajirak, Oded Bein, Ellen Bowen et al.
The Indra Representation Hypothesis
Jianglin Lu, Hailing Wang, Kuo Yang et al.
VidHalluc: Evaluating Temporal Hallucinations in Multimodal Large Language Models for Video Understanding
Chaoyu Li, Eun Woo Im, Pooyan Fazli
Electromyography-Informed Facial Expression Reconstruction for Physiological-Based Synthesis and Analysis
Tim Büchner, Christoph Anders, Orlando Guntinas-Lichius et al.
MyoChallenge 2024: A New Benchmark for Physiological Dexterity and Agility in Bionic Humans
Huiyi Wang, Chun Kwang Tan, Balint Hodossy et al.
Predictive Coding Enhances Meta-RL To Achieve Interpretable Bayes-Optimal Belief Representation Under Partial Observability
Po-Chen Kuo, Han Hou, Will Dabney et al.
VL2Lite: Task-Specific Knowledge Distillation from Large Vision-Language Models to Lightweight Networks
Jinseong Jang, Chunfei Ma, Byeongwon Lee
Mind the Gap: Confidence Discrepancy Can Guide Federated Semi-Supervised Learning Across Pseudo-Mismatch
Yijie Liu, Xinyi Shang, Yiqun Zhang et al.
Convergence Theorems for Entropy-Regularized and Distributional Reinforcement Learning
Yash Jhaveri, Harley Wiltzer, Patrick Shafto et al.
Filter Images First, Generate Instructions Later: Pre-Instruction Data Selection for Visual Instruction Tuning
Bardia Safaei, Faizan Siddiqui, Jiacong Xu et al.
MixSignGraph: A Sign Sequence is Worth Mixed Graphs of Nodes
Shiwei Gan, Yafeng Yin, Zhiwei Jiang et al.
Black Swan: Abductive and Defeasible Video Reasoning in Unpredictable Events
Aditya Chinchure, Sahithya Ravi, Raymond Ng et al.
Semantic-guided Cross-Modal Prompt Learning for Skeleton-based Zero-shot Action Recognition
Anqi Zhu, Jingmin Zhu, James Bailey et al.