FisherTune: Fisher-Guided Robust Tuning of Vision Foundation Models for Domain Generalized Segmentation

9citations

arXiv:2503.17940

Citations

#485

in CVPR 2025

of 2873 papers

Authors

Data Points

Authors

Dong Zhao Jinlong Li Shuang Wang Mengyao Wu Qi Zang Nicu Sebe Zhun Zhong

Topics

vision foundation models domain generalized segmentation fisher information matrix parameter sensitivity analysis variational inference semantic segmentation cross-domain adaptation robust fine-tuning

Abstract

Vision Foundation Models (VFMs) excel in generalization due to large-scale pretraining, but fine-tuning them for Domain Generalized Semantic Segmentation (DGSS) while maintaining this ability remains challenging. Existing approaches either selectively fine-tune parameters or freeze the VFMs and update only the adapters, both of which may underutilize the VFMs' full potential in DGSS tasks. We observe that domain-sensitive parameters in VFMs, arising from task and distribution differences, can hinder generalization. To address this, we propose \textbf{FisherTune}, a robust fine-tuning method guided by the Domain-Related Fisher Information Matrix (DR-FIM). DR-FIM measures parameter sensitivity across tasks and domains, enabling selective updates that preserve generalization and enhance DGSS adaptability. FisherTune incorporates variational inference to stabilize DR-FIM estimation, treating parameters as Gaussian-distributed variables and leveraging pre-trained priors. Extensive experiments show that FisherTune achieves superior cross-domain segmentation while maintaining generalization, outperforming selective-parameter and adapter-based methods.

Citation History

Jan 25, 2026

Jan 31, 2026

9+1