NEURIPS "fine-tuning attacks" Papers
2 papers found
CoreGuard: Safeguarding Foundational Capabilities of LLMs Against Model Stealing in Edge Deployment
Qinfeng Li, Tianyue Luo, Xuhong Zhang et al.
NEURIPS 2025posterarXiv:2410.13903
7
citations
ErrorTrace: A Black-Box Traceability Mechanism Based on Model Family Error Space
Chuanchao Zang, Xiangtao Meng, Wenyu Chen et al.
NEURIPS 2025spotlight