NEURIPS 2025 "influence functions" Papers
6 papers found
Better Training Data Attribution via Better Inverse Hessian-Vector Products
Andrew Wang, Elisa Nguyen, Runshi Yang et al.
NEURIPS 2025posterarXiv:2507.14740
3
citations
Enhancing Training Data Attribution with Representational Optimization
Weiwei Sun, Haokun Liu, Nikhil Kandpal et al.
NEURIPS 2025spotlightarXiv:2505.18513
IF-Guide: Influence Function-Guided Detoxification of LLMs
Zachary Coalson, Juhan Bae, Nicholas Carlini et al.
NEURIPS 2025posterarXiv:2506.01790
1
citations
LayerIF: Estimating Layer Quality for Large Language Models using Influence Functions
Hadi Askari, Shivanshu Gupta, Fei Wang et al.
NEURIPS 2025posterarXiv:2505.23811
Understanding Fairness and Prediction Error through Subspace Decomposition and Influence Analysis
Enze Shi, Pankaj Bhagwat, Zhixian Yang et al.
NEURIPS 2025posterarXiv:2510.23935
Which Data Attributes Stimulate Math and Code Reasoning? An Investigation via Influence Functions
Siqi Kou, Qingyuan Tian, Hanwen Xu et al.
NEURIPS 2025posterarXiv:2505.19949
4
citations