"internal representations" Papers
3 papers found
Beyond the Surface: Enhancing LLM-as-a-Judge Alignment with Human via Internal Representations
Peng Lai, Jianjie Zheng, Sijie Cheng et al.
NeurIPS 2025posterarXiv:2508.03550
2
citations
LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations
Hadas Orgad, Michael Toker, Zorik Gekhman et al.
ICLR 2025posterarXiv:2410.02707
118
citations
Robust Hallucination Detection in LLMs via Adaptive Token Selection
Mengjia Niu, Hamed Haddadi, Guansong Pang
NeurIPS 2025posterarXiv:2504.07863
4
citations