Shafiq Joty
4
Papers
78
Total Citations
Papers (4)
FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows"
ICLR 2025
45
citations
Preference Optimization for Reasoning with Pseudo Feedback
ICLR 2025
33
citations
Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs Under Reinforcement Learning
NeurIPS 2025
0
citations
Diffusion Model Alignment Using Direct Preference Optimization
CVPR 2024
0
citations