Ruoxi Jia
21
Papers
20
Total Citations
Papers (21)
LLMs Can Plan Only If We Tell Them
ICLR 2025
16
citations
Detecting Adversarial Data Using Perturbation Forgery
CVPR 2025
2
citations
Just Enough Shifts: Mitigating Over-Refusal in Aligned Language Models with Targeted Representation Fine-Tuning
ICML 2025
2
citations
The Mirrored Influence Hypothesis: Efficient Data Influence Estimation by Harnessing Forward Passes
CVPR 2024
0
citations
RigorLLM: Resilient Guardrails for Large Language Models against Undesired Content
ICML 2024
0
citations
Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models
ICML 2024
0
citations
Rethinking Data Shapley for Data Selection Tasks: Misleads and Merits
ICML 2024
0
citations
Position: A Safe Harbor for AI Evaluation and Red Teaming
ICML 2024
0
citations
Scalability vs. Utility: Do We Have To Sacrifice One for the Other in Data Importance Quantification?
CVPR 2021arXiv
0
citations
Label-Only Model Inversion Attacks via Boundary Repulsion
CVPR 2022arXiv
0
citations
Knowledge-Enriched Distributional Model Inversion Attacks
ICCV 2021arXiv
0
citations
Rethinking the Backdoor Attacks' Triggers: A Frequency Perspective
ICCV 2021arXiv
0
citations
Practical Membership Inference Attacks Against Large-Scale Multi-Modal Models: A Pilot Study
ICCV 2023
0
citations
The Secret Revealer: Generative Model-Inversion Attacks Against Deep Neural Networks
CVPR 2020arXiv
0
citations
Efficient Input-level Backdoor Defense on Text-to-Image Synthesis via Neuron Activation Variation
ICCV 2025
0
citations
Probing Hidden Knowledge Holes in Unlearned LLMs
NeurIPS 2025
0
citations
CATER: Intellectual Property Protection on Text Generation APIs via Conditional Watermarks
NeurIPS 2022
0
citations
Renyi Differential Privacy of Propose-Test-Release and Applications to Private and Robust Machine Learning
NeurIPS 2022
0
citations
A Randomized Approach to Tight Privacy Accounting
NeurIPS 2023
0
citations
A Privacy-Friendly Approach to Data Valuation
NeurIPS 2023
0
citations
Performance Scaling via Optimal Transport: Enabling Data Selection from Partially Revealed Sources
NeurIPS 2023
0
citations