Ruoxi Jia

21
Papers
20
Total Citations

Papers (21)

LLMs Can Plan Only If We Tell Them

ICLR 2025
16
citations

Detecting Adversarial Data Using Perturbation Forgery

CVPR 2025
2
citations

Just Enough Shifts: Mitigating Over-Refusal in Aligned Language Models with Targeted Representation Fine-Tuning

ICML 2025
2
citations

The Mirrored Influence Hypothesis: Efficient Data Influence Estimation by Harnessing Forward Passes

CVPR 2024
0
citations

RigorLLM: Resilient Guardrails for Large Language Models against Undesired Content

ICML 2024
0
citations

Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models

ICML 2024
0
citations

Rethinking Data Shapley for Data Selection Tasks: Misleads and Merits

ICML 2024
0
citations

Position: A Safe Harbor for AI Evaluation and Red Teaming

ICML 2024
0
citations

Scalability vs. Utility: Do We Have To Sacrifice One for the Other in Data Importance Quantification?

CVPR 2021arXiv
0
citations

Label-Only Model Inversion Attacks via Boundary Repulsion

CVPR 2022arXiv
0
citations

Knowledge-Enriched Distributional Model Inversion Attacks

ICCV 2021arXiv
0
citations

Rethinking the Backdoor Attacks' Triggers: A Frequency Perspective

ICCV 2021arXiv
0
citations

Practical Membership Inference Attacks Against Large-Scale Multi-Modal Models: A Pilot Study

ICCV 2023
0
citations

The Secret Revealer: Generative Model-Inversion Attacks Against Deep Neural Networks

CVPR 2020arXiv
0
citations

Efficient Input-level Backdoor Defense on Text-to-Image Synthesis via Neuron Activation Variation

ICCV 2025
0
citations

Probing Hidden Knowledge Holes in Unlearned LLMs

NeurIPS 2025
0
citations

CATER: Intellectual Property Protection on Text Generation APIs via Conditional Watermarks

NeurIPS 2022
0
citations

Renyi Differential Privacy of Propose-Test-Release and Applications to Private and Robust Machine Learning

NeurIPS 2022
0
citations

A Randomized Approach to Tight Privacy Accounting

NeurIPS 2023
0
citations

A Privacy-Friendly Approach to Data Valuation

NeurIPS 2023
0
citations

Performance Scaling via Optimal Transport: Enabling Data Selection from Partially Revealed Sources

NeurIPS 2023
0
citations