Dawn Song
34
Papers
116
Total Citations
Papers (34)
Latent Attention For If-Then Program Synthesis
NeurIPS 2016arXiv
72
citations
Data Shapley in One Training Run
ICLR 2025arXiv
44
citations
Position: Evolving AI Collectives Enhance Human Diversity and Enable Self-Regulation
ICML 2024
0
citations
RigorLLM: Resilient Guardrails for Large Language Models against Undesired Content
ICML 2024
0
citations
GRATH: Gradual Self-Truthifying for Large Language Models
ICML 2024
0
citations
Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression
ICML 2024
0
citations
C-RAG: Certified Generation Risks for Retrieval-Augmented Language Models
ICML 2024
0
citations
SHINE: Shielding Backdoors in Deep Reinforcement Learning
ICML 2024
0
citations
Agent Instructs Large Language Models to be General Zero-Shot Reasoners
ICML 2024
0
citations
Position: On the Societal Impact of Open Foundation Models
ICML 2024
0
citations
Robust Physical-World Attacks on Deep Learning Visual Classification
CVPR 2018
0
citations
Fooling Vision and Language Models Despite Localization and Attention Mechanism
CVPR 2018arXiv
0
citations
The Secret Revealer: Generative Model-Inversion Attacks Against Deep Neural Networks
CVPR 2020arXiv
0
citations
Natural Adversarial Examples
CVPR 2021arXiv
0
citations
Model-Contrastive Federated Learning
CVPR 2021arXiv
0
citations
Scalability vs. Utility: Do We Have To Sacrifice One for the Other in Data Importance Quantification?
CVPR 2021arXiv
0
citations
PixMix: Dreamlike Pictures Comprehensively Improve Safety Measures
CVPR 2022arXiv
0
citations
AdvIT: Adversarial Frames Identifier Based on Temporal Consistency in Videos
ICCV 2019
0
citations
The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization
ICCV 2021arXiv
0
citations
TrojDiff: Trojan Attacks on Diffusion Models With Diverse Targets
CVPR 2023arXiv
0
citations
CodeHalu: Investigating Code Hallucinations in LLMs via Execution-based Verification
AAAI 2025
0
citations
Improving Neural Program Synthesis with Inferred Execution Traces
NeurIPS 2018
0
citations
Tree-to-tree Neural Networks for Program Translation
NeurIPS 2018
0
citations
Using Self-Supervised Learning Can Improve Model Robustness and Uncertainty
NeurIPS 2019
0
citations
Compositional Generalization via Neural-Symbolic Stack Machines
NeurIPS 2020
0
citations
Towards practical differentially private causal graph discovery
NeurIPS 2020
0
citations
Synthesize, Execute and Debug: Learning to Repair for Neural Program Synthesis
NeurIPS 2020
0
citations
Adversarial Examples for k-Nearest Neighbor Classifiers Based on Higher-Order Voronoi Diagrams
NeurIPS 2021
0
citations
Latent Execution for Neural Program Synthesis Beyond Domain-Specific Languages
NeurIPS 2021
0
citations
How Would The Viewer Feel? Estimating Wellbeing From Video Scenarios
NeurIPS 2022
0
citations
Forecasting Future World Events With Neural Networks
NeurIPS 2022
0
citations
DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models
NeurIPS 2023
0
citations
BIRD: Generalizable Backdoor Detection and Removal for Deep Reinforcement Learning
NeurIPS 2023
0
citations
DiffAttack: Evasion Attacks Against Diffusion-Based Adversarial Purification
NeurIPS 2023
0
citations