Dawn Song

34
Papers
116
Total Citations

Papers (34)

Latent Attention For If-Then Program Synthesis

NeurIPS 2016arXiv
72
citations

Data Shapley in One Training Run

ICLR 2025arXiv
44
citations

Position: Evolving AI Collectives Enhance Human Diversity and Enable Self-Regulation

ICML 2024
0
citations

RigorLLM: Resilient Guardrails for Large Language Models against Undesired Content

ICML 2024
0
citations

GRATH: Gradual Self-Truthifying for Large Language Models

ICML 2024
0
citations

Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression

ICML 2024
0
citations

C-RAG: Certified Generation Risks for Retrieval-Augmented Language Models

ICML 2024
0
citations

SHINE: Shielding Backdoors in Deep Reinforcement Learning

ICML 2024
0
citations

Agent Instructs Large Language Models to be General Zero-Shot Reasoners

ICML 2024
0
citations

Position: On the Societal Impact of Open Foundation Models

ICML 2024
0
citations

Robust Physical-World Attacks on Deep Learning Visual Classification

CVPR 2018
0
citations

Fooling Vision and Language Models Despite Localization and Attention Mechanism

CVPR 2018arXiv
0
citations

The Secret Revealer: Generative Model-Inversion Attacks Against Deep Neural Networks

CVPR 2020arXiv
0
citations

Natural Adversarial Examples

CVPR 2021arXiv
0
citations

Model-Contrastive Federated Learning

CVPR 2021arXiv
0
citations

Scalability vs. Utility: Do We Have To Sacrifice One for the Other in Data Importance Quantification?

CVPR 2021arXiv
0
citations

PixMix: Dreamlike Pictures Comprehensively Improve Safety Measures

CVPR 2022arXiv
0
citations

AdvIT: Adversarial Frames Identifier Based on Temporal Consistency in Videos

ICCV 2019
0
citations

The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization

ICCV 2021arXiv
0
citations

TrojDiff: Trojan Attacks on Diffusion Models With Diverse Targets

CVPR 2023arXiv
0
citations

CodeHalu: Investigating Code Hallucinations in LLMs via Execution-based Verification

AAAI 2025
0
citations

Improving Neural Program Synthesis with Inferred Execution Traces

NeurIPS 2018
0
citations

Tree-to-tree Neural Networks for Program Translation

NeurIPS 2018
0
citations

Using Self-Supervised Learning Can Improve Model Robustness and Uncertainty

NeurIPS 2019
0
citations

Compositional Generalization via Neural-Symbolic Stack Machines

NeurIPS 2020
0
citations

Towards practical differentially private causal graph discovery

NeurIPS 2020
0
citations

Synthesize, Execute and Debug: Learning to Repair for Neural Program Synthesis

NeurIPS 2020
0
citations

Adversarial Examples for k-Nearest Neighbor Classifiers Based on Higher-Order Voronoi Diagrams

NeurIPS 2021
0
citations

Latent Execution for Neural Program Synthesis Beyond Domain-Specific Languages

NeurIPS 2021
0
citations

How Would The Viewer Feel? Estimating Wellbeing From Video Scenarios

NeurIPS 2022
0
citations

Forecasting Future World Events With Neural Networks

NeurIPS 2022
0
citations

DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models

NeurIPS 2023
0
citations

BIRD: Generalizable Backdoor Detection and Removal for Deep Reinforcement Learning

NeurIPS 2023
0
citations

DiffAttack: Evasion Attacks Against Diffusion-Based Adversarial Purification

NeurIPS 2023
0
citations