gradient science

Mar 4, 2024

We study the robustness benefits of pre-training and characterize failure modes that pre-training can and cannot address.

Jan 24, 2024

Selecting better data by approximating how models learn from data.

Dec 12, 2023

We introduce a new framework for data attribution in generative settings, and propose an efficient method to attribute diffusion models.

Jul 20, 2023

We introduce a new perspective on backdoor attacks and defenses in deep learning.

Mar 27, 2023

We introduce TRAK, a new data attribution method that scales to large(r) models!

Feb 16, 2023

We introduce dataset interfaces, a scalable framework that synthesizes counterfactual examples under user-specified shifts

Dec 14, 2022

We demonstrate how we can use Stable Diffusion to target a model's failure modes

Nov 23, 2022

We introduce a framework for comparing ML models trained with different learning algorithms.