gradient science

Mar 6, 2025

We present GSM8K-Platinum, a revised version of the GSM8K benchmark that reveals meaningful differences in frontier model capabilities

Feb 6, 2025

We introduce the concept of so-called platinum benchmarks to better quantify model reliability

Jun 25, 2024

May 6, 2024

We use our method ContextCite to detect unverified statements and discover poisoned documents.

May 6, 2024

We present ContextCite, a method for attributing statements generated by language models back to specific information provided in-context.

Apr 18, 2024

We use our component modeling framework to design targeted model edits.

Apr 18, 2024

We introduce a framework called component modeling for studying how model components collectively shape ML predictions.

Mar 4, 2024

We explore a simple principle for harnessing pre-training to develop robust models.