Benchmarks

Superficial achieves 100% factual accuracy on models from OpenAI, Google, xAI, and Anthropic — as measured on Google DeepMind’s FACTS benchmark.

View Benchmark

Loading benchmark data...

Models are evaluated using Google DeepMind’s FACTS methodology. When FACTS marks a response as inaccurate, we one-shot enhance it using Superficial’s audit results, then re-score it with FACTS to measure the independent accuracy gain. Real-world results may vary with source availability and domain complexity.