BLXBenchBLXBench UI
blxbench

Benchmark

Levels

Misc

DocsDownload blxbenchOur TestsPassSponsor / Partnership
DocsDownload blxbenchOur TestsPassSponsor / Partnership
BLXBenchBLXBench UI
blxbench

Benchmark

Levels

Misc

DocsDownload blxbenchOur TestsPassSponsor / Partnership
DocsDownload blxbenchOur TestsPassSponsor / Partnership
  1. Home
  2. Our Tests
  3. Speed Perf Throughput Essay
blxbench

Test fixture

Speed Perf Throughput Essay

Speedhardscorer: contains_all

Latency-sensitive tasks where concise correct output matters.

How it is scored

The model receives the prompt (and optional system message). The run uses scorer contains_all with the JSON configuration below. Pass/fail and partial credit are determined entirely by that scorer against the model output; no human grading.

User prompt
Write a long reflective essay in English on how software teams balance shipping speed with quality and reliability. Use exactly these Markdown headings in order, each followed by several substantial paragraphs:

## Introduction
## Historical context
## Trade-offs
## Practical recommendations
## Conclusion

Aim for breadth and depth of prose (not bullet lists). Continue writing until you have fully developed each section.
Scorer config
{
  "expected_contains": [
    "## Introduction",
    "## Trade-offs",
    "## Conclusion",
    "quality",
    "team"
  ]
}
Run parameters

temperature

0.3

max_tokens

2048

timeout (s)

300

type

throughput

file

speed_perf_throughput_essay.json

← PreviousSpeed Perf Throughput Data Structures
|
Next →Speed Perf Ttft Definition

BLXBench

Community driven leaderboardPublic benchmark runner — run in your environment, share results with the community.

© 2026 BLXBench by bitslix.com

ProvenanceAggregated from user runs
Scope6 / 7 / 372
Latestrun_fa781e / 7 / $0.02
TermsPrivacy