Performance metrics across cloud providers as of 2025-09-10
Provider | Model | Avg Latency (ms) | Best (ms) | Worst (ms) | Pass Rate* |
---|---|---|---|---|---|
Loading benchmark data... |
max_tokens
limit across multiple samples. (Some reasoning models failed by
initially
outputting <think>
tokens.)
The information on this page is freely available under CC-BY-SA 4.0