๐ฏ Benchmark Winner: Experiment Design
deepseek-v3.2
๐ Champion
Wins
3
Category
Experiment Design
๐ Recent Discoveries
Live
Loading...
๐ Cascade Routing
Active5-Tier Cascade System
| Tier | Model | Cost | Accuracy | Use Case |
|---|---|---|---|---|
| 1 | phi4-mini (local) | FREE | 70% | Simple math |
| 2 | gemma3:27b (cloud) | FREE | 100% | General queries |
| 3 | qwen3-coder:480b (cloud) | FREE | 100% | Coding tasks |
| 4 | Claude Haiku | $0.003 | 100% | Security/critical |
| 5 | Claude Sonnet/Opus | $$$ | 100% | Complex/creative |
Recent Routes
16:24:25
local
What is 15 + 27?
16:24:29
cloud-coder
Write a Python function to sort a list
16:24:36
haiku
Is this code vulnerable to SQL injection...
๐ Model Benchmarks
Feb 5, 2026
phi4-mini (Local)
Tested
Accuracy
70%
Speed
~5 sec
Cost
FREE
gemma3:27b (Cloud)
Tested
Accuracy
100%
Speed
~3 sec
Cost
FREE
qwen3-coder:480b (Cloud)
Tested
Accuracy
100%
Speed
~5 sec
Cost
FREE
deepseek-v3.2 (Cloud)
Tested
Accuracy
100%
Speed
~8 sec
Cost
FREE
qwen2.5-coder:3b (Local)
Tested
Accuracy
40%
Speed
~3.7 sec
Cost
FREE
mxbai-embed-large
Tested
Retrieval
100%
Speed
12.9 docs/s
Cost
FREE
โก Prompt Powerups
Completed
Chain of Thought (CoT)
Improvement
+15%
Best for
Logic, Math
Structured Output
Improvement
+20%
Best for
Data extraction
Role Prompting
Improvement
+5%
Best for
Domain tasks
Verify Step
Improvement
+10%
Best for
Math, Code