Name: Step 3.7 Flash
Price: 0.20 USD
Author: StepFun

Question 1

What is Step 3.7 Flash?

Accepted Answer

Step 3.7 Flash is StepFun's open-weight 196B-parameter sparse Mixture-of-Experts vision-language model, released on May 29, 2026 under the Apache 2.0 license. It activates ~11B parameters per token, supports a 256K-token context window, and understands images and video natively.

Question 2

How much does Step 3.7 Flash cost to use via API?

Accepted Answer

Input tokens cost $0.20 per million (cache miss) or $0.04 per million (cache hit). Output tokens cost $1.15 per million. The model weights are free to download and self-host under Apache 2.0.

Question 3

How much memory do I need to run Step 3.7 Flash locally?

Accepted Answer

At least 128 GB of unified memory is required for local deployment — for example a Mac Studio M3 Ultra, NVIDIA DGX Station, or AMD Ryzen AI Max+ 395. For cloud or datacenter deployment use H100 / A100 nodes via vLLM, SGLang, or NVIDIA NIM. Quantized GGUF variants are available for lower-memory setups.

Question 4

Is Step 3.7 Flash compatible with the OpenAI API?

Accepted Answer

Yes. Step 3.7 Flash exposes an OpenAI-compatible /v1/chat/completions endpoint. You only need to change base_url and model in your existing OpenAI SDK code.

Question 5

Is Step 3.7 Flash compatible with the Anthropic API?

Accepted Answer

Yes. StepFun also exposes an Anthropic Messages-compatible endpoint, so you can reuse Anthropic SDK code by swapping base_url and model.

Question 6

What is Advisor Mode?

Accepted Answer

Advisor Mode is StepFun's implementation of the advisor strategy described by Anthropic. Step 3.7 Flash handles the full agentic loop and escalates to a larger frontier model only at critical inflection points (planning, recovery from failures). This delivers 97% of Claude Opus 4.6's coding performance on SWE-Bench Verified at roughly 1/9th the per-task cost ($0.19 vs $1.76).

Question 7

Which agent frameworks does Step 3.7 Flash support?

Accepted Answer

It works out of the box with Claude Code, Cline, Roo Code, KiloCode, OpenCode, Hermes Agent, and OpenClaw. No workflow rewiring is required.

Question 8

What license is Step 3.7 Flash released under?

Accepted Answer

Apache 2.0 — free for commercial and non-commercial use.

Benchmark	Step 3.7 Flash	DeepSeek V4 Flash	GPT 5.5	Claude Opus 4.7
SWE-Bench Pro	56.3	55.6	58.6	64.3
Terminal-Bench 2.1	59.6	62.0	82.7	69.4
ClawEval-1.1	67.1 ★	57.8	60.3	70.8
HLE w/ Tool	47.2	45.1	52.2	54.7
SimpleVQA w/ Search	79.2 ★	—	79.1	—
Toolathlon	49.5	52.8	60.2	65.4

The high-efficiency Flash model for real-world agents

97% of Claude Opus 4.6 coding performance. 1/9th the cost.

Built for production agents

Native multimodal

Agentic coding

Reliable tool use & orchestration

Adaptive reasoning

Works with your stack

Two API flavors, one model

Benchmarks

Deploy in minutes

Minimum requirements

Pricing

Input (cache miss)

Input (cache hit)

Output

Frequently asked questions