Story points
are dead.
A framework for estimating software work in dollars of LLM inference — not hours, not abstract points. Honest, falsifiable, calibrated to 2026.
Hours lied. Story points were a costume.
For 20 years we estimated software in points — an abstract, unfalsifiable unit invented to dodge the obvious failures of hour estimates. It worked, sort of, when humans wrote every line of code.
That world is gone. In 2026, agents write the first draft of most production code. The bottleneck is no longer "how long will a developer type this?" — it's "how many turns of inference, with which model, against which codebase, will it take to ship?"
That question has a real answer in dollars. The token cost is on the API invoice every month. Ignored. TokenPoints is a planning vocabulary built around using it.
A sizing scale anchored in USD.
Five buckets, plus a "spike first" escape hatch for unknown work. The numbers are starting anchors — every team recalibrates with its own data after two sprints.
Anything above XL must be decomposed. If you can't decompose it, you don't understand it yet — that's a spike.
FAQ
Real questions from the community. If yours isn't here, open an issue.
?? size is for. If you can't decompose the work, you don't know its shape yet — forcing a number is fiction. Time-box a spike, produce sized sub-tasks, then estimate. Spike-first is the framework's release valve for genuine uncertainty.
Three ways to contribute.
TokenPoints is v0.1 and community-driven. The most valuable thing you can give back is real data from your team.
Share calibration data
Anonymized aggregate numbers from your team. The most valuable contribution — turns this from one person's idea into an empirical reference.
Open issue → 🗳Vote on the name
TokenPoints, TASK$, Cost-Anchored Sizing, or something better? The name is open until ~30 votes. Drop a number and a sentence.
Cast your vote → 🌐Translate
PT-BR, ES, FR, DE, JP — make the framework accessible to non-English teams. Good first issue, no architectural decisions required.
Pick a language →