Multi-Step Word Problem IEP Goals: Examples and Data Collection Tips
Multi-step word problems sit at the top of the difficulty pyramid for most students with math IEPs — and they deserve their own goal, not just a harder version of a single-step goal. When a student can solve a one-step problem but falls apart the moment a problem requires two operations in sequence, those are two different skill sets requiring two different instructional targets.
This post gives you ready-to-use IEP goal examples for multi-step word problems across Grades 1–4, a benchmark sequence that builds the skill progressively, and a step-level data collection system that shows you exactly where students get stuck — not just whether the final answer was right.
Quick Summary
- Why multi-step problems need their own IEP goal separate from single-step goals
- Ready-to-use goal language for 2-step and 3-step problems in Grades 1–4
- A four-benchmark sequence that builds multi-step skill progressively
- Exactly what to track at the step level — not just whether the final answer was correct
- A sample data sheet and real-sounding progress notes you can adapt right away
Why Multi-Step Word Problems Need Their Own IEP Goal
A single-step word problem asks a student to identify one operation and execute it. A multi-step problem asks something categorically harder: read the whole problem, identify that more than one operation is needed, decide the correct sequence, carry information forward between steps, and verify the final answer. These are not on the same continuum.
Yet most IEP goals treat them as if they are — as if "word problems at 80% accuracy" covers both cases. It doesn't. A student can score 85% on single-step problems and 38% on two-step problems in the same week with the same teacher. The chart below shows exactly how common that gap is.
Single-Step vs. Multi-Step Accuracy: Same Students, Same Week
The skills that break down in multi-step problems — identifying the hidden question, sequencing operations, holding intermediate values — are each worth targeting explicitly. When you bundle them under one broad goal, you lose the ability to see which skill is blocking the student and what to do about it.
What Makes a Multi-Step Word Problem Goal Measurable
Single-step word problem goals are relatively straightforward to write. Multi-step goals are harder because there are more places to be vague — and vague goals produce vague data.
"Given grade-level word problems, [Student] will solve multi-step problems with 80% accuracy across 3 consecutive probes." — This goal scores only the final answer. A student who consistently gets Step 1 right but fails Step 2 will hover around 50% accuracy forever, and you will never know why.
Here's what to do instead:
- Specify the number of steps — "two-step" and "three-step" are meaningfully different; use the right one for where your student is and where they're going
- Name the operations — "addition and subtraction" is clearer than "mixed operations," especially in Grades 1–2
- Define the scaffold level — "with a graphic organizer," "with visual step prompts," or "independently" changes what the data means
- Target step-level accuracy — write your criterion as "correctly completing both steps" rather than "correct final answer only"
- Anchor to a problem set — consistent problem formatting removes layout as a confounding variable in your weekly probes
IEP Goal Examples by Grade Level (Grades 1–4)
Each goal below includes the condition, behavior, and criterion. Adjust the accuracy threshold and trial count to match your student's baseline and your team's standards.
Grade 1 — Two-Step Addition and Subtraction
- Condition: visual step organizer provided
- Skill: identify operation per step + solve both steps correctly
- Criterion: 4 of 5 problems, both steps correct
- Consistency: 3 consecutive weekly probes
Grade 2 — Two-Step with CUBES Strategy
- Condition: CUBES strategy graphic organizer provided
- Skill: complete both steps + correct final answer
- Criterion: 4 of 5 problems
- Consistency: 4 consecutive weekly probes
Grade 3 — Two-Step Mixed Operations
- Condition: CUBES reference card available
- Skill: identify both operations + complete steps in order + correct final answer
- Criterion: 4 of 5 problems
- Consistency: 4 consecutive weekly probes
Grade 4 — Multi-Step with All Operations
- Condition: no visual supports (independent)
- Skill: identify operations, set up equations, correct final answer
- Criterion: 4 of 5 problems
- Consistency: 4 consecutive weekly probes
How to Write Benchmarks for Multi-Step Word Problem Goals
Benchmarks are where multi-step IEP goals really earn their value. A well-structured sequence tells you not just whether the student hit the annual goal, but exactly which part of the skill chain they mastered along the way. For multi-step problems, that granularity matters — the fix for "can't identify Step 2" is completely different from "can identify it but can't compute it."
| Benchmark | Timing | Target Skill | Criterion |
|---|---|---|---|
| Benchmark 1 | ~10 weeks | Identify that two steps are needed; state or circle both questions embedded in the problem | 4 of 5 problems, 3 consecutive probes |
| Benchmark 2 | ~20 weeks | Select the correct operation for both steps (computation accuracy not yet required) | 4 of 5 problems, 3 consecutive probes |
| Benchmark 3 | ~28 weeks | Complete Step 1 accurately (correct operation + correct calculation); Step 2 attempted but not scored for criterion | 4 of 5 problems, 3 consecutive probes |
| Annual Goal | ~36 weeks | Complete both steps accurately; correct final answer; independent (no visual supports) | 4 of 5 problems, 4 consecutive probes |
Notice that Benchmarks 1 and 2 don't require correct computation. This is intentional. If a student can't yet identify that a problem needs two steps, drilling computation is the wrong intervention. Sequencing the benchmarks this way lets you target instruction exactly where the skill breaks down.
Also build the scaffold fade into your benchmarks explicitly. If your annual goal says "independently" but all your benchmarks say "with graphic organizer," there's no bridge between them. Plan the fade: full CUBES organizer at Benchmark 1, simplified checklist at Benchmark 2, student-created reference at Benchmark 3, fully independent at the annual goal.
What to Track in Your Data — Step by Step
If you score only the final answer, you will miss most of what matters. A student who consistently gets Step 1 right but fails Step 2 has a different problem than a student who gets Step 1 wrong — and these require different interventions. Here is what to track and why each data point tells you something the final answer can't.
- Step 1 vs. Step 2 accuracy (separately). Track each step's accuracy independently so you can see where the chain breaks.
- Operation selection vs. computation accuracy. A student can select the right operation and still compute incorrectly. Use a two-column tally: "Correct Op?" and "Correct Calc?" — don't let computation errors mask good operation identification.
- Sequence errors. Does the student attempt Step 2 before completing Step 1, or skip Step 1 entirely? Mark sequence errors separately in your data (e.g., "SE"). A student who sequences wrong needs different instruction than one who sequences correctly but computes wrong.
- Prompt level. Was the student independent, or did they need a verbal redirect to move from Step 1 to Step 2? Track the highest prompt level needed per session — a drop from 2 verbal cues to independent over 8 weeks is meaningful progress even if the final accuracy score looks similar.
Sample Step-Level Data Sheet
| Problem | Op 1 Correct? | Calc 1 Correct? | Op 2 Correct? | Calc 2 Correct? | Final Answer | Prompt Level |
|---|---|---|---|---|---|---|
| Problem 1 | Yes | Yes | Yes | No | Incorrect | Independent |
| Problem 2 | Yes | Yes | No | — | Incorrect | 1 Verbal |
| Problem 3 | Yes | Yes | Yes | Yes | Correct | Independent |
| Problem 4 | Seq. Error | — | — | — | Incorrect | 2 Verbal |
| Problem 5 | Yes | Partial | Yes | Yes | Incorrect | Independent |
From this single session: Step 1 operation identification is solid (4/5), Step 2 operation selection is emerging (3/4 attempted), and there is one sequencing error. The final answer score is 1/5 — which dramatically understates where this student is. You would not get any of this from scoring only the final answer.
Step Completion Rate by Stage — One Student, 8 Weeks
For a deeper look at setting up a complete data collection system for word problem IEP goals — including how often to probe, what to record at each benchmark, and how to translate weekly data into progress notes — see our post on tracking data for word problems in IEP goals.
Frequently Asked Questions
Can I use one IEP goal to cover both single-step and multi-step word problems?
You can, but it usually isn't wise — especially if multi-step problems are where the skill deficit lives. A combined goal almost always gets met on single-step probes, which inflates your data and hides the multi-step gap. Write separate goals if your baseline shows a meaningful accuracy difference between problem types. The chart in Section 1 shows what that gap typically looks like.
How do I set the baseline for a multi-step word problem IEP goal?
Administer 5–10 two-step problems at the target grade level under normal classroom conditions. Score each problem at the step level — was Step 1 attempted, was the operation correct, was the calculation correct, was Step 2 attempted? Your baseline should reflect what the student does across 2–3 probes before new instruction begins, not a single data point taken on one day.
My student scores 80% with a graphic organizer but 30% without one. Which number goes in the IEP?
Both — and that contrast is exactly what your IEP needs. Write your present level as: "With CUBES graphic organizer: 80% accuracy on both steps. Without graphic organizer: 30% accuracy." Then write your annual goal to target either fully independent performance or a progressively faded scaffold. Hiding the scaffold in your baseline produces a misleading picture of where the student actually is.
Do I need to mention the CUBES strategy by name in the IEP goal?
You don't have to, but if students are expected to use the CUBES strategy as part of their instruction, naming it in the condition makes data collection conditions explicit — "with CUBES graphic organizer provided" is a specific, defensible condition. You can then plan to fade that support across benchmarks as the student internalizes the process.
How often should I collect data on a multi-step word problem goal?
Weekly probes with 5 problems give you a reliable trend within 4–6 weeks. Administer probes without instructional support so the data reflects independent skill, not assisted performance. You can note observations on non-probe days, but don't count coached practice sessions as IEP goal data.
What if my student can do two-step problems with addition but not when multiplication is involved?
Write the goal at the operation level your student is ready for, and note the limitation explicitly in present levels. "Student solves two-step addition and subtraction problems at 75% accuracy but cannot yet correctly identify multiplication as an operation in a word problem" is a valid present level. Write a separate or subsequent goal that adds multiplication once the foundational skill is solid.