Multi-Step Word Problem IEP Goals: Examples and Data Collection Tips

Multi-Step Word Problem IEP Goals: Examples and Data Collection Tips

Multi-step word problems sit at the top of the difficulty pyramid for most students with math IEPs — and they deserve their own goal, not just a harder version of a single-step goal. When a student can solve a one-step problem but falls apart the moment a problem requires two operations in sequence, those are two different skill sets requiring two different instructional targets.

This post gives you ready-to-use IEP goal examples for multi-step word problems across Grades 1–4, a benchmark sequence that builds the skill progressively, and a step-level data collection system that shows you exactly where students get stuck — not just whether the final answer was right.

What You'll Find Here

Quick Summary

  • Why multi-step problems need their own IEP goal separate from single-step goals
  • Ready-to-use goal language for 2-step and 3-step problems in Grades 1–4
  • A four-benchmark sequence that builds multi-step skill progressively
  • Exactly what to track at the step level — not just whether the final answer was correct
  • A sample data sheet and real-sounding progress notes you can adapt right away

Why Multi-Step Word Problems Need Their Own IEP Goal

A single-step word problem asks a student to identify one operation and execute it. A multi-step problem asks something categorically harder: read the whole problem, identify that more than one operation is needed, decide the correct sequence, carry information forward between steps, and verify the final answer. These are not on the same continuum.

Yet most IEP goals treat them as if they are — as if "word problems at 80% accuracy" covers both cases. It doesn't. A student can score 85% on single-step problems and 38% on two-step problems in the same week with the same teacher. The chart below shows exactly how common that gap is.

2–3×
more cognitive steps required vs. single-step problems
~50%
typical accuracy drop from single-step to multi-step for students with math IEPs
#1
reason multi-step goals fail: measuring only the final answer
Chart 1 of 2 — The Accuracy Gap

Single-Step vs. Multi-Step Accuracy: Same Students, Same Week

Four students with math IEPs — same teacher, same math program. The gap is why a separate goal is needed.
0% 20% 40% 60% 80% 100% 85% 38% Student A 78% 31% Student B 82% 42% Student C 90% 52% Student D Single-Step Accuracy Multi-Step Accuracy
What this means for IEP writing: Student A scores 85% on single-step but only 38% on multi-step. A goal written at "80% accuracy on word problems" would appear met on single-step probes while completely hiding the multi-step deficit.

The skills that break down in multi-step problems — identifying the hidden question, sequencing operations, holding intermediate values — are each worth targeting explicitly. When you bundle them under one broad goal, you lose the ability to see which skill is blocking the student and what to do about it.

What Makes a Multi-Step Word Problem Goal Measurable

Single-step word problem goals are relatively straightforward to write. Multi-step goals are harder because there are more places to be vague — and vague goals produce vague data.

Common Goal-Writing Mistake

"Given grade-level word problems, [Student] will solve multi-step problems with 80% accuracy across 3 consecutive probes." — This goal scores only the final answer. A student who consistently gets Step 1 right but fails Step 2 will hover around 50% accuracy forever, and you will never know why.

Here's what to do instead:

  • Specify the number of steps — "two-step" and "three-step" are meaningfully different; use the right one for where your student is and where they're going
  • Name the operations — "addition and subtraction" is clearer than "mixed operations," especially in Grades 1–2
  • Define the scaffold level — "with a graphic organizer," "with visual step prompts," or "independently" changes what the data means
  • Target step-level accuracy — write your criterion as "correctly completing both steps" rather than "correct final answer only"
  • Anchor to a problem set — consistent problem formatting removes layout as a confounding variable in your weekly probes

IEP Goal Examples by Grade Level (Grades 1–4)

Each goal below includes the condition, behavior, and criterion. Adjust the accuracy threshold and trial count to match your student's baseline and your team's standards.

Grade 1 — Two-Step Addition and Subtraction

Grade 1 Goal Example — 2-Step
"Given a set of 5 two-step word problems involving addition and subtraction within 20, with a visual step organizer provided, [Student Name] will correctly identify the operation needed for each step and produce the correct answer for both steps in at least 4 out of 5 problems, across 3 consecutive weekly probes as measured by teacher-recorded step-level data."
  • Condition: visual step organizer provided
  • Skill: identify operation per step + solve both steps correctly
  • Criterion: 4 of 5 problems, both steps correct
  • Consistency: 3 consecutive weekly probes

Grade 2 — Two-Step with CUBES Strategy

Grade 2 Goal Example — 2-Step with CUBES
"Given a set of 5 two-step word problems involving addition and subtraction within 100, using the CUBES strategy graphic organizer, [Student Name] will correctly complete both steps and arrive at the correct final answer in 4 of 5 problems across 4 consecutive weekly probes as measured by step-scored data sheets."
  • Condition: CUBES strategy graphic organizer provided
  • Skill: complete both steps + correct final answer
  • Criterion: 4 of 5 problems
  • Consistency: 4 consecutive weekly probes
Multi-Step Word Problems with CUBES Strategy – Grade 2

Multi-Step Word Problems with CUBES Strategy — Grade 2

$6.99

Structured two-step addition and subtraction word problems built specifically for the Grade 2 goal above. Consistent problem formatting means your data reflects student skill — not variability in problem layout. Includes graphic organizers aligned to the CUBES strategy.

Get the Grade 2 Resource →

Grade 3 — Two-Step Mixed Operations

Grade 3 Goal Example — 2-Step Mixed Operations
"Given 5 two-step word problems involving addition, subtraction, and multiplication within 1,000, with a CUBES strategy reference card available, [Student Name] will correctly identify both operations, complete both steps in order, and arrive at the correct final answer on 4 of 5 problems across 4 consecutive weekly probes as measured by step-level scoring."
  • Condition: CUBES reference card available
  • Skill: identify both operations + complete steps in order + correct final answer
  • Criterion: 4 of 5 problems
  • Consistency: 4 consecutive weekly probes
Multi-Step Word Problems with CUBES Strategy – Grade 3 Mixed Operations

Multi-Step Word Problems with CUBES Strategy — Grade 3 Mixed Operations

$6.99

Covers addition, subtraction, and multiplication in two-step and three-step formats — exactly the progression the Grade 3 goal requires. Consistent problem structure means you can score step-by-step without hunting for matching problems every week.

Get the Grade 3 Resource →

Grade 4 — Multi-Step with All Operations

Grade 4 Goal Example — 2-Step All Operations
"Given 5 two-step word problems involving any combination of addition, subtraction, multiplication, and division with whole numbers, [Student Name] will independently identify the correct operations for both steps, set up both equations correctly, and produce a correct final answer in 4 of 5 problems across 4 consecutive weekly probes as measured by teacher-scored step data."
  • Condition: no visual supports (independent)
  • Skill: identify operations, set up equations, correct final answer
  • Criterion: 4 of 5 problems
  • Consistency: 4 consecutive weekly probes

How to Write Benchmarks for Multi-Step Word Problem Goals

Benchmarks are where multi-step IEP goals really earn their value. A well-structured sequence tells you not just whether the student hit the annual goal, but exactly which part of the skill chain they mastered along the way. For multi-step problems, that granularity matters — the fix for "can't identify Step 2" is completely different from "can identify it but can't compute it."

Benchmark Timing Target Skill Criterion
Benchmark 1 ~10 weeks Identify that two steps are needed; state or circle both questions embedded in the problem 4 of 5 problems, 3 consecutive probes
Benchmark 2 ~20 weeks Select the correct operation for both steps (computation accuracy not yet required) 4 of 5 problems, 3 consecutive probes
Benchmark 3 ~28 weeks Complete Step 1 accurately (correct operation + correct calculation); Step 2 attempted but not scored for criterion 4 of 5 problems, 3 consecutive probes
Annual Goal ~36 weeks Complete both steps accurately; correct final answer; independent (no visual supports) 4 of 5 problems, 4 consecutive probes

Notice that Benchmarks 1 and 2 don't require correct computation. This is intentional. If a student can't yet identify that a problem needs two steps, drilling computation is the wrong intervention. Sequencing the benchmarks this way lets you target instruction exactly where the skill breaks down.

Also build the scaffold fade into your benchmarks explicitly. If your annual goal says "independently" but all your benchmarks say "with graphic organizer," there's no bridge between them. Plan the fade: full CUBES organizer at Benchmark 1, simplified checklist at Benchmark 2, student-created reference at Benchmark 3, fully independent at the annual goal.

What to Track in Your Data — Step by Step

If you score only the final answer, you will miss most of what matters. A student who consistently gets Step 1 right but fails Step 2 has a different problem than a student who gets Step 1 wrong — and these require different interventions. Here is what to track and why each data point tells you something the final answer can't.

  • Step 1 vs. Step 2 accuracy (separately). Track each step's accuracy independently so you can see where the chain breaks.
  • Operation selection vs. computation accuracy. A student can select the right operation and still compute incorrectly. Use a two-column tally: "Correct Op?" and "Correct Calc?" — don't let computation errors mask good operation identification.
  • Sequence errors. Does the student attempt Step 2 before completing Step 1, or skip Step 1 entirely? Mark sequence errors separately in your data (e.g., "SE"). A student who sequences wrong needs different instruction than one who sequences correctly but computes wrong.
  • Prompt level. Was the student independent, or did they need a verbal redirect to move from Step 1 to Step 2? Track the highest prompt level needed per session — a drop from 2 verbal cues to independent over 8 weeks is meaningful progress even if the final accuracy score looks similar.

Sample Step-Level Data Sheet

Problem Op 1 Correct? Calc 1 Correct? Op 2 Correct? Calc 2 Correct? Final Answer Prompt Level
Problem 1 Yes Yes Yes No Incorrect Independent
Problem 2 Yes Yes No Incorrect 1 Verbal
Problem 3 Yes Yes Yes Yes Correct Independent
Problem 4 Seq. Error Incorrect 2 Verbal
Problem 5 Yes Partial Yes Yes Incorrect Independent

From this single session: Step 1 operation identification is solid (4/5), Step 2 operation selection is emerging (3/4 attempted), and there is one sequencing error. The final answer score is 1/5 — which dramatically understates where this student is. You would not get any of this from scoring only the final answer.

Chart 2 of 2 — Progress Over Time

Step Completion Rate by Stage — One Student, 8 Weeks

Which steps were completed at each checkpoint — not just whether the final answer was correct
0% 25% 50% 75% 100% 60% 20% 10% Week 1 80% 40% 30% Week 4 95% 60% 55% Week 8 Step 1 Complete Step 2 Complete Checked Answer
How to read this: By Week 8, this student completes Step 1 in nearly every problem and checks their answer most of the time. Step 2 is still the growth edge. Final-answer-only scoring would show roughly 40% accuracy at Week 8 — but step-level data tells you the student has mastered Step 1 and needs targeted support for Step 2 specifically.

For a deeper look at setting up a complete data collection system for word problem IEP goals — including how often to probe, what to record at each benchmark, and how to translate weekly data into progress notes — see our post on tracking data for word problems in IEP goals.


Frequently Asked Questions

Can I use one IEP goal to cover both single-step and multi-step word problems?

You can, but it usually isn't wise — especially if multi-step problems are where the skill deficit lives. A combined goal almost always gets met on single-step probes, which inflates your data and hides the multi-step gap. Write separate goals if your baseline shows a meaningful accuracy difference between problem types. The chart in Section 1 shows what that gap typically looks like.

How do I set the baseline for a multi-step word problem IEP goal?

Administer 5–10 two-step problems at the target grade level under normal classroom conditions. Score each problem at the step level — was Step 1 attempted, was the operation correct, was the calculation correct, was Step 2 attempted? Your baseline should reflect what the student does across 2–3 probes before new instruction begins, not a single data point taken on one day.

My student scores 80% with a graphic organizer but 30% without one. Which number goes in the IEP?

Both — and that contrast is exactly what your IEP needs. Write your present level as: "With CUBES graphic organizer: 80% accuracy on both steps. Without graphic organizer: 30% accuracy." Then write your annual goal to target either fully independent performance or a progressively faded scaffold. Hiding the scaffold in your baseline produces a misleading picture of where the student actually is.

Do I need to mention the CUBES strategy by name in the IEP goal?

You don't have to, but if students are expected to use the CUBES strategy as part of their instruction, naming it in the condition makes data collection conditions explicit — "with CUBES graphic organizer provided" is a specific, defensible condition. You can then plan to fade that support across benchmarks as the student internalizes the process.

How often should I collect data on a multi-step word problem goal?

Weekly probes with 5 problems give you a reliable trend within 4–6 weeks. Administer probes without instructional support so the data reflects independent skill, not assisted performance. You can note observations on non-probe days, but don't count coached practice sessions as IEP goal data.

What if my student can do two-step problems with addition but not when multiplication is involved?

Write the goal at the operation level your student is ready for, and note the limitation explicitly in present levels. "Student solves two-step addition and subtraction problems at 75% accuracy but cannot yet correctly identify multiplication as an operation in a word problem" is a valid present level. Write a separate or subsequent goal that adds multiplication once the foundational skill is solid.

Shop SER Resources

Multi-Step Word Problem Resources

Multi-Step Grade 2
Multi-Step Word Problems — Grade 2
$6.99
Shop Now →
Multi-Step Grade 3 Mixed Ops
Multi-Step Word Problems — Grade 3 Mixed Ops
$6.99
Shop Now →
Multi-Step Grade 3 Bundle
Multi-Step Word Problems — Grade 3 Bundle
$15
Shop Now →
MEGA BUNDLE Grades 1–4
MEGA BUNDLE — Full Year Word Problems Gr. 1–4
$150
Shop Now →
Back to blog