Why K–12 Staff Training So Often Feels Overwhelming (And What to Do About It)
By Kalin Schoephoerster | KShep Creative
Picture a full-day professional development. Back-to-back sessions. A packed agenda. Presenters who clearly know their material. By session three, half the room has stopped taking notes. By lunch, people are checking their phones. By the end of the day, nobody can tell you the three most important things they were supposed to learn.
The content was good. The facilitators were prepared. Staff weren't disengaged on purpose. So what happened?
In most cases, what happened was cognitive load. And it shows up in K–12 professional development more than almost anywhere else.
If you've already read Why Your K–12 Staff Training Isn't Changing Practice →, this post goes a layer deeper into why training design choices matter so much — and what the science says about how people actually learn.
What cognitive load actually is
Cognitive load theory was developed by educational psychologist John Sweller and is grounded in a straightforward insight: working memory is limited. People can only hold and process a small amount of new information at one time. When that limit is exceeded, learning breaks down. Information doesn't get processed deeply enough to move into long-term memory, and it doesn't transfer to practice.
There are three types worth knowing.
Intrinsic load is the inherent complexity of the content itself. Some topics are genuinely harder than others, and that complexity can't be designed away — only managed thoughtfully.
Extraneous load is the unnecessary cognitive effort created by poor design. Confusing slides, unclear instructions, irrelevant information, too many things happening at once. This type of load doesn't contribute to learning at all. It just uses up mental bandwidth that could have gone toward understanding the content.
Germane load is the productive mental effort that actually leads to learning — making connections, applying concepts, building understanding over time. This is the load worth investing in.
The goal of good training design is to keep intrinsic load manageable, minimize extraneous load, and protect the space for germane load. In practice, that almost always means covering less, more deliberately. Most K–12 PD does the opposite.
How cognitive load shows up in K–12 staff training
These are the patterns district leaders tend to recognize immediately once they have a name for them.
Too much content in too little time. A two-hour session covering seven objectives is not a well-designed training. It's seven topics that didn't get their own session. When staff are asked to absorb more new information than working memory can hold, they absorb less of everything. The session feels productive because a lot was covered. But coverage and learning are not the same thing.
New content with no connection to prior knowledge. Cognitive load increases sharply when learners have no existing framework to attach new information to. A training that introduces a new framework, new vocabulary, and new procedures all at once — without connecting any of it to what staff already know and do — is asking working memory to do an enormous amount of work with very little support. Even a brief activation of prior knowledge at the start of a session reduces load significantly.
Competing demands during the training itself. Slide decks that require reading while someone is also talking. Chat boxes in virtual sessions pulling attention while a presenter is explaining something. Multiple simultaneous inputs split attention and drive up extraneous load without adding any learning value. Every additional thing asking for attention is a reduction in what's available for the content that actually matters.
No time to process before moving on. Learning requires consolidation. A training that moves from topic to topic without pausing for reflection, discussion, or application doesn't give working memory time to do its job. Staff leave with a vague sense of having covered a lot — and retain very little of it by the following week.
What lower cognitive load looks like in practice
These are design choices that make a real difference. None of them require starting from scratch. Most of them require doing less.
Narrow the focus. One training, one clear outcome. Not "staff will understand the new framework and be able to apply it across all contexts." Just: "staff will be able to use the three-step protocol in their first week back." Narrowing the outcome naturally limits the content, which limits the load on working memory. This is also the core of backward design — define the outcome first, then build only what serves it.
Chunk and sequence deliberately. Break content into small, logical units. Introduce one concept before building on it with the next. Give staff time to practice or apply each piece before adding complexity. The sequence matters as much as the content itself. A well-sequenced training covering four things will outperform a poorly sequenced training covering ten.
Use worked examples before asking for independent application. Cognitive load research consistently shows that worked examples — walking through a completed scenario step by step — are more effective than asking learners to solve problems before they have a solid foundation. In training, this means modeling before practicing, not the other way around. Show staff how the protocol works in a real scenario before asking them to apply it themselves.
Reduce split attention. If you're showing a diagram, explain it verbally at the same time rather than asking people to read it while you talk about something else. In virtual training, limit chat prompts to moments when you're not also presenting new information. The principle is simple: design for one input at a time wherever possible. Every split in attention is a cost.
Build in consolidation time. Pauses for reflection, partner discussion, or brief application aren't padding — they're where learning actually happens. Working memory needs time to process before it can move information into long-term storage. A training with less content and more consolidation time will produce better outcomes than a packed agenda nearly every time. This feels counterintuitive, but the research on it is consistent.
The counterintuitive truth about comprehensive training
The instinct in K–12 professional development is often to be comprehensive. Cover everything staff might need to know. Address every possible question. Include all the context and background. That instinct comes from a genuine desire to be thorough and to respect staff time by giving them everything at once.
But comprehensiveness and learning are frequently in tension. A training that covers twelve things gives staff twelve things they partially understood under pressure. A training that covers three things, deeply and with practice, gives staff three things they can actually do on Monday.
The goal of professional development is not to have covered the material. It's to change what staff do when they're back in their buildings. Less coverage, more learning. That shift in priority — from comprehensive to purposeful — is one of the most meaningful changes a district can make to its professional development.
It's also what connects cognitive load directly to backward design. When you start with a clear outcome and build only what serves it, you're already managing cognitive load — because you've given yourself a principled reason to leave things out.
Where cognitive load fits into training design
Cognitive load isn't something you check at the end of a training to see if you went too long. It's a lens that shapes design decisions from the beginning.
How many objectives can this session realistically support? What prior knowledge can be activated before introducing new content? Where does this training need to pause so people can process before moving on? Which elements of this slide deck are adding clarity and which are adding noise?
These are questions an instructional designer asks before a single slide gets built. For K–12 districts designing their own professional development, they're worth building into the planning process at the start — not as a checklist to run through at the end, but as a frame for every decision along the way.
If you're designing instructor-led or virtual training for your district and want support thinking through the structure before you build, explore training design services → or learn more about K–12 eLearning development →.
Less overwhelm, more learning
If your district's professional development is leaving staff exhausted rather than equipped, the design is usually where the fix starts. Not the content, not the facilitators, not the staff.
The fix is building training that respects the limits of working memory — covering less, sequencing better, building in time to process, and designing for one thing at a time.
That's what cognitive load-informed design looks like. And it's one of the most reliable ways to make professional development worth the time it takes.
Book a free 30-minute intro call →
Or explore all K–12 instructional design and training services → to see what this looks like in practice.
Kalin Schoephoerster is a CPACC-certified instructional designer and accessibility consultant based in St. Paul, MN. KShep Creative partners with K–12 districts, higher education institutions, and EdTech organizations to develop accessible eLearning, instructor-led training, curriculum, SOPs, and website accessibility audits aligned with WCAG 2.2 and ADA Title II requirements.