Kirkpatrick's Four Levels of Evaluation Explained for CIPD Students

Training is easy to deliver and hard to evaluate. A workshop might receive positive feedback, but did it improve capability? Did employees change what they do at work? Did the organisation benefit?
Kirkpatrick's four levels of evaluation help HR and L&D professionals answer those questions more carefully. For CIPD students, the model is useful because it links learning activity to evidence, behaviour and organisational value.
This guide explains each level, what evidence you might collect, and how to use the model critically in an assignment.
What Is Kirkpatrick's Four-Level Model?
Kirkpatrick's model is a framework for evaluating learning and development. It is usually explained as four levels:
- Reaction.
- Learning.
- Behaviour.
- Results.
The levels move from immediate participant response to longer-term organisational outcomes. That movement matters. Many organisations measure attendance and satisfaction, but fewer evaluate whether learning changed workplace behaviour or improved performance.
For CIPD assignments, the model can help you show evidence-based thinking, especially when writing about learning effectiveness, people practice value or 5CO02 Evidence-Based Practice.
Level 1: Reaction
Reaction evaluates how participants respond to the learning experience. This is the easiest level to measure, which is why it is so common.
Typical questions include:
- Was the session useful?
- Was the content relevant to the role?
- Was the facilitator effective?
- Did participants feel more confident?
- Was the learning format accessible?
Evidence might come from feedback forms, pulse surveys, ratings, comments or quick reflection prompts.
Reaction data is useful, but limited. Participants can enjoy a session without learning much. They can also dislike a challenging session that later improves performance. In a CIPD assignment, do not treat satisfaction as proof of effectiveness. Explain what it can and cannot show.
Level 2: Learning
Learning evaluates whether participants gained knowledge, skills, confidence or changed attitudes. This level asks: did people actually learn what the intervention intended?
Evidence might include:
- Pre- and post-training quizzes.
- Skills demonstrations.
- Scenario-based assessments.
- Reflective statements.
- Manager-observed practice tasks.
- Confidence ratings before and after learning.
For example, if line managers attend training on absence conversations, Level 2 evaluation might test whether they can explain the policy, identify supportive questions and apply the process to a case study.
This level is stronger than reaction because it checks capability. However, it still does not prove transfer to the workplace. A manager might pass a case study in training but avoid difficult conversations in real life.
Level 3: Behaviour
Behaviour evaluates whether participants apply learning back at work. This is where the model becomes especially valuable for HR.
Typical evidence might include:
- Manager or peer observation.
- Follow-up conversations.
- Work samples.
- Behaviour checklists.
- Coaching notes.
- Employee feedback.
- Changes in process compliance.
Returning to the absence management example, Level 3 evaluation would ask whether managers are now holding timely, supportive and consistent conversations with employees. Evidence might include case review, manager self-reflection and employee feedback.
Level 3 is often harder to measure because behaviour depends on context. Employees may fail to apply learning because they lack time, manager support, confidence, systems or authority. That is why L&D evaluation should consider the wider work environment, not just the training event.
Level 4: Results
Results evaluates whether learning contributed to meaningful outcomes. These outcomes may be financial, operational, people-related or service-focused.
Examples include:
- Reduced absence.
- Improved customer satisfaction.
- Lower error rates.
- Faster onboarding.
- Improved retention.
- Better compliance.
- Higher productivity.
- Reduced employee relations escalation.
For the absence management example, Level 4 might examine whether absence cases are handled earlier, long-term absence is better supported, or employee relations issues reduce over time.
This level is powerful but risky if you overclaim. Results are influenced by many factors. If absence falls after training, the training may have contributed, but seasonal patterns, policy changes, workload shifts or management attention may also have played a role.
A strong CIPD answer acknowledges this. Use careful wording such as "may have contributed to" or "evaluation should consider whether there is evidence of contribution." Avoid claiming direct causation unless the evidence supports it.
Why the Model Matters for Evidence-Based Practice
Kirkpatrick's model supports evidence-based HR because it encourages better questions.
Instead of asking only "Did people attend?" or "Did they like it?", the model asks:
- Did the learning feel relevant?
- Did participants gain knowledge or skill?
- Did workplace behaviour change?
- Did this contribute to valuable outcomes?
That progression links well to evidence-based practice. It pushes HR professionals to gather evidence from multiple sources and think critically about what the evidence really proves.
The model is also useful when discussing the value of people practice. HR and L&D budgets are often under pressure. Evaluation helps show whether learning activity is aligned with organisational priorities rather than delivered because it is traditional, popular or easy to arrange.
A CIPD Assignment Example
Imagine an organisation introduces coaching skills training for line managers because employee survey results show low-quality one-to-one conversations.
A Kirkpatrick evaluation might look like this:
Reaction: Managers rate the session as relevant and say they feel more confident holding coaching conversations.
Learning: Managers complete a role-play assessment showing they can use open questions, active listening and action planning.
Behaviour: Three months later, employees report more useful one-to-one meetings and HR observes better quality development objectives.
Results: Engagement scores for "my manager supports my development" improve, internal movement increases and fewer employees cite lack of development in exit interviews.
This example shows how the levels connect. It also shows why evaluation takes time. Level 1 can happen immediately. Level 3 and Level 4 may need follow-up weeks or months later.
Strengths of Kirkpatrick's Model
Kirkpatrick's model remains popular because it is practical and memorable. Its strengths include:
Simple structure: The four levels are easy to understand and explain.
Broader evaluation: It encourages more than satisfaction surveys.
Workplace focus: Behaviour and results connect learning to actual practice.
Flexible use: It can apply to workshops, digital learning, coaching, onboarding and CPD.
Useful for conversations: It helps HR ask managers what success should look like before training is designed.
In an assignment, these strengths help you explain why the model is widely used in L&D evaluation.
Limitations to Discuss Critically
To show stronger analysis, you should also discuss limitations.
Common limitations include:
It can look linear: In practice, evaluation planning should begin before training, not after.
Higher levels are harder to measure: Behaviour and results need time, access and good data.
Causation is difficult: Business outcomes are affected by many factors besides learning.
It may underplay context: Workplace systems, culture and line manager support influence transfer.
It can be used superficially: A quick feedback form is not a full evaluation strategy.
These limitations do not make the model useless. They mean you should use it thoughtfully.
How to Use Kirkpatrick in Your Writing
When writing about Kirkpatrick's model, avoid simply listing the four levels and stopping there. A stronger answer explains application and evaluation.
Use this structure:
- Define the model briefly.
- Explain each level in plain language.
- Give a workplace example.
- Identify evidence that could be collected.
- Discuss strengths and limitations.
- Link the model to people practice value.
You can also connect the model to wider learning topics, such as accidental and deliberate learning, digital learning and collaboration, or the CIPD CPD cycle.
Final Thoughts
Kirkpatrick's four levels are useful because they shift attention from learning activity to learning impact. For CIPD students, that makes the model a strong choice when discussing training evaluation, evidence-based practice or the value of HR.
Use the model to ask better questions, not to make unsupported claims. Reaction and learning data can show whether a programme was received well and built capability. Behaviour and results data can show whether it made a difference at work. The strongest answers recognise both the value and the limits of that evidence.
If you are writing about evaluation, start with the outcome you want to understand, then work backwards to the evidence you need. That is how Kirkpatrick becomes more than a four-level diagram. It becomes a practical evaluation plan.