Skip to content

How to Evaluate the Impact of Capacity Building

How Catalyst Exchange evaluates capacity building across levels of engagement from project completion to long-term mission impact

Capacity building can help organizations become stronger, more resilient, and better equipped to serve their communities. But evaluating its impact is inherently more complex than traditional program evaluation.

Unlike many programs, capacity building is rarely a standardized intervention. One organization may need a clearer strategy, another stronger data systems, and another improved fundraising or talent practices. Even when two organizations receive the same type of support, the work often looks different because it is shaped by context, leadership, stage, and existing systems – and the duration and intensity of that support can vary significantly as well.

The pathway from support to outcomes is also indirect. Capacity building typically improves the underlying conditions that enable performance (such as stronger teams, clearer priorities, or better systems) rather than producing immediate changes in mission outcomes. Those downstream outcomes may take time to emerge, and many other factors influence them along the way, making attribution especially difficult.

At Catalyst Exchange, we work closely with nonprofits receiving capacity-building support, providers delivering it, and funders investing in it. That vantage point has shaped a practical approach to evaluation: one that is rigorous enough to guide decisions, realistic about attribution, and grounded in organizational reality.

Key Principles for Impact Measurement

We design evaluation approaches to generate useful evidence without creating unnecessary burden for organizations. In practice, that means our approach is:

  • Built for decision-making – Evidence should help organizations, funders, and providers make better decisions about where and how to invest in capacity building, providing actionable guidance that leads to stronger outcomes.
  • Rigor with realism – Capacity-building efforts vary significantly across organizations, providers, and contexts. Rather than trying to isolate a single causal effect, we draw on multiple forms of evidence to understand contribution and identify meaningful patterns.
  • Low burden on organizations – Whenever possible, we use data organizations already collect and compensate partners when additional participation or data collection is needed.
  • Transparent about attribution – Organizational outcomes are shaped by many factors. We are clear about what the evidence can and cannot tell us.
  • Balanced across tradeoffs – Strong evaluation requires balancing rigor, feasibility, participation burden, and equity. The goal is evidence that is credible, practical, and useful in real-world settings.

Evaluation Framework

A leveled approach to measure impact across levels of engagement

Organizations engage with capacity-building support in different ways, and the depth of evidence we can generate depends on that type of engagement. We assess capacity-building impact across a few connected dimensions — usage, experience, results, and long-term impact — using a leveled approach that matches depth of evidence to available data and engagement intensity.

Importantly, this is not a single uniform evaluation applied to all work. Level 1 is universal across all projects. Level 2 is a subset of Level 1, used when pre- and post-diagnostic data are available; in the future, we anticipate supplementing this with qualitative data from interviews and focus groups. Level 3 is an emerging approach that builds on a smaller subset of Level 2 engagements where we have sufficient outcome data and sustained implementation to support deeper analysis, and will draw on case studies and focus groups to enrich that evidence.

Measurement-Graphic

Level 1: Project-Level Performance

Did individual projects meet their stated goals?

At the broadest level, we examine whether projects were completed successfully and whether organizations found the support valuable. This level applies to all capacity-building engagements.

We analyze:

  • Project completion and goal attainment
  • Sentiment and feedback from participating organizations
  • Patterns across providers, project types, and common capacity needs

We also incorporate a small number of case studies each year to provide context behind the quantitative trends and highlight how organizations experienced the work in practice.

Level 2: Organizational Capacity Shift

Did capacity-building strengthen the organization?

This level looks beyond individual projects to assess whether organizations experienced meaningful shifts in capacity over time — driven by factors such as stronger leadership, improved systems, clearer priorities, and more effective team dynamics. It applies to a subset of Level 1 engagements where we have pre- and post-engagement diagnostic data.

We examine:

  • Pre- and post-engagement diagnostic data (when available)
  • Reflections from organizational leaders
  • Evidence of changes in systems, processes, decision-making, or ways of working

The goal is to understand whether support is associated with measurable improvements in organizational functioning, and which types of support appear most strongly linked to those changes.

Level 3: Improved Organizational Outcomes

Did stronger capacity improve mission delivery?

This is the most demanding and currently emerging layer of our framework, which will be applied to a smaller subset of Level 2 engagements where we have sufficient outcome data and evidence of sustained change.

The strongest test of capacity building is whether it ultimately helps organizations better serve their communities. In practice, answering this is especially difficult because many other factors influence outcomes, and changes in capacity may take time to translate into observable mission results.

We will draw on:

  • Organization-level performance metrics, such as reach or service quality
  • Qualitative insights from staff and community members
  • Evidence that organizational changes were implemented and sustained

This work relies on mixed-methods evaluation and contribution analysis. The goal is not to claim that capacity building alone caused an outcome, but to understand whether the weight of evidence consistently suggests it played a meaningful contributing role.


Across all three tiers, we track change over time to better understand not only whether capacity-building efforts work, but how they work, for whom, and under what conditions.

Capacity building is complex, and evaluation should reflect that reality. The goal is to learn what helps organizations get stronger, and use that insight to help organizations, providers, and funders make better decisions about how to strengthen nonprofit capacity over time.

 

Catalyst Exchange

Catalyst Exchange