How to Build a Monitoring and Evaluation Framework for Health Projects | SPHERES, Inc.

Tools and Practice

How to Build a Monitoring and Evaluation Framework for Health Projects

📅 June 2, 2026 ✍️ SPHERES, Inc.

A health project that cannot demonstrate results will not be funded a second time. Monitoring and evaluation is the system that turns activities into evidence, and evidence into decisions. For organizations working in Philippine public health — whether with the Department of Health, local government units, UN agencies, or bilateral donors — a functional M&E framework is not optional. It is a core deliverable and a condition of continued credibility in this space.

Monitoring and Evaluation: Two Different Things

The terms are consistently used together but they serve distinct purposes. Monitoring is the continuous, systematic collection of data on specified indicators throughout a project's life. It tracks whether activities are being implemented as planned and whether outputs are being delivered on schedule. Evaluation is a periodic, structured assessment that goes deeper — examining whether the project actually worked, why it produced the results it did, and what should be done differently.

Dimension	Monitoring	Evaluation
Timing	Continuous, throughout implementation	Periodic — mid-term, end-of-project, post-project
Primary question	Are we doing what we planned to do?	Did it work? Why or why not?
Focus	Inputs, activities, outputs	Outcomes, impact, sustainability
Who leads	Project management team	Can be internal or independent evaluators
Output	Progress reports, dashboards, tracking tables	Evaluation reports with findings and recommendations

Together, monitoring and evaluation form a feedback loop. Monitoring data feeds into evaluations. Evaluation findings improve the next planning cycle. Without both operating in tandem, a project team is working without feedback on whether its efforts are producing change.

The Results Chain: From Inputs to Impact

Every M&E framework is built on a results chain, also called a logic model. The logic model maps the causal sequence from resources invested to change achieved. It is the structural backbone of any M&E system because each point in the chain must be measured, and the connections between points must be explained.

Inputs

Staff, funds, equipment, data systems, partner organizations

Activities

Training, service delivery, advocacy, construction, community mobilization

Outputs

Number of health workers trained, facilities equipped, materials distributed

Outcomes

Improved coverage, behavior change, policy adoption, strengthened systems

Impact

Reduced morbidity, mortality, improved health equity at population level

The distinction between outputs and outcomes is where most project teams go wrong. Outputs are the direct products of activities: the number of health workers trained, the number of informational materials distributed, the number of facilities equipped. Outcomes are what changes as a result: whether trained workers improved their clinical practice, whether informational materials shifted community behavior, whether equipped facilities saw increased utilization. Outputs are under the project's direct control. Outcomes are influenced by the project but shaped by many other factors as well.

Impact sits further along the chain. It refers to long-term changes in health status at the population level — reductions in maternal mortality, declines in child stunting, or improvements in TB case notification rates. Most health projects contribute to impact rather than directly cause it, and any honest M&E framework should reflect that distinction. The W.K. Kellogg Foundation formalized the logic model components in its development guide, and the approach has since been adopted by the CDC, USAID, UNDP, and most major funders as the standard structure for results-based programming.

Theory of Change: The Narrative Behind the Chain

A logic model shows the chain. A Theory of Change (ToC) explains why each step should lead to the next — it makes the assumptions explicit. A ToC answers: under what conditions do trained health workers actually change their practice? What needs to be true about the health system for improved practice to translate into better patient outcomes? What external factors could prevent this from happening?

A project in Eastern Visayas trained community health workers to counsel pregnant women on antenatal care. The assumption was that counseled women would attend at least four ANC visits. Monitoring data showed counseling sessions were delivered as planned. But utilization did not improve. The mid-term evaluation traced the problem back to the ToC: the assumption was wrong. Transportation cost — not knowledge — was the primary barrier to attendance. The M&E system caught this at mid-term, allowing the project to redesign its approach before the end of implementation.

The ToC becomes the governing document of the M&E system. Each outcome in the causal chain gets its own indicator, data source, collection method, and reporting frequency. Without a ToC, the M&E plan has no conceptual foundation, and the indicator list becomes a set of measurements without logic connecting them to the project's theory of how change happens.

A logic model fits on a single page. The narrative ToC document that supports it typically runs three to six pages, covering the problem statement, the causal logic, the key assumptions, and the external factors that could affect results. Annexes carry indicator definitions, data collection methods, and baseline values.

Developing SMART Indicators

An indicator is a specific, measurable variable that tracks whether a result is being achieved. Vague indicators produce useless data. Every indicator in an M&E framework must meet the SMART criteria: Specific, Measurable, Achievable, Relevant, and Time-bound.

Criterion	What it means	Example
Specific	Clearly defined with no ambiguity in what is being measured	"Percentage of pregnant women with four or more ANC visits" — not "improved antenatal care"
Measurable	Can be quantified or objectively assessed	Has a defined numerator and denominator, or a clear yes/no criterion
Achievable	Realistic given project scope, timeframe, and resources	Target is grounded in baseline data and comparable program experience
Relevant	Directly linked to the project objective it is meant to measure	If the goal is reduced maternal mortality, measure skilled birth attendance — not only facility visits
Time-bound	Measurement has a defined reference period	"By end of Year 2" or "within 12 months of training completion"

Each indicator also requires a data source, a responsible person or unit, a collection method, and a reporting frequency. These are compiled in the indicator reference sheet — sometimes called the indicator tracking table or performance monitoring plan depending on the donor. This document is the operational core of any M&E plan and must be maintained throughout implementation, not filed after project design.

Baseline, Midline, and Endline

An indicator without a baseline is not useful. A baseline is the value of an indicator at the start of the project, before any intervention is delivered. It establishes the starting point against which all future progress is measured. Without a baseline, it is impossible to determine whether any change actually occurred as a result of the project.

For most health projects, three data collection points are standard. The baseline is conducted before or at the very start of implementation. The midline assessment, typically at the halfway point of the project, checks whether the project is on track and creates space for course corrections. The endline assessment, conducted at or near project close, measures the total change achieved and serves as the primary input for the final evaluation.

A project without a baseline is a project without a story. It can report what it did. It cannot prove what it changed. — Adapted from UNAIDS M&E Fundamentals Series

The same data collection method must be used at baseline, midline, and endline to ensure comparability. Switching from a household survey at baseline to facility register data at endline produces figures that cannot be compared. Consistency of method is not optional — it is the condition that makes the data defensible when presented to a donor, a government partner, or an independent evaluator.

In the Philippines, Lot Quality Assurance Sampling (LQAS) is widely used as a resource-efficient alternative to full household surveys for coverage monitoring. LQAS uses a sample of 19 respondents per supervision area and has been applied extensively in Eastern Visayas health programs, including the KOICA-funded MNCH project in Samar and Southern Leyte, where it was used for cross-sectional surveys at baseline and midline.

Types of Evaluation

Evaluation is not a single activity conducted once at the end of a project. Different evaluation types serve different purposes and are conducted at different stages, each answering a different set of questions about program performance.

Type	When conducted	Primary purpose
Formative	Early implementation or during pilot phase	Tests feasibility and acceptability; identifies design problems before they scale
Process	During implementation	Assesses how well activities are being delivered; identifies operational gaps and quality issues
Mid-term	Midway through the project	Checks progress toward outcomes; enables major course corrections while time remains
Summative / End-of-project	At or near project close	Judges whether stated objectives were met; produces evidence of effectiveness for accountability and learning
Impact	After project ends, often years later	Assesses long-term population-level change; often uses a comparison group to strengthen attribution

For most health projects funded by international development agencies, a mid-term and a final evaluation are the minimum requirements. The terms of reference for these evaluations should be agreed upon with the donor at the start of the project and should form part of the original M&E plan — not commissioned as an afterthought when the project is already winding down.

Formative evaluations are particularly valuable for pilot programs. They help assess whether a new health intervention is feasible and acceptable before it is scaled, identifying early what is working, what needs adjustment, and what data will be needed for subsequent evaluations. Programs in the early stages of development may not yet be delivering all intended services or reaching all target populations, and formative evaluation captures that reality honestly.

The M&E Plan: The Document That Holds It Together

The M&E plan is the primary governing document for a project's monitoring and evaluation system. It is not an annex to be filed and forgotten after project design. It is a working document that guides every data collection activity from launch to close, and it must be updated as the project evolves.

A complete M&E plan contains the following elements:

Programme Summary and Theory of Change

A concise description of the project, its objectives, and the causal logic connecting activities to intended outcomes. This frames all subsequent M&E design and ensures that indicators are anchored to a coherent theory of how change is expected to occur.

Indicator Reference Sheet

A table listing every indicator with its definition, data source, collection method, frequency, responsible unit, baseline value, and targets at each measurement point. This is the operational core of the M&E plan. Every indicator on this sheet must be SMART and traceable back to a project objective.

Data Collection and Management

Describes how data will be collected, by whom, using what tools, and how it will be stored, cleaned, and protected. In the Philippines, this section must address compliance with Republic Act 10173, the Data Privacy Act of 2012, including consent procedures and data security protocols for any personal health information collected from program participants.

Data Quality Assurance

Specifies the procedures for verifying accuracy and completeness, including routine data quality assessments, supervisory verification visits, and triangulation between data sources. Data quality assurance is a standard requirement for Global Fund, UNDP, UNICEF, and most bilateral donor programs. It is what makes reported progress credible when presented to an external audience.

Evaluation Schedule

Specifies the evaluations to be conducted, their timing, who will conduct them, the level of independence required, and how findings will be disseminated and used. Independent evaluations must be planned and budgeted at the outset, not commissioned in the final months of implementation when terms of reference cannot be properly developed.

Reporting Plan

Defines what reports will be produced, for which audiences, at what frequency, and in what format. Donor-required reports have fixed schedules set out in the agreement. Internal management reports can be more frequent and less formal, but they must reach the people who can act on the findings — not just the M&E officer.

M&E Budget

M&E activities must be costed and included in the project budget from the start. The standard guidance from UNAIDS and other international bodies is that M&E should account for 5 to 10 percent of the total programmatic budget, with the higher end reserved for projects with complex evaluation designs, large geographic coverage, or multi-agency reporting requirements.

The Philippine Policy Context

Health projects in the Philippines operate within a national policy framework that increasingly demands results-based management at all levels — from government agencies to implementing partners and technical assistance providers.

At the national level, the National Economic and Development Authority and the Department of Budget and Management jointly issued the National Evaluation Policy Framework (NEPF) under Joint Memorandum Circular No. 2015-01. The NEPF provides the standard for evaluation conduct across all government programs and projects, including those implemented with official development assistance. Its stated purpose is to support good governance, transparency, accountability, and evidence-based decision-making in the public sector. In practice, this means that any consulting organization or implementing partner working with DOH or LGU-implemented programs must treat evaluation not as an optional add-on but as a built-in accountability requirement.

In 2016, DBM introduced the Results-Based Monitoring, Evaluation, and Reporting Policy Framework to further standardize how performance information is generated and used across government. Together, these frameworks mean that results-based management is now the language of public sector health programming in the Philippines — and any organization that cannot speak it will struggle to engage credibly with government counterparts or development partners.

NEDA's Results Matrix, derived from the Philippine Development Plan, is the primary instrument for monitoring progress toward national development goals. It follows the logical framework approach and serves as the guide for planning, programming, and budgeting across all implementing and oversight agencies. Health project indicators at the program level should, where applicable, align with the corresponding outcomes tracked in the Results Matrix. Misalignment between project-level and national-level indicators creates reporting complications and weakens the case for project relevance during evaluation.

IDinsight's engagement with NEDA and four Philippine government departments — including the Department of Health — to develop learning and evaluation roadmaps was part of a broader effort to operationalize the NEPF. The engagement included building theories of change, conducting evaluability assessments, and identifying evidence needs, demonstrating the kind of technical assistance the framework envisions for all major government programs.

For organizations preparing to engage with WHO, UNFPA, UNICEF, or other UN agencies on health programs in the Philippines, it is also worth noting that all UN agencies are expected to follow the common evaluation standards of the United Nations Evaluation Group. These standards cover independence, impartiality, credibility, and the utility of evaluation findings — and they apply to implementing partners through their contractual agreements with the respective agency.

Data Collection Methods in Philippine Health Programs

No single data collection method is sufficient on its own. Strong M&E systems combine quantitative and qualitative approaches so that numbers are always accompanied by an explanation of what they mean and why they moved in the direction they did.

Household Surveys

Household surveys are the standard instrument for measuring population-level outcomes such as vaccination coverage, ANC utilization, contraceptive prevalence, or skilled birth attendance. They produce statistically representative data but require substantial planning, trained enumerators, and dedicated budget. For programs with limited resources, LQAS provides a resource-efficient alternative. LQAS uses a sample of 19 respondents per supervision area and a decision rule that allows programs to classify whether coverage in a given area meets or falls below a defined threshold. It has been used extensively across Philippine maternal and child health programs.

Health Facility Data

Routine data from facility registers, the Field Health Service Information System (FHSIS), and the Philippine Health Information System provides continuous administrative data on service delivery. It is available without additional survey costs but is often incomplete or inconsistently recorded, particularly at the barangay health station level. Any M&E system that relies heavily on facility data must include explicit data quality checks and triangulation with other sources.

Qualitative Methods

Key informant interviews and focus group discussions provide the qualitative layer of understanding: why utilization is low, what barriers communities face, how health workers experience a program in practice, and what the numbers do not capture. These methods are particularly valuable for process evaluation, for interpreting unexpected findings in quantitative data, and for documenting community perspectives on program relevance and acceptability.

Digital Data Collection

Mobile data platforms using tools such as KoBoToolbox, ODK Collect, and DHIS2 are increasingly standard in donor-funded health programs. UNICEF's handover of DigiVacc to the DOH in 2025 — a digital immunization suite funded by the Government of Japan — reflects the broader push toward real-time digital monitoring of health program coverage. Organizations proposing digital data collection should include a connectivity assessment and an offline data capture protocol, given the infrastructure realities in many target communities.

Common Failures and How to Avoid Them

Most M&E frameworks fail not in design but in implementation. The failure modes are well documented and largely preventable.

Starting M&E After Implementation Begins

Once activities are underway, baseline data cannot be collected. This single error makes it impossible to demonstrate change at endline, regardless of how strong the project results actually were. The M&E plan must be developed before implementation begins, and baseline data collection must be completed before the first activity reaches beneficiaries.

Indicator Overload

Projects with forty or fifty indicators cannot realistically collect quality data on all of them within available resources. A disciplined M&E framework selects fewer indicators and collects them well. A small number of high-quality, well-verified measurements is more defensible to a donor or evaluator than a large dataset with inconsistent collection and doubtful accuracy.

Separating M&E from Program Management

When the M&E function is isolated from the program team, monitoring data does not reach the people who can act on it. M&E findings should feed directly into management decisions on a regular basis through a defined learning and adaptation process — not only at mid-term or final evaluation when course correction is no longer possible.

Neglecting Data Quality

Routine data quality assessments, supervisory spot-checks, and triangulation between data sources are not optional add-ons. They are what make monitoring data credible when presented externally. Harmonized Approach to Cash Transfers micro-assessments, which UN agencies conduct before awarding implementing partner agreements, specifically examine financial management and data quality systems. A weak data quality assurance process is a known risk indicator for implementing partner performance.

Attribution vs. Contribution

One of the most contested questions in health project M&E is whether a project caused the change it is reporting. In most implementation contexts, the honest answer is that the project contributed to change but did not exclusively cause it. Other programs, government initiatives, demographic trends, and external factors all operate in the same environment simultaneously.

Experimental designs with randomized control groups can establish stronger attribution but are expensive, ethically complex in health settings, and rarely feasible for standard implementation projects. Most health programs use contribution analysis instead — a structured approach to building a credible case that the project's activities were a significant contributing factor to the observed change, while acknowledging the role of other forces. A well-reasoned contribution narrative, supported by consistent monitoring data and a coherent Theory of Change, is more credible than inflated attribution claims that cannot withstand scrutiny.

Experienced funders operating in the Philippines — including KOICA, JICA, UNFPA, and the World Bank — understand the limitations of attribution in complex health systems. What they look for is not proof of exclusive causation but a credible, evidence-backed argument for the project's contribution to the results observed. That argument is built through the M&E system, over the life of the project, and documented in the final evaluation report.

Building an M&E Framework for Your Health Project?

SPHERES, Inc. provides technical assistance in M&E framework design, indicator development, data quality assessment, and evaluation for health programs across the Philippines.

Get in Touch

Sources and References

NEDA and DBM. Joint Memorandum Circular No. 2015-01: National Evaluation Policy Framework of the Philippines. National Economic and Development Authority and Department of Budget and Management, July 2015.
NEDA. Guidelines on Evaluation in the National Government. National Economic and Development Authority, 2020. Available at nep.depdev.gov.ph.
NEDA. National Evaluation Policy Framework. neda.gov.ph, July 2015.
DBM. Results-Based Monitoring, Evaluation, and Reporting Policy Framework. Department of Budget and Management, 2016.
Asia Pacific Evaluation Association. Fostering Evaluation Culture in the Philippines. asiapacificeval.org, June 2023.
IDinsight. Evaluation and Monitoring Planning with Four Government Departments in the Philippines. idinsight.org, 2021.
UNDP Health Implementation Manual. Monitoring and Evaluation. healthimplementation.undp.org.
UNDP Health Implementation Manual. M&E Plan. healthimplementation.undp.org.
UNAIDS. Basic Terminology and Frameworks for Monitoring and Evaluation. UNAIDS M&E Fundamentals Series. Geneva: UNAIDS.
EvalCommunity. SMART Indicators in Monitoring and Evaluation. evalcommunity.com, March 2026.
EvalCommunity. Monitoring and Evaluation Guide: Framework, Tools and Best Practices. evalcommunity.com, December 2025.
Sopact. Theory of Change in Monitoring and Evaluation. sopact.com, May 2026.
Sopact. Logic Model: Components, Examples, and How to Build One. sopact.com, May 2026.
The Compass for SBC. How to Develop a Monitoring and Evaluation Plan. thecompassforsbc.org, November 2023.
NCBI Bookshelf. Evaluation Types and Data Requirements. National Academies Press, March 2023.
Frontiers in Public Health. Evaluation Planning for the Timed and Targeted Care for Families Program in Eastern Visayas, Philippines. June 2025.
ENDVAWNOW. Monitoring and Evaluation Frameworks. endvawnow.org.
WHO Philippines. Technical Assistance in the Formulation of the Philippine Council for Mental Health Strategic Plan with M&E Framework for 2024-2028. World Health Organization Western Pacific, April 2023.
Springer Nature. Evaluation in the Philippines. In: Evaluation for Agenda 2030. 2023.

How to Build a Monitoring and Evaluation Framework for Health Projects

Monitoring and Evaluation: Two Different Things

The Results Chain: From Inputs to Impact

Theory of Change: The Narrative Behind the Chain

Developing SMART Indicators

Baseline, Midline, and Endline

Types of Evaluation

The M&E Plan: The Document That Holds It Together

The Philippine Policy Context

Data Collection Methods in Philippine Health Programs

Household Surveys

Health Facility Data

Qualitative Methods

Digital Data Collection

Common Failures and How to Avoid Them

Starting M&E After Implementation Begins

Indicator Overload

Separating M&E from Program Management

Neglecting Data Quality

Attribution vs. Contribution

Sources and References

More from SPHERES