A health project that cannot demonstrate results will not be funded a second time. Monitoring and evaluation is the system that turns activities into evidence, and evidence into decisions. For organizations working in Philippine public health โ whether with the Department of Health, local government units, UN agencies, or bilateral donors โ a functional M&E framework is not optional. It is a core deliverable and a condition of continued credibility in this space.
The terms are consistently used together but they serve distinct purposes. Monitoring is the continuous, systematic collection of data on specified indicators throughout a project's life. It tracks whether activities are being implemented as planned and whether outputs are being delivered on schedule. Evaluation is a periodic, structured assessment that goes deeper โ examining whether the project actually worked, why it produced the results it did, and what should be done differently.
| Dimension | Monitoring | Evaluation |
|---|---|---|
| Timing | Continuous, throughout implementation | Periodic โ mid-term, end-of-project, post-project |
| Primary question | Are we doing what we planned to do? | Did it work? Why or why not? |
| Focus | Inputs, activities, outputs | Outcomes, impact, sustainability |
| Who leads | Project management team | Can be internal or independent evaluators |
| Output | Progress reports, dashboards, tracking tables | Evaluation reports with findings and recommendations |
Together, monitoring and evaluation form a feedback loop. Monitoring data feeds into evaluations. Evaluation findings improve the next planning cycle. Without both operating in tandem, a project team is working without feedback on whether its efforts are producing change.
Every M&E framework is built on a results chain, also called a logic model. The logic model maps the causal sequence from resources invested to change achieved. It is the structural backbone of any M&E system because each point in the chain must be measured, and the connections between points must be explained.
The distinction between outputs and outcomes is where most project teams go wrong. Outputs are the direct products of activities: the number of health workers trained, the number of informational materials distributed, the number of facilities equipped. Outcomes are what changes as a result: whether trained workers improved their clinical practice, whether informational materials shifted community behavior, whether equipped facilities saw increased utilization. Outputs are under the project's direct control. Outcomes are influenced by the project but shaped by many other factors as well.
Impact sits further along the chain. It refers to long-term changes in health status at the population level โ reductions in maternal mortality, declines in child stunting, or improvements in TB case notification rates. Most health projects contribute to impact rather than directly cause it, and any honest M&E framework should reflect that distinction. The W.K. Kellogg Foundation formalized the logic model components in its development guide, and the approach has since been adopted by the CDC, USAID, UNDP, and most major funders as the standard structure for results-based programming.
A logic model shows the chain. A Theory of Change (ToC) explains why each step should lead to the next โ it makes the assumptions explicit. A ToC answers: under what conditions do trained health workers actually change their practice? What needs to be true about the health system for improved practice to translate into better patient outcomes? What external factors could prevent this from happening?
A project in Eastern Visayas trained community health workers to counsel pregnant women on antenatal care. The assumption was that counseled women would attend at least four ANC visits. Monitoring data showed counseling sessions were delivered as planned. But utilization did not improve. The mid-term evaluation traced the problem back to the ToC: the assumption was wrong. Transportation cost โ not knowledge โ was the primary barrier to attendance. The M&E system caught this at mid-term, allowing the project to redesign its approach before the end of implementation.
The ToC becomes the governing document of the M&E system. Each outcome in the causal chain gets its own indicator, data source, collection method, and reporting frequency. Without a ToC, the M&E plan has no conceptual foundation, and the indicator list becomes a set of measurements without logic connecting them to the project's theory of how change happens.
A logic model fits on a single page. The narrative ToC document that supports it typically runs three to six pages, covering the problem statement, the causal logic, the key assumptions, and the external factors that could affect results. Annexes carry indicator definitions, data collection methods, and baseline values.
An indicator is a specific, measurable variable that tracks whether a result is being achieved. Vague indicators produce useless data. Every indicator in an M&E framework must meet the SMART criteria: Specific, Measurable, Achievable, Relevant, and Time-bound.
| Criterion | What it means | Example |
|---|---|---|
| Specific | Clearly defined with no ambiguity in what is being measured | "Percentage of pregnant women with four or more ANC visits" โ not "improved antenatal care" |
| Measurable | Can be quantified or objectively assessed | Has a defined numerator and denominator, or a clear yes/no criterion |
| Achievable | Realistic given project scope, timeframe, and resources | Target is grounded in baseline data and comparable program experience |
| Relevant | Directly linked to the project objective it is meant to measure | If the goal is reduced maternal mortality, measure skilled birth attendance โ not only facility visits |
| Time-bound | Measurement has a defined reference period | "By end of Year 2" or "within 12 months of training completion" |
Each indicator also requires a data source, a responsible person or unit, a collection method, and a reporting frequency. These are compiled in the indicator reference sheet โ sometimes called the indicator tracking table or performance monitoring plan depending on the donor. This document is the operational core of any M&E plan and must be maintained throughout implementation, not filed after project design.
An indicator without a baseline is not useful. A baseline is the value of an indicator at the start of the project, before any intervention is delivered. It establishes the starting point against which all future progress is measured. Without a baseline, it is impossible to determine whether any change actually occurred as a result of the project.
For most health projects, three data collection points are standard. The baseline is conducted before or at the very start of implementation. The midline assessment, typically at the halfway point of the project, checks whether the project is on track and creates space for course corrections. The endline assessment, conducted at or near project close, measures the total change achieved and serves as the primary input for the final evaluation.
A project without a baseline is a project without a story. It can report what it did. It cannot prove what it changed. โ Adapted from UNAIDS M&E Fundamentals Series
The same data collection method must be used at baseline, midline, and endline to ensure comparability. Switching from a household survey at baseline to facility register data at endline produces figures that cannot be compared. Consistency of method is not optional โ it is the condition that makes the data defensible when presented to a donor, a government partner, or an independent evaluator.
In the Philippines, Lot Quality Assurance Sampling (LQAS) is widely used as a resource-efficient alternative to full household surveys for coverage monitoring. LQAS uses a sample of 19 respondents per supervision area and has been applied extensively in Eastern Visayas health programs, including the KOICA-funded MNCH project in Samar and Southern Leyte, where it was used for cross-sectional surveys at baseline and midline.
Evaluation is not a single activity conducted once at the end of a project. Different evaluation types serve different purposes and are conducted at different stages, each answering a different set of questions about program performance.
| Type | When conducted | Primary purpose |
|---|---|---|
| Formative | Early implementation or during pilot phase | Tests feasibility and acceptability; identifies design problems before they scale |
| Process | During implementation | Assesses how well activities are being delivered; identifies operational gaps and quality issues |
| Mid-term | Midway through the project | Checks progress toward outcomes; enables major course corrections while time remains |
| Summative / End-of-project | At or near project close | Judges whether stated objectives were met; produces evidence of effectiveness for accountability and learning |
| Impact | After project ends, often years later | Assesses long-term population-level change; often uses a comparison group to strengthen attribution |
For most health projects funded by international development agencies, a mid-term and a final evaluation are the minimum requirements. The terms of reference for these evaluations should be agreed upon with the donor at the start of the project and should form part of the original M&E plan โ not commissioned as an afterthought when the project is already winding down.
Formative evaluations are particularly valuable for pilot programs. They help assess whether a new health intervention is feasible and acceptable before it is scaled, identifying early what is working, what needs adjustment, and what data will be needed for subsequent evaluations. Programs in the early stages of development may not yet be delivering all intended services or reaching all target populations, and formative evaluation captures that reality honestly.
The M&E plan is the primary governing document for a project's monitoring and evaluation system. It is not an annex to be filed and forgotten after project design. It is a working document that guides every data collection activity from launch to close, and it must be updated as the project evolves.
A complete M&E plan contains the following elements:
Health projects in the Philippines operate within a national policy framework that increasingly demands results-based management at all levels โ from government agencies to implementing partners and technical assistance providers.
At the national level, the National Economic and Development Authority and the Department of Budget and Management jointly issued the National Evaluation Policy Framework (NEPF) under Joint Memorandum Circular No. 2015-01. The NEPF provides the standard for evaluation conduct across all government programs and projects, including those implemented with official development assistance. Its stated purpose is to support good governance, transparency, accountability, and evidence-based decision-making in the public sector. In practice, this means that any consulting organization or implementing partner working with DOH or LGU-implemented programs must treat evaluation not as an optional add-on but as a built-in accountability requirement.
In 2016, DBM introduced the Results-Based Monitoring, Evaluation, and Reporting Policy Framework to further standardize how performance information is generated and used across government. Together, these frameworks mean that results-based management is now the language of public sector health programming in the Philippines โ and any organization that cannot speak it will struggle to engage credibly with government counterparts or development partners.
NEDA's Results Matrix, derived from the Philippine Development Plan, is the primary instrument for monitoring progress toward national development goals. It follows the logical framework approach and serves as the guide for planning, programming, and budgeting across all implementing and oversight agencies. Health project indicators at the program level should, where applicable, align with the corresponding outcomes tracked in the Results Matrix. Misalignment between project-level and national-level indicators creates reporting complications and weakens the case for project relevance during evaluation.
IDinsight's engagement with NEDA and four Philippine government departments โ including the Department of Health โ to develop learning and evaluation roadmaps was part of a broader effort to operationalize the NEPF. The engagement included building theories of change, conducting evaluability assessments, and identifying evidence needs, demonstrating the kind of technical assistance the framework envisions for all major government programs.
For organizations preparing to engage with WHO, UNFPA, UNICEF, or other UN agencies on health programs in the Philippines, it is also worth noting that all UN agencies are expected to follow the common evaluation standards of the United Nations Evaluation Group. These standards cover independence, impartiality, credibility, and the utility of evaluation findings โ and they apply to implementing partners through their contractual agreements with the respective agency.
No single data collection method is sufficient on its own. Strong M&E systems combine quantitative and qualitative approaches so that numbers are always accompanied by an explanation of what they mean and why they moved in the direction they did.
Household surveys are the standard instrument for measuring population-level outcomes such as vaccination coverage, ANC utilization, contraceptive prevalence, or skilled birth attendance. They produce statistically representative data but require substantial planning, trained enumerators, and dedicated budget. For programs with limited resources, LQAS provides a resource-efficient alternative. LQAS uses a sample of 19 respondents per supervision area and a decision rule that allows programs to classify whether coverage in a given area meets or falls below a defined threshold. It has been used extensively across Philippine maternal and child health programs.
Routine data from facility registers, the Field Health Service Information System (FHSIS), and the Philippine Health Information System provides continuous administrative data on service delivery. It is available without additional survey costs but is often incomplete or inconsistently recorded, particularly at the barangay health station level. Any M&E system that relies heavily on facility data must include explicit data quality checks and triangulation with other sources.
Key informant interviews and focus group discussions provide the qualitative layer of understanding: why utilization is low, what barriers communities face, how health workers experience a program in practice, and what the numbers do not capture. These methods are particularly valuable for process evaluation, for interpreting unexpected findings in quantitative data, and for documenting community perspectives on program relevance and acceptability.
Mobile data platforms using tools such as KoBoToolbox, ODK Collect, and DHIS2 are increasingly standard in donor-funded health programs. UNICEF's handover of DigiVacc to the DOH in 2025 โ a digital immunization suite funded by the Government of Japan โ reflects the broader push toward real-time digital monitoring of health program coverage. Organizations proposing digital data collection should include a connectivity assessment and an offline data capture protocol, given the infrastructure realities in many target communities.
Most M&E frameworks fail not in design but in implementation. The failure modes are well documented and largely preventable.
Once activities are underway, baseline data cannot be collected. This single error makes it impossible to demonstrate change at endline, regardless of how strong the project results actually were. The M&E plan must be developed before implementation begins, and baseline data collection must be completed before the first activity reaches beneficiaries.
Projects with forty or fifty indicators cannot realistically collect quality data on all of them within available resources. A disciplined M&E framework selects fewer indicators and collects them well. A small number of high-quality, well-verified measurements is more defensible to a donor or evaluator than a large dataset with inconsistent collection and doubtful accuracy.
When the M&E function is isolated from the program team, monitoring data does not reach the people who can act on it. M&E findings should feed directly into management decisions on a regular basis through a defined learning and adaptation process โ not only at mid-term or final evaluation when course correction is no longer possible.
Routine data quality assessments, supervisory spot-checks, and triangulation between data sources are not optional add-ons. They are what make monitoring data credible when presented externally. Harmonized Approach to Cash Transfers micro-assessments, which UN agencies conduct before awarding implementing partner agreements, specifically examine financial management and data quality systems. A weak data quality assurance process is a known risk indicator for implementing partner performance.
One of the most contested questions in health project M&E is whether a project caused the change it is reporting. In most implementation contexts, the honest answer is that the project contributed to change but did not exclusively cause it. Other programs, government initiatives, demographic trends, and external factors all operate in the same environment simultaneously.
Experimental designs with randomized control groups can establish stronger attribution but are expensive, ethically complex in health settings, and rarely feasible for standard implementation projects. Most health programs use contribution analysis instead โ a structured approach to building a credible case that the project's activities were a significant contributing factor to the observed change, while acknowledging the role of other forces. A well-reasoned contribution narrative, supported by consistent monitoring data and a coherent Theory of Change, is more credible than inflated attribution claims that cannot withstand scrutiny.
Experienced funders operating in the Philippines โ including KOICA, JICA, UNFPA, and the World Bank โ understand the limitations of attribution in complex health systems. What they look for is not proof of exclusive causation but a credible, evidence-backed argument for the project's contribution to the results observed. That argument is built through the M&E system, over the life of the project, and documented in the final evaluation report.
Building an M&E Framework for Your Health Project?
SPHERES, Inc. provides technical assistance in M&E framework design, indicator development, data quality assessment, and evaluation for health programs across the Philippines.
Get in Touch