Putting Solutions on Trial

Impact Evaluation and the Millennium Villages Experiment in Africa


Update: The United Nations Development Programme has responded to the article. See below.


Villagers of Bar Sauri in western Kenya, showing a crop of maize. / CIMMYT / flickr (cc)

The Millennium Villages project (MVP) is an ambitious program to break targeted African communities free from extreme poverty and thereby demonstrate how poverty in general can be eradicated. Run by the United Nations Development Programme [*] and Columbia University’s Earth Institute—headed by renowned economist Jeffrey Sachs—the MVP began its pilot phase in a Kenyan village in 2004 and has grown to include numerous rural village clusters in nine other countries. It hopes to scale up its approach across much of Africa, and has plans on other continents as well.

The MVP’s strategy employs an intense, multi-pronged aid package for agriculture, education, health, and infrastructure—with a total cost of about $1.5 million per village cluster spread out over five years. It seeks to disrupt, all at once, the factors that together create the conditions for destitution. The MVP treatment simultaneously deploys a variety of aid-financed interventions, including fertilizer, anti-malaria bed nets, and school meals, among many others. Where a single intervention might leave villagers trapped in poverty by other problems, the MVP asserts that its simultaneous execution of several interventions can break villages out of poverty traps for good.

Although its aims are noble and its leaders well-intentioned, the MVP suffers from a glaring omission: it is not conducting an objective and independent evaluation of its impacts. The MVP downplays the importance of impact evaluation, since it accepts each element of its intervention package as already “proven.” According to the MVP’s view, as long as basic science has already shown that fertilizer can raise crop yields, bed nets reduce malaria transmission, and so on, there is no need to prove that its multi-pronged package of interventions can, in context, do what it claims. For example, it claims that this package can spark “self-sustaining economic growth” in isolated rural villages.

By ignoring the crucial step of objectively and independently evaluating its interventions’ impacts in context, the MVP cannot assure its donors that it has succeeded in achieving the promised development impact. Skipping this step means possibly wasting scarce aid money on one intervention, while a more effective one goes ignored.



• • •


Impact evaluation is a set of applied scientific tools to measure exactly how a particular community improved due to a given anti-poverty project, relative to how that community would have faired without that project. With so many solutions on offer for eliminating global poverty—population control, girls’ schooling, debt forgiveness, high-yielding seeds, free trade, microcredit, property rights, public-private partnerships, and information technology, to name a few—impact evaluation provides a basis for adjudicating among them and determining which package of solutions is most effective for a particular context. What science has shown to be effective generally may not work in a particular milieu; what succeeded in a Malawian village, for example, may not work in Burundi.

Despite its promise, there is surprisingly little impact evaluation of this kind performed in global-development circles, but there could easily be more. The MVP is no exception.

Since the MVP views the “solution to extreme poverty” as essentially technological, it is concerned primarily with how to deliver technology. But this is profoundly problematic as development policy. A clear scientific finding in one context may tell us little about what makes a sensible development project.

The Millennium Villages project cannot assure its donors that it has achieved the promised development impact.

Take fertilizer. The basic scientific result is that fertilizer raises crop yields. But, first, this does not mean that any given aid package that includes fertilizer can end poverty and unleash self-sustaining economic growth in a small village, even in combination with other efforts. Second, that finding does not mean that any given project promoting fertilizer use will actually influence use. There may be contextual reasons why farmers in a given place and time dislike fertilizer. Third, numerous forces other than the project can affect crop yields: that crop yields went up after a village received fertilizer does not mean that the fertilizer was primarily responsible, or that lack of fertilizer is the binding constraint on crop yields in villages elsewhere.

Just as the soundness of the basic science does not prove the success of an intervention, so too does in-house, qualitative evaluation obscure results. Yet this has been the MVP’s approach.

Together with my co-author, Gabriel Demombynes of the World Bank’s Kenya office, I have shown how wrong a project can go when it skips rigorous impact evaluation. Last year, the MVP issued a report (PDF) showing improvement in various development indicators at its sites. This work, described by the project’s leaders as a “major scientific report,” shows increased cell-phone ownership, access to improved drinking water, and crop yields. The report describes all of these advancements and many others as the “impacts” of the intervention.

In one instance, the report describes an increase in mobile phone penetration from 4 percent to 30 percent as among the project’s “biggest impacts” in the Ghanaian village of Bonsaaso.



[From “Harvests of Development in Rural Africa” (PDF), p. 91]



Yet the same changes have been going on all around Bonsaaso, in areas untouched by the project.





Considering all of the indicators and countries that public data allow, Demombynes and I have found that the MVP claims as its “impacts” several increases in indicators that are on the rise throughout the regions and countries where their sites are located—that is, in places untouched by the project.

In some cases, increases away from the intervention sites have been smaller than increases at the intervention sites, suggesting that the MVP is having an effect. But while the MVP does intend to go beyond before-and-after analysis and observe outcomes at “comparison” villages that did not receive the intervention, its approach falls short of genuine impact evaluation. For example, it is unclear how the current intervention sites were chosen, which makes it difficult to know if they were unusually likely to respond well to the project. And the MVP only chose comparison villages and started to collect data on them years after the project began, rendering them an unreliable control group.

Though development organizations frequently make overly optimistic claims about their impacts, the MVP is supposed to be different. Its publications, including the report referenced above, are prominently labeled as a product of Columbia University—one of the world’s leading institutions of scientific research. The MVP often claims unique credibility due to its ostensible focus on science. And its special status as a flagship initiative of the United Nations [*] gives it special responsibilities in this regard. We have a right to expect higher standards.



• • •


United Nations Development Programme Administrator Helen Clark and economist Jeffrey Sachs / Maureen Lynch, UNDP / Wikimedia Commons

There are many possible ways to evaluate the MVP’s impact more credibly. One way depends on the kind of analysis mentioned above: comparing the intervention sites to trends happening around those sites. An even better way, feasible but more demanding, requires a randomized treatment design that has become standard in the research literature of development economics. When money is available to intervene at several more sites—only twenty would be needed, far fewer than the MVP ultimately intends—an independent agency should randomly pick which sites receive the intervention. It would pick one from each of twenty matched pairs of candidate sites. That agency should then follow both members of each pair for several years after the intervention ends, to evaluate long-term impact.

Such engaged, rigorous, objective experimentation would resolve the problems with settling for basic science and technology. It would judge the effect of the project against its goals, clearly document that the project was their cause, and be free of potential conflicts of interest. And as we have shown, the price would be less than a sixth of the intervention cost per village—a tiny investment that is surely worthwhile if the intervention is going to spread across the continent, affecting millions of hopeful Africans.

Rigorous impact evaluation has many limitations. It can be hard to know if the effect measured in a small experiment would hold in a much larger intervention, hard to know on what factors the measured effect depends, and hard to measure all of the desired impacts comprehensively. These issues are now the subject of intense debate among the best minds in development economics. Whatever the standard, it is apparent that rigorous and independent impact evaluation should not be required for every project. Few would claim, for example, that emergency shipments of drinking water after an earthquake require rigorous evaluation.

But careful impact evaluation should be used when the benefits outweigh the costs. This is likely to happen when, first, the cost of properly collecting evidence is relatively low; second, a policy decision is not needed more quickly than rigorous analysis can be performed; third, the consequences of applying the wrong policy are particularly bad; fourth, resource constraints make it impossible for everyone to receive the intervention at once; fifth, strong interests on different sides of a policy decision necessitate an objective criterion of success; and finally, the carefully evaluated setting is similar enough to the scaled-up setting.

Many development projects, including the MVP, meet all of these conditions, so their impacts should be carefully and independently evaluated. A model in this regard is Mexico’s PROGRESA (the Education, Health, and Nutrition Program)—later renamed Oportunidades (Opportunities)—which rewarded low-income families with cash for enrolling children in school. Its initial phase was built on a careful, randomized impact evaluation executed by researchers outside the Mexican government. After this independent evaluation demonstrated large positive impacts, similar policies were greatly expanded in Mexico, survived a major change of government, and spread throughout Latin America. In many of the countries where the idea was later applied, it was rigorously evaluated anew, in context, and independently.

The challenges of impact evaluation are many and can frustrate the development of urgently needed delivery technologies in a world in which aid funding is scarce and has countless plausibly beneficial uses. Yet it is essential to assuring that promised aid is in fact doing more good for more people. The long, troubled history of efforts to help the poor suggests that we have much to learn.



UPDATE (June 23, 2011): Since publication of the article, a spokesperson for the United Nations Development Programme (UNDP) has told us the following:

(1) The UNDP has not co-run or co-led the Millennium Villages project. The role of the UNDP has been to provide project management services (e.g. logistics, recruitment, etc.) within a technical and scientific framework led by the Earth Institute of Columbia University. This participation has been focused primarily on the first five-year phase of work, which is coming to an end this month. Since the UNDP provides these services to a large number of other projects, the MVP doesn’t not enjoy a special status as one of UNDP’s flagship initiatives.

(2) The UNDP believes strongly in the role of evaluation, including the recent advances cited in the article, in identifying what works and what doesn’t across a wide variety of contexts in the developing world. The focus of UNDP’s engagement with the Earth Institute over the past several months has been on the rigorous identification of policy-relevant lessons learnt from the MVP that can help inform the wider development community, and also be used by UNDP itself.

Boston Review’s sources of information regarding the relationship between UNDP and the MVP included public websites, such as the Millennium Villages website, that identified UNDP as the “lead implementing partner” of the project. As of June 23, the Millennium Villages website has been corrected.


Post this page to: del.icio.us Yahoo! MyWeb Digg reddit Furl Blinklist Spurl

Comments

1 |
poverty reduction measures
Loved the article topic. Important conversation. But please get your work checked by a development expert. Population control and free trade aren't poverty interventions like girls access to school. By stating population control, you further the misconception that the developing world is suffering a population explosion. Increase access to education, health care & family planning is what reduces family size. Also it isn't just about free trade, but access to credit
— posted 06/17/2011 at 00:00 by bob
2 |
coko
Poverty in one country has a lot to do with leadership style. Read this to understand what leadership style means: http://www.flockee.com/opinion/adolf-hitler-leadership-style-vs-modern-leaders-style-1/
— posted 06/17/2011 at 16:58 by cokolol
3 |
No problem with BR's fact-checking
Bob: Population control and free trade have been frequently discussed in international policy circles as being key to long-term poverty reduction --- by development experts. There is no problem with BR's fact-checking. You must not have been around in the 1970s and 1980s.
— posted 06/19/2011 at 14:10 by Carlos Coelho
4 |
Achieving the Millennium Development Goals by 2015
Michael Clemens continues to misstate the objectives, methods, and results of the Millennium Villages Project (MVP). The MVP is a systematic and carefully documented program of community-led rural development to achieve the Millennium Development Goals in impoverished regions of rural Africa. The Millennium Villages are achieving great progress towards the Millennium Development Goals, much faster than is typical in rural Africa. The project is being documented in great detail and it uses many means of evaluation appropriate to the design, goals and conditions of the project, including a peer-reviewed protocol filed at The Lancet. The MVP is also publishing scientific reports across a range of disciplines, which may be found at http://millenniumvillages.org/resources/science-documents/.

The objectives of the ten-year Millennium Villages Project to 2015 include but are not limited to the following: (1) to achieve the MDGs in ten distinct agro-ecological zones representing a wide array of rural Africa; (2) to document the achievements and the methods of the project in detail; and (3) to create a number of tools and operational systems that can be used widely in Africa and other low-income rural settings. The project is on track to achieve all three objectives, reflected in the rapid progress towards the MDGs (with more than four years remaining to achieve the goals); the detailed information that is being collected and monitored to measure and document the project at all stages of the ten-year program; and the many tools in health, education, infrastructure, farming, and other areas that are finding wide application in Africa beyond the village sites themselves (such as new tools in e-health, soil mapping, ICT-based mapping systems, and many other applications). We invite readers to please check the MVP website (www.millenniumvillages.org) for the latest updates and information.
— posted 06/23/2011 at 15:57 by Kyu Lee
5 |
Response to: Achieving the Millennium Development Goals by 2015
Kyu: You state that “The Millennium Villages are achieving great progress towards the Millennium Development Goals, MUCH FASTER than is typical in rural Africa”. I wonder, what evidence do you base this statement on? Surely not the Millennium Villages Progress Report, which makes no effort to compare changes in the Millennium Villages to others that are not part of the project. You cannot make such a statement by simple before and after comparisons within the SAME village. As Michael says, just because mobile phone usage in Bonsaaso increased from 4% to 30% does not mean that Bonsaaso saw a greater rate of increase in mobile phones “than is typical in rural Africa”. In fact, it seems from the figure above that mobile phone usage in the rural areas of the same region as the millennium village grew at the same or faster pace without any assistance from the Millennium Villages project.

This is a typical example of the inadequate, even misleading evaluation that the Millennium Villages project has done so far. Furthermore, since the project did not collect baseline data on any comparison villages, it is not possible to know whether the millennium villages have gotten CLOSER to achieving objective (1), which you state is “to achieve the MDGs in ten distinct agro-ecological zones representing a wide array of rural Africa” than it would have in the absence of the Millennium Villages intervention.

Rigorous evaluation becomes all the more important here when you consider the magnitude of the project – it costs $150 per villager over 5 years. Moreover, this large short-term burst of funding might be crowding out what might have happened in its place. What will happen to the computers, schools, hospitals once the Millennium Villages project pulls out? Who will pay to keep the doctors in these villages?
— posted 06/23/2011 at 21:56 by Tejaswi Velayudhan
Name
E-mail (Will not appear online)
Title
Comment
To prevent automated Bots from spamming, please enter the text you see in the image below in the appropriate input box. Your comment will only be submitted if the strings match. Please ensure that your browser supports and accepts cookies, or your comment cannot be verified correctly.



Powered by Comment Script
del.ici.ous  stumbleUpon  Reddit  Facebook    Digg   RSS Feed Icon

About the Author

Michael A. Clemens is Senior Fellow at the Center for Global Development.

Small Changes, Big Results, a forum on applying behavioral economics to global development.

Can Technology End Poverty?, a forum on the potential of ICT for global development.


http://www.facebookloginhut.com/facebook-login/  http://www.facebookloginhut.com/facebook-login/ http://www.facebookloginhut.com/facebook-login/ 



Boston Review Newsletter