on ex post evaluations of Objectives 1 and 3 programmes 1994 to 1999 (Structural Funds), together with the Commission's replies

Observations on the hypotheses addressed in the examination of cases of impact on individuals

(1) SEC(2000) 1051 of 26 July 2000: Communication to the Commission ‘Focus on Results: Strengthening Evaluation of Commission Activities.’

(2) The MEANS programme was initiated by the Commission as a response to the demand for developing evaluation methodology. The MEANS Collection consists of six volumes on the evaluation of socio-economic programmes.

(4) Comprising the European Regional Development Fund (ERDF), the European Social Fund (ESF), the European Agriculture Guidance and Guarantee Fund (EAGGF) and the Financial Instrument for Fisheries Guidance (FIFG).

(7) National evaluations examined relate to: Objective 1 Belgium, Greece, Spain, Germany, Italy, Ireland, Portugal, and Objectives 1 and 3 (ESF): Spain, France, Italy, Germany, United Kingdom.

(11) Here referring to the active labour force as a percentage of the working age population.

(14) Per capita figures, in this context, have been calculated with reference to the whole national population.

(16) The same data as published in the report has been used in the compilation of Table 2. As a result, it can be seen that, for the Member States listed in Table 1, the sum of the three columns in this table do not add up to the total column.

(17) This assessment needs identification and examination within a global framework of the causal links between the needs, the strategic objectives, the objectives of programmes/sub-programmes/measures and the allocation of resources. It needs also in principle a ranking of the needs and unsatisfactory socio-economic situations which the CSF aim to address and a hierarchy of inherent factors to be considered in order to ensure that resources are optimally targeted towards meeting needs.

(19) A thematic evaluation is an evaluation which transversally analyses a particular point (a theme) in the context of several interventions within a single programme or of several programmes implemented in different countries or regions.

(20) Thematic Evaluation of the Impact of Structural Funds on Transport Infrastructures, Final Report, November 2000, p. 8.

(21) For example, in the case of Ireland, this result means that for every 100 euro invested in terms of construction and operating costs, the value of journey time savings, vehicle operating cost savings and safety benefits, was 13 euro for each year. Thus, the investment effectively paid for itself in eight years. (Thematic Evaluation of the Impact of Structural Funds on Transport Infrastructures, p. 8).

(24) The specific comment made was: ‘Whilst this would seem a logical recommendation, the context in which it is applied emerges as an important consideration’. Synthesis Report, p. 105.

(27) To quote again from the study: ‘The most effective mechanism for the creation and maintenance of jobs is argued to be direct support for productive investment, although as this takes no account of deadweight and substitution effects, its overall effectiveness cannot be judged’. Synthesis Report, p. 130.

(28) The implications of weak data and some general comments on this will be discussed in a later section.

(32) Greater than 25 million euro for infrastructure and greater than 15 million euro for productive investment projects.

(39) ‘Inclusion of private sector co-financing is of questionable value for an impact evaluation that is based on the use of a formal macro model and since considerable uncertainty and ambiguity surrounds the driving mechanisms behind the private sector CSF co-finance expenditures, we exclude them from our analysis’. Synthesis Report, p. 71.

(40) UK-Speke Garston project; in this particular example, additional private sector investment of 223 million GBP was attracted, generating 4 600 jobs in direct employment. Synthesis Report, p. 204.

(41) The four countries which qualify, in their totality, as Objective 1 regions.

(42) In the case of Italy the planned construction of a HERMIN model foreseen in the ToR, was not done due to ‘contracting difficulties’. For the remaining regions, which did not have a HERMIN model available, a so-called ‘bottom-up’ estimation was undertaken.

(43) The Report states: ‘Ideally we should use the actual ex post realised CSF expenditures. But these were not available for every country or region, disaggregated by priority and on an annual basis. In the interests of uniformity, we have used the planned CSF expenditure data as contained in the CSF 94 to 99 treaty documents. While these give a fairly accurate total for expenditure, they do not always give an accurate picture of the ex post scheduling of expenditures.’ (Synthesis Report, p. 150).

(45) As a matter of fact, some studies claim that EU funding crowds out national investment such that the growth path of the economy without CSF funding would look better than the picture obtained from the HERMIN simulations.

(46) This sensitivity analysis was carried out for eastern Germany, Spain, Greece, Ireland and Portugal.

(48) These demonstrate the positive influence of the SFs on the development of multiannual programming, the application of the programme concept and of objective-based management, the establishment of management structures and procedures and in particular systems of monitoring and indicators, systems of quality assurance in public works, tendering procedures for government aid, the development of evaluation systems, capacity and culture, the project selection procedures and control systems. SFs stimulate the development of more appropriate structures and capacities to handle development projects.

(51) Opinion No 2/2005 on the proposal for a Council Regulation laying down general provisions on the European Regional Development Fund, the European Social Fund and the Cohesion Fund (COM(2004) 492 final of 14 July 2004) (OJ C 121, 20.5.2005, p. 18); Special Report No 7/2003 on the implementation of assistance programming for the period 2000 to 2006 within the framework of the Structural Funds, paragraph 90 (OJ C 174, 23.7.2003, p. 22).

(52) These included: greater effectiveness in programme development and monitoring, more effective project selection, greater legitimacy and transparency in decisions and decision-making processes, greater commitment and ownership of programme outputs, etc. See Thematic Evaluation of the Partnership Principle, p. II.

(61) For example: ‘There is clear evidence that the partnership arrangements have deepened and widened and that there has been increasing delegation of responsibility for decision making to the programme partnership’ (Synthesis Report, p. 246).

(64) ‘Net’ effects are estimated by subtracting from gross effects any deadweight and displacement/substitution effects.

(65) Especially the links between the results of the analyses of impact and the analysis of effectiveness and appropriateness of the strategy.

(68) Inception Report, page 13. Similarly, the Methodology Guide includes key questions as: ‘What was the overall impact of the structural funds on supporting regional convergence during the period 1994 to 1999? What were the economic effects of the Structural Funds investments in terms of convergence of income levels?’ (p. 24).

(70) For example, a case was made for: ‘Providing much greater discretion to those Member States which have effective domestic monitoring systems to utilise their own monitoring requirements in relation to EU-supported initiatives. This would reduce the costs involved and the data which emerges would probably be more meaningful’. Synthesis Report, p. 249.

(73) For example, increased labour force ‘flexibility’ as a result of training.

(76) For instance, one can read in these notes that the figures for France, which amount to 17 million, were being ‘double-checked’ for ‘double counting’ while the figures for the Netherlands, which amount to 8,7 million, are also being ‘double checked with national teams’. Furthermore, it is indicated that data are largely missing for Germany, Ireland and the United Kingdom — although the later is listed as having 7,9 million beneficiaries and Ireland as having 680 000 beneficiaries.

(80) Combined measures enclose intensive counselling and training, work experience and training and employment incentives and training.

(85) For example, the achievement of qualifications and the soft outcomes presented in the UK evaluation report (representing ‘distance travelled toward the labour market’).

(89) In particular, the EES promotes stronger links between EU programme funding mechanisms and other policy instruments, regulations and national programmes including passive labour market policies.

(90) For example, in the case of Germany: support to the objectives that target disadvantaged sections of the public, sustainable development, equal opportunities, financial leverage effects for the new Länder. Similarly, various other topics were highlighted in the other national reports.

(95) Examples are: the need for a stronger link between the planned objectives and the local socio-economic context (Italy); a better targeting of beneficiaries (France); the expansion of integrated intervention approaches (United Kingdom); to widen the scope of the programmes to more innovative and risky actions (Spain); to strengthen anticipation actions in the training domain (Spain); to pay more attention to the less-favoured regions by putting more weight on variables such as unemployment, long term unemployment and social exclusion (Spain).

ANNEX I

Breakdown of aid under the Structural Funds — Available resources related to CSF 1994-1999

(million euro)
	Belgium	Denmark	Germany	Greece	Spain	France	Ireland	Italy	Luxembourg	The Netherlands	Portugal	United Kingdom	Austria	Finland	Sweden	Total
O1	748,50		13 985,30	14 333,90	26 965,80	2 245,40	5 762,30	15 236,20		153,80	14 333,90	2 419,70	171,50			96 356,30
O2	349,40	123,00	1 604,80		2 474,80	3 866,40		1 498,20	15,40	666,20		4 693,40	104,40	165,40	189,30	15 750,70
O3/O4	476,50	308,40	1 990,00		1 888,60	3 282,20		1 757,40	23,60	1 105,70		3 460,50	408,30	354,10	537,60	15 592,90
O5a Fisheries	25,10	143,20	76,30		122,20	194,40		137,60	1,10	47,70		90,80	2,10	23,80	41,40	905,70
O5b	78,90	55,30	1 257,30		680,40	2 293,30		923,30	6,10	153,70		837,20	424,90	201,00	142,70	7 054,10
O6														475,90	260,70	736,60
Total	1 678,40	629,90	18 913,70	14 333,90	32 131,80	11 881,70	5 762,30	19 552,70	46,20	2 127,10	14 333,90	11 501,60	1 111,20	1 220,20	1 171,70	136 396,30
Source: Special Report No 16/98 (OJ C 347, 16.11.1998).

Breakdown of aid under the Structural Funds — Available resources related to Community Initiatives 1994 to 1999 (1)

(million euro)
Interreg II	3 562,3
Leader II	1 777,2
ADAPT	1 646,4
PME	1 092,5
URBAN	894,5
PESCA	301,5
Rechar II	465,0
Resider II	610,9
RETEX	609,2
EMPLOI	1 858,5
Konver	734,9
REGIS II	615,0
PEACE	303,1
Total	14 471,0
Source: Special Report No 16/98 (OJ C 347, 16.11.1998).

(1) Names of programmes related to Structural Funds; further references can be found in the Commission budget, subsection B2.

Source: Special Report No 16/98 (OJ C 347, 16.11.1998).

ANNEX II

GDP and the employment rate

1.	A multiple regression analysis was conducted to test the statement in the Synthesis Report that ‘increased levels of GDP per capita have generally not been the result of increased employment rates resulting from job creation’ (1).

For strict comparability, the same data published in the Synthesis Report has been used in this analysis. In the case of the dependent variable the percentage change in GDP per capita for each Objective 1 region between 1993 and 2000 was used, while as main explanatory variables, the percentage change in the employment rate as well as that of the unemployment rate was used for the same regions, and over the two years 1993 and 1999 as published in the report. The results are given in Table 1.

Table 1 (2)

Regression of the change in GDP per capita on change in employment (CEMPL) and change in unemployment (CUNEMP)

Regressor	Coefficient	Standard error	T-ratio	[Prob]
C1	1,7615	1,2361	1,4251	[0,161]
CEMPL	0,24563	0,14237	1,7252	[0,092]
CUNEMP	0,051927	0,027418	1,8939	[0,065]
BE	–16,8803	6,2156	–2,7158	[0,009]
IR	38,2350	6,5314	5,8540	[000]
DX	26,9660	2,9504	9,1398	[000]

R-squared	0,74449	R-bar-squared		0,71546
S.E. of Regression	6,1411	F-stat. F (5,44)	25,6414	[000]
Mean of dependent variable	6,3647	S.D. of dependent variable		11,5126
Residual sum of squares	1 659,4	Equation log-likelihood		– 158,5015
Akaike information criterion	– 164,5015	Schwarz Bayesian criterion		– 170,2376
DW-statistic	2,3210

From a preliminary assessment of the data, it emerged that three particular regions exhibited significantly different patterns of change in GDP per capita over the period as compared to the other regions, and which required specific treatment in the regression, as otherwise it would be difficult to draw general conclusions about underlying links. These were; Ireland, which saw its GDP per capita rise from 81 % to 115 % of the EU average over the period; the Hainaut region in Belgium, which saw its GDP per capita decline from 82 % to 71 %; and German Objective 1 regions which saw the GDP per capita rise from a range around 50 % to close to 70 % of the EU average by the end of the period (for example, Thuringen saw its GDP per capita rise from 52 % in 1993 to 70 % in 2000). To take account of such large relative changes in GDP per capita, specific statistical (or dummy) variables where included for this purpose which allowed the function to shift for these three broad regions.

The regression results shown in Table 1 provide a number of relevant points worth noting. Overall, 74 % of the variation in the change in GDP per capita is accounted for by the ‘explanatory’ variables. Both main explanatory variables, that is the change in employment rates (CEMPL) and change in unemployment rates (CUNMP) are statistically significant. In the case of CEMPL, the coefficient estimated is 0,24, suggesting that each one percentage point increase in the employment or participation rate is associated with an improvement in GDP per capita of Objective 1 regions by a quarter of a percentage point — thereby, if nothing else changes, closing the GDP gap by this amount.

This result directly contradicts the conclusion reached in the synthesis report, and provides a useful indicator of the potential there is, from an analysis of the results of Objective 1 regions over this period, to close the GDP gap by improving participation rates — a most relevant result from the policy perspective.

A second feature of note in the results in Table 1 relates to the impact of the changes in unemployment rates on changes in the GDP per capita. This impact (also discussed in the next section) is also statistically significant, but is rather small quantitatively, almost of an irrelevant value. Unemployment declined marginally overall in Objective 1 regions, and this may be the reason for this result.

To explore further the robustness of the main result in Table 1 relating to the impact of changes in the employment rate and changes in GDP per capita, Table 2 provides the regression results for all regions in the EU-15 and for which data was consistently available from Eurostat, and not just for Objective 1 regions. The results from 145 regions where derived from slightly different data than in Table 1. Rather than taking just the rate of change between the beginning and end of the relevant period, the average rate of change was computed for the whole period. This was judged to give a more representative summary of developments over a number of years, in this case over the five year period between 1996 and 2000. (Due to data limitations in the available statistics, an earlier starting point was not feasible for a number of regions.)

Table 2 (3)

Regression of average change in GDP per capita on average change in employment (ACEMPL) and average change in unemployment (ACUNEMP)

Regressor	Coefficient	Standard error	T-ratio	[Prob]
C	5,2110	0,12999	40,086	[000]
ACEMPL	0,24119	0,63540	3,7959	[000]
ACUNEMP	–0,0019479	0,012260	–0,1588	[0,874]
ID	4,7009	0,85873	5,4743	[000]
LD	5,4309	0,84670	6,4142	[000]
BED	–0,64276	0,25885	–2,4831	[0,014]
DED	–1,3747	0,17517	–7,8479	[000]

R-squared	0,62600	R-bar-squared	0,60974
S.E. of regression	0,84193	F-stat. F (6,138)	38,4979 [000]
Mean of dependent variable	5,2359	S.D. of dependent variable	1,3477
Residual sum of squares	97,8204	Equation log-likelihood	– 177,2101
Akaike information criterion	– 184,2101	Schwarz Bayesian criterion	– 194,6286
DW-statistic	1,6353

The results indicate that the coefficient for the average change in employment is practically the same as in the previous case, that is 0,24, indicating that for all regions, a 1 % increase in the employment rate tends to be associated with a 0,24 % increase in GDP per capita. This result is also statistically significant, as is the whole regression equation. (An additional dummy variable for Luxembourg has been added, again because the increase in GDP per capita from 161 % in 1996 to 199 % in 2000, is markedly different from the average performance of the other regions).

The impact of the change in the unemployment rate variable is again minimal and in this case is statistically insignificant and negative. The implications from these results are that the impact of changes in the unemployment rate on changes in GDP per capita does not appear strong — possibly because the unemployed are already accounted for in the labour force and in the measure of GDP that is used, and any change from unemployed to employed has a less easy impact to capture in statistical analysis of this type. Certainly, more research in this area seems warranted.

(1) Synthesis Report, p. 62.

(2) 50 observations used for estimation. BE, IR and DX are dummy variables for Belgium, Ireland and Germany. C1 is the intercept.

(3) 145 observations used for estimation. ID, LD, BED, DED are dummy variables for Ireland, Luxembourg, Belgium and Germany. C is the intercept.

THE COMMISSION'S REPLIES

SUMMARY

VI.	The Commission is fully aware of the limitations of the study, due to a large extent to data constraints, and will take steps to improve the feasibility of future ex post evaluations.

VII.	Although some difficulties with data were encountered, the evaluation used a number of types of information to reach its conclusions (member state mid-term and final evaluation reports, interviews, national workshops, and case studies).

VIII.

The Commission considers that the terms of reference were adequate for the purpose, but the evaluation faced major challenges, especially of data availability. The Commission took into account both positive and negative aspects of the work carried out.

Nevertheless, the Commission accepts that a preparatory analysis would have helped to identify some of the information difficulties that the Court has described.

X.	Although difficulties were encountered, the results of the evaluations have been considered during the mid-term review process as foreseen.

XI.

The Commission considers that the future ex post evaluations should be more focused and take full account of data and resource constraints.

The Commission considers that macroeconomic models are a relevant tool for measuring the economic impact of large scale interventions. The HERMIN models developed for the various countries concerned took account as far as possible of the national specificities of the recipient economies. However, as for any evaluation tool, further improvements can be made to address limitations, in particular on micro data.

The Commission agrees that ex post assessments should take full account of existing thematic studies.

Future evaluations should cover fewer evaluation questions. This will make it possible to deepen the analysis in certain fields and will facilitate the oversight carried out by the Commission.

INTRODUCTION

The Court has made observations on the evaluation in general and ex post evaluation in particular on several previous occasions, notably in Special Report No 15/1998 on the assessment of Structural Fund intervention for the 1989 to 1993 and 1994 to 1999 periods and Special Report No 7/2003 on the implementation of assistance programming for the period 2000 to 2006 within the framework of the Structural Funds. In this earlier work the Court acknowledged the specific constraints of such evaluations and came to an overall satisfactory assessment, while pointing out aspects needing improvement. The Commission welcomes the Court's constructive criticism on this topic.

For the future ex post evaluation the Commission will review the organisation of the exercise, with particular reference to data collection and the preparation of the synthesis report.

The ESF ex post evaluation used the evaluation reports carried out by Member States at both the mid term and ‘final’ stages of the programming period 1994 to 1999.

EX POST ASSESSMENT OF OBJECTIVE 1 BY DG REGIO

15. Text box 1.

The ex post evaluation is not an official report of the Commission. Nevertheless, the Commission supports the overall conclusion of the report, which is borne out by evidence from the Commission's Third Report on Economic and Social Cohesion.

16.

The Court acknowledges the particular challenges facing ex post evaluation, and in particular the lack of quantified indicator data. The Commission agrees that there is room for improvement in future ex post evaluations. To this end, the Commission intends to initiate a broader methodological debate with Member states and the academic world before the next round of ex post evaluations. A major issue that needs to be explored is whether to focus on a small number of relevant evaluation questions. The exercise will be less subject to data constraints.

17.

Although the convergence rate differs between regions, growth in Objective 1 regions is driven not only by employment increases but also significantly by increases in productivity. Other empirical studies suggest that productivity is the key factor determining the growth rate that is sustainable in the long run (unless there is a permanent inflow of migrants).

18.	The Commission welcomes the regression analysis undertaken by the Court. However, it is based on only one of the explanatory variables. To avoid biased results, a control analysis for the other explanatory variables would be necessary.

19.

Other studies undertaken by the Commission, for example for the Third Cohesion Report, have also led to different conclusions from the Synthesis Report. The picture is in fact mixed, with increases in employment appearing to be the main factor in growth in some Member States, and productivity growth in other Member States (see Third Cohesion Report, p. 3).

20.	The Commission agrees that the data available on participation rates, gross domestic product (GDP) and productivity could have been better exploited by the evaluator. However, other studies have also explored these relationships.

21.

Planned and actual expenditure data cannot be broken down by region, as a significant share of the assistance is delivered through national programmes such as transport, research, national aid schemes or human resource programmes, and Member States do not keep statistics on how it is allocated by region. Therefore, the regression could necessarily only cover part of EU expenditure. This was an inherent constraint on this part of the evaluation.

23.	In its Third Cohesion Report (p. 147) the Commission showed that there was some relationship between the amount of structural aid provided and real growth of GDP in Objective 1 regions.

24.	As already pointed out, the evaluation was faced with severe data constraints, particularly regarding national data on both public and private co-financed expenditure. The data presented were mostly based on estimation procedures, which gave rise in some cases to errors or discrepancies.

26.	The participation of the private sector depends on the general economic conditions of the country as a whole and the region concerned. The leverage effect of EU Structural Funds varies greatly from one country to another.

27.	Examination of this issue was hampered by the uncertainty surrounding figures for the private sector.

28.	Market failure is a crucial issue in the context of ex ante evaluation. Although it is mentioned in the methodology guide drawn up by the evaluator, a formal and detailed examination of appropriateness was not required by the terms of reference.

30.

The evaluators had to base their judgements largely on information and data available in the Member States. If data, for example on deadweight, was not available, the terms of reference did not require the evaluators to collect such data themselves, given the limited budget available for the exercise.

Concerning the concept of Community value added, the Commission considers that the content of the concept depends on the context in the different Member States. For example, the programming approach can be new in one Member State but not in another. A qualitative assessment seems the most appropriate in these circumstances.

31.	The limitations are largely due to the data and budget constraints. While a consideration of the results of other thematic studies may well have given added weight to the conclusions, the ex post evaluation was necessarily broader in scope.

32.	The two evaluations have a different scope and coverage. For example, the thematic evaluation of transport is also based on the Cohesion Fund and covers all objectives.

33.

Estimating the impact of Structural Fund interventions on employment is notoriously difficult and depending on the methodology can produce different results. These should therefore be interpreted with caution. The Commission is doing further methodological work on the estimation of employment effects to increase the consistency of results.

34.

The methods used for estimating the employment effects in the Synthesis Report and the thematic evaluation report are not comparable. The HERMIN model is based on more robust assumptions and also captures supply-side effects associated with the investment in physical and human capital, while the study on transport estimates the direct and indirect short term effects.

35.-36.

As indicated above, estimating the impact of Structural Fund interventions on employment is notoriously difficult and depending on the methodology can come to different results. The thematic study on SMEs carried out a more in-depth analysis on SME interventions and also covered other interventions outside Objective 1 areas.

37.	Monitoring data of the Member States was a necessary input for the study. Given the time and budget constraints, it would have been impossible to make up for the gaps and deficiencies in the data.

38.	Given the broad scope of the evaluation, it was impossible to address all issues in depth by undertaking ad hoc surveys or other field work.

39.	The Commission agrees that evaluations should use existing studies and literature as a starting point for the analysis. However, the thematic studies had been undertaken in another context, namely as an input for preparing 2000 to 2006 programmes, whereas the ex post evaluation came later.

40.	The use of formalised quantitative methods requires the existence of robust and sufficient data from the Member States. As the Court points out, such data was often missing for the 1994 to 1999 programming period.

41.	The objective set by the terms of reference (ToR) for this area was fairly limited in scope.

42.	Despite the difficulties encountered in obtaining data due to the fact that different types of building work were interconnected, the evaluators attempted to establish comparable unit costs for some fields such as road construction, environmental projects and jobs created in industrial projects.

43.

The Commission agrees that a more detailed analysis is desirable. The subsequent ex post evaluation on the Cohesion Fund delivered further insights on this subject. In 1998 the Commission prepared a guide entitled ‘Understanding and monitoring the cost-determining factors of infrastructure projects — A user's guide’.

44.

The wording of the ToR made it clear that these tasks were especially difficult and could only yield partial conclusions. An assessment of public-private partnerships and of tendering procedures was not required by the ToR. The issue of public-private partnerships has been addressed in the ex post evaluation of the Cohesion Fund.

46.

The evaluator excluded private cofinancing in the macroeconomic modelling, as the model sought to quantify the overall impact of EU and public cofinanced expenditure on the economy including private investment. Private sector investment is regarded as a result and not as a separate policy input. The Commisssion refers to points 24 to 27 of its replies. The Commission shares the concerns of the Court on the difficulty of specifying externality parameters. The choice made was based on international studies from which a range of elasticity values could be derived. At the time the study was conducted, there was no real alternative for conducting the impact analysis.

48.

Some of the Court's observations in points 48 and 49 were made in its Special Report No 7/2003 on the implementation of assistance programming for the period 2000 to 2006 within the framework of the Structural Funds. The Commission refers to the replies given in that report.

The alternative suggested by the Court was not feasible. The requirements for collecting micro-data were not standardised to the extent that would have been necessary for this type of analysis. The Commission agrees that future macroeconomic evaluations should rely more on microeconomic data analysis.

49.

(a)	These were the assumptions of the HERMIN model at the time of the evaluation. However, further development of the model should pay greater attention to the importance of the service sector in Objective 1 economies.

(b)	The issue of structural instability of the model will be examined carefully in future macroeconomic work.

(c)	The Commission acknowledges that the modelling exercise would have been more accurate if data sets on actual expenditure had been available. The commitment data used were the only data available across all regions at the time of the evaluation.

(e)

The elasticity values associated with physical and human capital investment which are used in the HERMIN model are based on a broad range of academic studies. The model considers different scenarios based on high, medium and low values. This sensitivity test results in a range of impact values.

All macroeconomic models face data problems. The modelling exercise relied on the best data available. The Commission agrees that additional work is needed to further improve the elasticities data used by the model.

(f)	In the HERMIN model, the counterfactual scenario is defined as the situation with no CSF in order to be able to estimate the deviation relative to the baseline. However, other additionality scenarios can be envisaged, e.g. only structural and cohesion funds (without national co-financing).

50.

In the Commission's opinion, despite possible shortcomings, HERMIN-type models are a valuable instrument for understanding and measuring the effects of cohesion policy. The use of the model by a growing number of Member States and the absence of convincing alternative approaches confirm this view. The HERMIN model can be improved and the Commission is working on its development for the programming period 2007 to 2013.

51.	The Commission considers that it is difficult to establish any linear correspondence between a fall or increase in unemployment and the number of jobs created or lost. Labour market mechanisms are more complex and should be examined carefully before drawing firm conclusions.

52.	Again, the evaluation was dependent on the quality of data and setting of objectives by the Member States. The absence of such data made the evaluation in many cases difficult or impossible.

56.	Partially conflicting or differing recommendations in different evaluations with different questions and methods are a normal and productive phenomenon in economic and sociological research.

57.	The recommendation in the ex post evaluation to pay attention to an appropriate, workable size of monitoring committees is not inconsistent with the recommendation to ensure inclusiveness in their composition made in the thematic study.

59.

The ex post evaluation was based here only on partial studies. The Commission agrees that the findings concerning the management systems were interesting enough to deserve more attention. A specific evaluation of this issue was undertaken by the Commission services in 2003 (The efficiency of the implementation methods for Structural Funds, December 2003).

60.

Like the other thematic evaluation studies, that of the partnership principle was designed to inform the programming of the 2000 to 2006 period. The programming had been completed by the time of the ex post evaluation. Therefore, greater discussion of this subject in the ex post evaluation, while it may have been desirable, was not essential.

61.

The tasks and budget of the evaluation were necessarily limited.

The issues raised by the Court were the subject of the study referred to in the reply to point 59. Project selection procedures are also one of the aspects of management and control systems that are checked in audits, as they are an important condition of sound financial management.

The Commission tends to the view that future ex post evaluations should cover less questions.

62.

The Commission agrees that Member States' management and control systems for the Structural Funds during the 1994 to 1999 period contained weaknesses. However, they improved towards the end of the period with the introduction of the control regulation 2064/97, and the improvement has continued in the 2000 to 2006 period.

63.

Member States and Commission agree that the monitoring of Structural Funds programmes was a weak point in the 1994 to 1999 programming period. In the 2000 to 2006 period, substantial efforts were undertaken to address this issue. For the first time systematic evaluations (including ex ante evaluations and mid-term evaluations) were carried out for all Structural Funds programmes. This led to a substantial improvement of the indicator and monitoring system for the Structural Funds.

64.	The financial data produced by the systems were broadly reliable. This was not, however, the case for monitoring data, although the availability of such data improved in the later stages of the programme period.

65.	‘Internal evaluations’ were not legally required by the regulations but were performed in response to specific needs felt by the Member States. This was a positive by-product of the ex post evaluation, which is now being widely imitated.

66.

For the programming period 2000 to 2006 the regulation established for the first time detailed rules and requirements. Based on that, the Commission elaborated detailed working papers on the elements of monitoring and evaluation. The ex ante evaluations, carried out sytematically for all programmes and objectives, were a major opportunity for the Member States to invest in the establishment of indicators and quantified objectives. Most Member states introduced IT solutions to support regular and systematic reporting based on quantified information. The mid-term evaluation was an opportunity to revise and to improve the indicators and their use.

67.

(a)	The Terms of Reference stated that ‘data available in the Member States may not be complete, particularly if the Member State does not plan to undertake ex post evaluations…’.

(b)	The extent of further data collection was constrained by the time and resources available.

(c)	There are methodological difficulties in assessing net effects through microeconomic methods. However, further work should be undertaken in this area.

(d)	The Commission agrees that this is an important issue in assessing effectiveness. However, it would have required specific further studies, which are expensive. The Commission, however, encourages Member States to undertake such work.

(e)	The potential links between strategy, effectiveness and impacts are analysed in the ex ante evaluations.

(f) and (g)

In the Commission's view, it is better in terms of reference to set only the evaluation questions, leaving a methodological proposal to the bidders.

68.	A great deal of research on convergence has been done. The results of recent such work are summarised in the Commission's Cohesion Reports.

69.	This could have been another interesting approach, but was not necessary. In the Commission's view, future evaluation must set clear priorities, limiting the scope of questions.

70.	The Commission considers that the ToRs and the methodological guide were adequate for the purpose.

72.	Learning from the ex post evaluation, the Commision currently uses expert advice in methodologically demanding evaluations at an earlier stage.

73.	The ex post evaluation 1994 to 1999 faced specific challenges, especially of data availability.

75.

The Commission applied in this evaluation the usual management techniques such as a formalised interim reporting system, oversight by a steering committee, regular working meetings and the involvement of experts. A report of acceptable quality was the outcome of this process.

The complexity of the ex post evaluation was significant. The Commission will use the experience gained to address this issue better in future.

The Commission agrees with the view of the Court that future evaluations should cover fewer evaluation questions. This will make it possible to deepen the analysis in certain fields and will facilitate the oversight carried out by the Commission.

76.

The evaluation, both in the Executive Summary and Chapter 9 Recommendations, goes far beyond management remmendations. The recommendations are aimed at Structural Fund interventions in all Objective 1 regions and are necessarily of a general nature. The role of research and development is stressed, as are the need to pay more attention to differences within large Objective 1 regions, the need to coordinate better Structural Funds interventions across funds in rural areas, etc.

77.	The Commission agrees with the evaluator that Monitoring Committees should devote more attention to strategic issues. This must not detract from the other tasks Committees have to perform under the regulations.

80.	The recommendation specifically referred to by the Court has been taken up by the Commission for the 2007 to 2013 programming period.

EX POST ASSESSMENT OF ESF OPERATIONS (UNDER OBJECTIVES 1 AND 3) BY DG EMPL

81.

A coordination meeting between DG REGIO, DG EMPL and the evaluators for the ESF ex post evaluation took place in October 2002. At this meeting it was agreed that the ESF ex post evaluators would have access to material gathered by the ERDF ex post evaluators. This later took place. The two managers of the ex post evaluations were members of each other's steering groups and received documentation circulated to other steering group members. The ESF evaluators used the ERDF evaluation material available.

Text box 2.

A variety of sources of information was used by the ESF ex post evaluators.

82.	The synthesis report alone contains a great deal of information concerning the use of ESF resources (nature, quantity and impact) across different target groups and by types of measure across the Member States (see in particular Chapter 2 of the Synthesis Report).

84.

The number of beneficiaries estimated in the table mentioned is based on the information provided by the Member States' monitoring systems. The difficulties encountered in the accuracy of some data did not have a significant influence on the useful observations issued.

The beneficiary totals only play a role in relation to (i) accountability and (ii) the limited inpout/output relationships analysis.

The ex post evaluation did not rely on single sources of information in order to reach conclusions and make recommendations.

85.

The Commission recognises that there were problems in estimating the number of ESF beneficiaries. Nevertheless, it considers that the table contains useful data for policy makers.

While the ex post evaluator has already, quite correctly, cited the two sources of information which have been used to develop the table, a fuller explanation of how the figures were derived would nonetheless have been useful.

86.

The comments in the text refer to more detailed information than that presented in the accompanying table. The table presents averages while the text gives a breakdown by subcategory.

The Commission holds the view that the fact that a wide range of values is not provided does not detract from the value of the analysis.

87.	The aim of Chapter 3 is to provide contextual information for ESF actions and it is in fact Chapter 2 that provides the bulk of numerical information concerning the ESF.

Text box 3.

This limited number of textual mismatches does not invalidate the accuracy and usefulness of the data in the table.

88.

The table clearly shows overall support for the hypotheses identified.

Examination of Table 3 shows that:

(a)	the highest number of cases which contradict any hypothesis is one out of 30;

(b)	at least two thirds of the scores are in the categories ‘strong supporting evidence’ or ‘some supporting evidence’ (20/30, 21/30, 22/30 and 26/30).

89.

It is also a legitimate aspect of evaluation work to identify deficiencies in data.

The terms of reference clearly state that the evaluation should be primarily desk-based using secondary data (data and information available in the mid term and final evaluations and in the managing authorities' reports and monitoring systems) and that the evaluators should not engage in extensive primary information gathering. Significant additional field work and survey samples would have required extra resources and would have engendered significant additional costs especially given the need to collect information concerning the entire programming period. The forthcoming ESF ex post evaluation for 2000 to 2006 will be preceded by a preparatory analysis.

90.	The synthesis report contains several references to the significance of ‘combined measures’ in terms of effectiveness.

92.

Firstly, the evaluators choose their words with care and emphasise that the conclusions they draw are tentative ‘The combined allocation of national expenditure and ESF resources does not appear to have led to a convergence …’. Secondly, the figures (tables in Chapter 3 and Annex 2) do show that there is not a convergence of expenditure patterns between countries in combined allocation of national expenditures. Thirdly, as mentioned previously in reply to point 87 (including Text box 3) the data presented in the Chapter 3 tables mentioned by the Court are accurate and support the qualified observation provided by the evaluators. This observation is also based on the analysis of labour market trends presented in Annex 2 of the Synthesis Report (not only Chapter 3). The Commission has already indicated that the examples given concerned a limited number of textual mismatches which do not undermine the figures.

93.	As in the case of Table 3 on individuals mentioned in point 88, the data clearly shows overall support for the hypotheses identified.

95.

The Commission considers that its quality assessment was accurate and balanced, taking into account both positive and negative aspects of the work carried out (rather than only concentrating on a limited number of negative aspects).

Taking into account the above, and since the results and conclusions were based on a variety of sources of information the Commission stands by its quality assessment.

97.

(a)

It is not surprising that some evaluation results are of a qualitative nature as the evaluation makes use of secondary sources which have been summarised and primary sources which are of a qualitative nature (interviews with stakeholders and national workshops on the evaluation findings In some cases the description of results can only be of a qualitative nature. Estimation of deadweight was not a compulsory subject of analysis for Member States during the programming period 1994 to 1999, nor was it part of the evaluation terms of reference. The logical consequence of this is that the ex post evaluation itself could only report on deadweight estimates that were available to it.

(b)	The UK report provides a substantial amount of evidence on soft outcomes. Other national reports also make references to the concept.

(c)

The synthesis report, and the national reports on which it is based contain numerous observations about the content of ESF action (financial, by target group and by types of measure) and performance, comments derived from national evaluation reports and comments derived from interviews with stakeholders).

Information on performance is also available from the case studies.

(d)

Sustainable development was not yet a political priority for the Structural Funds at the time when the programmes for the 1994 to 1999 period were being established and therefore it was not identified as a theme in the terms of reference.

Socio-economic cohesion does not appear in the terms of reference: it was dealt with in the Cohesion Report.

(e)	Each of the national reports examined by the Court contains substantial material on the long-term unemployed and young people. The synthesis report also contains information on the above two ‘target groups’.

(f)	As has already been stated, the ex post evaluators were in some respects dependent on information provided by Member States. The fact that some programmes lacked ‘specific initial objectives’ was a further constraining factor for the evaluators.

(g)

The ESF and the ERDF ex post evaluators collaborated and the ESF ex post evaluators used reports provided to them by the Objective 1 evaluators. Productivity increases by ESF beneficiaries in terms of the benefits to the firms employing them were not a required subject for Member States in their evaluation work during the programming period 1994 to 1999, nor was it part of the terms of reference. As such it is not surprising that this subject occurs only occasionally.

98.

The synthesis report identified numerous factors impacting on the development of Active Labour Market Policies (ALMPs), such as the level of ESF resources, the Structural Funds planning processes and organisational arrangements for the ESF. It also described rather concretely the influence of the ESF in the context of the European Employment Strategy (EES).

As already pointed out, net impact analysis was not a compulsory subject for Member States during 1994 to 1999. Therefore, it is normal that the subject does not appear in all Member States' evaluation reports. Since the evaluators were requested not to carry out large scale primary data gathering, the ex post evaluators cannot be criticised for not having provided this material.

100.

The EES stronger strategic framework provides targets against which progress can be judged. Improved institutional structures have been created to measure progress. Moreover, the EES promotes stronger links between EU programme funding mechanisms and other policy instruments, regulations and national programmes.

101.

The Commission believes that each national report has dealt with the aspects of Community added value identified in the terms of reference (financial and leverage effects, policy and institutional effects, socio cultural effects, and variation across objectives). It recognises, however, that these aspects were sometimes dealt with in other sections of the national reports.

In addition, the national ex post evaluation reports (and the synthesis ex post evaluation report) integrate these various aspects into an overall analysis of Community added value.

102.

The national and synthesis reports contain a great deal of information concerning the financial resources that have been devoted to ESF action.

In the case of both Spain and Italy, the evaluators arrived at positive conclusions overall.

The ex post evaluator for Spain reports that: ‘The general picture that emerges is that the ESF funds have been adequately and efficiently used by Spanish authorities in the 1994 to 1999 period. Most of the programmes have been fully executed in the way that had been planned’ (p. 74).

In the case of Italy, the evaluator concludes that: ‘Some of these costs are necessary to improve performance and transparency in the implementation process; others could probably be reduced by simplification of procedure’. (The issue of simplification of procedures was taken up in the next programming period 2000 to 2006).

104.

The Commission believes that the report does indeed provide useful material for conducting future ex post evaluations.

The report contains rather comprehensive reviews of approaches used to evaluate ESF type actions, both in relation to actions targeted at individuals and actions targeted at systems.

With regard to approaches applied in assessing net impact on individuals, the report contains numerous references to techniques used to estimate net impact on individuals (see especially p. 14, although additional references can be found on p. 10, 12, 21, etc.).

105.

As regards the recommendations on policy analysis, the methods report clearly shows that the evaluators have identified a number of concrete recommendations linked to the area of policy analysis, for instance close monitoring of the relationships between ESF interventions and the evolving national and other regional level interventions.

Furthermore, the methods report discusses the ‘logic models’ for policy analysis at some length and develops evaluation questions relating to the justification for ESF action.

The ex post evaluators were not expected to propose an approach for the selection of appropriate indicators at different strategic levels because such indicators had already been set for the entire programming period in guidance documents issued by the Commission in 2000.

106.

The Commission considers that the evaluation tasks are adequately specified in the TOR. Conclusions in national ex post evaluation reports are based on diverse sources (interviews, case studies, etc.). The synthesis report provides an overall analysis of the results contained in the national reports. The contractor has provided an explanation of how these sources of information were used to generate conclusions in the proposal submitted to the Commission.

Describing all aspects of ESF action and fixing all aspects of the evaluation a priori would lead to unmanageable and inflexible terms of reference.

With regard to ‘concrete action’ to evaluate the impact on individuals, when the indicators are not concretely established or when specific statistical evaluations have not been carried out recently, as already stated, the entire evaluation approach followed by the ex post evaluator was designed to draw conclusions from a variety of sources of information (Member State evaluation reports, national workshops, face-to-face interviews with programme stakeholders, case studies, etc.).

The extent of the mismatch between quantitative objectives and available data was not known at the time of the drafting of the terms of reference, therefore it seems incoherent to expect the terms of reference to have been drafted differently.

The main contractor developed, in cooperation with national contractors, operational guidelines for the work to be conducted at national level.

107.	The terms of reference did not require the evaluators to assess the ‘appropriateness of the reprogramming choices’, but only to describe them. In each of the cases cited the ex post evaluator has described reprogramming changes based on the available information.

108.	The possibility of a preparatory analysis is being considered for the next exercise.

109.

The Commission considers that the data difficulties faced by national evaluators did not detract from the overall quality of the evaluation. The evaluation design foresaw the use of a variety of sources of information. In addition to reviewing the content of Member State evaluation reports, the national ex post evaluation reports include material from closure reports, case studies, national workshops and face-to-face interviews.

In terms of delays, the steering group was not in a position to speed up the closure of mainstream ESF programmes, which are dependent on implementation by the managing authorities.

At the start of the evaluation, the ex post evaluator was provided with all evaluation reports produced by the Member States during 1994 to 1999. Furthermore, the ex post evaluator was provided with letters of introduction in order to facilitate contacts with national ESF administrations.

111.	In the future the Commission will take all necessary steps for the successful preparation and close monitoring and supervision of the exercise.

CONCLUSIONS

115.

(b)

The extent of further data collection was constrained by the time and resources available.

Nevertheless, a substantial amount of additional qualitative data was collected, e.g. from interviews and workshops. National closure reports and the evaluation reports conducted by Member States during the 1994 to 1999 programming period were also reviewed.

(c)	A variety of sources of information was used in order to arrive at balanced, well considered conclusions. In some cases the description of results can only be of a qualitative nature. But this was not solely the case: the evaluation results also contain significant quantitative information.

(d)	The conclusions drawn were based on the evaluators' use of secondary sources (which they summarised) and primary sources of a qualitative nature (interviews with stakeholders and national workshops on the evaluation findings). See also reply to points 88, 92 and 96.

(e)

An evaluation of deadweight was not foreseen in the ToR.

As a general remark, an evaluation exercise cannot cover all the topics and has to focus on priority objectives and areas taking into account regulatory obligations, time and resource constraints and the intended use of the results. The experience of this exercise and lessons learned will be used for the 2000 to 2006 ex post evaluation.

116.

(a)	Examination of this issue was hampered by the uncertainty surrounding figures for the private sector.

(b)	The analysis was constrained by the availability of data and lack of comparability resulting from differences in methods and scope.

(c)	Some attempt was made to assess efficiency on the basis of a limited number of projects.

(d)	The HERMIN models did take account of the national specificities of the economies, however further work will be undertaken to address the limitations.

The Commission considers that the limited number of drafting mismatches in Chapter 3 of the report do not call into question the accuracy, validity and usefulness of the data to support the conclusions drawn. The data contained in the tables commented upon by the Court in points 88 and 92 show that there is significant overall supporting evidence for the hypotheses identified.

The difficulties encountered regarding the exhaustiveness of some data did not have a significant influence on the useful observations issued.

117.

The Commission considers that the evaluation relied on a robust design using a variety of information sources and that its conclusions are sound.

In the future, the Commission will take all necessary steps for the successful preparation and close monitoring and supervision of the exercise.

See reply to point 111 above.

118.	In the future the Commission will take all necessary steps for the successful preparation and close monitoring and supervision of the exercise.

RECOMMENDATIONS

119.	For future ex post evaluations preparatory analyses or feasibility studies will be carried out to assess the availability of data and what outputs can reasonably be delivered within the budget and timeframe.

120.

The Commission agrees that the quality control procedures applied in the 1994 to 1999 period ex post evaluations should be further improved.

The Commission recognises that for the forthcoming Structural Funds ex post evaluation exercises a preparatory analysis could play a useful role in facilitating the work of contractors during the evaluations proper.

(a)	The basic data used in the ex post evaluation are collected and provided by the Member States through the monitoring system. The availability of reliable data may be a constraint on the evaluation, but in the 2000 to 2006 period there has been progress in this area.

(b)

Terms of reference must enable the Commission to assess the ability of the candidate to conduct the evaluation and whether the particular combination of methods and techniques proposed can deliver the required outputs.

The Commission shares the Court's opinion that terms of reference play an important role in achieving good quality evaluations results.

(c)	Given budget and time constraints, future ex post evaluations will be more focused on key evaluation questions and will be prepared and launched earlier than the 1994 to 1999 exercise.

Based on the experience of 1994 to 1999, monitoring and supervision arrangements will be strengthened and the process launched earlier.

121.

The Commission considers that macroeocnomic models are relevant for estimating the economic impact of large-scale interventions. The HERMIN model was designed specifically for such purpose in the early 1990s. However, as for any economic model, further improvements will need to be made, in particular regarding the micro-data for the estimation of the economic returns of physical and human capital investments. The Commission considers that the future ex post evaluation should be more focused.

122.	The Commission agrees that ex post assessments should take full account of existing thematic studies. When using them, that ex post assessments and thematic studies serve different purposes and therefore may lead to different estimates.

123.

(a)	The Commission agrees that private investment is a key driver for economic development. However, experience shows that private sector leverage differs across countries depending on specific economic conditions.

(b)	This approach cannot be universally applied since economic conditions in Member States and regions differ greatly.

(c)	The Commission is aware that further systematic work is needed in this extremely complex area.

(d)	The Commission is aware that further work is also needed in this area.

(e)

Ex post evaluation reports commissioned by Member States were one of the primary sources of information in the 1994 to 1999 evaluations. Although the Commission is primarily responsible for ex post evaluation, cooperation with Member States should be strengthened in the future and internal evaluations could play a greater role.

124.	The Commission is taking steps to improve its oversight of the process.

125.

The Commission agrees that the SF ex post evaluations should make full use of knowledge gained during an entire programming period and for this reason accepts the idea that a preparatory analysis could contribute to facilitating the task of contractors and improving the quality of forthcoming SF ex post evaluations.

The Commission already collaborates extensively with research institutes and universities and is willing to envisage further assignments for the purposes referred to by the Court.

	Sum of columns (15)	Published total (15)	Difference (15)	Percentage discrepancy
Germany	41 015	48 243	7 228	–15
Italy	34 804	30 547	–4 257	14
United Kingdom	3 961	4 803	842	–18
France	3 339	4 019	680	–17
Austria	1 116	1 040	–76	7

Regressor	Coefficient	Standard error	T-ratio	[Prob]
C	–2,0780	0,90194	–2,3040	[0,047]
EUSFPC	2,8192	0,88559	3,1834	[0,011]

R-squared	0,52963	R-bar-squared		0,47737
S.E. of regression	1,2355	F-stat. F (1,9)	10,1340	[0,011]
Mean of dependent variable	0,53688	S.D. of dependent variable		1,7089
Residual sum of squares	13,7371	Equation log-likelihood		–16,8304
Akaike information criterion	–18,8304	Schwarz Bayesian criterion		–19,2283
DW-statistic	1,8219

	Net jobs created
	Zero-zero hypothesis	Medium-medium hypothesis	High-high hypothesis
Eastern Germany	822 308	729 923	635 755
Spain	1 051 135	241 789	– 806 026
Greece	303 034	190 825	65 969
Ireland	86 679	64 621	36 335
Portugal	497 445	250 502	– 253 299

ADAPT	Adaptation of the Workforce to Industrial Change (a Community initiative to promote employment and the adaptation of the workforce to industrial change)
ALMP	Active Labour Market Policies
CSF	Community Support Framework
Deadweight	Change observed among direct addressees following the public intervention, or reported by direct addressees as a consequence of the public intervention, that would have occurred, even without the intervention.
Displacement effect	Effect obtained in an eligible area at the cost of another area.
ERDF	European Regional Development Fund
ESF	European Social Fund
FIFG	Financial Instrument for Fisheries Guidance
HERMIN model	Macroeconomic model used to simulate the impact of SF interventions
Inception report	Report containing methodological questions such as the evaluation objectives, the methodological framework, the questionnaires and the organisations to be contacted
Monitoring Committee	Committee to oversee the strategic implementation of the funding programme
OP	Operational programme
RTD	Research and technical development
SFs	Structural Funds
SME	Small and medium-sized enterprises
Soft outcome	Result difficult to measure quantitatively
SPD	Single programming document
Synthesis Report	Final reports of DG REGIO (concerning Objective 1) and of DG EMPL (concerning Objective 3) about the ex post evaluation of operations financed by the ERDF and the ESF
ToR	Terms of reference

Total Actual/capita		EU per capita		Public Actual/capita		Private Actual/capita
Austria	3 866	Ireland	1 653	Netherlands	1 822	Austria	2 458
Netherlands	3 438	Greece	1 580	Austria	1 033	Germany	1 299
Germany	2 933	Portugal	1 403	Greece	739	Netherlands	1 039
Ireland	2 926	Spain	1 143	Ireland	703	Belgium	705
Greece	2 716	Germany	884	Portugal	621	Ireland	570
Portugal	2 472	Austria	657	Spain	582	Portugal	448
Spain	1 725	Italy	652	France	485	Greece	396
Belgium	1 587	France	621	Belgium	456	France	205
France	1 579	UK	605	UK	381	UK	174
Italy	1 445	Netherlands	578	Germany	311	Spain	na
United Kingdom	1 407	Belgium	427	Italy	na	Italy	na
Average/capita	2 104		1 033		419		377