Data envelopment analysis

Data envelopment analysis (DEA) is a nonparametric method in operations research and economics for the estimation of production frontiers^{[clarification needed]}. It is used to empirically measure productive efficiency of decision making units (or DMUs). Although DEA has a strong link to production theory in economics, the tool is also used for benchmarking in operations management, where a set of measures is selected to benchmark the performance of manufacturing and service operations. In the circumstance of benchmarking, the efficient DMUs, as defined by DEA, may not necessarily form a “production frontier”, but rather lead to a “bestpractice frontier” (Cook, Tone and Zhu, 2014). DEA is referred to as "balanced benchmarking" by Sherman and Zhu (2013).Nonparametric approaches have the benefit of not assuming a particular functional form/shape for the frontier, however they do not provide a general relationship (equation) relating output and input. There are also parametric approaches which are used for the estimation of production frontiers (see Lovell & Schmidt 1988 for an early survey). These require that the shape of the frontier be guessed beforehand by specifying a particular function relating output to input. One can also combine the relative strengths from each of these approaches in a hybrid method (Tofallis, 2001) where the frontier units are first identified by DEA and then a smooth surface is fitted to these. This allows a bestpractice relationship between multiple outputs and multiple inputs to be estimated.
"The framework has been adapted from multiinput, multioutput production functions and applied in many industries. DEA develops a function whose form is determined by the most efficient producers. This method differs from the Ordinary Least Squares (OLS) statistical technique that bases comparisons relative to an average producer. Like Stochastic Frontier Analysis (SFA), DEA identifies a "frontier" which are characterized as an extreme point method that assumes that if a firm can produce a certain level of output utilizing specific input levels, another firm of equal scale should be capable of doing the same. The most efficient producers can form a 'composite producer', allowing the computation of an efficient solution for every level of input or output. Where there is no actual corresponding firm, 'virtual producers' are identified to make comparisons" (Berg 2010)
Contents
History
In microeconomic production theory a firm's input and output combinations are depicted using a production function. Using such a function one can show the maximum output which can be achieved with any possible combination of inputs, that is, one can construct a production technology frontier. (Seiford & Thrall 1990). Some 30 years ago DEA (and frontier techniques in general) set out to answer the question of how to use this principle in empirical applications while overcoming the problem that for actual firms (or other DMUs) one can never observe all the possible inputoutput combinations.
Building on the ideas of Farrell (1957), the seminal work "Measuring the efficiency of decision making units" by Charnes, Cooper & Rhodes (1978) applies linear programming to estimate an empirical production technology frontier for the first time. In Germany, the procedure was used earlier to estimate the marginal productivity of R&D and other factors of production (Brockhoff 1970). Since then, there have been a large number of books and journal articles written on DEA or applying DEA on various sets of problems. Other than comparing efficiency across DMUs within an organization, DEA has also been used to compare efficiency across firms. There are several types of DEA with the most basic being CCR based on Charnes, Cooper & Rhodes, however there are also DEA which address varying returns to scale, either CRS (constant returns to scale) or VRS (variable). The main developments of DEA in the 1970s and 1980s are documented by Seiford & Thrall (1990).
Techniques
This article is in need of a cleanup. You can help out Infogalactic by reorganizing parts of the article, checking grammar and spelling, and doing other helpful things to correct the article.

Data envelopment analysis (DEA) is a linear programming methodology to measure the efficiency of multiple decisionmaking units (DMUs) when the production process presents a structure of multiple inputs and outputs.
"DEA has been used for both production and cost data. Utilizing the selected variables, such as unit cost and output, DEA software searches for the points with the lowest unit cost for any given output, connecting those points to form the efficiency frontier. Any company not on the frontier is considered inefficient. A numerical coefficient is given to each firm, defining its relative efficiency. Different variables that could be used to establish the efficiency frontier are: number of employees, service quality, environmental safety, and fuel consumption. An early survey of studies of electricity distribution companies identified more than thirty DEA analyses—indicating widespread application of this technique to that network industry. (Jamasb, T. J., Pollitt, M. G. 2001). A number of studies using this technique have been published for water utilities. The main advantage to this method is its ability to accommodate a multiplicity of inputs and outputs. It is also useful because it takes into consideration returns to scale in calculating efficiency, allowing for the concept of increasing or decreasing efficiency based on size and output levels. A drawback of this technique is that model specification and inclusion/exclusion of variables can affect the results." (Berg 2010)
Under general DEA benchmarking, for example, "if one benchmarks the performance of computers, it is natural to consider different features (screen size and resolution, memory size, process speed, hard disk size, and others). One would then have to classify these features into “inputs” and “outputs” in order to apply a proper DEA analysis. However, these features may not actually represent inputs and outputs at all, in the standard notion of production. In fact, if one examines the benchmarking literature, other terms, such as “indicators”, “outcomes”, and “metrics”, are used. The issue now becomes one of how to classify these performance measures into inputs and outputs, for use in DEA." (Cook, Tone, and Zhu, 2014)
Some of the advantages of DEA are:
 no need to explicitly specify a mathematical form for the production function
 proven to be useful in uncovering relationships that remain hidden for other methodologies
 capable of handling multiple inputs and outputs
 capable of being used with any inputoutput measurement
 the sources of inefficiency can be analysed and quantified for every evaluated unit
Some of the disadvantages of DEA are:
 results are sensitive to the selection of inputs and outputs (Berg 2010).
 you cannot test for the best specification (Berg 2010).
 the number of efficient firms on the frontier tends to increase with the number of inputs and output variables (Berg 2010).
Sample Applications and Example
Sample Applications
DEA is commonly applied in the electric utilities sector. For instance a government authority can choose Data Envelopment Analysis as their measuring tool to design an individualized regulatory rate for each firm based on their comparative efficiency. The input components would include manhours, losses, capital (lines and transformers only), and goods and services. The output variables would include number of customers, energy delivered, length of lines, and degree of coastal exposure. (Berg 2010)
DEA is also regularly used to assess the efficiency of public and notforprofit organizations, e.g. hospitals (Kuntz, Scholtes & Vera 2007; Kuntz & Vera 2007; Vera & Kuntz 2007) or police forces (Thanassoulis 1995; Sun 2002; Aristovnik et al. 2013, 2014).
Example
In the DEA methodology, formally developed by Charnes, Cooper and Rhodes (1978), efficiency is defined as a ratio of weighted sum of outputs to a weighted sum of inputs, where the weights structure is calculated by means of mathematical programming and constant returns to scale (CRS) are assumed. In 1984, Banker, Charnes and Cooper developed a model with variable returns to scale (VRS).
Assume that we have the following data:
 Unit 1 produces 100 pieces of items per day, and the inputs are 10 dollars of materials and 2 labourhours
 Unit 2 produces 80 pieces of items per day, and the inputs are 8 dollars of materials and 4 labourhours
 Unit 3 produces 120 pieces of items per day, and the inputs are 12 dollars of materials and 1.5 labourhours
To calculate the efficiency of unit 1, we define the objective function as
 maximize efficiency = (u_{1} × 100) / (v_{1} × 10 + v_{2} × 2)
which is subject to all efficiency of other units (efficiency cannot be larger than 1):
 subject to the efficiency of unit 1: (u_{1} × 100) / (v_{1} × 10 + v_{2} × 2) ≤ 1
 subject to the efficiency of unit 2: (u_{1} * 80) / (v_{1} * 8 + v_{2} * 4) ≤ 1
 subject to the efficiency of unit 3: (u_{1} * 120) / (v_{1} * 12 + v_{2} * 1.5) ≤ 1
and nonnegativity:
 all u and v ≥ 0.
But since linear programming cannot handle fraction, we need to transform the formulation, such that we limit the denominator of the objective function and only allow the linear programming to maximize the numerator.
So the new formulation would be:
 maximize Efficiency = u_{1} * 100
 subject to the efficiency of unit 1: (u_{1} * 100)  (v_{1} * 10 + v_{2} * 2) ≤ 0
 subject to the efficiency of unit 2: (u_{1} * 80)  (v_{1} * 8 + v_{2} * 4) ≤ 0
 subject to the efficiency of unit 3: (u_{1} * 120)  (v_{1} * 12 + v_{2} * 1.5) ≤ 0
 subject to v_{1} * 10 + v_{2} * 2 = 1
 all u and v ≥ 0.
Inefficiency measuring with DEA
Data Envelopment Analysis (DEA) has been recognized as a valuable analytical research instrument and a practical decision support tool. DEA has been credited for not requiring a complete specification for the functional form of the production frontier nor the distribution of inefficient deviations from the frontier. Rather, DEA requires general production and distribution assumptions only. However, if those assumptions are too weak, inefficiency levels may be systematically underestimated in small samples. In addition, erroneous assumptions may cause inconsistency with a bias over the frontier. Therefore, the ability to alter, test and select production assumptions is essential in conducting DEAbased research. However, the DEA models currently available offer a limited variety of alternative production assumptions only.
References
 Aristovnik, A., et al. (2013). Relative efficiency of police directorates in Slovenia: A nonparametric analysis. Expert Systems with Applications 40(2), pp. 820–827 http://dx.doi.org/10.1016/j.eswa.2012.08.027
 Aristovnik, A., et al. (2014). Performance measurement of police forces at the local level: A nonparametric mathematical programming approach. Expert Systems with Applications 41(2), pp. 1647–1653 http://dx.doi.org/10.1016/j.eswa.2013.08.061
 Banker, R.D., R.F. Charnes, & W.W. Cooper (1984) "Some Models for Estimating Technical and Scale Inefficiencies in Data Envelopment Analysis, Management Science vol. 30, pp. 1078–1092.
 Berg, S. (2010). "Water Utility Benchmarking: Measurement, Methodology, and Performance Incentives." International Water Association.
 Brockhoff, K. (1970). "On the Quantification of the MArginal Productivity of Industrial Research by Estimating a Production Function for a Single Firm", German Economic Review vol. 8, pp. 202–229.
 Charnes, A., W. Cooper, & E., Rhodes (1978) "Measuring the efficiency of decisionmaking units," European Journal of Operational Research vol. 2, pp. 429–444.
 Coelli, T.J., D.P. Rao, C.J. O'Donnell, and G.E. Battese, An Introduction to Efficiency and Productivity Analysis, Springer, 2005.
 Cook, W.D., Tone, K., and Zhu, J., Data envelopment analysis: Prior to choosing a model, OMEGA, 2014, Vol. 44, 14.
 Emrouznejad A., Barnett R. Parker, Gabriel Tavares (2008) Evaluation of research in efficiency and productivity: A survey and analysis of the first 30 years of scholarly literature in DEA, SocioEconomic Planning Sciences, 42(3):151157.[1]
 Farrell, M.J. (1957) "The Measurement of Productive Efficiency," Journal of the Royal Statistical Society vol. 120, pp. 253–281.
 Kuntz L., Vera A. (2007): Modular organization and hospital performance. Health Services Management Research 20(1), 48–58.
 Kuntz, L., S. Scholtes, A. Vera (2007), Incorporating Efficiency in Hospital Capacity Planning in Germany, European Journal of Health Economics 8(3), 213223.
 Lovell, C.A.L., & P. Schmidt (1988) "A Comparison of Alternative Approaches to the Measurement of Productive Efficiency, in Dogramaci, A., & R. Färe (eds.) Applications of Modern Production Theory: Efficiency and Productivity, Kluwer: Boston.
 Ramanathan, R. (2003) An Introduction to Data Envelopment Analysis: A tool for Performance Measurement, Sage Publishing.
 Seiford, L.M., & R.M. Thrall (1990) "Recent Developments in DEA: The Mathematical Programming Approach to Frontier Analysis," Journal of Econometrics vol. 46: pp. 7–38.
 Sherman, H.D., and Zhu, J., Analyzing performance in service organizations, Sloan Management Review, Vol. 54, No. 4 (Summer 2013), 3742.
 Simar, L. & V. Zelenyuk, 2011. "Stochastic FDH/DEA estimators for frontier analysis," Journal of Productivity Analysis, Springer, vol. 36(1), pages 120, August.
 Simar, L. and V. Zelenyuk, 2007. "Statistical inference for aggregates of Farrelltype efficiencies," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 22(7), pages 13671394.
 Simar, L. and V. Zelenyuk, 2006. "On Testing Equality of Distributions of Technical Efficiency Scores," Econometric Reviews, Taylor & Francis Journals, vol. 25(4), pages 497522.
 Sun, S. (2002), Measuring the relative efficiency of police precincts using data envelopment analysis, SocioEconomic Planning Sciences, 36(1), 5171.
 Thanassoulis, E. (1995): Assessing police forces in England and Wales using data envelopment analysis, European Journal of Operational Research, 87(3), 641657.
 Tofallis, C. (2001) "Combining two approaches to efficiency assessment." Journal of the Operational Research Society 52 (11), 1225–1231.
 Vera, A. & Kuntz, L. (2007), Processbased organization design and hospital efficiency, Health Care Management Review, 32(1), 5565.
 Mayer, A. and Zelenyuk, V. 2014. "Aggregation of Malmquist productivity indexes allowing for reallocation of resources," European Journal of Operational Research, Elsevier, vol. 238(3), pages 774785.
 Zelenyuk, V. 2006. "Aggregation of Malmquist productivity indexes," European Journal of Operational Research, vol. 174(2), pages 10761086.
 Zelenyuk, V. (2015) "Aggregation of scale efficiency," European Journal of Operational Research, 240:1, pp 269277.
External links
 All You need to know about DEA at www.DataEnvelopment.com
 DEA Facebook
 Tweet DEA
 Open Source DEA, An open source software that solves 40 DEA models with no limitation of DMUs or variables.
 DEAFrontier software, DEA books..., DEA Excel AddIns software, DEA books and articles
 [2], DEA 2013 Workshop on Productivity, Regulation & Transportation
 DEA Zone, A comprehensive website on Data Envelopment Analysis
 Full Bibliography of DEA Part1 , Full bibliography of DEA with more than 4000 journal articles  Part 1
 Full Bibliography of DEA Part2 , Full bibliography of DEA with more than 4000 journal articles  Part 2
 DEA software, The DEA software (Performance Improvement Management Software)
 OR Notes by J E Beasley DEA
 COOPERframework: A unified process for nonparametric projects , COOPERframework: A unified process for nonparametric projects
 DEA Online Software, Solve your DEA models with a professional software
 [3], European Workshop on Efficiency and Productivity Analysis
 [4], Group on Efficiency and Productivity Analysis that is active within EURO
 [5], Journal of Productivity Analysis, Kluwer Publishers
 Body of Knowledge on Infrastructure Regulation  Principles of using efficiency measures for yardstick regulation
 WebdeA, Solve your DEA models with a free software.
 Articles needing expert attention with no reason or talk parameter
 Articles needing expert attention from February 2009
 Mathematics articles needing expert attention
 Economics articles needing expert attention
 Wikipedia articles needing clarification from October 2012
 Wikipedia articles needing clarification from July 2011
 Articles in need of cleanup
 Operations research