econometrics using r., handbook of statistics, c.r. rao

43
Econometric Analysis of Productivity: Theory and Implementation in R Robin C. Sickles * Wonho Song Valentin Zelenyuk § September 3, 2018 Abstract Our chapter details a wide variety of approaches used in estimat- ing productivity and efficiency based on methods developed to es- timate frontier production using Stochastic Frontier Analysis (SFA) and Data Envelopment Analysis (DEA). The estimators utilize panel, single cross section, and time series data sets. The R programs in- clude such approaches to estimate firm efficiency as the time invariant fixed effects, correlated random effects, and uncorrelated random ef- fects panel stochastic frontier estimators, time varying fixed effects, correlated random effects, and uncorrelated random effects estima- tors, semi-parametric efficient panel frontier estimators, factor models for cross-sectional and time-varying efficiency, bootstrapping methods to develop confidence intervals for index number-based productivity estimates and their decompositions, DEA and Free Disposable Hull estimators. The chapter provides the professional researcher, analyst, * Corresponding author. Professor of Economics, Department of Economics, Rice Uni- versity, 6100 Main Street, Houston, Texas 77005, USA; phone: +1 713-348-3322; email: [email protected] Professor of Economics, School of Economics, Chung-Ang University, 84 Heukseok- ro, Dongjak-gu, Seoul 06974, Republic of Korea, Phone: +82-2-820-5493, email: wh- [email protected] Professor of Economics, School of Economics, University of Queensland, 530 Colin Clark Building (39), St Lucia, Brisbane, Qld 4072, Australia; Phone:+ 61 7 3346 7054 e-mail:[email protected] § The authors would like to thank Kerda Varaku for her assistance in preparing our manuscript. All remaining errors are the responsibility of the authors. 1 Econometrics Using R., Handbook of Statistics, C.R. Rao and H.D. Vinod, eds., Elsevier, 41 (forthcoming, 2019) : 1-31.

Upload: others

Post on 25-Dec-2021

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Econometrics Using R., Handbook of Statistics, C.R. Rao

Econometric Analysis of Productivity: Theoryand Implementation in R

Robin C. Sickles ∗ Wonho Song † Valentin Zelenyuk ‡§

September 3, 2018

Abstract

Our chapter details a wide variety of approaches used in estimat-ing productivity and efficiency based on methods developed to es-timate frontier production using Stochastic Frontier Analysis (SFA)and Data Envelopment Analysis (DEA). The estimators utilize panel,single cross section, and time series data sets. The R programs in-clude such approaches to estimate firm efficiency as the time invariantfixed effects, correlated random effects, and uncorrelated random ef-fects panel stochastic frontier estimators, time varying fixed effects,correlated random effects, and uncorrelated random effects estima-tors, semi-parametric efficient panel frontier estimators, factor modelsfor cross-sectional and time-varying efficiency, bootstrapping methodsto develop confidence intervals for index number-based productivityestimates and their decompositions, DEA and Free Disposable Hullestimators. The chapter provides the professional researcher, analyst,

∗Corresponding author. Professor of Economics, Department of Economics, Rice Uni-versity, 6100 Main Street, Houston, Texas 77005, USA; phone: +1 713-348-3322; email:[email protected]

†Professor of Economics, School of Economics, Chung-Ang University, 84 Heukseok-ro, Dongjak-gu, Seoul 06974, Republic of Korea, Phone: +82-2-820-5493, email: [email protected]

‡Professor of Economics, School of Economics, University of Queensland, 530 ColinClark Building (39), St Lucia, Brisbane, Qld 4072, Australia; Phone:+ 61 7 3346 7054e-mail:[email protected]

§The authors would like to thank Kerda Varaku for her assistance in preparing ourmanuscript. All remaining errors are the responsibility of the authors.

1

Econometrics Using R., Handbook of Statistics, C.R. Rao and H.D. Vinod, eds., Elsevier, 41 (forthcoming, 2019) : 1-31.

rsickles
Pencil
Page 2: Econometrics Using R., Handbook of Statistics, C.R. Rao

statistician, and regulator with the most up to date efficiency modelingmethods in the easily accessible open source programming languageR.

Keywords: Production (technical) efficiency; Stochastic frontieranalysis; Data envelopment analysis; Panel data; Index numbers; Non-parametric analysis; Bootstrapping

1 Introduction

Our chapter provides a discussion of various statistical and mathematicalprocedures that firms, regulators, and academics and policy makers utilizein order to better understand performance and production (technical) effi-ciency of the entities they are benchmarking against other competitors orpeers. As well we discuss, give examples of, and provide extensive links toR programs that implement these various methods and approaches. Thevarious methods and approaches distinguish themselves by leveraging the es-timation of relative performance and of production efficiency measures onregression-based methods and on linear programming methods, the formerreferred to as Stochastic Frontier Analysis (SFA) and the latter as Data En-velopment Analysis (DEA). We discuss the main approaches in turn, theirrelative strengths and weaknesses, and briefly touch on ways to aggregatethe various methods using model averaging approaches.

Our chapter is organized in the following way. We first briefly discuss themotivation for using such methods that rest on the presence of production ef-ficiency differences among units of production that are being compared. Thevarious sets of rationales for such a discipline as efficiency and productivityanalysis can be categorized into three main groups of motivating factors.They are based on varying management practices, which deliver heteroge-neous outcomes, behavioral economics, and the presence of X-efficiency. Wethen discuss the two main classes of production efficiency estimators, the SFAand the DEA estimators. We next discuss examples of R programs that im-plement these various classical estimators and various extensions that havebeen introduced in the recent literature as well as protocols for accessing Rprograms on well-documented websites and other open source sites. We thenprovide concluding remarks.

2

Page 3: Econometrics Using R., Handbook of Statistics, C.R. Rao

2 Why Estimate Production (Technical) Ef-

ficiency?

One of the most compelling rationales for the study of production efficiencyis substantial heterogeneity in management practices and resulting changesin the operating efficiency of a firm. A consensus in the empirical literatureexists that indicates substantial productivity differences both within a firm1

over time and among firms (Lieberman et al. (1990); Foster et al. (2008);Hsieh and Klenow (2009); Hall and Jones (1999)). Glaister (2014), amongothers, have noted that management practices are a key factor in explain-ing such productivity differences. Other factors, such as expenditures onR&D, utilization of capacity, and technology adoption, which are key deci-sions of management (Nallari and Bayraktar (2010)), are typically controlledby management practices. Bloom et al. (2012) regressed gross domestic prod-uct (GDP) per capita on a set of indicators of management practices among17 countries. These indicators of management practices explained 87% ofthe variation in per capita GDP. These findings are corroborated in a moremicro oriented study of Indian firms by Bloom et al. (2013). The engineer-ing mechanism, blueprints, formal structural statistical model, or economicmodel that explains how a firm’s productivity is linked to management skillsand practices has not been developed in a way that lends itself to empiri-cal analysis and to the generation of relative technical efficiency differencesamong peer firms. We thus tend to view such a factor as an unobservable la-tent factor and assume that management practices are one of the key factorsin firm productivity. Another factor is innovation but innovation is often-times facilitated by decisions and practices of management. The literature onthe effects of management practices on production efficiency, labor efficiency,and related measures of a firm’s financial success is quite dense.2 Behavioraleconomics provides another motivation for why firms may not operate at thefrontier of production efficiency. The assumptions of efficient markets andrational decision makers have been leveraged with substantial success by neo-classical economists, but the footing on which these assumptions rest may bea bit loose and slippery.

1We will use the term “firm” to denote any generic unit whose production efficiency isbeing measured and estimated.

2Much of this literature is summarized and referenced Grifell-Tatje and Lovell (2013,2015) and Grifell-Tatje et al. (2018).

3

whson
강조
whson
텍스트 상자
referenced in
rsickles
Pencil
Page 4: Econometrics Using R., Handbook of Statistics, C.R. Rao

X-efficiency theory is a pragmatic paradigm that admits to the prevalenceof various sources on inefficiency in economics. It was introduced by Leiben-stein (1966, 1975, 1987), who pointed out that agency problems, asymmetricinformation, and monitoring by regulators were all factors that generatedincentives to engage in sub-optimal decision making, absent these factorsand constraints. Drawing from studies of the health care industry, telecom-munications, airlines, and education, Frantz (1997, 2007) have documentedthe ubiquitous existence of levels of inefficiency with the predictions from X-efficiency theory and have also concluded that such production inefficiency ismuch more significant than inefficiencies due to incorrect output and inputallocations (allocative inefficiency). Other studies that have documentedsuch inefficiency levels in the banking system can be found in Kwan (2006);Jiang et al. (2009); Fu and Heffernan (2009); Yao et al. (2008); Rezvanianet al. (2011); Bauer and Hancock (1993); Mester (1993); DeYoung (1998).And as emphasized by Frantz (1997)

“...what becomes of the word maximize if non-maximize is not possible?Is the concept of efficiency important if the possibility of inefficiency is ruledout a priori? The importance of efficiency remains as long as economicsremains important...”

X-efficiency theory and the methods that we present in our chapter isbased on an interdisciplinary approach that combines psychology, manage-ment, statistics, applied mathematics, and engineering.

Finally, the usefulness and importance of SFA and DEA methods havepassed the market test as they are required to be used in a wide varietyof regulatory decision-making settings in Europe and elsewhere (Bogetoft(2013), Agrell et al. (2017)).

3 Regression-Based Methods to Estimate Pro-

duction Efficiency

We begin our formal discussion of production efficiency measurement by firstdefining a few sets and functions that are necessary for our presentationof methods and how to implement them. To facilitate further discussion,

4

Page 5: Econometrics Using R., Handbook of Statistics, C.R. Rao

let inputs and outputs be represented by x = (x1, ..., xN)′ ∈ RN+ and y =

(y1, ..., yM)′ ∈ RM+ , respectively and the production process characterized by

a technology set T that satisfies standard regularity conditions (Sickles andZelenyuk (2018)). The output set is defined as

P (x) ≡ {y ∈ <M+ : y is producible from x ∈ <N+}. (1)

The production function f : <N+ → <1+ is defined as f(x) ≡ max{y: y ∈

P (x)}. The maximum of the production function exists and is unique owingto the fact that P(x) is a compact set. Details on the regularity conditionsthat assure the maximum is unique and exists also can be found in Sicklesand Zelenyuk (2018). When there is more than one output one can usethe generalization of the production function, output orientated (Shephard(1970)) distance function Do : <N+ × <M+ → <1

+ ∪ {+∞} as Do(x, y) ≡inf{θ > 0 : y/θ ∈ P (x)}. For a technology with only one output and holdingthe inputs constant, Do is the ratio of actual output to potential (maximal)output. Then frontier production is f(xo) = yo/Do(x

o, yo). When thereare multiple-outputs, the Shephard output distance function is the smallestscalar required to radially expand all outputs to the output set’s boundary,again at a fixed level of inputs.

3.1 The Stochastic Frontier Paradigm

In cross-sectional Stochastic Frontier Analysis (SFA) there are at least twosources of error (panel extensions may extend the identifiable error compo-nents for four distinct sources of error). In addition to the error that isappended to the parametric or nonparametric production or distance func-tion to account for standard statistical noise (assumed to have mean zero) anadditional source of error is added to address the asymmetry in the errors dis-played by most production or distance function estimates that is due to theboundary property of the function being estimated. Such a one-sided erroris utilized additively as a placeholder for production or technical inefficiency,which diminishes the level of observed output from the frontier productionor distance function specified parametrically or nonparametrically. An ex-ample of such an SFA model for a linear in logs (Cobb-Douglas) productionfunction is,

ln(y) = β0 +N∑k=1

βk ln(xk) + ε, (2)

5

Page 6: Econometrics Using R., Handbook of Statistics, C.R. Rao

ε = v − u. (3)

Here v is a random term with mean zero and variance σ2v . The error term

u represents the latent production inefficiency term. The latent productioninefficiency term has positive support with a mean µ > 0 and a variance σ2

u.3

Although these assumptions have been modified to account for more realisticempirical settings, this forms the basis for the canonical model used in SFA.Most of the literature in SFA has worked to generalize the distributions ofu and v, their moment properties, stochastic properties of the inputs (andfor multi-output distance functions the outputs as well), and different de-pendency patterns among the disturbances and the inputs and outputs, aswell as possibly distinguishing between variables that are used in developingthe production boundary and environmental or confounding factors that mayinfluence production efficiency, that is the deviations between the boundaryand non-boundary observations, as opposed to influencing the boundary perse.

3.1.1 Corrected OLS

One of the first approaches to develop measures of production efficiency isthe corrected ordinary least squares. The steps employed in such an exer-cise involve first estimating the production (or distance) function by ordinaryleast squares to get estimates of the average production relationship. In thesecond step one simply shifts the intercept (the example here is for the singleoutput production function) to ensure that the residuals are all non-positive(Olson et al. (1980)). With only mild assumptions, OLS will yield mini-mum variance unbiased linear estimators. The step with the Cobb-Douglasproduction function involves estimating

ln(yi) = β0 + β1 ln(xi1) + · · ·+ βN ln(xiN) + εi, i ∈ {1, ..., n}. (4)

The second step uses the intercept correction

βcols0 := β0 + maxi{εi}, i = 1, ..., n, (5)

where εi are OLS residuals. The last step estimates production efficiency fora firm as

Technical Inefficiency = εcolsi := maxi{εi} − εi, i ∈ {1, ..., n}. (6)

3Cost inefficiency can be modeled with a one-sided error with only negative support.

6

whson
강조
whson
강조
whson
텍스트 상자
below, we denote i=1,...,n
Page 7: Econometrics Using R., Handbook of Statistics, C.R. Rao

Thus the corrected production function that estimates the production frontieris

ln(yi) = βcols0 + β1 ln(xi1) + · · ·+ βN ln(xiN)− εcolsi , i ∈ {1, ..., n}. (7)

3.1.2 Stochastic Frontier Model

Aigner et al. (1977) (hereafter ALS) and Meeusen and van den Broeck (1977)pursued a parametric model of the stochastic frontier based via maximumlikelihood. Average inefficiency is defined in terms of the performance of afirm to the firm identified as having the best-practices, as measured by itslevel of efficiency. The original ALS model used a half-normal distributionfor the efficiency term and a normal error for the idiosyncratic disturbance.Many other distributions have been considered, usually for the inefficiencyterm. These include the exponential, truncated normal, gamma, and dou-bly truncated normal (Stevenson (1980), Greene (1980a,b),Qian and Sickles(2008), Almanidis and Sickles (2012), and Almanidis et al. (2014)). In thecanonical SFA the composite error terms are assumed to be independent. Ifthe production function is linear in logs then a convenient parameterizationis

yi = f(xi|β) exp(εi), i = 1, · · · , n (8)

and for ε = v − u the inefficiency of firm i is measured as

exp(−ui) ≡yi

f(xi) exp(vi), i = 1, · · · , n. (9)

The model is usually specified after log-transforming the production re-lationship

ln yi = ln f(xi|β) + vi − ui, i = 1, · · · , n (10)

and assumingvi ∼ N (0, σ2

v)

andui ∼ |N (0, σ2

u)|

and that ui and vi are independent and i.i.d., it follows that the density of εiis:

fεi(ε) =2

σφ( εσ

)[1− Φ

(ελ

σ

)], −∞ ≤ ε ≤ +∞ (11)

7

whson
강조
whson
텍스트 상자
below, we denote i=1,...,n
whson
강조
whson
강조
whson
강조
whson
텍스트 상자
Below, sometimes we use i=1,....,n or i=1,2,...,n with lower dots .... It would be better to use same notations. Please check time index t as well.
Page 8: Econometrics Using R., Handbook of Statistics, C.R. Rao

where Φ(·) is the standard normal distribution function, σ2 = (σ2v +σ2

u) , andλ = σu/σv. The composite errors, ε1, · · · , εn, are of course not observed, butfrom our model we know that

εi = ln yi − ln f(xi), i = 1, · · · , n. (12)

The log-likelihood function is

`(y1, ..., yn|β, λ, σ2) =n

2ln( 2

π

)− n

2ln(σ2)− 1

2σ2

n∑i=1

[ln yi − ln f(xi)

]2+

n∑i=1

ln[1− Φ

( [ln y − ln f(xi)]λ

σ

)].

(13)Inefficiency has a mean and variance given by

E(ui) ≡ µ =

√2√πσu (14)

and

V (ui) =

(π − 2

π

)σ2u (15)

and thus the composed error’s mean and variance is

E(εi) = E(vi − ui) = E(−ui) = −µ = −√

2√πσu (16)

V (εi) = V (vi − ui) = V (v) + V (u) = σ2v +

(π − 2

π

)σ2u (17)

andcov(εi, εj) = 0,∀i 6= j. (18)

Estimation of individual inefficiencies Cross-sectional SFA, unfortu-nately, does not provide a consistent estimate of the efficiency of each firm.A simple solution by Materov (1981) and further developed in Jondrow et al.(1982) uses E(ui|εi) as a point estimate of ui. This estimator has its draw-back as it is contaminated by statistical noise but it is appealing in that,from the law of iterated expectations,

E(ui) = Eε[E(ui|εi)] (19)

8

Page 9: Econometrics Using R., Handbook of Statistics, C.R. Rao

and thus to develop a consistent estimator of E(ui|εi) we can write

E(ui|εi) ≡∞∫0

ufui|εi(u|ε)du (20)

where, from the definition of the conditional density,

fui|εi(u|ε) ≡fui,εi(u, ε)

fεi(ε). (21)

Based on the normal/half normal composed error structure typically used inSFA we can write

E(ui|εi) = µ∗ + σ∗1

1− Φ(−µ∗/σ∗)φ

(−µ∗σ∗

)(22)

or

E(ui|εi) = σ∗

[µ∗σ∗

+1

1− Φ(−µ∗/σ∗)φ

(−µ∗σ∗

)]. (23)

Using the parameterization −µ∗σ∗

= σ2uε

σ2 σvσuσ

= σuεσσv

= ελσ

, the conditional expec-

tation is:

E(ui|εi) =σvσuσ

[−εiλσ

+φ (εiλ/σ)

1− Φ(εiλ/σ)

](24)

for which a consistent estimate can be obtained based on consistent estimatesof the model parameters (e.g., obtained via MLE or COLS, or the ModifiedOLS procedures of Olson et al. (1980)). Thus a (conditional) consistentestimator of each firm’s efficiency level can be based on this last equation.

3.1.3 Panel Stochastic Production Frontiers

The cross-sectional stochastic frontier model has a number of drawbacks thathave been addressed over the forty years since it was introduced. As wepointed out, a consistent estimate for a firm’s technical efficiency is notavailable, only a consistent estimate of the conditional mean of the firm’sefficiency level. Of course the canonical ALS model is fully parametric andinputs are assumed to be uncorrelated with the regressors, which is partic-ularly troubling when the regressors are inputs and the latent factor thattypically is assumed to account for inefficiency is unobservable managerialexpertise, an input whose independence from the levels of capital and labor

9

Page 10: Econometrics Using R., Handbook of Statistics, C.R. Rao

used in production is a questionable assumption, and of course if a firm isaware of its level of technical efficiency this information, although unknownto the analyst, should not be independent of the firm’s input choices.

These problems are potentially avoidable if one has panel data (Pitt andLee (1981); Schmidt and Sickles (1984)), although the possible endogeneityof input choice must be addressed with care and its treatment using fixedeffects type estimators may not be completely satisfactory. We will returnto this issue shortly. If we have a panel of firms, then if the unobservedfirm effects represent technical efficiency they can be estimated consistentlyfor large T (assuming that our sample of firms n is large) after controllingfor inputs and other environmental and observable factors that may impactproduction. To show how this is accomplished with panel data we use thegeneral treatment discussed in Schmidt and Sickles (1984) (SS), which alsoconsiders the Pitt and Lee (1981) parametric random effects model. Themodel is

yit = α + x′itβ + vit − ui, i = 1, ..., n; t = 1, ..., T, (25)

which can be rewritten as

yit = α∗ + x′itβ + vit − u∗i (26)

where α∗ = α − µ; u∗i = ui − µ; E(ui) = µ ≥ 0, and where xit is a vector ofN inputs. If we let αi = α∗ − u∗i then the model becomes the usual paneldata model

yit = αi + x′itβ + vit (27)

and cross-sectional effects that can be viewed as random, fixed, or simply ig-nored. Five estimators of the classical panel data model with time-invarianteffects are discussed in SS. These are the pooled OLS model, the fixed effectswithin estimator, the random effects model, the Hausman-Taylor estimator,and the fully parametric random effects MLE model (Pitt and Lee (1981)).Schmidt and Sickles (1984) discuss the asymptotics of each of these estima-tors and the assumptions needed in order for the parameter estimates andestimates of the technical efficiency level to be consistently estimated.

Technical efficiency effects estimates with the SS estimators do not changeover time and this is a strong and often unreasonable assumption that is notnecessary. The suite of panel stochastic frontier estimators developed by

10

Page 11: Econometrics Using R., Handbook of Statistics, C.R. Rao

Cornwell et al. (1990) (CSS) had a parameterization that allowed for time-varying heterogeneity and is based on the model:

yit = x′itβ + z′iγ + w′itδi + vit i = 1, · · · , n; t = 1, · · · , T, (28)

where xit, zi and wit are N ×1, J×1 and L×1 vectors, respectively, and theparameter vectors β, γ and δi are dimensioned conformably. With δ0 = E[δi],and δi = δ0 + ui the model can be written as:

yit = x′itβ + z′iγ + w′itδ0 + εit,

where

εit = vit + w′itui (29)

and where ui is assumed to be i.i.d., zero mean random variables with co-variance matrix ∆.The error term vit is assumed to be i.i.d., with zero meanand constant variance σ2

v . The basic model assumes that vit is uncorrelatedwith z, x, and ui. In order to allow for time-varying technical efficiency thestochastic frontier model of Schmidt and Sickles (1984)

yit = α + x′itβ + vit − ui = αi + x′itβ + vit. (30)

can be modified by replacing the αi with, e.g.,

αit = θi1 + θi2t+ θi3t2, (31)

and the model in matrix form becomes

y = Xβ + Zγ +Wδ0 + ε, (32)

ε = Qu+ v. (33)

Details of the estimator can be found in Cornwell et al. (1990). Fixedeffects, random effects, and Hausman-Taylor estimators of the CSS modelwere developed to estimate productivity efficiency that is time varying andallow for consistent estimation under large n and T asymptotics of the timevarying productivity efficiency for each firm. The parameter δi is estimatedby regressing the residuals (yit − x

′itβ) for firm i on wit. This amounts to

regressing the within residual on a constant term, time, and time-squared for

11

Page 12: Econometrics Using R., Handbook of Statistics, C.R. Rao

the specification we introduced for αit above and relative efficiencies can beapproximated (in the linear in logs production function) by

αt = maxi

(αit) i = 1, ..., n (34)

uit = αt − αit. (35)

Parametric MLE case also be used to estimate a random effects timevarying technical efficiency models. Kumbhakar (1990) and Battese andCoelli (1992) present two such models. In the former inefficiency is modeledas

uit = (1 + exp(bt+ ct2))−1τi, (36)

where a and b are parameters to be estimated and where τi’s distribution isassumed to be i.i.d N+(0, σ2

τ ) and vit is i.i.d N (0, σ2v). In the latter specifi-

cation inefficiency is modeled as

uit = ηtτi = {exp[−η(t− T )]}τi, t = 1, . . . , T ; i = 1, . . . , n, (37)

where vit is i.i.d N (0, σ2v) random variables, τi is assumed to be i.i.d. and has

a non-negative truncated distribution N (µ, σ2), and η is a scalar parameter.

3.1.4 Second-and Third-Generation Stochastic Frontier Models

The models we have just discussed provide the basic statistical intuition forSFA. Much has been done to extend and generalize the canonical models wehave discussed and space prohibits us from providing comparable detail onthese second and third generation extensions. However, the R codes that wewill discuss shortly are equipped to deal with a number of these relativelynew modeling scenarios and so we give a brief summary of what these secondand third generation SFA models deliver and the general ideas behind howthey are formulated and specified. We leave it to the reader to seek out theoriginal sources and recent books, handbooks, and survey articles that wehave referenced and that provide more detailed treatments.

Researchers have provided further generalizations of the CSS time-varyingtechnical efficiency model that focus on two main aspects of the model. Thefirst is to allow for a factor-type structure for the cross-section and timevarying technical efficiency term (these are almost all exclusively panel data

12

Page 13: Econometrics Using R., Handbook of Statistics, C.R. Rao

extensions). The second is to decompose the time invariant and time varyingefficiency terms into a portion that is skewed, to represent technical ineffi-ciency, and a portion that is symmetric, the latter somewhat courageouslycalled the “true” effects model. There are four other generalizations thatsome may view as comparable to these but there is simply not enough spaceto address them in any substantive way. We will mention them and pro-vide several references as well as links to R codes that deal with these fouradditional issues, which are: environmental factors, endogeneity, Bayesianmethods for SFA, and nonparametric specifications of the technology and ofthe error structure.

We briefly discuss these sets of generalizations and available R code toimplement them.

Factor Models and SFALee (1991) and Lee and Schmidt (1993) were the first to propose a (one

component) factor model to address time-varying and cross-sectional specificproduction efficiency. Their model is

yit = αt + x′itβ + vit − uit for i = 1, . . . , n; t = 1, . . . , T, (38)

or as

yit = x′itβ + αit + vit, (39)

where αit = αt − uit is the time varying cross-sectional specific technicalefficiency term, which are modeled as s as

αit = ηtδi. (40)

Here ηt, t = 1, ..., T, are the time-varying effects to be estimated and δi, i =1, ..., n, are the firm effects. Lee and Schmidt (1993) provide fixed-effectsand random effects estimators for this one-factor model based one nonlinearregression estimators.

Ahn et al. (2007) generalized this model to allow for multiple factors thatchange over time and specify the production frontier as

yit = δt + x′itβ + vit − uit (41)

= x′itβ + ηit + vit, i = 1, 2, . . . , n, t = 1, 2, . . . , T, (42)

13

whson
강조
whson
텍스트 상자
as?
rsickles
Pencil
Page 14: Econometrics Using R., Handbook of Statistics, C.R. Rao

where, again, vit is the usual disturbance term and uit ≥ 0 is the inefficiencyterm and where ηit ≡ δt − uit is the time-varying and cross-sectional spe-cific technical efficiency term that is expressed as a linear combination of punrestricted components,

ηit = θ1tα1i + θ2tα2i + · · ·+ θptαpi =

p∑j=1

θjtαji. (43)

The model is estimated using generalized methods of moments. Ahn et al.(2013) provide a focused study of consistency properties of their Ahn et al.(2007) model when different sorts of dependency relationships exist betweenthe production efficiency effects and the regressors.

In the Ahn et al. (2007) model the effects are multiplicative. Kneip et al.(2012) (KSS) provided a more general model than Ahn et al. (2007) by al-lowing for a general nonparametric time-varying and cross-section specificproductivity efficiency. The KSS model is:

yit = β0(t) +N∑j=1

βjxitj + ui(t) + vit, i = 1, . . . , n; t = 1, . . . , T. (44)

The ui(t)’s are assumed to be smooth time-varying individual effects thatsatisfy a normalization that

∑i ui(t) = 0 and are a linear combination of

L < T basis functions (common factors) g1, ..., gL:

ui(t) =L∑r=1

θirgr(t). (45)

The term β0(t) is an average function that is eliminated by centering themodel yielding

yit−yt =N∑j=1

βj(xitj−xtj)+ui(t)+vit−vi, i = 1, . . . , n; t = 1, . . . , T (46)

where yt = 1n

∑i yit, xtj = 1

n

∑i xitj and vi = 1

n

∑i vit. As with the CSS

and many other panel stochastic frontier estimators, technical efficiency iscalculated as TEi(t) = exp{ui(t)−maxj=1,...,n(uj(t))}, just as it is calculatedwith the CSS estimator. The estimator used is a three step one that first esti-mates β using penalized least squares and smoothing splines to approximate

14

Page 15: Econometrics Using R., Handbook of Statistics, C.R. Rao

the factors, then second generates estimates of the covariance structure of theui(t)

′s , and the third step estimates the basis functions gr and then updatesestimates of ui by

∑Lr=1θirgr. The KSS estimator and related estimators

such as those introduced by Bai and Ng (2002) and Bai (2009), as well astests for the number of factors, have been programmed in R and are availablenot only in the suite of programs we discuss at the end of this chapter butalso can be found on the website referenced in Bada and Liebl (2014).

True Fixed Effects and SFAGreene (2005a,b) proposed a stochastic panel frontier model in which the

intercept fixed effects were not measures of persistent inefficiencies, as hadbeen assumed by SS, but rather was simply firm specific heterogeneity. Thishas become known as the“true”fixed effects model, although a less ambitiouslabel may be the panel frontier with only transitory inefficiency. The modelis

yit = αi + x′itβ + vit − uit (47)

εit = vit − uit, (48)

or

yit = αi + x′itβ + εit (49)

and thus the source of inefficiency is uit, which is distinguished from the in-tercept terms αi. Greene outlines estimators for both the fixed effects andrandom effects panel stochastic frontier model with only transitory ineffi-ciency based on simulated MLE and technical efficiency estimates are basedon the familiar expression

TEit = exp[−{max(uit)− uit}]. (50)

Colombi et al. (2011), Colombi et al. (2014), and Tsionas and Kumbhakar(2014), provided further generalizations of this model and alternative estima-tors based on Bayesian methods, full information maximum likelihood, quasi-MLE, and method of moments. One important generalization proposed bythese authors specifies the persistent and transitory efficiency terms as sepa-rate skew-normal errors based on the following parameterization ofColombiet al. (2014)

yit = x′itβ + αi + vit − uit − ηi, (51)

15

Page 16: Econometrics Using R., Handbook of Statistics, C.R. Rao

where εit = αi+vit−uit−ηi is the error structure and where uit and ηi are non-negative random variables that capture firm-specific random effects, randomnoise, short-run technical inefficiency, and long-run technical inefficiency.

Environmental factors can be introduced into the SFA paradigm in anatural way by including them in a vector of variables that is assumed toimpact the mean and the variance of production efficiency. For an excellenttreatment of the problem see Wang and Schmidt (2002), Simar and Wilson(2008) and Kim and Schmidt (2008). The marginal effects of the environ-mental factors, based on Battese and Coelli (1995), can be estimated usingthe R package Frontier (http://cran.r-project.org/web/packages/frontier/)developed by Arne Henningsen.

Various dependency structures have been assessed within the SFA paradigm.An excellent source of recent work on this topic can be found in Kumbhakarand Schmidt (2016).

No one has had more of an impact on the introduction of Bayesian meth-ods into the SFA and DEA paradigms than Mark Steel and Mike Tsionas(Griffin and Steel (2007), Tsionas and Papadakis (2010), and Liu et al.(2017)). The WinBUGS package of Steele is Fortran and Matlab-based opensource code that can be modified to run in R. Nonparametric models in SFAhave been studied in a variety of settings by Park et al. (1998), Adams et al.(1999), Park et al. (2003), Park et al. (2007), among others, and the R codesfor the latter models are discussed in more detail in the last sections of thischapter on specific implementation of open source R codes for SFA and DEA.Experimental R codes that can estimate both parametric and non-parametricstochastic frontiers can also be found in the experimental R-Forge packagemaintained by Arne Henningsen.

4 Envelopment Estimators

In this section we will briefly review another very popular approach in themeasurement and empirical estimation of the efficiency of economic systems(firms, industries, etc.), called Data Envelopment Analysis (DEA).

4.1 The Origins of DEA

In a nutshell, DEA is rooted in and coherent with theoretical economic mod-eling using Activity Analysis Models (AAM) and is estimated via the powerful

16

Page 17: Econometrics Using R., Handbook of Statistics, C.R. Rao

linear programming approach. Its name was branded in the seminal work ofCharnes et al. (1978), who refined and generalized the approach of Farrell(1957) to estimate production efficiency, which in turn was influenced byseminal works of Debreu (1951), Koopmans (1951a,b) and Shephard (1953,1970).

4.2 The Basic DEA Model

In his seminal work, Farrell (1957) focused on the constant returns to scalemodel, with multiple inputs and a single output. About two decades later,Charnes et al. (1978) generalized Farrell’s approach to the multi-output casebut started with a very different formulation–a fractional programming prob-lem formulation with the objective being to optimize the ratio of a weightedaggregate of outputs to a weighted aggregate of inputs (i.e., a kind of produc-tivity index). They then transformed this problem into a linear programming(LP) problem and derived its dual, which turned out to be the (generalizedversion of) AAM proposed by Farrell. Specifically, the formulation of Charneset al. (1978) for estimating the efficiency score of a firm or decision-makingunit (DMU) j ∈ (1, ..., n) with an allocation (xj, yj), states

Eji,CCR = max

v1,...,vN ;u1,...,uM

{ ∑Mm=1 umy

jm∑N

l=1 vlxjl

:∑Mm=1 umy

km∑N

l=1 vlxkl

≤ 1, k = 1, ..., n,

um ≥ 0, vl ≥ 0, l = 1, ..., N ;m = 1, ...,M

}(52)

where u′ = (u1, ..., uM) and v′ = (v1, ..., vN) are optimization variables (alsocalled here ‘multipliers’). This DEA formulation is more popular in theoperations research and management science literature and is often referredto as the ‘CCR model’ or the ‘multiplier form of DEA’ under CRS, additivityand free disposability.

Importantly, Charnes et al. (1978) then showed that (52) is equivalent tothe AAM version of the DEA-estimator of the Farrell input oriented technicalefficiency score of a DMU with an allocation (xj, yj) under the assumptionof CRS (also assuming additivity and free disposability of all inputs and alloutputs), formulated as

17

rsickles
Pencil
Page 18: Econometrics Using R., Handbook of Statistics, C.R. Rao

Ei(xj, yj) ≡ min

λ,z1,...,znλ (53)

s.t.n∑k=1

zkykm ≥ yjm,m = 1, ...,M, (54)

n∑k=1

zkxkl ≤ λxjl , l = 1, ..., N, (55)

λ ≥ 0, zk ≥ 0, k = 1, ..., n, (56)

which is an LP problem that can be solved via any standard LP solver.This formulation is more common in the economics literature (largely dueto its connection to works of Debreu (1951) and Koopmans (1951a,b)), andis often referred to as the envelopment form of DEA under CRS, additivityand free disposability. We will use this envelopment formulation to describeother variants of DEA, though it is useful to keep in mind that there isalso a multiplier form that optimizes what can be viewed as a normalizedproductivity index.

The previous formulation looks at minimization of all inputs, while keep-ing outputs fixed, and hence is called the input orientation. Similarly, theDEA-estimator of the Farrell output oriented technical efficiency score ofany (xj, yj) allocation, under the assumptions of CRS, and additivity andfree disposability of all outputs and all inputs is formulated as

Eo(xj, yj) ≡ max

λ,z1,...,znλ (57)

s.t.n∑k=1

zkykm ≥ λyjm,m = 1, ...,M, (58)

n∑k=1

zkxkl ≤ xjl , l = 1, ..., N, (59)

λ ≥ 0, zk ≥ 0, k = 1, ..., n. (60)

Immediately note that for these formulations we haveEi(x

j, yj) = 1/Eo(xj, yj) for any (x, y), as is required theoretically due to

CRS.

18

Page 19: Econometrics Using R., Handbook of Statistics, C.R. Rao

Also note that the reciprocals of these estimated Farrell efficiency mea-sures also give estimates of the input and output oriented Shephard’s distancefunctions (Shephard (1953, 1970)), under the same assumptions on technol-ogy, i.e., CRS, additivity and free disposability.

Sometimes a researcher may not be interested in fixing only the levelsof inputs or outputs, but may want to simultaneously expand outputs andcontract inputs, thus requiring other orientations. The DEA-estimator of ageneral efficiency measure of any (xj, yj) allocation in such a case (also hereunder the assumptions of CRS and additivity and free disposability of alloutputs and all inputs), can be obtained as follows

GEo(xj, yj) ≡ max

λ1,...,λN ,,θ1,...,θM ,z1,...,znf(λ1, ..., λN , , θ1, ..., θM) (61)

s.t.n∑k=1

zkykm ≥ θmyjm,m = 1, ...,M, (62)

n∑k=1

zkxkl ≤ xjl /λl, l = 1, ..., N, (63)

λ ≥ 0, zk ≥ 0, k = 1, ..., n, (64)

where certain restrictions can be imposed on the objective functionf(λ1, ..., λN , , θ1, ..., θM) and its arguments, depending on the interest of theresearcher. For example, restricting f to be additive (but summing only pos-itive arguments) will give the DEA estimate of the general Russell efficiencymeasure and an additional restriction of θ1 = ... = θM = 1 will turn it intothe input oriented Russell efficiency measure (as was originally introducedby Fare and Lovell (1978)). If one instead restricts λ1 = ... = λN = 1then one obtains the output oriented Russell efficiency measure. Mean-while, restricting f(λ1, ..., λN , θ1, ..., θM) to be multiplicative (a geometricmean) will generate the multiplicative-Russell efficiency measure introducedby Fare et al. (2007). Furthermore, if instead one imposes θ1 = ... = θM = θand λ1 = ... = λN = λ then the DEA estimate of the what has been termedthe general hyperbolic efficiency measure is obtained. This latter measuresometimes also appears with the additional restriction that θ = λ, which im-poses the properties of equiproportional expansion (contraction) of all output(inputs). Note that in general, this last formulation is no longer an LP prob-lem and therefore, it is typically more challenging to estimate.

19

Page 20: Econometrics Using R., Handbook of Statistics, C.R. Rao

Another very general measure, as well as a primal characterization oftechnology, is the directional distance function,4, which can be estimated viathe following DEA formulation (here also under the assumptions of CRS andadditivity and free disposability of all outputs and all inputs):

Dd(xj, yj|dx, dy) ≡ max

λ,z1,...,znλ, (65)

s.t.n∑k=1

zkykm ≥ yjm + λdym ,m = 1, ...,M, (66)

n∑k=1

zkxkl ≤ xjl − λdxl , l = 1, ..., N, (67)

λ ≥ 0, zk ≥ 0, k = 1, ..., n. (68)

In a similar fashion, DEA can be used to model and estimate cost, revenueand profit functions and associated efficiency measures. More details on thiscan be found in Sickles and Zelenyuk (2018).

4.3 The Myriad of DEA models

The approach briefly described in the previous section in the context of differ-ent efficiency measures constitutes the canonical forms of the DEA paradigm.Essentially, all the other versions are modifications or extensions of the mod-els we have outlined. In this section we briefly discuss a few of these modi-fications and extensions.

Oftentimes, extensions are obtained by imposing various additional con-straints onto either the envelopment form or the multiplier form of DEA,with an aim to better mimic the particular actual production process understudy. Such additional restrictions should be imposed with care, since theymay (and often do) affect other desirable properties related to previouslyadded constraints, such as CRS, convexity, free disposability, additivity, etc.

Additional constraints also may create computational complications, e.g.,turning the problem from a linear to a non-linear one or a hybrid problem

4The origins of ideas for this function go back to at least Allais (1943), Diewert (1983),and Luenberger (1992), and were later revived and thoroughly developed by Chamberset al. (1996)

20

Page 21: Econometrics Using R., Handbook of Statistics, C.R. Rao

that may require integer-programming problems. This may result in possiblymaking the problem much harder to compute, possibly infeasible, or may giveinferior local optima or degenerate solutions.

4.3.1 Relaxing Constant Returns to Scale and Convexity

The first wave of extensions of the DEA-CRS model mainly focused on re-laxing assumptions such as CRS and convexity. This line of research waspursued by a number of researchers who extended and enriched the DEAparadigm, among them Afriat (1972); Fare et al. (1983); Banker et al. (1984);Deprins et al. (1984) and Petersen (1990); Bogetoft (1996), to mention a few.

Out of the many modifications and extensions of the canonical DEA prob-lem the several that have sustained the test of time and popularity are theDEA-VRS (variable returns to scale) and the DEA-NIRS (non-increasing re-turns to scale) models, which simply amount to adding additional constraintsin the form of

∑nk=1 zk = 1 or

∑nk=1 zk ≤ 1, respectively, to the DEA-CRS

formulations as described above.Meanwhile, the Free Disposal Hull (FDH) approach, can be implemented

via a hybrid of the linear programming and the integer programming prob-lems, which is formulated in exactly the same way as the DEA-VRS program-ming problem with the exception that the constraints “zk ≥ 0, k = 1, ..., n”are replaced with “zk ∈ {0, 1}, k = 1, ..., n”. For example, using the outputoriented Farrell efficiency with an allocation (xj, yj) the FDH formulation isgiven by

Eo(xj, yj) ≡ max

θθ (69)

s.t.n∑k=1

zkykm ≥ θyjm,m = 1, ...,M, (70)

n∑k=1

zkxkl ≤ xjl , l = 1, ..., N, (71)

n∑k=1

zk = 1, (72)

θ ≥ 0, zk ∈ {0, 1}, k = 1, ..., n. (73)

The value of such formulation is that it hints at the relationship betweenFDH and DEA-VRS: indeed, the DEA-VRS estimated technology set is sim-

21

Page 22: Econometrics Using R., Handbook of Statistics, C.R. Rao

ply the convex closure of the FDH-estimated technology set. Thus, the FDHestimator can be viewed as a special case of the data envelopment analy-sis approach since it also envelopes the data but does so without imposingconvexity. For historical reasons these names are kept separate to avoidconfusion. An alternative yet equivalent form of the FDH estimator can begiven via the min-max problem, which is faster to compute (e.g., see Simarand Wilson (2013) for more details).

Finally, stochastic versions of DEA and FDH are also available (e.g.,Simar (2007); Simar and Zelenyuk (2011)).

4.3.2 Modeling with Undesirable Outputs or with Congesting In-puts

Another important stream of DEA literature has focused on estimating tech-nologies with weak disposability of inputs or (and especially) outputs, toaccount for the fact that some outputs are undesirable (‘bad’) and someinputs can cause congestion.

The ideas for such modeling approaches go back to at least Shephard(1974), and then was elaborated on in Fare and Svensson (1980); Fare andGrosskopf (1983); Grosskopf (1986); Tyteca (1996); Chung et al. (1997),which defined the mainstream approach on this matter. More recently thismainstream approach was re-evaluated in several important works, includ-ing Seiford and Zhu (2002), Fare and Grosskopf (2003, 2004, 2009), Før-sund (2009), Podinovski and Kuosmanen (2011), Pham and Zelenyuk (2018,2017).5

While many proposals have been made, the most popular approach inthis context so far continues to be the mainstream one. In this approach, forexample, if one is interested in measuring the radial expansion of the goodoutputs (g) while having no more inputs (x) and no more of bad outputs (b),then the DEA estimate of the good-output-oriented Farrell technical efficiency

5Also see Dakpo et al. (2017) and Sueyoshi et al. (2017) for reviews of this researchstream.

22

Page 23: Econometrics Using R., Handbook of Statistics, C.R. Rao

(under VRS) is given by

TEg(x, g, b) ≡ maxγ,z1,...,zn,δ

γ (74)

x ≥n∑k=1

zkxk (75)

gγ ≤ δn∑k=1

zkgk, γ ≥ 1 (76)

b = δn∑k=1

zkbk, 0 ≤ δ ≤ 1 (77)

n∑k=1

zk = 1, zk ≥ 0, k = 1, ..., n. (78)

For more theoretical and practical (computational) discussions on thistopic see Pham and Zelenyuk (2017) and references therein.

4.3.3 Other Streams of DEA

Another stream of DEA focuses on accounting for the network structure ofproduction technologies whether static or dynamic. This stream originatedin the seminal works of Fare and Grosskopf (1996); Fare et al. (1996) andwas taken further in many other works, e.g., see Kao (2009a,b, 2014) andreferences therein.

Another important stream of DEA literature is on the topic of weight re-strictions in multiplier form of DEA and the classical works here are by Dysonand Thanassoulis (1988); Charnes et al. (1990); Thompson et al. (1990), withmore recent and fundamental contributions including new interpretations (as‘technological trade-offs’) of various weight restrictions in DEA from Podi-novski and Bouzdine-Chameeva (2013), to mention just a few. Also seereviews on this topic by Allen et al. (1997); Podinovski (2015).

Yet another interesting research stream of DEA overlaps with game the-ory, which also has its roots in the seminal work of von Neumann (1945) andmore explicit treatment, for example, in Hao et al. (2000); Nakabayashi andTone (2006); Liang et al. (2008); Lozano (2012).

23

whson
텍스트 상자
Previous equations (eq. 53-73) are aligned at = or > symbols. But here, equations are right-justified.
Page 24: Econometrics Using R., Handbook of Statistics, C.R. Rao

4.4 Statistical analysis of DEA and FDH

Another key research wave that brought DEA and FDH to a totally differentlevel is related to their statistical aspects—this wave was mainly influencedby the seminal works of Leopold Simar and many of his co-authors.

The first breakthrough in this area was made by Banker (1993), where thefirst proof of consistency of the DEA estimator was sketched for the single-output case (in output oriented context), and was pointed out that it belongsto the class of maximum likelihood estimators. This important discovery wasthen substantially enriched by Korostelev et al. (1995a,b) who proved con-vergence of the estimated technology to the true technology for both DEAand FDH estimators, and derived convergence rates of these estimators, clar-ifying that they depend on the dimension of the production model, yet alsohave some optimality properties under certain conditions.

The convergence properties for the multi-input-multi-output case werefirst presented in the seminal work of Kneip et al. (1998). Meanwhile, thediscovery of the limiting distribution of the DEA estimator was done by Gi-jbels et al. (1999), only for the 1-input-1-output case and a decade later,Kneip et al. (2008) derived it for the fully multivariate case for DEA withVRS and also proved consistency of various bootstrap procedures. The lim-iting distribution for the case of DEA with CRS was established by Parket al. (2010) and the limiting distribution of the FDH estimator for the fullymultivariate case was also derived by Park et al. (2000), while consistencyof the bootstrap for FDH estimator was first presented in Jeong and Simar(2006).

This stream also includes the approach of analyzing the DEA (or FDH)estimated efficiency scores. Perhaps the most popular of these is the so-called ‘two-stage DEA’, which involves regression analysis of efficiency scoreson some factors. The state of the art here is the approach proposed by Simarand Wilson (2007), which is based on truncated regression where the inferenceis done with the help of a double bootstrap.6 A non-parametric version ofthis approach (based on non-parametric truncated regression) was proposedby Park et al. (2008). Furthermore, methods to analyze the distributions ofDEA and FDH efficiency scores were explored in Simar and Zelenyuk (2006),while methods to analyze industry efficiency were explored by Simar andZelenyuk (2007). These approaches were applied in various contexts andindustries, e.g., Zelenyuk and Zheka (2006), Demchuk and Zelenyuk (2009),

6Also see Simar and Wilson (2011) for the discussion on caveats and limitations.

24

Page 25: Econometrics Using R., Handbook of Statistics, C.R. Rao

Curi et al. (2015), Chowdhury and Zelenyuk (2016), Du et al. (2018), tomention a few.

A particularly notorious drawback of DEA and FDH—not allowing fornoise and sensitivity to ‘super-efficient’ outliers—was addressed by Simar(2007); Simar and Zelenyuk (2011), who proposed their version of StochasticDEA and Stochastic FDH, which consists of two stages: (i) filter the datafrom the noise using a non-parametric stochastic frontier method7 and then(ii) use DEA or FDH on the filtered data.

More recently, Kneip et al. (2015) derived new central limit theoremsfor the context where DEA or FDH estimates are used in place of the trueefficiency and thus provided the foundation for many useful statistical testsinvolving DEA or FDH estimators, including the two-stage ‘DEA+regression’context. This foundation was then used by Kneip et al. (2016); Daraio et al.(2017b) to develop various statistical tests and by Simar and Zelenyuk (2017)to develop two new central limit theorems for the aggregate efficiency scores(industry efficiency, etc.) of the type described above and more work contin-ues in this area.

5 SFA Efficiency Software in R

Reproducing the R code needed to implement the methods we have discussedin our chapter is not feasible and thus we provide a short tutorial on how touse a suite of estimators that can easily be accessed via the website “Produc-tivity in R”that can be found as https://sites.google.com/site/productivityinr/.We use standard notations in the codes and let n be the number of firms,T the number of time series and nT the total number of observations, i.e.,nT = n× T . The data input files configured with the first column a nT × 1column vector of the dependent variable y, and the next k containing nT × 1vectors of the independent variables contained in x′. The convention used isfor the first T observations to be for the first cross-section, the second T ob-servations for the second cross-section and so on. The default program output(‘results.out’ is the default output filename) contains parameter estimates,standard errors, t-values, average technical efficiency, correlation of effectsand efficiencies, Spearman rank order correlation of effects and efficiencies,

7While originally Simar and Zelenyuk (2011) considered the approach of Kumbhakaret al. (2007) for the first stage, one could use other non-parametric SFA approaches, e.g.a more general version proposed by Park et al. (2015) or Simar et al. (2017).

25

rsickles
Highlight
rsickles
Pencil
Page 26: Econometrics Using R., Handbook of Statistics, C.R. Rao

R-squared and adjusted R-squared.

5.1 Basic Model Setup

The default model is the panel data model we introduced in section 3, whichwe rewrite here:

yit = α + x′itβ − uit + vit, (79)

where the global mean (α) is subtracted out in the regression and where thisdemeaning method is applied to the suite of estimators discussed in SS andCSS as well as for the Battese and Coelli (1992) (BC) estimator. Timetrends can be added as well to account for disembodied technical changethat is available for adoption by all firms and various right-hand-side (rhs)variables that may be correlated with the effects can be identified in the mainR file. Efficiencies can also be averaged over different estimators using modelaveraging weights based on a simple average (AVE=0), AIC weights (AVE=1),or BIC weights (AVE=2). Outliers can be addressed by trimming (this doesnot apply to the BC and the DEA estimators).

5.2 Figures and Tables

Figures and tables are selected by setting values of 1 to print and 0 to skipthe results. There are a number of figures that are already set up to print.For example, when fig1=1 a figure for the average of efficiencies of the time-variant estimators is printed. When fig2=1 one for the efficiencies from thetime-variant estimators is printed. When fig3=1 a figure for the average ofefficiencies from all estimators is printed, while fig4=1 the weighted averageof the efficiencies is printed. tab1=1 prints a table for the average of efficien-cies from the time-variant estimators, tab2=1 prints one for the efficienciesfrom the time-variant estimators are printed and tab3=1 prints a table ofthe individual effects from all estimators. Also, setting firmeff=1 savesthe efficiencies of each individual firm for each estimator utilized.

5.3 Different Estimators

The R codes for the SFA models include the fixed effect, random effects, andthe Hausman-Taylor (HT) version of the stochastic panel frontier models ofSchmidt and Sickles (1984). The FR and the HT global parameters are set

26

Page 27: Econometrics Using R., Handbook of Statistics, C.R. Rao

to determine which model is estimated. Since the HT estimator of the panelstochastic frontier allows for selected regressors to be correlated with the ef-ficiency effects term the global option k1 is set at the number of variablesin X not correlated with the efficiency effects and are loaded into the firstk1 columns for the regressors. These different models are referred to as FIX,RND, and HT. The global options PSS1, PSS2, PSS3 designate the estimatorsfor the Park, Sickles and Simar (1998, 2003, 2007) models. The value of 1prints the results and 0 skips the results. Bandwidth selection is based onleave-one-out least squares cross-validation. The CSS global option allows forthe estimates of the Cornwell et al. (1990) models to be generated. A globaloption of CSS=1 prints results of the Fixed Effects/Within estimator (CSSW)estimator, while CSS=2 generated the GLS random effects estimator. Theefficient IV estimator (analogous to the HT estimator for the SS Model) isengaged with CSS=3. Finally, CSS=4 prints out all four model results. Timeinvariant variables should be placed at the end of X. The parameter zp isset equal to the number of time invariant variables. The Kneip, Sickles andSong (2012) factor-model is called by setting the global parameter KSS=1.

The KSS estimator also finds the number of factors and the code varies thenumber of factors from Lmax to Lmin and finds the first highest number atwhich the dimensionality test is not rejected. Cross-validation is used toestimate the optimal smoothing parameter. gr_st is the starting point ofthe grid search, gr_in is the increment in the grid search, and gr_en is theend point of the grid search. The Battese and Coelli (1992) model is calledby setting the BC option=1. The bounded inefficiency model of Almanidis,Qian, and Sickles (2014) is called by setting the BIE option=1. Differentdistributions for the lower bound on inefficiency can also be specified usingthe bie_dist parameter. For a bie_dist=0 the truncated is specified,while bie_dist=1 uses the truncated half normal, and bie_dist=2 the dou-bly truncated normal distribution. Finally, the Kutlu (2018) endogeneitycorrection for selected regressors (e.g., input levels in the production fron-tier) based on the CSS estimator is also available on this website along withinstructions for its use.

6 DEA Efficiency Software in R

The Jeon and Sickles (2004) model is the directional distance function methodoutlined in the section on DEA estimators. This estimator also allows us to

27

Page 28: Econometrics Using R., Handbook of Statistics, C.R. Rao

address the presence of undesirable outputs. Jeon and Sickles (JS) usedthis DEA estimator to examine the productivity effects of controlling forcarbon dioxide emissions on productivity growth using Mamlquist indexes.The JS estimator uses OECD data while Efficiency software uses UNIODand Bank data. The JS R codes are in a separate Jeon Sickles 2004.zipfile on the website. The default results that are printed by the R codes areproductivity growth, efficiency change, and technology change and confidenceintervals for the growth decompositions are based on the Simar and Wilson(2007) bootstrapping method. Next, we have the Simar and Zelenyuk (2006)model, which implements the Li (1996) test in the context of comparingdistributions of efficiency estimated via DEA and the Simar and Zelenyuk(2007) model that constructs confidence intervals and bias corrections forDEA-estimated aggregate efficiencies of a set of firms. It also provides a testfor the comparison of these group efficiencies.

There are other open source R code platforms other than the one thatwe have focused on in our discussions so far. The Benchmarking packagein R prepared by Peter Bogetoft and Lars Otto is one such program andthe manual and other directions for its use can be found at https://cran.r-project.org/web/packages/Benchmarking/Benchmarking.pdf. This can beused to estimate both DEA and FDH models. There have been many otherprograms developed by Simar and his colleagues in Matlab and these canbe compiled into R using Matlab-to-R converters. One such package, the‘matconv’ package by Siddarta Jairam is a useful conversion tool, althoughsome testing for accuracy is always advised for such automatic translations.8

Many of these Matlab programs can be found athttps://sites.google.com/site/productivityefficiency.9 Finally, Daraio et al.(2017a) provide a survey on a variety of software platforms that can estimateSFA and DEA models, including a number with R coding.

8See https://cran.r-project.org/web/packages/matconv/matconv.pdf for more details.9Arne Henningsen has provided the R conversion to a number of Fortran-based pro-

grams (FRONTIER) developed by Tim Coelli as well as a set of new programs in hisA Package for Stochastic Frontier Analysis (SFA) in R. This platform is available athttps://cran.r-project.org/.

28

whson
강조
whson
텍스트 상자
UNIDO. Sorry for the typo!
rsickles
Pencil
Page 29: Econometrics Using R., Handbook of Statistics, C.R. Rao

7 Summary and Final Remarks

This chapter has discussed a number of statistical and programming tech-niques used to evaluate the production efficiency of economic units, whetherthey be firms, sectors, or countries. We have discussed the methods mostwidely used for these sorts of evaluations and have provided references andurl’s to the most up to date websites that provide the R code to implementthese methods. We trust that the interested reader will find our discussionsand the software helpful for the purposes of practical evaluations of the per-formance of business entities as well as in their academic research. We havealso provided an up to date set of references that productivity and efficiencyresearchers can use for extended and deeper readings on these widely usedand adopted performance benchmarkingu; methods.

29

Page 30: Econometrics Using R., Handbook of Statistics, C.R. Rao

References

Adams, R. M., Berger, A. N., Sickles, R. C., 1999. Semiparametric ap-proaches to stochastic panel frontiers with applications in the bankingindustry. Journal of Business and Economic Statistics 17 (3), 349–358.

Afriat, S. N., 1972. Efficiency estimation of production functions. Interna-tional Economic Review 13 (3), 568–598.

Agrell, P. J., Bogetoft, P., et al., 2017. Regulatory benchmarking: Models,analyses and applications. Data Envelopment Analysis Journal 3 (1–2),49–91.

Ahn, S. C., Lee, Y. H., Schmidt, P., 2007. Stochastic frontier models withmultiple time-varying individual effects. Journal of Productivity Analysis27 (1), 1–12.

Ahn, S. C., Lee, Y. H., Schmidt, P., 2013. Panel data models with multipletime-varying individual effects. Journal of Econometrics 174 (1), 1–14.

Aigner, D., Lovell, C., Schmidt, P., 1977. Formulation and estimation ofstochastic frontier production function models. Journal of Econometrics6 (1), 21–37.

Allais, M., 1943. Traite D’Economie Pure. Vol. 3. Paris, FR: ImprimerieNationale.

Allen, R., Athanassopoulos, A., Dyson, R. G., Thanassoulis, E., 1997.Weights restrictions and value judgements in data envelopment analysis:Evolution, development and future directions. Annals of Operations Re-search 73, 13–34.

Almanidis, P., Qian, J., Sickles, R. C., 2014. Stochastic frontier with boundedinefficiency. In: Sickles, R. C., Horrace, W. C. (Eds.), Festschrift in Honorof Peter Schmidt: Econometric Methods and Applications. New York, NY:Springer, pp. 47–82.

Almanidis, P., Sickles, R. C., 2012. The skewness problem in stochastic fron-tier models: Fact or fiction? In: Van Keilegom, I., Wilson, P. W. (Eds.),Exploring Research Frontiers in Contemporary Statistics and Economet-rics: A Festchrift in Honor of Leopold Simar. New York, NY: Springer, pp.201–227.

30

Page 31: Econometrics Using R., Handbook of Statistics, C.R. Rao

Bada, O., Liebl, D., 2014. The r-package phtt: Panel data analysis withheterogeneous time trends. Journal of Statistical Software 59 (6), 1–33.

Bai, J., 2009. Panel data models with interactive fixed effects. Econometrica77 (4), 1229–1279.

Bai, J., Ng, S., 2002. Determining the number of factors in approximatefactor models. Econometrica 70(1), 191–221.

Banker, R. D., 1993. Maximum likelihood, consistency and data envelopmentanalysis: A statistical foundation. Management Science 39 (10), 1265–1273.

Banker, R. D., Charnes, A., Cooper, W. W., 1984. Some models for es-timating technical and scale inefficiencies in data envelopment analysis.Management Science 30 (9), 1078–1092.

Battese, G., Coelli, T., 1992. Frontier production functions, technical effi-ciency and panel data: With application to paddy farmers in India. Journalof Productivity Analysis 3 (1–2), 153–169.

Battese, G. E., Coelli, T. J., 1995. A model for technical inefficiency effectsin a stochastic frontier production function for panel data. Empirical eco-nomics 20 (2), 325–332.

Bauer, P. W., Hancock, D., 1993. The efficiency of the Federal Reserve inproviding check processing services. Journal of Banking & Finance 17 (2-3),287–311.

Bloom, N., Eifert, B., Mahajan, A., McKenzie, D., Roberts, J., 2013. Doesmanagement matter? Evidence from India. The Quarterly Journal of Eco-nomics 128 (1), 1–51.

Bloom, N., Genakos, C., Sadun, R., Van Reenen, J., 2012. Management prac-tices across firms and countries. The Academy of Management Perspectives26 (1), 12–33.

Bogetoft, P., 1996. DEA on relaxed convexity assumptions. ManagementScience 42 (3), 457–465.

Bogetoft, P., 2013. Performance benchmarking: Measuring and managingperformance. Springer Science & Business Media.

31

Page 32: Econometrics Using R., Handbook of Statistics, C.R. Rao

Chambers, R., Fare, R., Grosskopf, S., 1996. Productivity growth in APECcountries. Pacific Economic Review 1 (3), 181–190.

Charnes, A., Cooper, W., Rhodes, E., 1978. Measuring the efficiency of deci-sion making units. European Journal of Operational Research 2 (6), 429–444.

Charnes, A., Cooper, W. W., Huang, Z. M., Sun, D. B., 1990. Polyhedralcone-ratio DEA models with an illustrative application to large commercialbanks. Journal of econometrics 46 (1-2), 73–91.

Chowdhury, H., Zelenyuk, V., 2016. Performance of hospital services in On-tario: DEA with truncated regression approach. Omega 63, 111–122.

Chung, Y. H., Fare, R., Grosskopf, S., 1997. Productivity and undesirableoutputs: A directional distance function approach. Journal of Environmen-tal Management 51 (3), 229–240.

Colombi, R., Kumbhakar, S. C., Martini, G., Vittadini, G., 2014. Closed-skewnormality in stochastic frontiers with individual effects and long/short-runefficiency. Journal of Productivity Analysis 42 (2), 123–136.

Colombi, R., Martini, G., Vittadini, G., 2011. A stochastic frontier modelwith short-run and long-run inefficiency random effects, university of Berg-amo Department of Economics and Technology Management Working Pa-per No. 012011.

Cornwell, C., Schmidt, P., Sickles, R. C., 1990. Production frontiers withcross-sectional and time-series variation in efficiency levels. Journal ofEconometrics 46 (1), 185–200.

Curi, C., Lozano-Vivas, A., Zelenyuk, V., 2015. Foreign bank diversificationand efficiency prior to and during the financial crisis: Does one businessmodel fit all? Journal of Banking and Finance 61 (S1), S22–S35.

Dakpo, K. H., Jeanneaux, P., Latruffe, L., 2017. Modelling pollution-generating technologies in performance benchmarking: Recent develop-ments, limits and future prospects in the nonparametric framework. Euro-pean Journal of Operational Research 250 (2), 347–359.

32

Page 33: Econometrics Using R., Handbook of Statistics, C.R. Rao

Daraio, C., Kerstens, K. H., Nepomuceno, T. C. C., Sickles, R., 2017a. Pro-ductivity and efficiency analysis software: An exploratory bibliographicalsurvey of the options. Journal of Economic Surveys.

Daraio, C., Simar, L., Wilson, P. W., 2017b. Central limit theorems forconditional efficiency measures and tests of the “separability” condition innonparametric, two-stage models of production. The Econometrics Jour-nal.

Debreu, G., 1951. The coefficient of resource utilization. Econometrica 19 (3),273–292.

Demchuk, P., Zelenyuk, V., 2009. Testing differences in efficiency of regionswithin a country: The case of Ukraine. Journal of Productivity Analysis32 (2), 81–102.

Deprins, D., Simar, L., Tulkens, H., 1984. Measuring labour efficiency inpost offices. In: Marchand, M., Pestieau, P., Tulkens, H. (Eds.), The Per-formance of Public Enterprises: Concepts and Measurement. Amsterdam,NL: Springer, pp. 243–267.

DeYoung, R., 1998. Management quality and x-inefficiency in national banks.Journal of Financial Services Research 13 (1), 5–22.

Diewert, W. E., 1983. The measurement of waste within the production sectorof an open economy. The Scandinavian Journal of Economics 85 (2), 159–179.

Du, K., Worthington, A. C., Zelenyuk, V., 2018. Data envelopment analysis,truncated regression and double-bootstrap for panel data with applicationto Chinese banking. European Journal of Operational Research 265 (2),748 – 764.

Dyson, R. G., Thanassoulis, E., 1988. Reducing weight flexibility in dataenvelopment analysis. Journal of the Operational Research Society 39 (6),563–576.

Fare, R., Grosskopf, S., 1983. Measuring congestion in production. Zeitschriftfur Nationalokonomie / Journal of Economics 43 (3), 257–271.

33

Page 34: Econometrics Using R., Handbook of Statistics, C.R. Rao

Fare, R., Grosskopf, S., 1996. Intertemporal Production Frontiers: WithDynamic DEA. Norwell, MA: Kluwer Academic Publishers.

Fare, R., Grosskopf, S., 2003. Nonparametric productivity analysis with un-desirable outputs: Comment. American Journal of Agricultural Economics85 (4), 1070–1074.

Fare, R., Grosskopf, S., 2004. Modeling undesirable factors in efficiency eval-uation: Comment. European Journal of Operational Research 157 (1),242–245.

Fare, R., Grosskopf, S., 2009. A comment on weak disposability in nonpara-metric production analysis. American Journal of Agricultural Economics91 (2), 535–538.

Fare, R., Grosskopf, S., Logan, J., 1983. The relative efficiency of Illinoiselectric utilities. Resources and Energy 5 (4), 349–367.

Fare, R., Grosskopf, S., Roos, P., 1996. On two definitions of productivity.Economics Letters 53 (3), 269–274.

Fare, R., Grosskopf, S., Zelenyuk, V., 2007. Finding common ground: Effi-ciency indices. In: Fare, R., Grosskopf, S., Primont, D. (Eds.), Aggrega-tion, Efficiency and Measurement. Boston, MA: Springer, pp. 83–95.

Fare, R., Lovell, C. K., 1978. Measuring the technical efficiency of production.Journal of Economic Theory 19 (1), 150–162.

Fare, R., Svensson, L., 1980. Congestion of production factors. Econometrica:Journal of the Econometric Society 48 (7), 1745–1753.

Farrell, M. J., 1957. The measurement of productive efficiency. Jornal of theRoyal Statistical Society. Series A (General) 120 (3), 253–290.

Førsund, F. R., 2009. Good modelling of bad outputs: Pollution and multiple-output production. International Review of Environmental and ResourceEconomics 3 (1), 1–38.

Foster, L., Haltiwanger, J., Syverson, C., 2008. Reallocation, firm turnover,and efficiency: Selection on productivity or profitability? American Eco-nomic Review 98 (1), 394–425.

34

Page 35: Econometrics Using R., Handbook of Statistics, C.R. Rao

Frantz, R. S., 1997. X-Efficiency: Theory, Evidence and Applications, 2ndEdition. Norwell, MA: Kluwer Academic Publishers.

Frantz, R. S., 2007. Empirical evidence on X-efficiency, 1967-2004. In: Frantz,R. (Ed.), Renaissance in Behavioral Economics: Essays in Honour of Har-vey Leibenstein. New York, NY: Routledge, pp. 221–227.

Fu, X. M., Heffernan, S., 2009. The effects of reform on China’s bank struc-ture and performance. Journal of Banking & Finance 33 (1), 39–52.

Gijbels, I., Mammen, E., Park, B. U., Simar, L., 1999. On estimation ofmonotone and concave frontier functions. Journal of the American Statis-tical Association 94 (445), 220–228.

Glaister, K. W., 2014. The contribution of management to economic growth:A review. Prometheus 32 (3), 227–244.

Greene, W. H., 1980a. Maximum likelihood estimation of econometric frontierfunctions. Journal of Econometrics 13 (1), 27–56.

Greene, W. H., 1980b. On the estimation of a flexible frontier productionmodel. Journal of Econometrics 13 (1), 101–115.

Greene, W. H., 2005a. Fixed and random effects in stochastic frontier models.Journal of Productivity Analysis 23 (1), 7–32.

Greene, W. H., 2005b. Reconsidering heterogeneity in panel data estimatorsof the stochastic frontier model. Journal of Econometrics 126 (2), 269–303.

Grifell-Tatje, E., Lovell, C. A. K., 2013. Advances in cost frontier analysis ofthe firm. In: Thomas, C. R., Shughart, W. (Eds.), The Oxford Handbookof Managerial Economics. London, UK: Oxford University Press, pp. 1–18.

Grifell-Tatje, E., Lovell, C. A. K., 2015. Productivity Accounting. New York,NY: Cambridge University Press.

Grifell-Tatje, E., Lovell, C. K., Sickles, R. C., 2018. The Oxford Handbookof Productivity Analysis. Oxford University Press.

Griffin, J. E., Steel, M. F., 2007. Bayesian stochastic frontier analysis usingwinbugs. Journal of Productivity Analysis 27 (3), 163–176.

35

Page 36: Econometrics Using R., Handbook of Statistics, C.R. Rao

Grosskopf, S., 1986. The role of the reference technology in measuring pro-ductive efficiency. The Economic Journal 96 (382), 499–513.

Hall, R. E., Jones, C. I., 1999. Why do some countries produce so muchmore output per worker than others? The Quarterly Journal of Economics114 (1), 83–116.

Hao, G., Wei, Q. L., Yan, H., 2000. A game theoretical model of DEA ef-ficiency. The Journal of the Operational Research Society 51 (11), 1319–1329.

Hsieh, C.-T., Klenow, P. J., 2009. Misallocation and manufacturing TFP inChina and India. The Quarterly Journal of Economics 124 (4), 1403–1448.

Jeon, B. M., Sickles, R. C., 2004. The role of environmental factors in growthaccounting. Journal of Applied Econometrics 19 (5), 567–591.

Jeong, S.-O., Simar, L., 2006. Linearly interpolated FDH efficiency score fornonconvex frontiers. Journal of Multivariate Analysis 97 (10), 2141–2161.

Jiang, C., Yao, S., Zhang, Z., 2009. The effects of governance changes onbank efficiency in China: A stochastic distance function approach. ChinaEconomic Review 20 (4), 717–731.

Jondrow, J., Lovell, C. A. K., Materov, I. S., Schmidt, P., 1982. On theestimation of technical inefficiency in the stochastic frontier productionfunction model. Journal of Econometrics 19 (2–3), 233–238.

Kao, C., 2009a. Efficiency decomposition in network data envelopment analy-sis: A relational model. European Journal of Operational Research 192 (3),949–962.

Kao, C., 2009b. Efficiency measurement for parallel production systems. Eu-ropean Journal of Operational Research 196 (3), 1107–1112.

Kao, C., 2014. Network data envelopment analysis: A review. EuropeanJournal of Operational Research 239 (1), 1–16.

Kim, M., Schmidt, P., 2008. Valid tests of whether technical inefficiencydepends on firm characteristics. Journal of Econometrics 144 (2), 409–427.

36

Page 37: Econometrics Using R., Handbook of Statistics, C.R. Rao

Kneip, A., Park, B. U., Simar, L., 1998. A note on the convergence of non-parametric DEA estimators for production efficiency scores. EconometricTheory 14 (6), 783–793.

Kneip, A., Sickles, R. C., Song, W., 2012. A new panel data treatment forheterogeneity in time trends. Econometric Theory 28 (3), 590–628.

Kneip, A., Simar, L., Wilson, P. W., 2008. Asymptotics and consistent boot-straps for DEA estimators in nonparametric frontier models. EconometricTheory 24 (6), 1663–1697.

Kneip, A., Simar, L., Wilson, P. W., 2015. When bias kills the variance:Central limit theorems for DEA and FDH efficiency scores. EconometricTheory 31 (2), 394–422.

Kneip, A., Simar, L., Wilson, P. W., 2016. Testing hypotheses in nonpara-metric models of production. Journal of Business & Economic Statistics34 (3), 435–456.

Koopmans, T., 1951a. Activity Analysis of Production and Allocation. NewYork, NY: Wiley.

Koopmans, T. C., 1951b. Analysis of production as an efficienct combinationof activities. Activity Analysis of Production and Allocation 13, 33–37.

Korostelev, A., Simar, L., Tsybakov, A. B., 1995a. Efficient estimation ofmonotone boundaries. Annals of Statistics 23 (2), 476–489.

Korostelev, A., Simar, L., Tsybakov, A. B., 1995b. On estimation of mono-tone and convex boundaries. Publications de l’Institut de Statistique desUniversites de Paris 39 (1), 3–18.

Kumbhakar, S., Schmidt, P., 2016. Endogeneity Problems in Econometrics,Special Issue of the Journal of Econometrics. Vol. 190. Amsterdam, Nether-lands: Elsevier.

Kumbhakar, S. C., 1990. Production frontiers, panel data, and time-varyingtechnical inefficiency. Journal of Econometrics 46 (1), 201–211.

Kumbhakar, S. C., Park, B. U., Simar, L., Tsionas, E. G., 2007. Nonparamet-ric Stochastic Frontiers: A Local Maximum Likelihood Approach. Journalof Econometrics 137 (1), 1–27.

37

Page 38: Econometrics Using R., Handbook of Statistics, C.R. Rao

Kutlu, L., 2018. A distribution-free stochastic frontier model with endogenousregressors. Economics Letters 163, 152–154.

Kwan, S. H., 2006. The X-efficiency of commercial banks in Hong Kong.Journal of Banking & Finance 30 (4), 1127–1147.

Lee, Y. H., 1991. Panel data models with multiplicative individual and timeeffects: Applications to compensation and frontier production functions.Ph.D. thesis, Michigan State University, East Lansing, MI, USA.

Lee, Y. H., Schmidt, P., 1993. A production frontier model with flexible tem-poral variation in technical efficiency. In: Fried, H. O., Lovell, C. A. K.,Schmidt, S. S. (Eds.), The Measurement of Productive Efficiency: Tech-niques and Applications. New York, NY: Oxford University Press, pp.237–255.

Leibenstein, H., 1966. Allocative efficiency vs. “X-efficiency”. American Eco-nomic Review 56 (3), 392–415.

Leibenstein, H., 1975. Aspects of the X-efficiency theory of the firm. The BellJournal of Economics 6 (2), 580–606.

Leibenstein, H., 1987. Inside the Firm: The Inefficiencies of Hierarchy. Cam-bridge, MA: Harvard University Press.

Li, Q., 1996. Nonparametric testing of closeness between two unknown dis-tribution functions. Econometric Reviews 15 (3), 261–274.

Liang, L., Wu, J., Cook, W. D., Zhu, J., 2008. The DEA game cross-efficiencymodel and its Nash equilibrium. Operations Research 56 (5), 1278–1288.

Lieberman, M. B., Lau, L. J., Williams, M. D., 1990. Firm-level productivityand management influence: A comparison of US and Japanese automobileproducers. Management Science 36 (10), 1193–1215.

Liu, J., Sickles, R. C., Tsionas, E. G., 2017. Bayesian treatments to paneldata models with time-varying heterogeneity. Econometrics 5 (33), 1–21.

Lozano, S., 2012. Information sharing in DEA: A cooperative game theoryapproach. European Journal of Operational Research 222 (3), 558–565.

38

rsickles
Pencil
Page 39: Econometrics Using R., Handbook of Statistics, C.R. Rao

Luenberger, D. G., 1992. Benefit functions and duality. Journal of Mathe-matical Economics 21 (5), 461–481.

Materov, I. S., 1981. On full identification of the stochastic production fron-tier model. Ekonomika i Matematicheskie Metody 17, 784–788.

Meeusen, W., van den Broeck, J., June 1977. Efficiency estimation fromCobb-Douglas production functions with composed error. InternationalEconomic Review 18 (2), 435–44.

Mester, L. J., 1993. Efficiency in the savings and loan industry. Journal ofBanking & Finance 17 (2–3), 267–286.

Nakabayashi, K., Tone, K., 2006. Egoist’s dilemma: a DEA game. Omega34 (2), 135–148.

Nallari, R., Bayraktar, N., 2010. Micro efficiency and macro growth, theWorld Bank Policy Research Working Paper No. 5267.

Olson, J. A., Schmidt, P., Waldman, D. M., 1980. A Monte Carlo study ofestimators of the stochastic frontier production function. Journal of Econo-metrics 13 (1), 67–82.

Park, B. U., Jeong, S.-O., Simar, L., 2010. Asymptotic distribution of conical-hull estimators of directional edges. Annals of Statistics 38 (3), 1320–1340.

Park, B. U., Sickles, R. C., Simar, L., 1998. Stochastic panel frontiers: Asemiparametric approach. Journal of Econometrics 84 (2), 273–301.

Park, B. U., Sickles, R. C., Simar, L., 2003. Semiparametric-efficient es-timation of AR(1) panel data models. Journal of Econometrics 117 (2),279–309.

Park, B. U., Sickles, R. C., Simar, L., 2007. Semiparametric efficient esti-mation of dynamic panel data models. Journal of Econometrics 136 (1),281–301.

Park, B. U., Simar, L., Weiner, C., 2000. The FDH estimator for productivityefficiency scores: Asymptotic properties. Econometric Theory 16 (6), 855–877.

39

whson
강조
whson
텍스트 상자
delete June
rsickles
Pencil
Page 40: Econometrics Using R., Handbook of Statistics, C.R. Rao

Park, B. U., Simar, L., Zelenyuk, V., 2008. Local likelihood estimation oftruncated regression and its partial derivatives: Theory and application.Journal of Econometrics 146 (1), 185–198.

Park, B. U., Simar, L., Zelenyuk, V., 2015. Categorical data in local maxi-mum likelihood: Theory and applications to productivity analysis. Journalof Productivity Analysis 43 (2), 199–214.

Petersen, N. C., 1990. Data envelopment analysis on a relaxed set of assump-tions. Management Science 36 (3), 305–314.

Pham, M. D., Zelenyuk, V., 2017. Convexity, disposability and returns toscale in production analysis, cEPA Working Paper No. WP04/2017.

Pham, M. D., Zelenyuk, V., 2018. Slack-based directional distance functionin the presence of bad outputs: Theory and application to Vietnamesebanking. Empirical Economics 54 (1), 153–187.

Pitt, M. M., Lee, L.-F., 1981. The measurement and sources of technicalinefficiency in the Indonesian weaving industry. Journal of DevelopmentEconomics 9 (1), 43–64.

Podinovski, V. V., 2015. Dea models with production trade-offs and weightrestrictions. In: Zhu, J. (Ed.), ). Data Envelopment Analysis: A Handbookof Models and Methods. New York, NY: Springer, pp. 105–144.

Podinovski, V. V., Bouzdine-Chameeva, T., 2013. Weight restrictions andfree production in data envelopment analysis. Operations Research 61 (2),426–437.

Podinovski, V. V., Kuosmanen, T., 2011. Modelling weak disposability indata envelopment analysis under relaxed convexity assumptions. EuropeanJournal of Operational Research 211 (3), 577–585.

Qian, J., Sickles, R. C., 2008. Stochastic frontiers with bounded inefficiency,unpublished manuscript, Shanghai Jiao Tong University, Shanghai, CN.

Rezvanian, R., Ariss, R. T., Mehdian, S. M., 2011. Cost efficiency, tech-nological progress and productivity growth of Chinese banking pre-andpost-WTO accession. Applied Financial Economics 21 (7), 437–454.

40

Page 41: Econometrics Using R., Handbook of Statistics, C.R. Rao

Schmidt, P., Sickles, R. C., 1984. Production frontiers and panel data. Jour-nal of Business & Economic Statistics 2 (4), 367–374.

Seiford, L. M., Zhu, J., 2002. Modeling undesirable factors in efficiency eval-uation. European Journal of Operational Research 142 (1), 16–20.

Shephard, R. W., 1953. Cost and Production Functions. Princeton, NJ:Princeton University Press.

Shephard, R. W., 1970. Theory of Cost and Production Functions. Princetonstudies in mathematical economics. Princeton, NJ: Princeton UniversityPress.

Shephard, R. W., 1974. Indirect production functions. In: MathematicalSystems in Economics. Vol. 10. Hain, Meisenheim am Glan.

Sickles, R., Zelenyuk, V., 2018. Measurement of Productivity and Efficiency:Theory and Practice. New York, NY: Cambridge University Press, forth-coming.

Simar, L., 2007. How to improve the performances of DEA/FDH estimatorsin the presence of noise? Journal of Productivity Analysis 28 (3), 183–201.

Simar, L., Van Keilegom, I., Zelenyuk, V., 2017. Nonparametric least squaresmethods for stochastic frontier models. Journal of Productivity Analysis47 (3), 189–204.

Simar, L., Wilson, P. W., 2007. Estimation and inference in two-stage,semi-parametric models of production processes. Journal of Econometrics136 (1), 31–64.

Simar, L., Wilson, P. W., 2008. Statistical inference in nonparametric frontiermodels: Recent developments and perspectives. In: Harold Fried, C. A.K. L., Schmidt, S. (Eds.), The Measurement of Productive Efficiency, 2ndEdition. New York, NY: Oxford University Press, pp. 421–521.

Simar, L., Wilson, P. W., 2011. Two-stage DEA: Caveat emptor. Journal ofProductivity Analysis 36 (2), 205–218.

Simar, L., Wilson, P. W., 2013. Estimation and inference in nonparametricfrontier models: Recent developments and perspectives. Foundations andTrends in Econometrics 5 (3–4), 183–337.

41

Page 42: Econometrics Using R., Handbook of Statistics, C.R. Rao

Simar, L., Zelenyuk, V., 2006. On testing equality of distributions of technicalefficiency scores. Econometric Reviews 25 (4), 497–522.

Simar, L., Zelenyuk, V., 2007. Statistical inference for aggregates of Farrell-type efficiencies. Journal of Applied Econometrics 22 (7), 1367–1394.

Simar, L., Zelenyuk, V., 2011. Stochastic FDH/DEA estimators for frontieranalysis. Journal of Productivity Analysis 36 (1), 1–20.

Simar, L., Zelenyuk, V., 2017. Central limit theorems for aggregate efficiency.Operations ResearchForthcoming.

Stevenson, R. E., 1980. Likelihood functions for generalized stochastic fron-tier estimation. Journal of Econometrics 13 (1), 57–66.

Sueyoshi, T., Yuan, Y., Goto, M., 2017. A literature study for DEA appliedto energy and environment. Energy Economics 62, 104–124.

Thompson, R. G., Langemeier, L. N., Lee, C.-T., Lee, E., Thrall, R. M.,1990. The role of multiplier bounds in efficiency analysis with applicationto Kansas farming. Journal of Econometrics 46 (1-2), 93–108.

Tsionas, E. G., Kumbhakar, S. C., 2014. Firm heterogeneity, persistent andtransient technical inefficiency: A generalized true random-effects model.Journal of Applied Econometrics 29 (1), 110–132.

Tsionas, E. G., Papadakis, E. N., 2010. A bayesian approach to statisticalinference in stochastic dea. Omega 38 (5), 309–314.

Tyteca, D., 1996. On the measurement of the environmental performance offirms—a literature review and a productive efficiency perspective. Journalof Environmental Management 46 (3), 281–308.

von Neumann, J., 1945. A model of general equilibrium. Review of EconomicStudies 13 (1), 1–9.

Wang, H.-J., Schmidt, P., 2002. One-step and two-step estimation of theeffects of exogenous variables on technical efficiency levels. Journal of Pro-ductivity Analysis 18 (2), 129–144.

42

whson
강조
whson
텍스트 상자
Research, Forthcoming
rsickles
Pencil
Page 43: Econometrics Using R., Handbook of Statistics, C.R. Rao

Yao, S., Han, Z., Feng, G., 2008. Ownership reform, foreign competition andefficiency of Chinese commercial banks: A non-parametric approach. TheWorld Economy 31 (10), 1310–1326.

Zelenyuk, V., Zheka, V., 2006. Corporate governance and firm’s efficiency:The case of a transitional country, Ukraine. Journal of Productivity Anal-ysis 25 (1), 143–157.

43