bid forecasting in public procurementkth.diva-portal.org/smash/get/diva2:1354986/fulltext01.pdf ·...

IN THE FIELD OF TECHNOLOGYDEGREE PROJECT INDUSTRIAL ENGINEERING AND MANAGEMENTAND THE MAIN FIELD OF STUDYINDUSTRIAL MANAGEMENT,SECOND CYCLE, 30 CREDITS

, STOCKHOLM SWEDEN 2019

Bid Forecasting in Public Procurement

KARIM STITI

SHIH JUNG YAPE

KTH ROYAL INSTITUTE OF TECHNOLOGYSCHOOL OF INDUSTRIAL ENGINEERING AND MANAGEMENT

This page is intentionally left blank

SE-100 44 STOCKHOLM


by

Karim Stiti Shih Jung Yape

Master of Science Thesis TRITA-ITM-EX 2019:446

KTH Industrial Engineering and Management

Industrial Management

SE-100 44 STOCKHOLM

Budgivningsmodeller i Offentliga Upphandlingar

av

Karim Stiti Shih Jung Yape

Examensarbete TRITA-ITM-EX 2019:446

KTH Industriell teknik och management

Industriell ekonomi och organisation

Keywords: Bidding in public procurement, Count data regression, Economically most advantageous tenders, Lowest price tenders, Machine learning, Multiple linear regression, Non-parametric bootstrap, Position performance coefficient, Sealed-bid auctions, Stochastic dominance, Support vector regression.

Master of Science Thesis

TRITA-ITM-EX 2019:446

Abstract

Public procurement amounts to a significant part of Sweden's GDP. Nevertheless, it is an

overlooked sector characterized by low digitization and inefficient competition where bids are

not submitted based on proper mathematical tools. This Thesis seeks to create a structured

approach to bidding in cleaning services by determining factors affecting the participation and

pricing decision of potential buyers. Furthermore, we assess price prediction by comparing

multiple linear regression models (MLR) to support vector regression (SVR). In line with

previous research in the construction sector, we find significance for several factors such as

project duration, location and type of contract on the participation decision in the cleaning

sector. One notable deviant is that we do not find contract size to have an impact on the pricing

decision. Surprisingly, the performance of MLR are comparable to more advanced SVR

models. Stochastic dominance tests on price performance concludes that experienced bidders

perform better than their inexperienced counterparts and companies place more competitive

bids in lowest price tenders compared to economically most advantageous tenders (EMAT)

indicating that EMAT tenders are regarded as unstructured. However, no significance is found

for larger actors performing better in bidding than smaller companies.

Examiner

Pontus Braunerhjelm Comissioner

Tendium AB

Supervisor

Almas Heshmati Contact person

Farzad Khoshnoud

Approved

2019-06-14


Karim Stiti

Shih Jung Yape

Nyckelord: Budgivningsmodeller I offentliga upphandlingar, Ekonomiskt mest fördelaktiga anbud, Förseglade auktioner, Lägsta-pris anbud, Maskininlärning, Multipel linjär regression.

Examensarbete

TRITA-ITM-EX 2019:446

Godkänt

2019-06-14

Sammanställning

Examinator

Pontus Braunerhjelm

lm

Uppdragsgivare

Tendium AB

Handledare

Almas Heshmati Kontaktperson

Farzad Khoshnoud

Budgivningmodeller i Offentliga Upphandlingar

Karim Stiti

Shih Jung Yape

Offentliga upphandlingar utgör en signifikant del av Sveriges BNP. Trots detta är det en

förbisedd sektor som karakteriseras av låg digitalisering och ineffektiv konkurrens där bud

läggs baserat på intuition snarare än matematiska modeller. Denna avhandling ämnar skapa

ett strukturerat tillvägagångssätt för budgivning inom städsektorn genom att bestämma

faktorer som påverkar deltagande och prissättning. Vidare undersöker vi

prisprediktionsmodeller genom att jämföra multipel linjära regressionsmodeller med en

maskininlärningsmetod benämnd support vector regression. I enlighet med tidigare

forskning i byggindustrin finner vi att flera faktorer som typ av kontrakt, projekttid och

kontraktsplats har en statistisk signifikant påverkan på deltagande i kontrakt i städindustrin.

En anmärkningsvärd skillnad är att kontraktsvärdet inte påverkar prissättning som tidigare

forskning visat i andra områden. För prisprediktionen är det överraskande att den enklare

linjära regressionsmodellen presterar jämlikt till den mer avancerade

maskininlärningsmodellen. Stokastisk dominanstest visar att erfarna företag har en bättre

precision i sin budgivning än mindre erfarna företag. Därtill lägger företag överlag mer

konkurrenskraftiga bud i kontrakt där kvalitetsaspekter tas i beaktning utöver priset. Vilket

kan indikera att budgivare upplever dessa kontrakt som mindre strukturerade. Däremot

finner vi inger signifikant skillnad mellan större och mindre företag i denna bemärkning.

This page is intentionally left blank

Acknowledgements

We would like to express our deep and sincere gratitude to our thesis advisor Pro-fessor Almas Heshmati for his continuous support and guidance within the field ofstatistical analysis. His broad experience and expertise was valuable for us throughdifferent part of our thesis. We could not have imagined a more suitable candi-date in counselling us than Professor Almas. Beside our advisor, we would also liketo thank the people who helped us in Tendium, mainly Farzad Khousnoud, PeterVesterberg and Tim Lachmann. Together with Farzad we developed the idea offorecasting public procurement in the Swedish cleaning industry. Peter and Timprovided insight in data handling and model constructions which was helpful for theend result.

Finally, we gratefully acknowledge the support and continuous love of our parents.Their influence throughout our education is what have driven our passion and thecompletion of this thesis.

Jung Yape and Karim StitiStockholm, June 2019

i

Table of Contents

1 Introduction 11.1 Background . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11.2 Purpose . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31.3 Problem Formulation . . . . . . . . . . . . . . . . . . . . . . . . . . . 31.4 Sustainability Aspect . . . . . . . . . . . . . . . . . . . . . . . . . . . 41.5 Scope . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41.6 Report Structure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

2 Previous Research in Sealed Bid Auctions 62.1 Measuring Tender Participation . . . . . . . . . . . . . . . . . . . . . 6

2.1.1 Probability of Winning Tenders . . . . . . . . . . . . . . . . . 62.1.2 Determining Number of Participants in Tenders . . . . . . . . 72.1.3 Factors Influencing the Bid/No Bid Decision . . . . . . . . . . 8

2.2 Price Prediction Models in Sealed Bid Auctions . . . . . . . . . . . . 92.2.1 Predicting Bid Prices Using Statistical Models and Machine

Learning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102.3 Assessing EMAT tenders . . . . . . . . . . . . . . . . . . . . . . . . . 122.4 Green Public Procurement - Environmental Quality Criteria . . . . . 13

3 Empirical Framework 163.1 Multiple Linear Regression Analysis . . . . . . . . . . . . . . . . . . . 163.2 Model Validation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17

3.2.1 Multicollinearity . . . . . . . . . . . . . . . . . . . . . . . . . 173.2.2 Residual Diagnostics . . . . . . . . . . . . . . . . . . . . . . . 183.2.3 Variable Transforms . . . . . . . . . . . . . . . . . . . . . . . 19

3.3 Model Selection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 203.3.1 Selective Criteria . . . . . . . . . . . . . . . . . . . . . . . . . 203.3.2 K-fold Cross-Validation . . . . . . . . . . . . . . . . . . . . . . 21

3.4 Poisson Regression Models . . . . . . . . . . . . . . . . . . . . . . . . 213.4.1 Negative Binomial Regression . . . . . . . . . . . . . . . . . . 23

3.5 Support Vector Regression . . . . . . . . . . . . . . . . . . . . . . . . 253.6 Probit Regression . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 263.7 Stochastic Dominance . . . . . . . . . . . . . . . . . . . . . . . . . . 273.8 Confidence Intervals Using Non-Parametric Bootstrap . . . . . . . . . 273.9 Joint Distribution Function with Position Performance . . . . . . . . 293.10 Other Statistical Models . . . . . . . . . . . . . . . . . . . . . . . . . 30

3.10.1 Chi-Square Test . . . . . . . . . . . . . . . . . . . . . . . . . . 30

ii

4 Data and Methodology 314.1 Data Collection and Limitations . . . . . . . . . . . . . . . . . . . . . 314.2 Variable Description . . . . . . . . . . . . . . . . . . . . . . . . . . . 324.3 Competitors Affect On Price . . . . . . . . . . . . . . . . . . . . . . . 344.4 Assessing Tender Participants . . . . . . . . . . . . . . . . . . . . . . 344.5 Predict Price Performance . . . . . . . . . . . . . . . . . . . . . . . . 36

4.5.1 Comparing Multiple Linear Regression and Support VectorRegression . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36

4.5.2 Inference for experience, size and tendering method with Stochas-tic Dominance . . . . . . . . . . . . . . . . . . . . . . . . . . . 38

4.6 Assessing Quality Performance . . . . . . . . . . . . . . . . . . . . . . 38

5 Results 405.1 Bid/no Bid Decision . . . . . . . . . . . . . . . . . . . . . . . . . . . 40

5.1.1 Number of Bidder’s Effect on Price . . . . . . . . . . . . . . . 405.1.2 Fitting a Distribution the Participation Data . . . . . . . . . . 415.1.3 Factors Contributing to Participants With Regression . . . . . 455.1.4 Impact of Environmental Criteria on Participation . . . . . . . 515.1.5 Bid/No Bid Decision for Individual Participants . . . . . . . . 51

5.2 Predicting Price . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 535.2.1 Factors Affecting Pricing . . . . . . . . . . . . . . . . . . . . . 535.2.2 Comparing SVR And MLR Models for Price Prediction . . . . 545.2.3 Price Range with Non-parametric Bootstrap . . . . . . . . . . 565.2.4 Stochastic Dominance for Spread . . . . . . . . . . . . . . . . 585.2.5 Price Performance with Spread Variable . . . . . . . . . . . . 60

5.3 Competitor Performance in EMAT tenders . . . . . . . . . . . . . . . 61

6 Analysis of Results 636.1 Number of Competitors . . . . . . . . . . . . . . . . . . . . . . . . . 636.2 Bid/No Bid Logistic Regression . . . . . . . . . . . . . . . . . . . . . 666.3 Predicting Price . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66

6.3.1 Stochastic Dominance and Non-Parametric Bootstrap Analysis 686.4 Quantifying Quality . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69

7 Conclusions 71

8 Suggestions for Future Research 73

Bibliography 74

Appendix 80A. Literature Review . . . . . . . . . . . . . . . . . . . . . . . . . . . 80

iii

B. Bid/No-Bid Probit Regression . . . . . . . . . . . . . . . . . . . . 81C. Diagnostics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82D. Price Prediction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84

iv

1 Introduction

In this section public procurement auctions are introduced, the objectives and im-

plications of our thesis are described, the scope explained and finally we detail the

report structure.

1.1 Background

The purchase of goods and services by a public organization from external sources

is termed public procurement. It has a sizeable affect on world trade amounting to

more than 1.3 trillion euro each year. In EU it accounts for around 16 % of GDP

and during 2016 the EU put in new rules intended to open up the public procure-

ment market within Europe. The stated goal was to streamline processes and make

it easier for smaller companies to participate in the market (European Commission

2017). The total value of public procurement contracts in Sweden was estimated

to 683 billion SEK which is a 17 % share of GDP. In 2017, 18525 contracts were

announced and 39 % of those were in the construction sector (Upphandlingsmyn-

digheten 2019).

Public procurement contracts are awarded by Swedish local government in a process

known as sealed bid auctions meaning no bidder knows what the other participants

have bid (Upphandlingsmyndigheten 2019). This is a broadly used auction method

not only in public procurement but also in private procurement, especially in the

construction sector (Brook 2016).

There has been a steady decline from an already low level of the average number

of participants in public procurement auctions in Sweden, in particular the cleaning

industry. Today, there are on average 4.3 bidders per contract. It is the goal of the

Swedish government to increase competitiveness and make public procurement more

efficient (Upphandlingsmyndigheten 2019). The market is plagued by laws that

make the process static and inefficient which subsequently leads to inexperienced

parties winning auctions. Contracts where the winner is determined by combining

an assessment of quality with price have also not been effective (Inkopsradet 2016).

1

The bidding process in cleaning services in particular has been characterized by bids

under the recommended price winning. As a result, serious actors are electing not to

participate (Fastighetsfolket 2017). Part of the improvement process is to evaluate

how artificial intelligence can be used by the government in public procurement

to increase security and improve processing time. In addition, they aim to make

the market more transparent by providing access to more data (Regeringskansliet

2019).

We perform our analysis in this thesis by drawing on both existing literature directly

from cleaning in public procurement and the construction industry. We believe this

paper contributes to the existing literature in several ways. Firstly, it provides an

updated look at factors affecting the bid/no bid and pricing decision of bidders in

public procurement cleaning using factors initially discovered in the construction

industry. Secondly, we find that linear models perform in line to more advanced

support vector regressions. Thirdly, using models that have not been applied to

public procurement before we show that the experience of the bidder is more im-

portant when it comes to performance in tenders than company size. This follows

as more frequent bidders perform better than less frequent bidders and have smaller

confidence intervals in their bids; indicating that with experience comes structure in

bidding. Finally, we show companies tend to bid more evenly in lowest price tenders

compared to EMAT tenders. All together, our results show that one can draw infer-

ence of bidding behaviour with straightforward mathematical models and as more

structured data is made available with time a complete bid forecasting model can

be built.

2

1.2 Purpose

The Swedish Upphandlingsmyndigheten is hoping to streamline and digitize the

public procurement sector in general and the cleaning services in particular. It is a

sector plagued by low competition and inefficiency. Our thesis seeks to understand

and improve the low participation rate in cleaning services’ tenders by contributing

to digitizing the sector. We do this by evaluating pricing models and participa-

tion models that will make the bidding easier and more transparent. In addition,

considering the perceived problems with economically most advantageous tenders

(EMAT), we also intend to study whether there is a difference in how companies

perform in EMAT in relation to lowest price tenders.

1.3 Problem Formulation

Considering the industry inefficiencies, an important question to pose is whether

there is in fact any structure to the bidding behavior of participants or if they act

randomly. This thesis will seek to understand whether models can be developed for

the behavior of competitors within the Swedish cleaning service industry and in that

case what models are best suited. The approach to these problems is to divide them

into two main areas of study: The bid/no bid decision of competitors and predicting

bid prices. To do this we must first determine factors that affect the decision-making

of companies in the sector, then we will explore whether simple linear regression

models or more advanced machine learning methods perform better.

Research Question 1 (RQ1): What factors contribute to companies bidding in

public procurement?

Research Question 2 (RQ2): What factors affect the pricing decision in public

procurement?

3

Research Question 3 (RQ3): Are multiple linear regression or support vector re-

gression models best suited for price prediction in public procurement?

Research Question 4 (RQ4): How does bidder performance differ in EMAT ten-

ders compared to lowest price tenders?

1.4 Sustainability Aspect

This paper will explore sustainability through the public procurement contracts that

are EMAT tenders. We will study these contracts to determine what impact the

quality criteria based on environmental factors has on participation. This results in

the following research question:

Research Question 5 (RQ5): Does participation increase or decrease when en-

vironmental quality criteria are included in EMAT tenders?

1.5 Scope

Our research will only focus on public procurement contracts in the cleaning services

sector in central and the south of Sweden from 2016-2019. Within the cleaning

sector we focused on sanitation and standard cleaning of floors and windows. For

price analysis, we only look at floor care. Some of the companies participating in

these tenders are not Swedish but no distinctions are made between Swedish and

foreign firms. Previous research presents numerous attributes that have an affect

on bidding in other sectors as well as in the cleaning industry. We limit our thesis

to those that are most quantifiable and are accessible to us from our database.

Limitations in the data will be further explored in section 4.1. The study methods

used will mainly consist of regression analysis but some inference will be drawn from

stochastic dominance and non-parametric bootstrapping.

4

1.6 Report Structure

The paper is organized as follows. Section 2.1 starts off by going trough research in

tender participation with section 2.1.3 in particular detailing findings when looking

at factors affecting the bid/no bid decision. Section 2.2 provides an overview of

existing literature pertaining to predicting bid prices and determinants of the pricing

decision. The literature review concludes with section 2.3 describing the difficulty in

judging economically most advantageous tenders and 2.4 that looks specifically at

the impact of environmental quality criteria. All of the mathematical models used

in this thesis are presented next. Section 4 then describes the data and details its

limitations as well as defining key variables. In addition, it is exhaustively explained

how we intend to apply the methods that have been introduced in section 3. We

outline each approach in the order they will be presented in the results section.

Results will be presented in section 5 for fitting tender participation into a known

distribution and the regression models. Price prediction model comparison and price

performance is shown next and we finally present the assessment of EMAT tenders.

Section 6 and 7 analyzes and concludes the results. Finally, suggestions for further

research are presented in section 8.

5

2 Previous Research in Sealed Bid Auctions

Using past research we hope to find good models that can be used for our thesis.

For tender participants there are some existing studies in public procurement for

cleaning services that we can build on. For price prediction and EMAT assessment

the research is more sparse and we seek studies in other sectors such as construc-

tion.

2.1 Measuring Tender Participation

2.1.1 Probability of Winning Tenders

Multiple and logistic regression can be applied in determining probabilities of win-

ning tenders. Data from one specific bidding firm in the Polish construction industry

allowed Malara and Mazurkiewcz (2012) to model the winning probability through

a binary logistic regression model. The binary response variable is partially deter-

mined by qualitative variables such as the type of tender, size of the procurer and

the presence of partner contractors, and partially by quantitative variables such as

number of competitors, lowest price bid and highest price bid.

Results in Malara and Mazurkiewcz (2012) model gave the suppliers either an output

of 0 as in loosing the tender or 1 as in winning the tender. Rounding between 0 and

1 are done accordingly with standard mathematical procedures. Although Malara

and Mazurkiewcz do not forecast any optimal bid range or optimal bid price, their

contribution could still be useful for this study. Their method of determining winning

probabilities can be incorporated in other auction bidding theories that both utilize

quality and price to find and optimal bid. Lastly, it is imperative to note that the

model Malara and Mazurkiewcz propose is restricted to one firm’s probability of

winning, which may not be satisfactory in a large competitive setting with many

actors.

Malara and Mazurkiewcz work is pertinent for this thesis, not due to their procedure

of measuring the probability of winning tenders, but rather for what factors that

6

could contribute firms to participate in tenders. Furthermore, their mathematical

theory could be of significance for this thesis since they apply binary regression

models well aligned with probit regression and the probability of a firm participating

in tenders.

2.1.2 Determining Number of Participants in Tenders

There have been varying results regarding the effect of number of participants on

the tender outcome. An analysis of the so called bid spread, defined in section 4.2,

in auctions did not show any significant effect by either number of competitors or

contract size (Drew et al. 2001). However, determining the number of participants

in a contract may be pertinent due to simple supply demand theory; the more

bidders in a contract, the harder it becomes for firms to profit and the cheaper

it becomes for buyers. Several studies have centered around understanding how

different factors affect the participation rate of contracts in different industries.

Augustin and Walter (2010) found in their study on operator changes that project

duration, i.e. longer contracts, usually increased the participation rate in contracts.

When studying competitive bidding Beck (2011) found that season was statistical

significant when describing the number of participants in tenders. Contracts starting

during summer or at the beginning of the year had lower participation rate than

contracts that started mid autumn or mid spring. Although indications that both

time and seasonal components might affect the amount of participants in contracts,

it is important to be critical. Influencing factors may vary for different industries

depending on the type of project, the risk firms undertake and the amount of capital

firms bind during a certain time period (Agerberg and Agren 2012).

A more explanatory procedure for determining number of participants in contracts

was presented by Vigren (2017). In his study, Vigren used count data regression

models to determine what contract characteristics that affected the number of bids

for public transport bus contracts. The main finding from the study was that most

contract features changed participation in tenders by around 0.1− 0.5 bidders. The

factors he looked at included, project duration, number of other tenders available,

geography, a firm’s current workload and whether the contract was evaluated based

7

on lowest price or economically most advantageous criteria. He compared ordinary

least-square to Poisson Regression, Generalized Poisson Regression, Truncated Pois-

son Regression and Truncated Generalized Poisson. The OLS regression performed

worse than the different Poisson models. In particular he found that project du-

ration increased the number of participants while tenders that are evaluated on

EMAT decrease the number of participants as with the number of other tenders

available to bid, possibly indicating that some firms have limited capacity to bid.

Finally, Vigren found that tenders with combined contracts did not affect the overall

participation.

Over the years the participation rate of contracts ranging from the timber industry

to the construction industry has been found to vary depending on factors such as

the type of project (Drew and Skitmore 2006; Athey et al. 2011), client relationship

(Bageis and Fortune 2009), project location (Azman 2014), project duration (Au-

gustin and Walter 2010) and project size (Al-Arjani 2002; Benjamin 1969; Drew and

Skitmore 2006; Lundberg et al. 2015). To further understand important attributes

that may contribute to the bidding participation rate, it is relevant to study factors’

influence on single firm’s bidding participation.

2.1.3 Factors Influencing the Bid/No Bid Decision

Numerous studies have been made on which factors that are most important when

assessing the bid/no bid decision and overall competitiveness for contracts in dif-

ferent sectors. One study in the construction industry, performed by Cheng et al.

(2010), indicated variables such as project size, type of project, time available for

tender preparation, current workload, expected number of competitors, tendering

method and project duration to be important for a firm’s decision to bid in a con-

tract (remaining factors in Cheng et. al. study are illustrated in table 21). In

comparison to other studies which uses various statistical models they implement

a questionnaire to determine important factors contributing to firms’ bid/no bid

decisions. Cheng et. al. gathered key influencing factors from renowned market

participants, weighed their relative importance within the industry and finally gave

them points that described factors’ significance to contracts. Ballesteros-Perez et

8

al. (2016) took a similar approach as Cheng where they focused on anticipating the

participation of individual bidders. They found that contract size was significant

and thus affected the likelihood of individual bidders to participate in contracts.

Furthermore, bidders that have not previously participated in a auction are impor-

tant to consider in tender forecasting models, since excluding them would limit the

predictive accuracy (Ballesteros-Perez et al. 2016)

Fu (2003) measured in his doctoral thesis the effect of contractor’s bidding expe-

riences on competitiveness in recurrent bidding. His work is pertinent in multi-

attribute tendering theory since he incorporates other important factors to assess

competitive bidding. Well aligned with Cheng et al. (2010) he studied how fac-

tors contributes to bid/no bid contract decision, Fu found that features such as

client relationship, a firm’s previous experience, project size and project location

to be important when determining competitiveness mainly within the construction

industry.

By comparing studies in the construction industry (Cheng et al. 2010; Fu 2003) and

public transport industry (Vigren 2017), there is a further strong indication that

common factors to assess in bid/no bid decision models are project size, project

location and the workload (in some studies called market conditions) of firms.

Conclusively, the number and identity of participants auctions in economics have

been challenging to forecast. There are few models that provide accurate solutions

or predictions, thus further strengthening the fact that the area of multi-attribute

auction theory and assessment of factors contributing to competitive bidding to be

a fairly unexplored field of study. In addition, there are no good models to predict

specific number of key competitors (Ballesteros-Perez et al. 2015).

2.2 Price Prediction Models in Sealed Bid Auctions

In this section we review past research to identify attributes that affect the pricing

decision of a bidder to use in our regression analysis. There have been several

studies in this area and Takano et al. (2018) categorized them into three buckets:

statistical models, multi-criteria utility models and artificial-intelligence (AI) based

9

models. We focus on statistical and AI models by presenting the results of previous

attempts to use machine learning and least-square regression to predict prices in

sealed bid auctions in procurement. In addition to these models we would also argue

the prevalence of models based on game theory. Friedman, a prominent researcher

in bidding theory, neglected these model arguing game theory models were only

functional when the number of bidders are predictable and few Friedman (1956), a

rare occurrence in this types of auctions.

2.2.1 Predicting Bid Prices Using Statistical Models and Machine Learn-

ing

Based on an extensive literature review it was found that factors affecting the bid

decision of companies in the construction sector can be categorized into seven ma-

jor groups; project characteristics, economic Characteristics, bidding characteris-

tics, contract Characteristics, owner characteristics, company characteristics and

opportunity Characteristics. Some of the most prominent attributes within these

categories were project size, investment risks, time for tender preparation, type of

contract, relationship with the owner, current workload and need for work (Polat

et al. 2016). The paper further compared a machine learning method called ar-

tificial neural network with least-square regression. They found that both models

performed equally with good predictive accuracy.

10

In one paper least-square regression was applied to forecast a bidding range. They

applied logistic regression and found a model with a very high model fit (Petrovski et

al. 2015). An advantage of this model is that it can easily take in to account several

factors that impact the bidding decision, both quantitative and qualitative.

Artificial intelligence tools have been used since the 1990s to tackle bidding problems.

Methods include artificial neural networks (Hegazy and Moselhi 1994; Li 1996), case-

based reasoning (Dikmen et al. 2007) and fuzzy set theory (Fayek 1998).

A paper that instead looks at itemized bids is Jung and Kim (2019) that provided a

forecasting model to using a machine learning approach called random forest method

to estimate bidding ranges with the help of random forest variable selection and

regularized linear regression approaches. They validated their model by finding

that the actual winning bid always was in the proposed range. Lastly, they argued

that the predictive power of the suggested model could be improved by using better

datasets.

Petrovski et al. (2015) used Support vector regression with a Radial Basis Function

(RBF) kernel to arrive at a model using two attributes, tender preparation and price

received, and predict prices in the construction sector in Macedonia by only 2.5 %

mean absolute percentage error

Instead of looking at regression models some studies have focused on trying to fit

a distribution model to bids. One study (Ballesteros-Perez and Skitmore 2017) as-

sessed seven different distributions including Uniform, Weibull and different versions

of the log-normal distributions. They found that the 3-parameter log normal gave

the best results while Weibull, Log-Uniform and Uniform performed badly. They

noted that the Weibull giving such poor results was surprising since it is widely used

to model bid variability.

11

2.3 Assessing EMAT tenders

EMAT contracts are not simply awarded to the lowest price bidder. Instead each

company is evaluated by a set of quality criteria. The criteria either results in a price

discount for good scores, price penalty for bad scores or is translated into a score

that combined with a price score results in a final bid score. Both the criteria to be

included and subsequent assessment mechanism is up to each procurement entity

to choose. There are certain guidelines from upphandlingsmyndigheten regarding

appropriate quality scores (Upphandlingsmyndigheten 2019).

Yu et al. (2013) argue that few adequate models exist for EMAT tenders due to the

inherent difficulty in measuring the difference in price that comes from variance in

the quality of service or product. They tackle EMAT tenders by estimating a bidding

range that takes into account quality scores by introducing a price elasticity of

quality (PEQ) model. This measure is reminiscent of the definition of price elasticity

of demand found in economics, with the demand replaced by a quality variable. The

quality is defined by factors such as capabilities, experience and management skills.

Using this model they are able to present a method to estimate a bidding range.

The bidding range is solved with a graphical analysis tool known as geometric graph

analysis (GGA) proposed by Wang et al. (2007). The key finding from their paper

was that both the most competitive and profitable strategies suggest choosing the

same quality level in the bid. Meaning that regardless of bidding strategy there

exists a given optimal quality level so that the quality and pricing aspects can be

kept separate.

Ballesteros-Perez et al. (2015) developed a method to assess the position of a bidder

in EMAT tenders using a position performance coefficient. They found that the both

the Beta distribution and the Kumaraswamy’s distribution fits the position perfor-

mance coefficient. In addition to calculating a distribution of the likely position

a bidder will have in a future auction they model the number of participants in a

bidding process with the Laplace distribution. They did this because the likelihood

that you are placed second when there are three participants will likely be different

than when there are ten bidders. They then arrived at their final result by creating

a joint probability distribution. This distribution gives the probability of a competi-

12

tor placing in any given position. Thereby a bidder can assess the performance of

its competitors in an EMAT tender. The authors raise three drawbacks with their

research with the biggest being that they neglect to account for non-economic ra-

tional bidding and cover pricing, defined as a participant finding it in their interest

to bid in an auction without the intention to win.

2.4 Green Public Procurement - Environmental Quality Cri-

teria

Green public procurement (GPP) can be used as a tool by public organizations to

achieve environmental quality objectives. Overall, there seems to be an increased

usage of environmental criteria (Von Oelreich and Philip 2013). The extent to

which GPP is used however differs between countries in EU because it is voluntary.

In Sweden, environmental criteria were used in 40-60 % of tenders while in EU as a

whole it was less than half that (Renda et al. 2012).

Aldenius and Khan (2017) listed several important factors featured in previous lit-

erature that have an effect on the outcome of GPP. These included strategy and

goals since research has shown that top level staff in government have an impact

on the degree to which GPP is taken into account when setting goals. When GPP

directives were more voluntary than mandatory then factors other than sustainabil-

ity were prioritized in the procurement process. What is lacking in GPP research

is studies detailing how specific regions strategically use public procurement to pro-

mote environmental objectives and what challenges it implies. Another key factor

driving the use of GPP was found to be costs. Studies have shown that procurement

entities perceive the inclusion of environmental criteria as cost ineffective and that

is slows down the process. Moreover, the size of public organizations in different

regions may explain varying success of GPP. Aldenius and Khan (2017) presented

a study conducted in Norway that showed that larger municipalities have imple-

mented GPP criteria to a larger extent than smaller ones. Finally Aldenius and

Khan (2017) described the existence of legal uncertainties regarding the application

of GPP criteria as well as a lack of knowledge of the advantages of GPP and life cycle

costs. In fact, this lack of knowledge and training in environmental criteria are more

13

critical factors to the future success of GPP rather than budgetary considerations

(Testa et al. 2013).

An argument for green public procurement (GPP) is that public sector parties can in-

fluence producers and consumers to reduce their impact on the environment through

their purchasing power. However, Lundberg et al. (2016) assessed the ability of GPP

to achieve environmental objectives and found its potential as an environmental pol-

icy to be limited in terms of how polluting firms choose to adapt to the environmental

requirements posed by the public sector and invest in greener technologies. In fact,

they argued it can be counterproductive. They concluded by stating that GPP must

aim for environmental standard beyond the technology of the procurement firms and

to be designed with clear environmental objectives in mind.

The Swedish public procurement agency Upphandlingsmyndigheten has put forth

sustainability criteria that should be used in public procurement contracts. They

can be divided into four subcategories. It is up to each individual procurement en-

tity to choose which criteria and to what degree to use them. The four categories

are: 1. Qualification criteria, 2. Technical specification, 3. EMAT criteria, 4. Con-

tract criteria. For cleaning services there are specific criteria that a procurement

entity may choose to use. Qualification criteria are used for the bidders to prove that

they works systematically with environmental considerations. ISO 14001 certificates

are therefore often required to participate in contracts. It is an international envi-

ronmental management system standard that aims to decrease the environmental

footprints of companies. The minimum requirements is to work proactively with

regard to negative impact on the environment as well as to meet national laws.

The procurement entity should accept substitute documents showing that a com-

pany meet the requirements in the system standard if it currently does not hold the

certificate (Upphandlingsmyndigheten 2019).

The consequences of including environmental criteria in public procurement auctions

has been considered. Lundberg et al. (2015) presented with a negative binomial re-

gression that environmental criteria, together with other variables such as Tendering

Method and Project Size, does in fact have statistical significance when describing

the number of competitors in tenders. Their argument is that it will decrease the

14

number of tender participants and thereby lower competition. On the other hand

it might lead to increased competition because it incentivizes suppliers that already

deliver sustainable solutions to participate more. Then, if these companies out-

number those that do not focus on sustainability than the overall effect could be

increased participation (Lundberg et al. 2009).

To conclude this section, it is clear from our review that research pertaining to

bidding participation, bid forecasting related directly to the cleaning public pro-

curement sector is relatively limited. Thus justifying our thesis on bidding in public

procurement contracts.

15

3 Empirical Framework

3.1 Multiple Linear Regression Analysis

To model the relationship between a dependent variable and independent variables

a multiple regression model (MLR) can be used. The dependent variable can also

be referred to as the response variable. The extended model has the following

mathematical notation

Yi = β0 + β1x1 + β2x2 + .....+ βkxk + εi, (1)

where, Yi is the dependent variable, β0 is the intercept, βk are the regression coeffi-

cients, β0 is the intercept, xk the independent variables, also referred to as explana-

tory variables, and εi the random error terms. MLR assumes that the relationship

between the variables are linear (Montgomery et al. 2012).

In matrix form the regression model is expressed as

Y = Xβ + ε, (2)

where

Y =

∣∣∣∣∣∣∣∣∣∣Y1

Y2...

Yn

∣∣∣∣∣∣∣∣∣∣, X =

∣∣∣∣∣∣∣∣∣∣1 X11 X12 ... X13

1 X21 X22 ... X23

......

.... . .

...

1 Xn1 Xn2 ... Xnk

∣∣∣∣∣∣∣∣∣∣β =

∣∣∣∣∣∣∣∣∣∣β0

β1...

βk

∣∣∣∣∣∣∣∣∣∣, ε =

∣∣∣∣∣∣∣∣∣∣ε1

ε2...

εn

∣∣∣∣∣∣∣∣∣∣, (3)

where Y is a nx1 vector of observations. X is a nxp matrix of the independent

variables, β is px1 vector of regression coefficients and ε the random errors in an

nx1 vector.

16

Estimation of model parameters is done with the ordinary least-square approach

(OLS). The least-square estimators are obtained by minimizing the sum of squares

of the errors: ε′ε. We obtain the least-squares normal equations

X ′Xβ = X ′Y, β = (X ′X)−1X ′Y, (4)

In a regression model two types of independent variables are used: quantitative

variables and qualitative variables. The first type are continuous variables. Dummy

variables are qualitative variables and take values 1 or 0 to indicate a categorical

effect that can shift the outcome (Montgomery et al. 2012).

3.2 Model Validation

3.2.1 Multicollinearity

The presence of multicollinearity can increase the uncertainty in the model by in-

creasing the standard errors of estimated coefficients. The first step to detect mul-

ticollinearity is by inspecting the correlation matrix of the independent variables.

These are obtained by the unit length scaled values give from

wij =Xij −Xj

s1/2jj

, i = 1, 2, ..., n, j = 1, 2, ..., k (5)

where k is the number of independent variables without the intercept, Xj is the

mean of the independent variables in j th row and sjj =∑n

i=1(Xij − Xj)2. The

correlation matrix is now obtained by multiplying two matrices W of the scaled

values.

W ′W =

∣∣∣∣∣∣∣∣∣∣1 r12 r13 ... r1k

r12 1 r23 ... r2k...

......

. . ....

r1k r2k r3k ... 1

∣∣∣∣∣∣∣∣∣∣(6)

17

An additional multicollineraity diagnostic is the variance inflation factor (VIF).

According to James et al. (2013) and Montgomery et al. (2012) a VIF value

exceeding 5 or 10 is a strong indication of collinearity between the variables. A

benefit of the VIF over the simple cross-correlation is that it is conditional on other

explanatory variables. For the ith independent variables the VIF is

V IFi =1

1−R2i

, i = 1, 2, ..., p (7)

Where R2i is the coefficient of determination that is the result of using the ith

independent variables in a regression against the other independent variables. The

coefficient of determination is further explained below.

3.2.2 Residual Diagnostics

The use of the multi-linear regression model requires some assumptions to hold:

• Approximate linear relationship between the dependent variable and indepen-

dent variables

• The errors are normally distributed by e ∼ N(0, σ2)

• The errors are uncorrelated

The residuals are defined as

ei = Yi − Yi, i = 1, 2, ..., n (8)

where Yi is the ith observation and Yi the fitted value. Plotting the residuals is a

good way to investigate the key assumptions underlined above . In particular one can

look at the studentized residuals to detect outliers or extreme values (Montgomery

et al. 2012). They are defined as

ri =ei√

MSres(1− hii), i = 1, 2, ...n (9)

18

where hii is found in the ith diagonal element of the hat matrix

H = X(X ′X)−1X ′, (10)

MSres is the residual mean square defined as

MSres =SSresn− p

, (11)

where SSres is defined in section 3.3.1

The Studentized residuals may be used in Quantile-Quantile (QQ) plots. to check

if the errors are normally distributed. QQ-plots are sample order statistics plotted

against theoretical quantiles from a standard normal distribution. Non-normality

can be spotted in such a plot (Thode 2002).

3.2.3 Variable Transforms

The Box-Cox method can be used to try to remedy non-normality by transforming

the dependent variable. The power transformation of the ith observation Y λi is

defined as

Y(λ)i =

Y λ−1i

λY λ−1 , λ 6= 0

Y ln(Yi), λ = 0(12)

where Y = ln−1[ 1n

∑ni=1 lnYi] corresponds to the geometric mean of the observations

and Y(λ)i is the transformed dependent variable (Montgomery et al. 2012).

19

3.3 Model Selection

3.3.1 Selective Criteria

Model selection in regression analysis can be done by the all possible regression

method. It fits all possible combinations of the regressors and selected the best

model according to some selective criteria. For k regressors there are 2k possible

combinations.

A common selective criteria is the coefficient of determination, R2. It measures what

amount of the variance in the dependent variable that can be predicted with the

independent variable (Montgomery et al. 2012). R2 is defined as

SSres =n∑i=1

(Yi − Yi)2, (13)

SST =n∑i=1

(Yi − Y )2, (14)

R2 = 1− SSresSST

, (15)

where Y is the dependent variable mean, Yi is the ith observation and Yi is value

estimated by the regression. Finally we also use the Akaike Information Criteria

(AIC) and Bayesian information criterion (BIC) defined as

AIC = −2ln(L) + 2p (16)

In the OLS regression this becomes

AIC = nln(SSresn

) + 2p (17)

20

The BIC is defined as

BIC = ln(n)p− 2ln(L) (18)

where, L is a likelihood function for a specific model. A lower AIC and BIC value

indicates better fit (Montgomery et al. 2012)

For prediction in regression analysis a frequently used criteria is the root-mean

square error (RMSE) defined as

RSME =

√∑ni=1(yi − yt)2

n, (19)

where yi is the predicted value, yi the corresponding observed value and n the number

of observations (Montgomery et al. 2012).

3.3.2 K-fold Cross-Validation

K-fold cross-validation is a rigorous method for prediction model validation in re-

gression analysis. It separates the data into k-subsamples of which the model is

tested on k-1 samples and validated on the remaining sample (Montgomery et al.

2012)

3.4 Poisson Regression Models

The dependent variables used in this paper are count data, meaning they are non-

negative integer values. The probability mass of the distribution for count data is

limited to a non-negative range as opposed to the the normal distribution (Cameron

and Trivedo 2005). Consequently, the standard OLS method might fail.

The Poisson regression model is based on the Poisson distribution with a probability

mass function

21

Pr[Y = y1|xi] =exp(−λi)λyi i

yi!, yi = 0, 1, 2..., (20)

with yi as response variable, independent variables x, parameters λi and first and

second order moments

E[Y ] = V ar[Y ] = λi, (21)

So the expected value of the response variable is:

E[yi|xi] = λi = exp(x′

i)β, (22)

Meaning it is non-linear expressed as an exponential parameterization of Y. It is

estimated with a maximum likelihood technique and log-likelihood function (Green,

2003).

The Poisson model requires equidispersion and it does not hold the model may give

uncertain standard errors (Cameron and Trivedo 2005). Choosing an appropriate

model thus requires investigating the response variable. Underdispersion would

indicate that the standard errors may be overestimate and thus leading to false

insignificant results (Hilbe 2014). A remedy if the data is found to be over-or

underdispersed is to use the Generalized Poisson distribution (Consul, 1989). This

extends the Poisson distribution in the above equation to:

Pr[Y = y1|xi] =

λiexp(−λi−yi)(λi+yiγ)yi−1

yi!, yi = 0, 1, 2...,

0, for y˙i m, when γ < 0(23)

This model introduces an additional parameter dispersion parameter γ lying in the

range max[-1, λi/m] < γ ≤ 1 with a negative value indicating underdispersion and

γ = 0 reduces it to the standard Poisson distribution. The moments are:

E[Y ] =λi

1− γ, (24)

22

V ar[Y ] =λi

(1− γ)3, (25)

To allow for interpretation of the coefficients they are transformed by taking the

exponent. Then they can be interpreted as a percentage change in the number of

counts/bids (Hilbe 2014).

To address the fact that the count data in this thesis (participation in tenders)

does not have any zero counts we extend the empirical model with truncation at

zero.

3.4.1 Negative Binomial Regression

The Negative Binomial regression is a generalized linear regression which in similar-

ity to the Poisson regression can be used for count data. The dependent variable Y

in Negative Binomial regression is a count of the number of times an event occurs

(Zwilling 2013). A convenient parameterization of the Negative Binomial distribu-

tion is given by

p(y) = P (Y = y) =Γ(y + 1/α)

Γ(y + 1)Γ(1/α)

(1

1 + αµ

)1/α(αµ

1 + αµ

)y, (26)

where µ > 0 is the mean of Y and α > 0 is the heterogeneity parameter. The

parameterization is derived as a Poisson-gamma mixture, or as the number of failures

before the (1/α)th success, though 1/α is not required to be an integer (J.M. Hilbe

2011).

According to J.M. Hilbe (2011), the traditional negative binomial regression model,

designated as the NB2 model, is defined as

lnµ = β0 + β1x1 + β2x2 + .....+ βpxp, (27)

where the predictor variables x1i, ....., xpi are given and the regression coefficients

β0, β1, ....., βp are to be estimated using maximum likelihood estimation.

23

Given a random sample of n subjects, observe for i the dependent variable y, and

the predictor variables x1i, ....., xpi. Following vector notation can be made for

β = (β0, β1, ....., βp)T and predictor data can be gathered in to the following ma-

trix X

X =

1 x11 x12 . . . x1p

1 x21 x22 . . . x2p...

......

. . ....

1 xn1 xn2 . . . xnp

Designating the ith row of the matrix X to be xi, and exponentiating equation (26),

the distribution in equation (27) can be written as

p(yi) = P (Y = yi) =Γ(yi + 1/α)

Γ(yi + 1)Γ(1/α)

(1

1 + αexjβ

)1/α(αexjβ

1 + αexjβ

)yi,

where i = 1, 2, ....., n. Maximum likelihood estimation is applied to estimate the

unknown parameters α and β. The likelihood function is defined as

L(α, β) =n∏i=1

p(yi) =n∏i=1

Γ(yi + 1/α)

Γ(yi + 1)Γ(1/α)

(1

1 + αexjβ

)1/α(αexjβ

1 + αexjβ

)yi,

and the log-likelihood function is

lnL(α, β) =n∑i=1

(yilnα + yi(xjβ)− (yi +1

α)ln(1 + αexjβ)

+ lnΓ(yi +1

α)− lnΓ (yi + 1)− lnΓ(

1

α))

The values of α and the regression coefficients β that maximize lnL(α, β) will be

the maximum likelihood.

24

3.5 Support Vector Regression

Initially, Support Vector Machines were developed to solve classification tasks, but

it can also be used to tackle regression problems. The kernel function is used to map

lower dimensional data to higher dimensional data. Radial Basis Function (RBF) is

an often recommended kernel function (Petrovski et al. 2015).

To assess the quality of estimation SVR uses a loss function called ε-insensitive loss

function proposed by Vapnik (Vapnik and Chapelle 1999). It measures the error of

approximation and is defined by

|y− f(x,w)|ε=

0, if |y− f(x,w)|≤ ε,

|y− f(x,w)|−ε otherwise(28)

Where, ε is the insensitive zone, f(x,w) is a vector of the predicted values, y is a

vector of the true values and w is vector of the unknown wights coefficients. The

interpretation of the model is that when difference between the predicted value and

true value is less than ε the error is put to zero and thus no included in the model.

The SVR is defined as minimizing the error given by

R =1

2||w||2+C(

l∑i=1

|y − f(xi, w)|), (29)

Where the hyperparameter C is chosen and its value impacts the value of approxi-

mation error and ||w||

The hyperparameters C and gamma (γ) act like regularization hyperparamters and

are used to mitigate overfitting. If the model is overfitting then γ should be reduced,

and if it is overfitting, it should be increased. The C parameter works the same way

(Aurelien 2017).

25

3.6 Probit Regression

In determining the probability of winning binary logistic regression can be applied

in accordance to Malara and Mazurkiewcz (2012). In their model they define the

binary explanatory variable Y , the quantitative variables Xi, i = 1, 2, 3...., n and the

qualitative variables Zj, j = 1, 2, 3....,m. Both quantitative and qualitative variables

should be collected through historical data from several tenders committed by one

firm.

The binary logistic regression model expresses probabilities in terms of so called odds

instead of the classic method where one divide # of successes through # of trials

(Peng et al. 2002). Contrary to the classical method of calculating probabilities,

Malara et al. calculate the odds as the ratio of # of successes to the # of failures.

Thus, the odds can be defined as,

logit(p) = ln(odds) = ln(p

1− p) = Y = β0 + β1x1 + β2x2 + ...+ βkxk + ε, (30)

p denoted the likelihood of the occurrence of an event so that the probability p ∈[0, 1] and ε is the error term of the regression model. β is the unknown vector of

regression parameters, where β = (β0, β1, ..., βk)T . Equation (30) can also be written

in the form,

P (Y ) =eβ1x1+β2x2+...+βkxk

1 + eβ1x1+β2x2+...+βkxk, (31)

26

3.7 Stochastic Dominance

Assume F (x) and G(x) are continuous cumulative distributions functions of X and

Y. Then stochastic dominance of first order (FSD) is defined as

F (x) ≤ G(x), (32)

To test for first order stochastic dominance the Kolmogorov-Smirnov test is widely

used. The test statistic is defined as (Schmid and Trede 1996)

Dn,m = supx∈R{Gn − Fm}, (33)

where Gn and Fm are Empirical Cumulative Distribution Functions (ECDFs) for

the data with n and m data points. This method has been used before in analysis

of income distribution (Hestmati and Maasoumi 2000).

3.8 Confidence Intervals Using Non-Parametric Bootstrap

Non-parametric bootstrap generates additional data by re-sampling from the orig-

inal dataset with replacement. Bootstrapping is a powerful tool particularly when

assessing prices since it does neither assume distribution model or require any model

as inputs. The purpose of the non-parametric bootstrap in this thesis is to enable

further study of mainly the distribution of unit prices. The non-parametric boot-

strap method applied in this thesis will be in accordance to what is described in

the literature Risk and Portfolio Analysis - Principles and Methods by Hult et al.

(2012).

Consider the observations x1, ....., xn of independent and identically distributed ran-

dom variables X1, ....., Xn. The aim is to estimate some quantity θ = θ(F ) that de-

pends on an unknown empirical cumulative distribution function F of Xk. In the case

of this thesis θ could be the unit prices θ =∫xdF (x) giving θobs = (x1+ .....+xn)/n.

Construct thereafter a confidence interval for θ with confidence level q, where q is

27

usually set to 95%. Since the empirical cumulative density function F is unknown,

one method of constructing a confidence interval is to simulate large samples form

F to approximately compute θ as the empirical estimate (Hult et al. 2012).

More samples are generated by randomly drawing with replacement n times from the

set of observations {x1, ...., xn} to produce {X∗1 , ....., X∗n}. The amount of bootstraps

required for good results usually vary but a general rule of thumb is minimum to or

more than 599 bootstrap iterations (Wilcox 2010). The generated sample points X∗kare assumed to be independent and Fn-distributed (uniformly distributed on the set

of the original observations {x1, ....., xn}). Some X∗k : s will be equal even though

the xk are all different. F ∗n is denoted as the empirical cumulative distribution of

X∗1 , ....., X∗n and θ∗ = θ(F ∗n) for the estimate of θ based on the samples {X∗1 , ....., X∗n}.

Even though {X∗1 , ....., X∗n} is not a sample from F , it has most of the features of

a sample from F as long as n is sufficiently large. In particular, the probability

distribution of θ∗ is likely to be close to the probability distribution of θ. While

the probability distribution of θ is unknown (since F is unknown), the probability

distribution of θ∗ can, with sufficiently large N, be approximated arbitrarily by

repeated re-sampling N times. An approximate confidence interval Iθ,q for θ with

confidence level q using the non-parametric bootstrap method is constructed as

follows.

28

1. For each j in the set {1, ....., N} draw with replacement n times from the

sample {x1, ....., xn} to obtain sample {X(j)1 , ....., X

(j)n } and the corresponding

empirical cumulative distribution function F∗(j)n .

2. Compute the estimates θ∗j = θ(F∗(j)n ) of θ and the residuals R∗j = θobs − θ∗j for

j = 1, ....., N .

3. Compute the confidence interval for the confidence level q

Iθ,q = (θobs +R∗[N(1+q)/2+1,N ], θobs +R∗[N(1−q)/2+1,N ]),

where R∗1,N ≤ ..... ≤ R∗N,N is the ordering of the sample {R∗1, ....., R∗N}

3.9 Joint Distribution Function with Position Performance

As described in section 2.3 we use the model that allows us to calculate the probabil-

ity curve that shows the likely position to be occupied by a given bidder that takes

into account the total number of bidders. The general expression for the position

performance joint probability curve using the kumaraswamy for bidder i is defined

as

Ji PDF (x = j, αi, βi,m, b) =

Nk=+∞∑Nk=jk

kum PDF (x, α, β) ∗ Laplace PDF (x,m, b)

(34)

Where x is the position in a tender, Nk is the number of participants, αi and βi are

distribution parameters for the kumaraswamy distribution, and m, b are parameters

for the laplace distribution that was chosen in the paper.

29

3.10 Other Statistical Models

3.10.1 Chi-Square Test

Pearson chi-square test statistic can be applied in order to compare different distri-

butions and assess the goodness of fit. The test statistic compares observed probabil-

ities with expected probabilities of success and failure at each group of observations

(Montgomery et al. 2012). Define the expected number of successes and expected

number of failures as niπi and ni(1 − πi) respectively. The Pearson chi-square test

statistic can thus be formulated as,

χ2 =n∑i=1

{(yi − niπi)2

niπi+

[(ni − yi)− ni(1− πi)]2

ni(1− niπi)

}=

n∑i=1

(yi − niπi)niπi(1− πi)

(35)

The goodness-of-fit test statistic above is comparable with a χ2-distribution with

n− p degrees of freedom. Large p-values for the test statistic implies that a model

or distribution has a satisfactory fit to the data.

30

4 Data and Methodology

Due to the limited amount of research that has focused solely on the cleaning service

sector we seek to test models employed in the construction sector and see how

well they can be translated. Research in public procurement auctions are often

characterized by limited data and to remedy this we apply other methods in addition

to regression analysis. Computation of models and choice of variables will be inspired

by the studies presented in the literature review. The key focus of this thesis is to

assess tender participation and pricing decisions. We will study two key dependent

variables in each area.

For the bid/no bid decision we choose to look at all bidders collectively through

count data regression as well as compare behavior on an individual basis with probit

regression. For pricing we will assess floor care unit prices and the spread, both will

be defined in section 4.2.

4.1 Data Collection and Limitations

We have two separate datasets. One set consists of 409 sealed bid auctions in pub-

lic procurement from 2016-2019 in Stockholm County, Ostergotland County, Skane

County and Vastra Gotaland County. However, there are several rows with missing

data and after removing them we are left with 278 observations. Each contract

contains information regarding, project duration, project location, procurement en-

tity, tender preparation time, tendering method (EMAT or lowest price), number

of participants and their submitted bids. For the individual company bid/no bid

decision we use the same data as for the collective study.

For the spread variable, not all contracts contained detailed information on bids for

all participants and since this is required to construct the spread dependent variable

we are left with in total 285 points for all 15 most frequent bidders.

31

The other dataset contains unit prices submitted by the 15 most frequent bidders

on floor care. This set amount to 123 points from 2016-2018. These where chosen

so that we could compare large to small companies and frequent to less frequent

bidders with enough data points. An advantage with our data is that it does not

only contain successful bids; i.e., the data base contains offers by tender participants

who were outbid. In sealed-bid auctions the opposite is normally the case and a

typical limitation of the data (Kleijnen and Schaik 2011)

4.2 Variable Description

• Bid Spread: The difference between the lowest and second lowest bid. Used

as a response variable. Will be used as a response variable in section 5.1.1

BS = B1−B2

B1

• Competitiveness: Measuring the spread between a company is bid and the

winning bid. Will be used as a response variable in section 5.5

Ci = B1−BiB1

• Current Workload: We measure workload by the number of simultaneous ten-

ders that are offered on the market when a company makes the decision to

bid within a given time interval. Either five or ten days. We define two con-

tracts to be simultaneous if they have at least ten overlapping days for tender

preparation.

• EMAT: In economically most advantageous tenders there are selective criteria

other than price taken into consideration when ranking bidding proposals.

Subsequently, bidding the lowest price is not a guaranteed winning strategy

(Pla et al. 2014).

• Experience: It is defined as the number of similar tenders a company has par-

ticipated in during the last three years. It will be used to categorize companies

32

in to experienced and inexperienced in the stochastic dominance test. We refer

to experienced bidders as the most frequent bidders (having participated in at

least 30 % of the contracts) and less frequent bidders as inexperienced.

• Floor Care: We demonstrate our unit price analysis with floor care bids in

kr/m2 submitted by the 15 most frequent bidders. Floor care bids are sub-

mitted for different types of floor types ranging from linoleum to solid wood.

• Location North: Used as a dummy variable for unit price regression. It indi-

cates whether a contract was stipulated for a project in the inner city or north

of the inner city in Stockholm County.

• Location Stockholm: A dummy variable that describes whether a contract is

for a project in the inner city of Stockholm County.

• Position Performance Coefficient: Used to model performance in the sealed

bid auctions. Used a response variable in section 5.2.1

Pik = Nk−jik+0.5Nk

• Sealed Bid Auction: In a first-price sealed bid auction all bidders submit one

bid simultaneously unaware of what the others have bid (Vijay 2002).

• Tendering Method: This explanatory variable will be used to indicate whether

a contract is lowest price or EMAT tender.

• Tendering Preparation: This variable is measured in days and indicates how

long time a tenderer has to bid in a certain contract.

• Type of Project: Type of project is dummy variable that details whether a

contract is a sanitation or cleaning services contract.

33

4.3 Competitors Affect On Price

To justify why the number of competitors is paramount in a bidding forecast model

we look at how the number of competitors impact the competitiveness variable

described in section 4.2. We do this test by a simple linear regression with com-

petitiveness as the dependent variable and tender participants as the independent

variable.

4.4 Assessing Tender Participants

This section describes how we will approach RQ 1 and RQ 5. Firstly we detail how

we study competitors aggregated and then individually. In the literature review

some studies in the construction sector found that number of bidder participants

impact the tender price outcome. This mean that a bid forecasting model should

account for the number of participants. Therefore we will study what factors affect

the bidding decision. We will attempt to fit a distribution to the participation data

as well as perform several different regression analysis. Our regression models will

look at all our data combined as well as only the data for Stockholm County to find

deviations. We fit three types of regression models. Firstly the standard ordinary

least-square (OLS) found in section 3.1. Then count data regression models found

in section 3.4. We separate our data into one regression for all counties and one for

Stockholm County only. We compare with the negative binomial regression used

in previous research for the entire dataset. Lastly probit regression explained in

section 3.6. To assess the model performance and show that the OLS is inadequate

we perform a residual, goodness of fit and multicollinearity and all possible regression

analysis. The regression models and explanatory variables employed are

34

Tender Participation Model:

(36)ln(yi) = β0 + βworkxwork,i + βprepxprep,i + βdurxdur,i + βlpxlp,i+ βprojxproj,i + βcountyxcounty,i + βstoxsto,i + εi

where yi is number of participants in contract i. xwork,i is current workload, mea-

sured by the number of contracts that are simultaneously available to bid. xprep,i

measure tender preparation in days. That is the number of days for a bidder to

prepare documents to be submitted before the auction deadline. xdur,i is the project

duration measured in days. xproj,i, xcounty,i, xsto,i and xlp,i are dummy variables.

xproj,i measures the type of project, which is either classical cleaning services or

sensitization. xlp,i is a dummy variable that measure the type of tendering method.

It gives 1 if its a lowest price auction and 0 if EMAT auction. xcounty,i is a location

variable that gives 1 if the contract is in Stockholm county and 0 if the tender is

outside the Stockholm county. Lastly, xsto,i is used to indicate Stockholm city when

only looking at Stockholm County contracts in the second model. Value of 1 in xsto,i

implies that the contract is located in Stockholm.

Environmental Quality Criteria Model:

To specifically check the impact the environmental criteria has on tender participa-

tion we subset all contracts in our dataset that are EMAT. Then we use environ-

mental as a dummy variable on those contracts where some type of environmental

criteria awards points or price discounts. This criteria could be anything regarding

chemicals or certificates. We use the same OLS and count data regressions models

as above. Thereby we hope to understand whether these criteria specifically impact

the decision more than other quality criteria.

(37)ln(yi) = β0 + βenvxenv,i + εi

where yi is as above and xenv is a dummy variable giving 1 for contracts where

environmental criteria are defined in the EMAT tenders.

35

Probit Model:

To compare the aggregated results above to individual bid/no bid decision we look

at the probit regression model. The probit model is implemented to scrutinize

individual bid/no bid decisions and is similar to count data regression models such

as the Poisson regression model. The difference when computing these models are

that we are focusing on the Stockholm county only.

ln(yi) = β0+βworkxwork,i+βprepxprep,i+βdurxdur,i+βlpxlp,i+βprojxproj,i+βlocxloc,i+εi(38)

Where xloc,i is a dummy variable indicating whether a contract is in the inner city of

Stockholm or north of the inner city. The other variables are defined as above. Due

to company data being confidential we label the companies A-E where company A,

B and D are large and C and E small.

4.5 Predict Price Performance

In this section the methods used to answer RQ 2, RQ 3 are presented. We begin

by describing how we will determine what factors affect the pricing decision and

how we will compare prediction models. Then we elaborate on how we will use

non-parametric bootstrap and stochastic dominance to compare price performance

among companies.

4.5.1 Comparing Multiple Linear Regression and Support Vector Re-

gression

To measure price performance we look at floor care unit prices and the spread. Both

measures are defined in section 4.2. For each bidder, the submitted bid sheets consist

of several other unit prices in addition to floor care such as window cleaning. We

choose to only analyze floor care prices because it is sufficient to demonstrate our

models and because of data availability considerations. Furthermore, we look at unit

36

prices and not overall bids because the latter cannot be normalized in absence of

cost data. Since cost data is confidential to each company that they are not willing

to distribute, it is not realistic to create a bidding model containing costs.

The first step is to assess what factors affect the pricing decision with multiple

linear regression to answer RQ 2. Then we compare the predictive performance

of multiple linear regression with support vector regression in accordance with the

previous studies described in section 2.2. The regression model is the following

Unit Price Model:

yi = β0 + βworkxwork,i + βprepxprep,i + βdurxdur,i + βlpxlp,i + βlocxloc,i + βsizexsize,i + εi(39)

Where yi is the unit prices for floor care in kr/m2. xsize,i is the size of the contracts

where the winning bid is used as proxy. To balance our results we take the natural

logarithm of the contract values. The rest of the explanatory variables are as in

previous models. To improve this model we employ the method in section 3.2.3 to

log-transform the dependent variable.

The next step is to use k-fold cross-validation as described in Section 3.3.2 to com-

pare the multiple linear regression model in equation 39 to the performance of sup-

port vector regression model described in section 3.5 with the RSME in equation

19. Since there is limited data on unit prices which we believe will be the case for

the foreseeable future we consolidate the data for large and small companies. Large

companies were selected as those with a revenue of more than 125 million Swedish

crowns (SEK), whereas small companies had under 125 million SEK. We then com-

pare these models to the separated data. However, the Final RSME score will vary

significantly when the data is reduced to less than 70 points so we focus on the mean

RSME score in this case. Summary statistics for company revenue is presented in

Table 1. The numbers are expressed in thousands of SEK (tSEK).

Table 1: Revenue statistics of the 15 companies selected for analysis

Mean SD Min Max Median

Revenue (tSEK) 790395 2256892 14644 8280000 91217

37

The model comparison detailed in the previous paragraph will be repeated for the

spread variable as we have access to more data here. However, for spread we have

enough data points and thus do not have to worry about introducing noise to the

model by consolidating data for large and small companies.

4.5.2 Inference for experience, size and tendering method with Stochas-

tic Dominance

As described by previous authors in section 2.2 itemized bids can suffer from high di-

mensionality, meaning that there are likely many factors that impact price that can-

not be accounted for. We address this concern by also incorporating non-parametric

bootstrap and stochastic dominance in addition to the regression analysis. The first

method is used on the unit prices while the second on the spread because stochastic

dominance normally requires a bigger dataset than we have available for unit prices

while bootstrapping is suitable for smaller samples. These method will be employed

to see if we find significant difference between three key aspects: experience in sim-

ilar tenders, size of the bidder and comparing lowest price to EMAT tenders. With

the experience aspect we are able to explore whether the prices submitted and bid-

ding performance depend on the experience of the bidders as measured by their

frequency of participation in the previous three years. The size of the company is

measured by their revenue during 2018 as above. Analyzing the stochastic domi-

nance of lowest price against EMAT contracts allows us to strengthen our results

from the regression analysis.

4.6 Assessing Quality Performance

RQ 4 will in part be answered by the methods in the previous section. But to

properly evaluate performance we include another method described here. Table

1 illustrates that the number of each type of tendering method is fairly evenly

matched in our dataset. Therefore, a bidding model should take into account how

quality impacts price. Our hypothesis is that companies with good quality scores

are awarded a pricing discount, which would allow them to submit higher prices

38

while remaining an overall competitive bid. It is important to note however that

from our dataset the majority of the EMAT tenders are located in Stockholm, which

implies that they seem to be used to a lesser extent outside the capital.

Table 2: Share of EMAT and lowest price tenders in the dataset

Lowest Price 54 %

EMAT 46 %

We have seen that EMAT tenders are frequently used, what remains to be answered

is how efficient they are. From our dataset we found that in over 70 % of the time,

the lowest price submitted still wins. This indicates that EMAT tenders have not

been used effectively. Upphandlingsmyndigheten acknowledges that there have been

difficulties in constructing effective criteria that are both suitable for the tenders

and can then also be monitored after the contract has been awarded to ensure the

promised quality is actually delivered (Upphandlingsmyndigheten 2019).

Because of the limited data on quality and the low frequency of EMAT tenders

where the quality score has an impact on the bidding we will approach the EMAT

tender trough the use the position performance coefficient described in section 2.3.

We change this model by using the distribution we find in the results section 5.1.2

instead of the laplace distribution. In addition, instead of only looking at EMAT

auctions we will compare EMAT auction to lowest price auction and see whether the

competitiveness of individual bidders differ. To demonstrate the model we compare

the four most frequent bidders from our data. Due to some of the data containing

proprietary information we call label the companies 1-4 as seen in Table 3.

Table 3: Bidding frequency of the four most frequent companies

Company 1 32.5 %

Company 2 32.1 %

Company 3 31.1 %

Company 4 28.2 %

39

5 Results

We present our results in the order they were presented in the Methodology section,

beginning with analyzing tender participation collectively and individually. Then

we look at price prediction for our two dependent variables and finally show how

one can assess performance in EMAT tenders.

5.1 Bid/no Bid Decision

Firstly we present the results for the bid/no bid Decision. We begin the section by

looking at what impact the number of bidders have on tender price. Thereafter we

fit a distribution function to the tender participation data. We finish the section

by studying what factors affect the decision to bid and we compare these to probit

regression looking at individual companies at a micro level.

5.1.1 Number of Bidder’s Effect on Price

Using the bid spread as a proxy for competitiveness as described in the methodol-

ogy section 4.3 our results are presented below. We find that the with a significant

and negative coefficient of -0.0110 that increasing the number of competitors de-

creases the competitiveness spread implying that more participants result in a more

competitive bidding process.

40

Table 4: Multiple Linear Regression for Competitiveness

Estimate

Intercept 0.2341

(0.0425)

Tender Participants -0.0110**

(0.0038)

R2 0.09

Observations 69

Competitiveness as the dependent variable.

*p < 0.05 **p < 0.01

With the knowledge that the number of competitors has an impact on tender price,

we move forward to assess the number of competitors.

5.1.2 Fitting a Distribution the Participation Data

Descriptive Statistics

This section is initiated by displaying descriptive statistics in Table 5. Note that the

mean number of participants is 5.36 with a max of 20. The average number of par-

ticipants for EMAT is only slightly lower than for lowest price auctions. Therefore,

a plausible procedure would be to aggregate them for the first analysis.

Table 5: Descriptive statistics for tender participants contracts

Mean SD Min Max Median

EMAT 5.08 3.89 1 18 4

Lowest Price 5.65 4.23 1 20 5

Participants Total 5.36 4.06 1 20 4

Project Duration 765 339.66 365 4380 730

Tender Preparation 32.61 13.41 7 101 35

N = 409. Project duration and tender preparation are in days.

41

Figure 1: Number of Bidders Per Month

Figure 1 illustrates that for the three year dataset there seems to be a low point

in August for both 2016 and 2017 as well as a downward trend in 2018. This

indicates that during summer (June-August) there is the least participants in public

procurement tenders. However, in May there seems to be the most participants,

suggesting a seasonal effect.

The histogram of the number of participants in Figure 2 seems to suggest a heavy

left tail. This is confirmed by looking at the QQ-plot indicating a right tail that

conforms to a normal distribution but a significantly heavier left tail. Looking at the

histogram, it could indicate for example a weibull distribution or negative binomial

distribution. A Chi-square test demonstrated in Table 6 on these two distribution

does not reject the null-hypothesis that data comes from them. The chi-square

test does reject a Poisson distribution and a Normal distribution. The Log-normal

distribution is rejected at p < 0.001 but not at p < 0.01.

42

Table 6: Chi-Square test for Distribution of Number of Participants

df = 5 Chi-Square P-Value

Weibull Distribution 0.4596

Normal Distribution 0.0003

Poisson Distribution < 0.0001

Log-normal Distribution 0.0055

Negative Binomial Distribution 0.4840

Observations 409

Figure 2: Histogram for number of Ten-der Participants

Figure 3: QQ-Plot of Participation dataversus standard Normal

43

We proceed with testing fit by log-likelihood and AIC, the model that shows the best

fit is the Negative Binomial because it has the highest log-likelihood and smallest

AIC. The parameters we obtain with maximum likelihood estimation is: r = 3.390

and p = 0.3480 as illustrated in Figure 4. If we call number of participants a

stochastic variable P then P ∼ NB(3.390, 0.3480). This result will be used in

section 5.3.

Figure 4: Negative Binomial fitted to participation data

Table 7: Distribution model test for participants

Log-likelihood AIC

Negative Binomial -768.9 1542

Geometric -818.5 1639

Uniform -877.9 1758

Poisson -878.7 1759

Observations 409

44

5.1.3 Factors Contributing to Participants With Regression

As explained in the Methodology section 3.4, count data regression is normally better

fitted to a generalized linear regression model belonging to the Poisson regression

family. Below an OLS regression is compared to Poisson Regression, Truncated

Poisson Regression, Generalized Poisson and a Negative Binomial Regression Model.

Table 8 contains the results for the entire dataset for Stockholm County, Vastra

Gotaland County, Skane County and Ostergotland County. A notable results is

that the standard error for most of the coefficients in the OLS model seem to be

high in comparison to the estimate, thus indicating that the model has a bad fit

which further justifies our count data regression. At the end of this section we

demonstrate a diagnostic analysis of the standard OLS model to investigate among

other things the dependence between the explanatory variables.

45

Table 8: Results for least-square and count data regression models

OLS Poisson T.Poisson G.Poisson N.Bin

Intercept -3.4514 0.1945 0.1609 0.8467 0.1078

(4.5583) (0.4590) (0.4645) (0.0708) (0.7408)

Stockholm County 1.8805** 0.3277*** 0.3350*** 0.2308* 0.3152**

(0.7072) (0.0739) (0.0750) (0.1333) 0.0995

Tender Preparation 0.0411 0.0066** 0.0065** 0.0077* 0.0060*

(0.0219) (0.0020) (0.0020) (0.0031) (0.0033)

Type Of Project 1.3306 0.2325** 0.2378** 0.1386 0.2145*

(0.6849) (0.0715) (0.0726) (0.1093) (0.1126)

Project Duration 0.8799 0.1474** 0.1501** 0.1380 0.1615

(0.6643) (0.0663) (0.0670) (0.1043) (0.1077)

Tendering Method 0.4555 0.0734 0.0746 0.0462 0.0522

(0.5184) (0.0503) (0.0507) (0.0795) (0.0824)

Current Workload (10D) -0.0096 -0.0160 -0.0017 -0.0020 -0.0015

(0.0102) (0.0010) (0.0010) (0.0016) (0.0016)

AIC 1586 1694 1692 1515 1512

Observations 278 278 278 278 278

*** p < 0.001, ** p < 0.01, * p < 0.05

Tender participants as dependent variable.

As seen in Table 8 the best model in terms of significance corresponds to the Trun-

cated Poisson model. Stockholm County is significant meaning that companies bid

more in Stockholm County compared to other parts of Sweden. Companies bid

more when they have longer to prepare documentation indicated by significance for

Tender Preparation. Moreover, Type of Project demonstrates that there are more

bidders on cleaning service contracts compared to sanitation contracts as it has a

positive coefficient of 0.2378 and the dummy gives 1 for cleaning services. Project

46

Duration is significant for the Poisson regression model and the Truncated Poisson

regression model. The significance for these variables are in line with the previous

research we demonstrated in the literature review. However, Tendering Method is

not significant for the total dataset nor is Current Workload. The lack of signifi-

cance for the last two explanatory variables justifies further inquiry. Therefore we

now turn to looking at only Stockholm to see what differences exist within a county.

As explained in the Framework section, an assumption for the Poisson regression

is that the mean and variance should be nearly equal. Lastly, it can be seen that

the AIC is lowest for the Negative Binomial model even though the independent

variables selected had lower significance for that model.

Since the dependent variable Number of Participants has been log-transformed for

the regression as shown in (37) we take the exponential of the attributes to obtain

values in Table 9. Notably, contracts in Stockholm County compared to south of

Sweden and cleaning contracts compared to sanitation in Type of Project have the

biggest net effect on participation.

Table 9: Transformed Independent Variables

T.Possion

Stockholm County 1.3979***

Tender Preparation 1.0006**

Type Of Project 1.2685**

Project Duration 1.1619**

Tendering Method 1.0774

Current Workload (10D) 0.9983

*** p < 0.001, ** p < 0.01, * p < 0.05

47

Table 10: Results for tender participation count data regression in StockholmCounty

OLS Poisson T.Poisson G.Poisson

Intercept -1.9560 0.4294 0.3958 0.92488

(5.4269) (0.5163) (0.5227) (0.0814)

Location Stockholm 1.1168 0.1781** 0.1810** 0.1715*

(0.6048) (0.0550) (0.0554) (0.0904)

Tender Preparation 0.0394 0.0064** 0.0065** 0.0069*

(0.0254) (0.0022) (0.0022) (0.0036)

Type Of Project 1.8102* 0.3152*** 0.3228*** 0.1876

(0.8412) (0.0840) (0.0855) (0.1319)

Project Duration 0.9161 0.1544* 0.1577* 0.1563

(0.7853) (0.0736) (0.0743) (0.1206)

Tendering Method 0.9676 0.1484** 0.1502** 0.1199

(0.6179) (0.0558) (0.0561) (0.0919)

Current Workload (5D) -0.0537 -0.0087** -0.0088** -0.0071

(0.0325) (0.0035) (0.0032) (0.0061)

AIC 1223 1336 1334 1173

Observations 210 210 210 210

*** p < 0.001, ** p < 0.01, * p < 0.05

Tender participants as dependent variable.

Looking at the results for Stockholm County in Table 10 we observe that the signif-

icance holds for the independent variables from Table 8 but that in addition Cur-

rent Workload and Tendering Method have become significant. The transformed

explanatory variables in Table 11. indicate that the Current Workload variable

decreases participation while Type of Project has the largest net effect. However,

the effect is even larger for Stockholm County compared to the previous model, in-

48

creasing participation by 38%. Type of project has nearly the same net effect while

Tender Preparation is significantly smaller compared to the entire dataset.

Table 11: Transformed Independent Variables in Stockholm County

T.Possion

Location Stockholm 1.1984**

Tender Preparation 1.0065**

Type Of Project 1.3809***

Project Duration 1.1708*

Tendering Method 1.1620**

Current Workload (10D) 0.9912**

*** p < 0.001, ** p < 0.01, * p < 0.05

For diagnostics, it is imperative to check for dependence between independent vari-

ables. Beginning with pair-wise correlations, Table 23 in Appendix C. indicates that

the variable Stockholm County (Project Location) is highly correlated with a firm’s

Current Workload. This is anticipated since projects in near Stockholm City usu-

ally have higher workload. As with other sectors, larger firms with higher workload

capacity are oftentimes located in urbanized areas.

Variance Inflation Factors (VIF) are illustrated in Figure 10 in Appendix C. to

further assess multicollinearity between the chosen full model variables. Project

Location variable and Current Workload (10D) seem to have higher VIF than the

other variables, which is well aligned with the results in Table 23 and the implication

of multicollinearity between them. If assessed by the general rule of thumb presented

in section 3.2.1 all variables are considered to have low or no collinearity between

each other.

To show that the OLS regression was inadequate, further analysis was made by

looking at the residuals and QQ-plot. The normal QQ-plot bottom-left of Figure 11

in Appendix C. illustrates that the empirical distribution has heavy left tails but is

close to resembling a normal distribution. One outlier in the right tail of the normal

49

QQ-plot was removed after testing multicollinearity. Furthermore, the residual plot

on top-left is well aligned with the results seen in the normal QQ-plot since the

left tail residuals deviates more than the remaining data from the majority of the

other fitted residuals. Inspecting the residual plot it is seen that the residuals does

indeed not follow a N(0, 1)-distribution, implying that they in fact are under impact

by heteroskedacity. The residuals coinciding with a normal distribution contradicts

the normality assumption for error terms in an OLS model. Variables was not

transformed to achieve homoskedacity since it could discard important information

in models that describes number of participants.

Finally, an all possible regression was performed based on the selection criteria

BIC. The model that described the number of competitors in a contract best was

whether it was located in Stockholm or another County in Sweden. Figure 12 in

Appendix C. illustrates that the multiple regression analysis might not be the best

suitable approach to model the number of participants in a contract. Omitted

variable bias has to be considered when computing a model that describe number

of participants in a contract. Surprisingly, the numbers of competitors increases the

closer Stockholm City the contract is based. The diagnostic results presented in this

section is well aligned with the results displayed in Table 10, which displays that

OLS does not perform well.

50

5.1.4 Impact of Environmental Criteria on Participation

As described in the methodology section 4.4 assessments were made whether the

environmental criteria in EMAT tenders had impact on participation. Table 12

indicates that the dummy variable has no significance for either of the presented

models.

Table 12: Count data regression for environmental criteria as independent variable

OLS Poisson T.Poisson G.Poisson

Intercept 5.3333 1.6907 1.6863 0.9478

(0.3795) (0.0395) (0.0400) (0.1030)

Environmental 1.0877 0.0168 0.1717 0.1928

(1.0264) (0.1039) (0.0994) (0.1630)


*** p < 0.001, ** p < 0.01, * p < 0.05

5.1.5 Bid/No Bid Decision for Individual Participants

We proceed by focusing on each company individually. In the interest of space we

only demonstrate Company A and Company B here since they had the most inter-

pretive results. Company C,D and E can be seen in Appendix B. Probit regression

was performed to assess factors contributing to single large firms’ participation in

auctions. Results in Table 13 show workload being significant for both Company A

and Company B indicating that the workload is important for whether a firm bid in

contract or not. However as seen in appendix Table 22, this result did not hold for

all companies. Furthermore, Type of Project indicates significant contribution for

one of the companies but not the remaining. Overall, it seems that most factors, as

chosen with inspiration from Cheng et al. (2010), were insignificant for our probit

regression model. Majority of the variables had little or no explanatory power.

51

Table 13: Bid/No Bid probit regression for two firms

Company A Company B

Intercept -0.3633 -0.0283

(1.9594) (1.6231)

Stockholm 0.2326 0.0527

(0.2102) (0.1863)

Type of Project 0.2797 0.4585*

(0.2611) (0.2307)

Tender Preparation 0.0111 -0.0104

(0.0085) (0.0080)

Project Duration 0.1526 -0.1631

(0.2779) (0.2360)

Tendering Method -0.1251 -0.0834

(0.2100) (0.1862)

Current Workload (5D) 0.0313* 0.0213*

(0.0124) (0.0106)

’***’ p < 0.001, ’**’ p < 0.01, ’*’ p < 0.05, ’.’ p < 0.1

52

5.2 Predicting Price

This section intend to present the results on price performance. As explained in the

Methodology section we first look at floor care unit prices as the dependent variable

and then compare with the spread dependent variable.

5.2.1 Factors Affecting Pricing

We begin by testing a linear relationship between the unit prices and independent

variables with a multiple linear regression to examine what factors affect the pricing

decision. The independent variables included are demonstrated in section 4.5.1. The

results in Table 14 indicates that Tendering Method is significant with companies

placing lower bids for lowest price compared to EMAT tenders. The Location North

dummy is significant with a positive value of 0.4808 which implies that contracts

in the north receive higher bids. The dummy corresponding to small companies is

significant but at the lowest significance level, with a value of 0.2910 it could indicate

that smaller companies bid higher than larger ones. Because the significance is so

low we further explore the dynamic between large and small companies in sections

5.2.3 and 5.2.4.

53

Table 14: Results for least-square regression with floor care prices as dependentvariable

OLS Model

Intercept 3.2849

(0.8759)

Location North 0.4808**

(0.1719)

Project Duration 0.0002

(0.0003)

Tender Preparation 0.0033

(0.005)

Tendering Method -0.3252**

(0.1524)

Contract Size -0.09105

(0.0708)

Small 0.2910.

(0.1702)

R2 0.22

Observations 123

’***’ p < 0.001, ’**’ p < 0.01, ’*’ p < 0.05, ’.’ p < 0.1

5.2.2 Comparing SVR And MLR Models for Price Prediction

The relative poor performance of the MLR model led us to test whether the problem

lied in the linearity assumption. We tested that hypothesis by comparing to Machine

Learning with Support Vector Regression using the Radial Basis Function as Kernel.

To compare the prediction models we used cross-validation and RSME defined the

Framework. The results are displayed in Table 26. Surprisingly the SVR with RBF

54

kernel did not perform significantly better than the MLR linear model. This can be

seen for both the Final RSME for the validation data and the Mean RSME score

on the training data. The RSME scores obtained for the training data for all our

SVR models can be seen in Appendix D.

Table 15: Cross-Validation comparison of SVR and multiple linear regression models

SVR Model MLR Model

K-Folds 5 5

Grid Search Optimal Parameters

Gamma 0.3 -

C 3 -

Kernel RBF -

Training Data

Mean RSME Score 6.6543 6.7157

Standard deviation 0.3521 0.6824

Validation data

Final RSME 4.3292 4.9314

Observations 123 123

To obtain enough data points the 15 largest companies is aggregated as described in

the methodology section. Therefore we compare these results when separating into

small and large companies to see if this consolidation of data is a viable option. The

results in Table 16 indicate that the mean RSME scores are similar which would

imply that the data can be consolidated. The Final RSME scores differs significantly,

but as explained in section 4.5.1 because of the small dataset the Final RSME scores

will have significant variation when running numerous iterations. Therefore the

mean RSME score is a better comparison.

55

Table 16: Cross-Validation comparison SVR and MLR models categorized by largeand small companies

SVR Large MLR Large SVR Small MLR Small

K-Folds 5 5 5 5


Gamma 5 - 1 -

C 3 - 1 -

Kernel RBF - RBF -

Training Data

Mean RSME Score 6.4281 6.7079 6.4539 6.9948

Standard deviation 1.8631 1.2481 1.9347 1.1287

Validation data

Final RSME 7.607 7.006 5.8849 6.7320


5.2.3 Price Range with Non-parametric Bootstrap

This section intends to present the results for the estimation of confidence intervals

for the floor care unit prices in the data. Applying non-parametric bootstrap for

the most frequent bidder, the obtained results are displayed in Table 17. As the

histogram illustrates and QQ-plot confirms in Figure 5, the unit price data has

a heavier right tail and lighter left tail than a standard normal distribution. This

shape of the distribution was similar for the bid prices made by low frequent bidders,

large companies and small companies.

56

Figure 5: Non-parametric bootstrap for prices including only High Frequent bidder.Left figure illustrates the frequency of prices whereas the right figure is a QQ-plotindicating a heavier right tail and lighter left tail.

Table 17: Confidence interval for floor care unit prices using non-parametric boot-strap

CI 95 % Mean

High frequency bidders: 7.31 - 15.83 10.31

Low frequency bidders: 7.37 - 16.66 10.26

Large Companies 7.36 - 16.14 10.18

Small Companies 7.49 - 15.73 10.23

Table 17 indicates that the confidence interval for high frequency bidders is slightly

more narrow than low frequent bidders. For large players and small players however

the confidence is moderately narrower for smaller companies.

57

5.2.4 Stochastic Dominance for Spread

The confidence interval is not sufficient when determining whether a certain category

dominates another. With this in mind, we turn our interest to study stochastic dom-

inance for price spread. The first step to detect stochastic dominance is to study the

empirical cumulative distribution functions (ECDFs) illustrated in Figure 6, 7 and

8. To begin with, Figure 7 illustrates that high frequent bidders seem to dominate

low frequent bidders. Furthermore, Figure 6 also seems to suggest dominance even

though the curves get very close at the high end. However, for Figure 8 it is seen

that the lines cross several times. Thus, graphically possible to conclude stochastic

dominance. The results from the statistical test are displayed in Table 18. The left

column describes which variables are tested. For example, FSxlfreqoyhfreq looks for

dominance for low frequency bidders over high frequency bidders. Significance can

be identified that lowest price tenders dominates economically most advantageous

tenders. Similarly high frequency bidders, dominate low frequency bidders meaning

experienced bidders perform better measured by bidding accuracy in the spread.

No first order stochastic dominance could be found when comparing large and small

firms.

Table 18: First order stochastic dominance test

D p-value

FSxlpoyemat 0.15213 0.0300*

FSyematoxlp -0.013674 0.9700

FSxlfreqoyhfreq 5.8981e-17 1

FSyhfreqoxlfreq -0.1694 0.0135*

FSxbigoysmall 0.11304 0.1593

FSysmalloxbig -0.0086957 0.9892

’*’ p < 0.05,

58

Figure 6: ECDF for spread. EMAT vs. lowest price tenders.

Figure 7: ECDF for spread. high frequent vs. low frequent bidders.

Figure 8: ECDF for spread. Large vs. Small firms.

59

5.2.5 Price Performance with Spread Variable

The procedures in section 5.2.2 are repeated for the support vector regression and

multiple linear regression by changing the dependent variable from unit prices to

the spread defined in section 4.2. Same pattern can be read that the MLR performs

equally to the SVR. However, the spread variable performs worse in prediction

compared to the unit prices.

Table 19: Cross-Validation Comparison SVR and MLR Models with spread as priceperformance indicator

SVR Big MLR Big SVR Small MLR Small

K-Folds 5 5 5 5


Gamma 1 - 0.3 -

C 10 - 1 -

kernel RBF - RBF -

Training Data

Mean RSME Scores 0.3796 0.3797 0.4238 0.4483

Standard deviation 0.0898 0.1664 0.1663 0.1289

Validation data

Final RSME 0.3474 0.3716 0.2169 0.2632


60

5.3 Competitor Performance in EMAT tenders

To assess the performance of competitors in EMAT tenders we apply the position

performance coefficient as described in section 3.9. This method assesses quality,

circumventing the need to calculate scores. To demonstrate the model and ensure we

have enough data, the four most frequent bidding companies were compared. The

joint probability distributions for each company was separated by tendering method,

either lowest price or EMAT. Results are illustrated in Figure 9. As indicated in

the figure company 2,3 and 4 perform better in LP auctions while it is the opposite

for company 1. Interestingly, for the four demonstrated examples, if a company is

likely to perform better than a competitor in a lowest price tenders than it performs

better in EMAT as well.

Table 20: MLE estimated Kumaraswamy Parameters

(α, β) EMAT (α, β) LP

Company 1 (1.2012, 1.4510) (0.8025, 1.2207)

Company 2 (1.4394, 0.8268) (1.8635, 0.7417)

Company 3 (1.2808, 0.9714) (1.9193, 0.8406)

Company 4 (1.5478, 0.8683) (3.4560, 0.4918)

61

Figure 9: Position performance joint probability curves for company 1-4. The y-axisdescribes the likelihood of ending up in a given position and the x-axis describes thefinal position of the bidder.

62

6 Analysis of Results

Our results have given us some important takeaways. To begin with we found

that the number of bidders have an impact on tender price. This result led us

to conclude that number of competitors must be accounted for in a bidding model.

Our tender participation regression found several significant variables while for price

prediction we surprisingly found equal performance of multiple linear regression

models compared to more advanced support vector regression with RBF kernel.

We found that experienced bidders perform better in bidding, companies bid more

evenly in lowest price than in EMAT tenders but did not find enough evidence that

larger companies perform better than smaller ones. Below we go into more detail of

the results.

6.1 Number of Competitors

As expected the Poisson and Truncated Poisson model performed better than the

MLR model on the count data. The Generalized Poisson did not perform well when

describing the number of competitors in a tender in comparison to Poisson and

Truncated Poisson. Results from the Generalized Poisson model did not exhibit any

notable statistical significant variables except for the dummy Stockholm County and

Tender Preparation. The negative Binomial model gave the best model fit by the

selective criteria, which is in line with Lundberg et al. (2015) that used only this

regression for a study in the cleaning sector. The negative binomial model should

perform well when the equidispersion assumption does not hold, that is when the

mean value does not equal the variance. And as our descriptive statistics indicate,

the assumption does not hold for our data. Thus, the negative binomial ought to

perform better. Which it does measured by the AIC, but it has lower significance for

the explanatory variables. Thus, even though the truncated Poisson is chosen as our

model because it explained the independent variables the best, the negative binomial

had the lowest AIC which is a drawback for our model. The negative binomial still

shows significance for Stockholm County, Tender Preparation and Type of Project

albeit at a lower significance level so these are the significance results we can be

63

most confident in. The fact that project duration loses significance for the Negative

Binomial is not surprising. When looking through the raw data, the project duration

in the cleaning contracts is almost exclusively less than three years with the majority

of the project within two years. Compare this to the results obtained by (Vigren

2017) where project duration was significant for bus transport. Those contracts

ranged up to 7-8 years. So naturally in that instance the bidder is more likely to

take duration into account. The pattern was the same for the Stockholm County

data with the negative binomial lowering the significance for most variables. It is

not surprising that the Truncated Poisson model performed better than the regular

Poisson since the latter does not take zeros into account in the dependent variable.

Lastly, as our diagnostic for the MLR participant model indicated, the standard

assumptions of homoscedasticity in the OLS regression were not fulfilled, implying

that such as model could be discarded when assessing public procurement within

the cleaning sector.

Comparing our count data regression models for mainly Stockholm County with the

full dataset including all counties, Tendering Method and Current Workload became

significant. We expected Current Workload to become significant looking at a spe-

cific counties considering how firms make resource allocation decisions in cleaning.

The main asset of cleaning services companies is the labour force. Therefore, we do

not expect that a firm is willing to move their labour force far distances and thus

any choice regarding distribution of resources will not be on a cross-county line but

rather within each county. Moreover, many of the companies in our dataset act

locally. However, it it more surprising that Tendering Method became significant,

indicating that companies in Stockholm participate more on lowest price tenders

compared to EMAT tenders. This could be interpreted as larger participants in

Stockholm County deeming EMAT contracts to be unreliable. Our review of the

raw data for EMAT tenders indicated that larger participants in EMAT contracts

usually achieve the same scores thus making the EMAT grading system obsolete

since other companies can place sham offerings and still win the contracts. The

incentives for participants to commit to EMAT contracts are thus lower than for

lowest price contracts. Nevertheless, the difference in results for Tendering Method

between the two models may simply be due to the fact that so many of the con-

64

tracts outside Stockholm are just lowest price, contributing to uncertainty in the

significance results.

Tender Preparation was significant for both models and at equal value, implying that

the preparation of documents is an important aspect all around for several counties.

These results are in line with previous research where Tender Preparation has been

featured prominently, both with regard to the participation and pricing decision.

Another interesting, albeit expected, outcome for Stockholm County is that Location

Stockholm gives significance, showing that companies are more inclined to bid on

contracts that are in Stockholm city compared to the rest of the county. This is

not unanticipated since contracts in large cities tend to be more attractive from a

financial perspective. It is reasonable to assume that the bid prices in comparison

to firm costs are higher in large cities than for projects taken in the outskirts.

The Type of Project variable had a big net effect in both datasets, indicating that

regular cleaning contracts to a larger degree increase the number of bidding partic-

ipants compared to sanitation contracts.

Lastly, we used the count data regression models to see if the inclusion of environ-

mental criteria has any affect on participation. The results showed there was no

significance. This could either be because most companies fulfill the environmental

criteria which entail providing certificates. Equally, it could also mean that partici-

pants do not consider the criteria effective. Meaning that there is not enough gover-

nance regarding living up to what has been stipulated in the contract. So they can

commit to fulfilling demands, knowing it is difficult to control they are upholding the

promises. This would be in line with the concerns from Upphandlingsmyndigheten

that we mentioned in section 2.4, that there is an underlying problem in selecting

good criteria that can be efficiently measured and governed.

65

6.2 Bid/No Bid Logistic Regression

Many of the variables that were significant before, become less significant or fully

disappeared when assessing the bid/no bid decision model at an individual level. We

did find some noteworthy results. For example the likelihood of bidding depended

on project location in Stockholm for small companies we looked at but not for larger

ones. For some companies the time allocated to prepare documents had an impact

on the decision. Surprisingly the current workload had significance with a positive

coefficient for one of our companies. That runs contrary to the results from the

Poisson regression and it reveals that current workload is a bad variable to include

in the individual bid/no bid decision. Naturally when there are several contracts,

the odds are that the company in question participated in at least one of them, and

so therefore it shows up as 1 in our data. The current workload measures whether

the overall participation goes down while the probit does not measure volume. We

conclude that current workload should not be included as a variable in the individual

model.

Overall, we conclude that the poor performance of all probit models seem to sug-

gest that looking at individual companies with the probit model will not render

good results and so the bid/no bid decision should be analyzed through aggregated

measures as in the Poisson regression or through another mathematical model.

6.3 Predicting Price

The first analysis we performed was a multiple linear regression on floor care unit

prices. With our given attributes our model fit was poor. Our initial hypothesis was

that the problem lay in the linearity assumption. Therefore, comparing the MLR

model to SVR it was surprising that the former performed comparably to the more

advanced machine learning model. This means we can not reject the idea that the

relationship is linear. It is then plausible to believe that the reason the prediction

is so poor is due to omitted variable bias, as previous studies showed, the unit

prices can suffer from high dimensionality. Previous research in construction found

more than 50 variables that affect the pricing decision of a bidder. Meaning there

66

are numerous attributes that have an impact which is likely the case for cleaning

services as well. The poor model fit for the unit price regression could also be

because the bidding behavior is random. The latter was one of the questions we

sought to answer with this thesis.

What speaks against the randomness argument is that we found significance for

Tendering Method, Location North and Company Size in the regression analysis,

albeit the latter at the lowest significance level. The Tendering Method dummy

indicates that companies bid lower in lowest price tenders compared to EMAT. This

is likely due to the fact that they are hoping that the points awarded in EMAT

tenders will be able to compensate for a higher price. The location dummy being

significant shows that companies bid lower in the south of Stockholm. This might

be due to the behavior of certain companies that primarily act in that region. On

the other hand, it could imply that contracts in the north are more expensive. The

latter is more likely as most large players bid all over Stockholm. The dummy

variable ’Small’ implies that large companies bid lower than smaller ones. The

result that small companies frequently bid higher can be due to the fact that large

companies have economies of scale. However, considering how standardized the

cleaning industry is it is more likely due to the experience that some large companies

have in addition to them having a designated bid manager, which likely bring with

it structure. Because the significance for company size was so weak we wanted to

further explore it with stochastic dominance and non-parametric bootstrapping. We

analyze these results in the next section.

Surprisingly for the unit price regression, contract size had no significance on the

pricing decision. The Contract Size variable was prominent in previous literature for

the construction sector but did not translate to the cleaning services. This could be

explained by the fact that the the Swedish cleaning industry does not bind the same

amount of capital as say the construction industry. Firms tend to approach contracts

with more caution the higher capital they have to tie up to certain projects. In the

cleaning industry, a majority of the tied up capital is the labour force, whereas in

the construction industry there are machinery, subcontractors and labour force to

mention a few.

67

For unit price prediction, to gather enough data for our test we wanted to see if

we could consolidate the data for the 15 largest companies for prediction. This

is a relevant this as we foresee that larger datasets of unit prices will take years

to obtain. Therefore one must have an alternative during this time. So we did

an additional test separating the data noting that it implied two small sample sizes

which will affect the results. Nonetheless, the mean scores did not differ significantly

which could indicate that you can consolidate the data while the final RSME sore

did differ but as explained it is not a trustworthy measure with less than 70 points

because it shows high variation as the sub-sample that ends up as validation data in

the k-fold method will be very different for such a small sample size. As we will see

in the next section, the differences in pricing seem to stem from experience rather

than size which would lend credibility to the possibility of consolidating data for

large players with similar experience.

Regarding differences between the two dependent variables we chose, our results

indicate that spread is not suitable as a dependent variable because the prediction

performed much worse compared to unit prices for both of our models. So our

approach of using the spread to obtain more data points is not a good way to

predict price performance with regression.

6.3.1 Stochastic Dominance and Non-Parametric Bootstrap Analysis

As explained above we wanted to further test the price performance. In particular

to differences between large and small players. From the bootstrapping we obtained

unit price confidence intervals for the categories described in section 4.5.2. Here the

interval was equal for large and small players but was narrower for more frequent

bidders compared to less frequent bidders. Due to data constraints we only used the

spread variable for stochastic dominance test. The results showed that both lowest

price tenders dominated EMAT and high frequent bidders dominated lower frequent

bidders at first order, i.e experience has an effect on performance. These results were

expected. However, it was surprising that large companies did not dominate smaller

ones at first order. This lends credibility to the theory we proposed above that

experience is more important than size in how bidders perform. Because the costs

68

are standardized it is less likely that economies of scale drive differences in pricing

rather than experience. However, as explained in section 4.5.2, to ensure enough

data for comparison we defined smaller companies as those with less than 125 MSEK

in revenue. Thus there are by no means small companies and perhaps significance

would have been found if we could compare to truly small companies such as those

with less than 10 MSEK. However, these tend to bid more sporadically and as a

result it is difficult to obtain sufficient data.

6.4 Quantifying Quality

Our initial goal was to predict the quality scores of each company just as with price.

It quickly became evident that this was not possible due to two reasons: Firstly,

because using EMAT criteria is not mandatory as we described in section 2.4 there

does not seem to be any standardized models for EMAT tenders; the procurement

entities use widely different criteria. This means that for each company there are

only 2-3 observations for the score they received per criteria in our dataset. Secondly

the larger companies tend to be awarded max points anyway. This results in the

lowest price bidder winning oftentimes. In fact, as we demonstrated in section 4.6

this tends to happen in more than 70 % of the cases. We therefore do not believe it

is feasible to assess the quality criteria even if we had data for an additional three

years.

Naturally, there are some instances where the quality score did impact outcome. And

as we found in section 5.2.1 companies tend to bid lower in EMAT tenders, expecting

their quality score to still keep them competitive. So we should still take this into

account. Our solution was to compare LP and EMAT tenders to see differences in

performance and see which ones are likely to be most competitive going forward.

Out of our sample of the four most frequent bidders the performance was consistent.

Meaning the company that was likely to be placed highest in lowest price tenders

was also likely to perform well in EMAT tenders. Using these results one can know

how competitive different competitors will be in an upcoming EMAT tender. So

for example if a hypothetical company G knows of a competitor that is expected

to participate and the model shows they have a high likelihood of placing badly in

69

EMAT tenders and Company G has a high likelihood of placing well then Company

G can expect to place a slightly higher bid while still being competitive. Clearly,

for this model to work perfectly we require a better prediction of each company’s

participation, and as explained in section 6.2 the probit models were inadequate for

explaining each company’s participation.

70

7 Conclusions

In this thesis we sought to understand what factors affected the decision making

of bidders in public procurement auction in the cleaning sector. For the bid/no

bid we found several factors that had an impact which were aligned with previous

research from construction. However, one major departure in our results was the low

significance of contract size on the pricing decision. We also found that in order to

make a good bid forecasting model one must predict both the price and the number

of competitors. For the impact of quality criteria we found that some companies

perform better in EMAT tenders compared to lowest price auctions and applied a

model that can be used to assess the performance of companies in EMAT tenders.

For tender participation it is imperative to use count data regression since ordinary

least-square regression perform poorly.

With regard to price prediction, we started this thesis declaring that our goal was

to examine whether prediction models for this sector was possible and in that case

what models were most suitable, or if the bidding behaviour is random. For the

first question our results were inconclusive. We had a good number of attributes

and the fact that they had such a low predictive power for both of our models

could indicate that there is some randomness in the bidding process. However, our

stochastic dominance test found that more experienced bidders perform better and

our non-parametric bootstrap showed that experienced bidders had a narrow bid

range which could mean there is more consistency in their bidding. So perhaps

the prediction models would improve if the data only consisted of the most frequent

bidders. But for that to work one must have access to a much bigger dataset. So since

we did find some differences between bidders we believe that rather than randomness

explaining the poor predictive performance it is the need for more attributes. One

might conclude that the low predictive power came from the small dataset but

since SVR is normally well equipped for small datasets we argue it is more likely

that what is required is more attributes. The models would presumably improve

with more granular data including factors such as type of floor, number of rooms

and area. However, a problem with this is of course that data like this is not

widely available in a structured way. During the work with this thesis we had the

71

opportunity to study these contracts in detail and it is clear that the sector suffers

from a lack of digitization and standards for submitting bids that companies adhere

to. Subsequently, key information such as the total area of the object is not available

and in cases that it is reported one must collect it individually from each contract.

This is not realistic for thousands of contracts. An essential attribute is the type

of floor that is cleaned. That information is often not available and we found that

prices could be several hundred percent higher for wood floor compared to plastic

floors. So naturally that is an attribute that is of utmost importance for a regression

analysis. Therefore, the key aspect to determine whether prediction is possible in

future studies will be to have access to more granular data

Surprisingly, the support vector regression with RBF kernel did not perform better

than multiple linear regression for both dependent variables. So the more advanced

machine learning models are not always better. Finally, one can draw the conclusion

that unit prices are a better approach for price prediction than the spread.

72

8 Suggestions for Future Research

As we concluded with regards to the SVR model it is quite adequate even with

smaller datasets so the problem of poor model fit is more likely due to missing

attributes. Therefore, with time as more companies become better at reporting their

data, more structurally good prediction models can be made. To exhaustively test

whether artificial intelligence could perform better than multiple linear regression,

other methods such as artificial neural networks should be tested although these are

generally more suitable for larger datasets.

A complete bid forecasting model requires knowledge of participation of individual

bidders. Our results with probit regression gave insufficient results, therefore one

should explore whether the prediction of individual participants is possible with

other models.

A final suggestion is to proceed by comparing the results we have obtained in the

cleaning industry and previous results from the construction on other areas of public

procurement to see if there are signifiant differences between industries in public

procurement.

73

Bibliography

Agerberg, J. and Agren, J. 2012. “Risk Management in the Tendering Process: A

survey of risk management practices within infrastructural construction.” Chalmers

University of Technology. Department of Technology Management and Economics.

Aldenius, M. and Khan, J. 2017. “Strategic use of green public procurement in the

bus sector: Challenges and opportunities.” Journal of Cleaner Production 164:250–

257.

Al-Arjani, A.H. 2002. “Type and size of project influences on number of bidders

for maintenance and operation projects in Saudi Arabia.” International Journal of

Project Management, 20(4):279–287.

Athey, S., Levin, J., and Seira, E. 2011. “Comparing open and sealed bid auctions:

Evidence from timber auctions,” The Quarterly Journal of Economics 126(1):207–

257.

Augustin, K. and Walter, M. 2010. “Operator changes through competitive tender-

ing: Empirical evidence from German local bus transport.” Research in Transporta-

tion Economics, 29(1):36–44.

Aurelien, G. 2017. Hands-On Machine Learning with Scikit-Learn and Tensorflow:

Concepts, Tools, and Techniques to Build Intelligent Systems. O’reilly Media.

Azman, M.A. 2014. “Number of Bidders in Small and Medium Public Construction

Procurement in Malaysia.” Applied Mechanics and Materials 567:595–600.

Bageis, A.S. and Fortune, C. 2009. “Factors affecting the bid/no bid decision in

the Saudi Arabian construction contractors.” Journal of Financial Management of

Property and Construction, 15(2):118–142.

Ballesteros-Perez, P., Campo-Hitscfeld, M., Mora-Melia, and Domınguez, D. 2015.

“Modeling Bidding Competitiveness and Position Performance in Multi-attribute

Construction Auctions.” Operations Research Perspectives, 2:24–35.

74

Ballesteros-Perez, P. and Skitmore, M. 2017. “On the Distribution of bids for Con-

struction Contract Auctions.” Construction Management and Economics, 35(3):106–

121.

Ballesteros-Perez, P., Skitmore, M., Pellicer, E., and Gutierrez-Bahamondes, J. 2016.

“Improving the Estimation of Probability of Bidder Participation in Procurement

Auctions.” International Journal of Project Management, 34(2):158–172.

Beck, A. 2011. “Experiences with competitive tendering of bus services in Germany.”

Transport Reviews, 31(3):313–339.

Benjamin, N.B.H. 1969. “Competitive bidding for building construction contracts.”

Stanford University. PhD dissertation.

Brook, M. 2016. Estimating and Tendering for Construction Work. Routledge 5th

ed.

Cameron, A.C. and Trivedo, P.K. 2005. Microeconometrics: Methods and Applica-

tions. Cambridge University Press.

Cheng, M-Y, Hsiang, C-C, Tsai, H-C, and Do, H-L. 2010. “Bidding Decision Making

for Construction Company Using a Multi-criteria Prospect Model.” Journal of Civil

Engineering and Management, 17(3):424–436.

Dikmen, I., Birgonul, M., and Kemal Gur, A. 2007. “A case-based Decision Support

Tool for Bid Mark-up Estimation of International Construction Projects.” Automa-

tion in Construction, 17(1):30–44.

Drew, D.S., Lo, Hing Po, and Skitmore, R.M. 2001. “The effect of client and type

and size of construction work on a contractor’s bidding strategy.” Elsevier, Building

and Environment 36(3):393–406.

Drew, D.S. and Skitmore, R.M. 2006. “Competitiveness in bidding: a consultant’s

perspective.” Construction Management and Economics, 10(3):227–247.

European Commission. 2017. “Public Procurement.” http://ec.europa.eu/trad

e/policy/accessing-markets/public-procurement/.

75

http://ec.europa.eu/trade/policy/accessing-markets/public-procurement/

http://ec.europa.eu/trade/policy/accessing-markets/public-procurement/

Fastighetsfolket. 2017. “Fler seriosa foretag avstar offentlig upphandling.” https:

//fastighetsfolket.se/2017/11/27/fler-seriosa-foretag-avstar-offentl

ig-upphandling/.

Fayek, A. 1998. “Competitive Bidding Strategy Model and Software System for Bid

Preparation.” Journal of Construction Engineering and Management, 124(1).

Friedman, L. 1956. “A Competitive-Bidding Strategy.” Operations Research, 4(1):104–

112.

Fu, W.K. 2003. “The Effect of Experience in Construction Contract Bidding.” De-

partment of Building and Real Estate. The Hong Kong Polytechnic University. De-

gree of Doctor of Philosophy at The Hong Kong Polytechnic University.

Hegazy, T. and Moselhi, O. 1994. “Analogy-based Solution to Markup Estimation

Problem.” Journal of Computing in Civil Engineering, 8(1):72–87.

Hestmati, A. and Maasoumi, E. 2000. “Stochastic Dominance Amongst Swedish

Income Distributions.” Econometric Reviews 19(3):287–320.

Hilbe, J. 2014. Modelling Count Data. Cambridge Books.

Hilbe, J.M. 2011. Negative Binomial Regression. New York: Cambridge Press 2nd

ed.

Hult, H., Lindskog, F., Hammarlind, O., and Rehn, CJ. 2012. Risk and Portfolio

Analysis: Principles and Methods. Springer-Verlag New York Inc. Springer Series in

Operations Research / Financial Engineering.

Inkopsradet. 2016. “Problemen med viktade utvarderingar.” Inkoprsradet. https:

//inkopsradet.se/expert/problemen-med-viktade-utvarderingar/.

James, G., Witten, D., Hastie, T., and Tibshirani, R. 2013. An Introduction to

Statistical Learning. Springer-Verlag New York Inc 1st ed.

Jung, H. and Kim, J-M. 2019. “Predicting Bid Prices by Using Machine Learning

Methods.” Applied Economics, 51(19):2011–2018.

76

https://fastighetsfolket.se/2017/11/27/fler-seriosa-foretag-avstar-offentlig-upphandling/



https://inkopsradet.se/expert/problemen-med-viktade-utvarderingar/

https://inkopsradet.se/expert/problemen-med-viktade-utvarderingar/

Kleijnen, J. P.C and Schaik, F. D.J van. 2011. “Sealed-bid auction of Nether-

lands mussels: Statistical analysis.” International Journal of Production Economics,

132(1):154–161.

Li, H.. 1996. “Neural Network Models for Intelligent Support of Mark-up Estima-

tion.” Engineering Management, 131(4):391–399.

Lundberg, S., Marklund, P-O., and Brannlund, R. 2009. “Miljohansyn i offentlig

upphandling.” Konkurrensverket.

Lundberg, S., Marklund, P-O., and Stromback, E. 2016. “Is environmental policy

by public procurement effective?” Public Finance Review, 44(4):478–499.

Lundberg, S., Marklund, P-O, Stromback, E., and Sundstrom, D. 2015. “Using pub-

lic procurement to implement environment policy: an empirical analysis.” Umea

University Environmental Economics and Policy Studies, 17(4):487–520.

Malara, M. and Mazurkiewcz, M. 2012. “Modelling the Determinants of Winning

in Public Tendering Procedures Based on the Activity of a Selected Company.”

Operations and Research and Decisions, 22(1):51–62.

Montgomery, D. C., Peack, E. A., and Vining, G. G. 2012. Introduction to Linear

Regression Analysis. Wiley.

Petrovski, A., Petruseva, S., and Zileska, V. P. 2015. “Multiple Linear Regression

Model for Predicting Bidding Price.” Technics Technologies Education Management,

10(3):386–393.

Pla, A., Lopez, B., Murillo, J., and Maudet, N. 2014. “Multi-attribute Auctions With

Different Types of Attributes: Enacting Properties in Multi-attribute Auctions.”

Expert Systems with Applications, 41(10):4829–4843.

Polat, G., Bingol, B., Gurgun, A., and Yel, B. 2016. “Comparison of ANN and

MRA Approaches to Estimate Bid Mark-up Size in Public Construction Projects.”

Elsevier. Procedia Engineering, 164:331–338.

Regeringskansliet. 2019. “Regeringen kraftsamlar kring artificiell intelligens och oppna

data.” Regeringskansliet.

77

Renda, A., Pelkmans, J., Egenhofer, C., Schrefler, L., Luchetta, G., and Selcuki, C.

2012. “The Uptake of Green Public Procurement in the EU27.” European Comission,

DG Environment.

Schmid, F. and Trede, M. 1996. “Testing for First-Order Stochastic Dominance: A

New Distribution-Free Test. A New Distribution-Free Test.” Journal of the Royal

Statistical Society. Series D (The Statistician), 45(3):371–380.

Takano, Y., Ishii, N., Yu, W., and Muraki, M. 2018. “Determining Bid Markup and

Resources Allocated to Cost Estimation in Competitive Bidding.” Automation in

Construction, 85:358–368.

Testa, F., Annunziata, E., Iraldo, F., and Frey, M. 2013. “Drawbacks and oppor-

tunities of green public procurement: an effective tool for sustainable production.”

Journal of Cleaner Production, 112(3):1893–1900.

Thode, H. C. 2002. Testing for Normality. New York: Marcel Dekker 1st ed.

Upphandlingsmyndigheten. 2019. “Statistik om Offentlig Upphandling 2018.” Up-

phandlingsmyndighetens rapport 2018:2.

Vapnik, V. and Chapelle, O. 1999. “Model selection for support vector machines.”

MIT Press Cambridge 132:230–236.

Vigren, A. 2017. “How Many Want to Drive the Bus? Analyzing the Number of

Bids for Public Transport Bus Contracts.” Elsevier. Transport Policy, 72:138–147.

Vijay, K. 2002. Auction Theory. San Diego: Academic.

Von Oelreich, K. and Philip, M. 2013. “Green public Procurement - a Tool for

Achieving National Environmental Quality Objectives.” Swedish Environmental Pro-

tection Agency.

Wang, W.C., Dzeng, R-J., and Lu, Y-H. 2007. “Integration of Simulation-Based Cost

Model and Multi-Criteria Evaluation Model for Bid Price Decisions.” Department

of Civil Engineering, National Chiao Tung University, Taiwan, 22(3):223–235.

Wilcox, R. 2010. “Fundamentals of Modern Statistical Methods: Substantially Im-

proving Power and Accuracy.” Springer.

78

Yu, W, Wang, K-W, and Wand, M-T. 2013. “Pricing Strategy for Best Value Ten-

der.” Journal of Construction Engineering and Management, 139:675–684.

Zwilling, M. L. 2013. “Negative Binomial Regression.” The Mathematica Journal

15.

79

Appendix

A. Literature Review

Factors

1. Reputation of client

2. Relationship with client

3. Project size

4. Nature of project

5. Time available for tender preparation

6. Experience with similar projects

7. Current workload

8. Availability of other projects

9. Expected number of competitors

10. Expected profitability

11. Tendering Method

12. Track record on previous bids

13. Project Duration

14. Market Conditions

15. Confidence in cost structure

Table 21: Factors contributing to the bid/no bid decision by Cheng et al. (2010)

80

B. Bid/No-Bid Probit Regression

Table 22: Bid/No Bid probit regression for three firms

C D E

Intercept -5.0012 -1.9204 -2.3068

(2.2146) (1.6386) (1.8901)

Stockholm -0.2750 0.0173 0.0902

(0.2351) (0.1845) (0.2107)

Type of Project 0.2841 0.4277. 0.5022.

(0.2934) (0.2264) (0.2787)

Tender Preparation -0.0091 -0.0093 -0.0031

(0.0107) (0.0079) (0.0090)

Project Duration 0.5996. 0.1844 0.0803

(0.3200) (0.2373) (0.2722)

Tendering Method -0.4338. -0.0756 -0.1214

(0.2354) (0.1848) (0.2108)

Current Workload (5D) 0.0116 0.0104 0.0176

(0.0131) (0.0103) (0.0121)

’***’ p < 0.001, ’**’ p < 0.01, ’*’ p < 0.05, ’.’ p < 0.1

81

C. Diagnostics

1 2 3 4 5 6

1.Stockholm County 1.00

2.Tender Preparation 0.07 1.00

3.Type Of Project 0.01 0.06 1.00

4.Project Duration -0.17 0.05 -0.06 1.00

5.Tendering Method -0.08 0.03 -0.25 0.07 1.00

6.Current Workload (5D) 0.52 0.20 -0.01 -0.12 0.07 1.00

Table 23: Correlation Matrix For Stockholm and Other County data.

Figure 10: Variance Inflation Factors (VIF). High VIFs usually indicates multi-collinearity between explanatory variables.

82

Figure 11: Illustrates residual plots and QQ-plot with empirical distribution againsta standard normal distribution. This plots refers to the performed OLS on consoli-dated Stockholm and County data.

Figure 12: Illustrates all possible regression models with number of participants asresponse variable. From top to down, model 1 has the best fit, whereas model 6 hasthe worst fit.

83

D. Price Prediction

Big SVR Scores: (8.53562764 6.12892149 6.36627364 4.62509598 6.48479331)

Big MLR Scores: (9.15909356 5.73788296 5.63397006 4.36290633 8.64586943)

Small SVR Scores: (4.75971136 9.49534115 6.49531601 7.43772976 4.0818302)

Small MLR Scores: (6.28460425 8.86306535 7.23570753 7.11378941 5.47702397)

Table 24: Cross-Validation Scores For Big and Small Unit Prices SVR and MLRModels

SVR Scores: (7.26769843 6.43824234 6.77724886 6.24194079 6.54655757)

MLR Scores: (7.853427 6.03135318 6.51085139 6.09506751 7.08792713)

Table 25: Cross-Validation Scores For Full Unit Price SVR and MLR Models

Big SVR Scores: (0.46019892 0.3870526 0.20945988 0.39343138 0.44799723)

Big MLR Scores: (0.48889964 0.39996489 0.21547178 0.38490709 0.40966344)

Small SVR Scores: (0.42581727 0.22424561 0.51743096 0.27046925 0.68121542)

Small MLR Scores: (0.45160859 0.2943744 0.50855643 0.33264519 0.65411871)

Table 26: Cross-Validation Scores Spread For Big and Small SVR and MLR Models

84

bid forecasting in public procurementkth.diva-portal.org/smash/get/diva2:1354986/fulltext01.pdf ·...

Documents