estimation of oil thickness
TRANSCRIPT
-
7/26/2019 Estimation of Oil Thickness
1/41
AEAT-5279 Issue 1
Estimation of oil thickness
A report produced for the Maritime and Coastguard
Agency
Louise Davies
Jenny Corps
Tim LunelKaren Dooley
November 1999
-
7/26/2019 Estimation of Oil Thickness
2/41
AEAT-5279 Issue 1
Estimation of oil thickness
A report produced for Maritime and Coastguard
Agency
Louise Davies
Jenny Corps
Tim LunelKaren Dooley
November 1999
-
7/26/2019 Estimation of Oil Thickness
3/41
- ssue Estimation of Oil Thickness
AEA Technology ii
Title Estimation of oil thickness
Customer Maritime and Coastguard Agency
Customer reference MSA 10/9/111
Confidentiality,
copyright and
reproduction
Copyright AEA Technology plc 1999
All rights reserved.
Enquiries about copyright and reproduction should be addressed to the
Commercial Manager, AEA Technology plc.
File reference EERA 20738001
Report number AEAT-5279
Report status Issue 1
AEA Technology Environment
National Environmental Technology Centre
Culham
Abingdon
Oxfordshire
OX14 3ED
Telephone +44 (0)1235 463117
Facsimile +44 (0)1235 463030
AEA Technology is the trading name of AEA Technology plc
AEA Technology is certificated to BS EN ISO9001:(1994)
Name Signature Date
Author Louise Davies
Reviewed by Tim Lunel
Approved by Jenny Corps
-
7/26/2019 Estimation of Oil Thickness
4/41
- ssue Estimation of Oil Thickness
AEA Technology iii
Executive Summary
This project was undertaken on behalf of the Maritime and Coastguard Agency to devise a method
of measuring the total volume of oil on the sea surface using airborne remote sensing techniques.
This would provide considerable information for those planning a counter pollution response as itwould allow near real time estimates of the change in oil volume over time as a result of natural
processes and response actions. Such a response tool would revolutionise oil spill response by
providing a real time measure of the success of the response technique employed.
Thermal infra-red (IR) is able to detect a different thermal response from the heterogeneous (non-
uniform) thickness of an oil slick spreading on the sea surface. However, the relationship between oil
thickness and thermal response is complex. To overcome the complexities of the inter-relationships,
the National Environmental Technology Centre (NETCEN) developed a method of analysing IR
data from experimental oil slicks in the North Sea using neural network processing (Wood et al,
1997). This work demonstrated that it was possible to use a neural network approach to predict the
oil thickness of the test data set on the basis of the IR response. This initial development project was
followed with a validation exercise carried out by NETCEN (Davies et al, 1998). The 1998 project
concluded that the neural network was unable to classify oil thickness reliably for the validation data
set.
This report details the re-calibration of the neural network to establish whether it has the ability to
predict oil thickness from a wider range of input variables taking account of factors such as solar
heating of the surface oil slick. The neural network was retrained with two data sets: a primary data
set containing only field trial data from 1994, 1995 and 1997 with oil thickness measured by pad
sampling, and a secondary data set containing the primary data set and a Sea Empress data set with
estimated slick thickness. The secondary data set included thermal IR imagery and parameters from
a wider range of environmental conditions that represent both winter and summer.
The re-calibrated neural network was subjected to a verification process through testing with a
selection of data from each of the data sets that had not been used to re-train the network. This
verification process has established that the neural network is internally consistent and is able to
predict oil thickness representative of the data sets used to train the network in 50-75% of cases,
even under a wide range of environmental conditions.
A committee of networks approach was able to correctly assign the thickness of the slick into one of
four ranges of slick thickness in 76% of cases. In the majority of the misclassifications, the thicknesswas assigned to an adjacent thickness range. Whilst these results are promising, it must be noted that
the number of data available for training the network is relatively small compared to the number of
variables in the network and the network is over-fitted. Therefore the networks derived during this
project should not be considered as being sufficiently robust to be relied upon for operational use at
present. In addition it is important to emphasise that there is no guarantee that the data sets being
used to re-train the network have covered the complete range of the parameters (such as the
meteorological conditions and oil type) which will be encountered in the event of a spill. The
-
7/26/2019 Estimation of Oil Thickness
5/41
- ssue Estimation of Oil Thickness
AEA Technology iv
importance of using a training data set which includes the full range of parameter values was
emphasised by the fact that the network trained solely with the primary data set (summer field trial)
was not able to predict the thickness for the Sea Empress imagery taken during the winter.
However, the results are sufficiently encouraging, we believe, to develop a pilot system to deploy as
an experimental system at the next real incident or experimental field trial. This will allow the
technique to continue development at every available opportunity while providing an additional tool
to the MCA in the event of an oil spill.
To develop the current experimental system into a pilot operational system would require the
following relatively minor developments:
Collaboration with Air Atlantique to standardise on image analysis software (for example, using
ER Mapper) and to devise procedures for downloading the thermal IR imagery to a suitable
format for analysis.
Determine suitable techniques for collecting all the variables required by the neural network, i.e.
air and sea temperature, irradiance.
Develop a sorbent pad sampling technique that includes real-time analysis for calibration of the
neural network.
We believe that this pilot system could be developed within one month. If the pilot system is
developed and deployed (potentially using the FLIR system on the SAR helicopters as well as the
MCAs remote sensing planes) at future incidents then the system can be used to give estimates of
volumes, alongside the normal visual estimates, for operational use. The pilot system can also be
evaluated in terms of the practicalities of operating the system and the benefits it provides to MCA in
an oil spill incident.
In addition, the neural network could be continually re-trained with the new sets of field data taken indifferent environmental conditions to maximise the chances of the operational data falling in the range
of environmental parameters used in the training process. This will be particularly useful if pad
sampling is used to provide in-situ calibration of the slick thickness.
-
7/26/2019 Estimation of Oil Thickness
6/41
- ssue Estimation of Oil Thickness
AEA Technology v
Contents
1 Introduction 1
2 Thermal Infra-red 2
2.1 THERMAL IR CAMERA 2
2.2 THERMAL BEHAVIOUR OF SPILLED OIL 2
2.2.1Thin Oil Layers 3
2.2.2Thick oil layers 3
2.2.2Heat exchange and surface temperature effects 4
3 Previous Neural Network Research Programmes 5
3.1 INITIAL DEVELOPMENT OF NEURAL NETWORK SYSTEM 5
3.2 NEURAL NETWORK VALIDATION 8
4 Research Programme 9
5 Data Collation 10
5.1 DATA SELECTION 10
5.2 TRAINING PARAMETERS 11
5.3 PRIMARY DATA SET 135.4 SECONDARY DATA SET 15
6 Neural Network Re-Training 17
6.1 REGRESSION APPROACH 18
6.2 CLASSIFICATION APPROACH 19
6.3 SUMMARY 21
7 Neural Network Verification 22
7.1 PRIMARY DATA SET 227.2 SECONDARY DATA SET 23
7.3 COMMITTEE OF NETWORKS 25
7.4 SUMMARY 28
-
7/26/2019 Estimation of Oil Thickness
7/41
- ssue Estimation of Oil Thickness
AEA Technology vi
8 Conclusions 29
9 Recommendations 30
9 References 33
Appendices
APPENDIX 1 PRIMARY AND SECONDARY DATA SETS
VERIFICATION RESULTS FOR SEA EMPRESS DATA USING
NEURAL NETWORK TRAINED ON PRIMARY DATA
THERMAL IR IMAGE OF 1997 FORTIES BLEND EXPERIMENT
-
7/26/2019 Estimation of Oil Thickness
8/41
- ssue Estimation of Oil Thickness
AEA Technology 1
1 Introduction
The ability to measure the total volume of oil on the sea surface using airborne remote sensing
techniques would provide considerable information for those planning a counter pollution response.This would allow near real time estimates of the change in oil volume over time as a result of natural
processes and response actions. Such figures would also be invaluable in quantifying the success of
the response technique employed. For example, a quantification of the volume of oil remaining on
the sea surface before and after a test run of dispersant spraying would enable the responder to
rapidly assess whether the spilt oil is responding to the dispersant under the given environmental
conditions.
Thermal infra-red (IR) is able to detect a different thermal response from the heterogeneous non-
uniform thickness of an oil slick spreading on the sea surface. However, the relationship between oil
thickness and thermal response is complex. To overcome the complexities of the inter-relationships,
the National Environmental Technology Centre (NETCEN) developed a method of analysing IR
data from experimental oil slicks in the North Sea using neural network processing on behalf of the
Maritime and Coastguard Agency (Wood et al, 1997). The 1997 work demonstrated that it was
possible to use a neural network approach to predict the oil thickness of the test data set on the basis
of the IR response.
This initial development project was followed with a validation exercise carried out by NETCEN and
sponsored by the Maritime and Coastguard Agency (Davies et al, 1998). This exercise consisted of
a set of field experiments to collect the data that was subsequently used to test the validity of the
trained neural network. This project concluded the neural network was unable to classify oil
thickness reliably for the validation data set.
It was also concluded that the failure of the neural network was mainly due to the data being outside
the data range the network was originally trained on for a number of the input variables. The neural
network was also unable to account for the thermal effects of the oil as a result of solar heating of
the surface oil during this set of field trials. The thermal effects in the IR imagery were not observed
in the field data sets used to train the network.
In the report by Davies et al (1998) it was suggested that it might be possible to re-train the neural
network with the new 1997 which included solar heating effects. However it was emphasised in the
conclusions that there was no guarantee that even with this new additional data set that the full rangeof environmental variables would be taken into account.
This report details a study involving the re-calibration of the neural network with all the data sets
obtained during the initial training of the neural network (Wood et al, 1997) and the validation
exercise (Davies et al, 1998). The data used in the re-calibration includes additional core variables
that were considered important for predicting the effect of solar heating on the thermal IR imagery
during the validation exercise. These additional core variables were not previously included as core
variables in the initial training programme. The re-calibrated neural network was subjected to a
-
7/26/2019 Estimation of Oil Thickness
9/41
- ssue Estimation of Oil Thickness
AEA Technology 2
verification process through testing with a selection of data that had not been used to re-train the
network. This verification process has established whether the neural network is internally consistent
and is able to predict oil thickness representative of the data sets used to train the network. This
process does not provide a validation of the network, since the verification data set is a subset of the
training data set.
2 Thermal Infra-red
2.1 THERMAL IR CAMERA
A thermal IR camera, such as the Talytherm camera in the Air Atlantique Cessna 404, detects
thermal IR radiation in the 8-13 m range. The detector measures apparent black body
temperature, which is a function of actual temperature (expressed as C) modified by the emissivity
of the material that is omitting the IR radiation. Emissivity is the property of a material that indicates
the proportion of thermal IR emitted by the material when it is at a particular temperature.
The system detects very small differences in thermal IR emission. For display on a video output, the
signal is amplified (the degree of amplification is expressed as gain) and the signal base-level is
expressed as offset. The gain and offset may be adjusted automatically or set manually. Changing
the offset will alter the response of the camera image to temperature; manually setting the sea to a
mid-grey on the screen establishes the prevailing sea temperature as being in the middle of the
detector range. Increasing the gain causes maximum discrimination in the image. The brightness of a
recorded thermal IR image is therefore a function of the objects temperature, but is an indication of
relative and not actual temperature.
2.2 THERMAL BEHAVIOUR OF SPILLED OIL
A thermal IR image of an oil slick is a representation of apparent temperature differences in the oil
slick and these can give an indication of relative oil layer thickness. An oil surface at the same
temperature as a water surface will have a lower equivalent black body temperature than the water
surface. This will cause the oil to appear cooler in the thermal IR image, although it is at an identical
temperature as the water surface. On the basis of the difference in emissivity values of oil and water,
oil should always appear cooler than water when both are at the same temperature. However, oilslicks tend to appear as two distinct areas in a thermal IR image:
Areas that are apparently cooler than the sea.
Areas that are apparently warmer than the sea.
The apparently cooler areas of the slick are areas of relatively thin oil, from 50 m up to
approximately 250 to 500 m thick and the warmer areas are thicker or emulsified oil, greater than
approximately 250 to 500 m (Goodman, 1994).
-
7/26/2019 Estimation of Oil Thickness
10/41
- ssue Estimation of Oil Thickness
AEA Technology 3
2.2.1 Thin Oil Layers
Thin layers of oil will be in intimate thermal contact with the sea surface. The thermal capacity of a
thin oil film will be very low because there is not much oil per unit area. It is therefore unlikely that the
oil could be at a significantly different temperature from the sea for any length of time before the
temperature equilibrates. Under most circumstances, the thinner layers of oil are likely to be at almost
exactly the same temperature as the sea and appear cooler in the thermal IR image because of the
differences in emissivity between oil and water. There are two factors that may slightly alter the
actual temperature of thin oil films:
In the very early stages of oil weathering, the rapid evaporation of large quantities of volatile
components from the oil layer may cause cooling, leading to the oil film being at a lower
temperature than the sea for a very short period.
Layers of oil of any significant thickness are likely to absorb solar radiation more effectively than
the surrounding sea because they are opaque and dark coloured. This could cause a slight
increase in temperature of the oil film.
Evaporative cooling and solar heating will tend to cancel each other out some time after the oil has
been released onto the sea. If the oil layer is being exposed to solar heating, the rise in temperature
may still be insufficient to cause the oil to appear warmer than the sea in the thermal IR image
because of the effect of emissivity.
2.2.2 Thick oil layers
While the thinner layers of oil are essentially in thermal equilibrium with the sea, the areas of thick oil
or emulsion will not be in thermal equilibrium with the sea and can have a higher temperature. The
higher temperature of the thicker layers of oil will be due to the balance of heat transfer into and out
of the spilled oil layer. The precise temperature rise will depend on several factors:
The temperature rise produced by constant heat input will depend on the thermal capacity of the
oil layer, which is proportional to its volume per unit area (thickness). The actual temperature of
an oil layer is the result of the balance between heat input from solar radiation and heat lost to the
air or sea. Thicker oil layers are therefore capable of sustaining a higher temperature difference
than thinner oil layers.
The thermal capacity of an oil layer depends on its mass per unit area (thickness) and its specific
heat capacity. Oil has a specific heat that is about half that of water. As oil emulsifies its specific
heat will increase in proportion to the water content. Emulsified oil is therefore capable of holding
more heat than non-emulsified oil.
The heat input and output is a result of temperature differences between the oil layer, the sea and
air, and the degree of solar heating. Thick layers of oil or emulsion will absorb heat from the sun
-
7/26/2019 Estimation of Oil Thickness
11/41
- ssue Estimation of Oil Thickness
AEA Technology 4
more effectively than the sea because the oil is opaque or of a dark colour. This will cause a
significant rise in temperature if the rate of heat input is greater than the rate of heat loss to the sea.
2.2.2 Heat exchange and surface temperature effects
Unless the sea and air are at precisely the same temperature there will be heat transfer between
them. If spilled oil is present it will act as a buffer to this heat exchange. The oil will adsorb heat
from the warmer of the two and lose heat to the cooler. The thermal capacity of the oil layer is
minuscule compared to that of the sea and atmosphere and it is inevitable that the temperature of the
oil layer will change. The temperature differentials between air, sea and oil plus the relative thermal
conductivities of the system will determine the rate of change of temperature of the oil layer.
The sea surface temperature will tend to remain almost constant during the period of a day, due to its
massive thermal capacity. However the oil layer temperature will oscillate; the thick layers of oil
warm to quite high temperatures during the course of the day as heat is absorbed from the sun and
cool slowly at night as heat is lost to the sea and air. The oil will be cooler than the sea in the early
morning, but rise to a temperature higher than that of the sea during the day. At nightfall the
temperature of the oil will start to drop, but it will still be higher than that of the sea for some time in
the evening. During the night it will lose heat to the cooler air which is not replaced by solar heating,
and the temperature of the oil will then drop below that of the sea. Therefore typically the greatest
extent of thick oil is detected in imagery recorded around midday to early afternoon when it is the
warmest part of the day.
Similar effects will occur on a shorter time-scale if there is intermittent sunlight or cloud shadow
effects. The oil will heat up rapidly in the sun, but start to cool when the sun is obscured. The major
determinants of the heating and cooling of the oil layers will be the intensity and duration of sunlight
and the relative temperature differences of the air and sea.
The temperature attained by an emulsified oil layer will be proportional to its thickness under the
prevailing conditions; thinner layers of oil will lose heat more rapidly, and therefore have a lower
temperature, than thicker layers. However, this proportionality only holds true for a specific set of
conditions and the actual temperature attained by the oil in thicker layers will depend on the heating
and cooling effects that have occurred previously, rather than those that prevail at the time of
observation.
In a thermally equilibrated system with no heat exchange, surface temperature will be a very good
indicator of bulk temperature as heat will be conducted to the surface from the bulk of the oil.
However, in situations where the temperatures are constantly changing, such as the dynamic heatexchange (heat input and output) which gives rise to temperature differences in spilled oil layers, the
surface temperature might not be a direct indicator of bulk temperature. In these circumstances, the
IR brightness (or relative equivalent black temperature) as measured by a thermal IR camera, will be
an indication of the surface temperature of the oil layer and not of the internal temperature. This can
occur if there are large differences in sea and air temperature. Heat transfer from the inside of the oil
layer to the surface might be slower than heat transfer from the surface of the layer to cold air. The
surface of the oil layer may be chilled by extremely cold air to lower temperatures than that of the
sea, even though the internal temperature of the oil is higher due to previous heat input.
-
7/26/2019 Estimation of Oil Thickness
12/41
- ssue Estimation of Oil Thickness
AEA Technology 5
3 Previous Neural Network Research
Programmes
The National Environmental Technology Centre (NETCEN), have completed two previous research
projects relating the response of thermal IR to oil slick thickness on behalf of the Maritime and
Coastguard Agency. The initial research project developed the neural network approach to
measuring oil slick thickness (Wood et al, 1997) and the following research project validated the
neural network developed in the initial project (Davies et al., 1998).
3.1 INITIAL DEVELOPMENT OF NEURAL NETWORK SYSTEM
Neural networks are computing methods that learn from empirical data rather than being
programmed explicitly. Neural networks are particularly useful in the modelling of complex
relationships where non-linear effects could be involved. Instead of being programmed with a series
of equations, the neural network can be trained to learn a cause-effect relationship. During training,
examples of input and output are fed to the network, which learns the relationships by building up
interconnecting paths in an iterative training process.
A range of object-oriented neural network software exists, some of which are appropriate for the
analysis of IR imagery data. For this type of problem, a neural network may be regarded as a non-
linear regression tool. It predicts output values from a number of inputs. The inputs are weighted in
a similar manner to linear regression, and then passed to a set of non-linear basis functions (each
with a different set of weights). Then a least squares fit is performed on the outputs of the basis
functions.
The initial experimentation (Wood et al., 1997) has trained a Multi-Layer Perceptron (MLP) neural
network architecture (Figure 1). The MLP consists of a number of interconnected layers of
processing units. The first layer (left on the diagram) is called the input layer. Each circular node
represents an input to the network. Each input is a single number. The processing units in the other
layers all perform a similar function, which is to take a weighted sum of the outputs from the
previous layer, add a bias term, and apply an activation functionto the result. In the diagram, the
weights are represented by the connections between the circular nodes.
The data set used to develop the neural network was collected in experimental field trials carried out
in 1994 and 1995 by NETCEN. The inputs to the neural network included the thermal IR response
(brightness of the oil and sea in the imagery), oil type, time of day the data was recorded and various
meteorological parameters. The resulting oil thickness predictions were compared to measured oil
thickness values which were obtained in the field through sorbent pad sampling.
-
7/26/2019 Estimation of Oil Thickness
13/41
- ssue Estimation of Oil Thickness
AEA Technology 6
Figure 1. Schematic Diagram of an MLP
The initial development work carried out by NETCEN (Wood et al., 1997) suggested an optimal
configuration for prediction of oil thickness, which is shown in Figure 2. Bias weights are omitted for
clarity.
Figure 2. Optimal Configuration for Prediction of Oil Thickness
B
Bs
cos (Time)
sin (Time)
Oil T e
Wind Speed
Sea Temp
log( )
brightness of oil B
brightness of sea + oil Bsobrightness of sea Bsoil thickness Note that B B Bso s= .
This optimum configuration was then used to predict oil thickness on a proportion of the field data
set from the 1994 and 1995 experimental field trials. Figure 3 shows the typical performance of the
-
7/26/2019 Estimation of Oil Thickness
14/41
- ssue Estimation of Oil Thickness
AEA Technology 7
network; predicted oil thickness is plotted against the actual oil thickness. The straight line is the line
y=xand corresponds to a perfect prediction. The R2value is 0.846.
Figure 3. Scatter Plot of Predicted Against Actual Thickness for Optimal Configuration
a
-0.5 0 0.5 1 1.5 2 2.5 3 3.5
Actual log( )
-0.5
0
0.5
1
1.5
2
2.5
3
Predicted log( ) R
= 0.846221
Additionally, a classification approach was used to categorise the data into four thickness ranges:
0-100 m
101-500 m
501-1000 m
1001-2500 m.
Table 1 presents typical results from the combined data set obtained in analysing the previous sea
trial data sets (Wood et al. 1997). When all available sea trial data were used to train the network it
was possible to classify correctly 80 % of data into one of four thickness classes for the previous
sea trial data sets.
Table 1. Confusion Matrix for Combined Sea Trial Data Set
Bin 0-100
m
101-500
m
501-1000
m
1001-2500
m
Correct
%
0-100 m
136
3
0
1
97.1%
101-500 m 8 15 3 0 57.7%
501-1000 m 2 1 7 0 70%
1001-2500 m 0 1 0 8 88.9%
-
7/26/2019 Estimation of Oil Thickness
15/41
- ssue Estimation of Oil Thickness
AEA Technology 8
Once the initial neural network system was completed, it was recognised that the validity of this
approach had to be tested against data sets which were totally unrelated to the data sets used to
initially train the network. This led to the validation exercise as described in the following section.
3.2 NEURAL NETWORK VALIDATION
The validation exercise consisted of collecting a new data set though an experimental field trial. This
was carried out in 1997 by NETCEN, a data set was obtained containing thermal IR imagery and
meteorological parameters for input into the neural network trained in the initial project. Oil
thickness was also measured during the field trial using sorbent pad sampling for comparison to the
predicted thickness values. The data sets collated from the 1997 field trial for the validation of the
neural network were processed by the neural network in order to obtain predicted thickness values
(Davies et al., 1998).
As previously stated, the core input variables required by the neural network trained in 1997 were :
Thermal infrared sea and oil brightness Time of day
Wind speed
Sea temperature
Oil type
The 1997 field trial data set consisted of data collected from three oil types; Forties Blend crude oil,
Alaska North Slope crude oil, and IFO-180 heavy fuel oil. The network was originally developed
with data from the 1994 and 1995 field trials obtained from Medium Fuel Oil and Forties Blend
crude oil experiments. The neural network requires specification of oil type so the 1997 Forties data
set was analysed using only the trained network for Forties. The data sets from the Alaska NorthSlope Crude oil slick and IFO-180 were processed by the network with oil type as both MFO and
Forties.
Using these input data, the neural network analysis was unable to accurately predict the measured oil
thickness. For example, the data set from the experimental Forties blend slicks showed that
measured slick thickness data were representative of all four thickness categories. Yet the neural
network analysis categorised all the thickness measurements into just two, non-consecutive,
categories 0-100 m and 501-1000 m. Further, the categorisation was purely a function of time:
before midday all the IR data was categorised into 0-100 m despite a measured thickness of 336-
2625 m; and after midday all the data was categorised into 501-1000 m despite a measuredthickness of 139-2359 m.
The poor performance of the neural network classification was likely to be due to the degree of solar
heating of the surface oil during this set of field trials which was not observed in the field data sets
used to train the network. In addition, in the training data sets the air temperature was above the
seawater temperature in nearly all cases whereas in the validation data set, air temperature was
always below sea temperature thereby reversing the net thermal flux.
-
7/26/2019 Estimation of Oil Thickness
16/41
- ssue Estimation of Oil Thickness
AEA Technology 9
4 Research Programme
The validation exercise demonstrated that the neural network trained in 1997 was incapable of
reliably predicting oil slick thickness. Therefore, a research programme was devised to re-calibrate
the neural network using both the initial training data set and the validation data set. This would
indicate whether the neural network is able to predict oil slick thickness from a more varied data set.
The re-calibrated neural network would then be tested with a subset of the data that was not used
for training as a verification of whether the neural network is internally consistent and therefore has
the ability to predict oil slick thickness.
The research programme was divided into three stages;
1. Data collation
2. Neural network re-training
3. Neural network verification
The data collation stage involved collating all the data sets that had been used in the previous two
neural network research projects. This was followed by a data selection procedure to ensure that
the data used to retrain the neural network was as accurate and robust as possible. This involved re-
examination of the thermal IR imagery and the thickness measurements. Specific meteorological
factors were selected that were considered essential to the prediction of the oil thickness, i.e.
emissivity, irradiance, air and sea temperature. Once the final data set was complete, a proportion of
the data was removed for the verification stage and the remaining data set was used to re-train the
neural network.
The neural network was re-trained using the brightness measurements from the thermal IR imagery,
the thickness measurements and the selected meteorological parameters. The optimum neural
network settings for the data set were devised based on the work undertaken in the initial training of
the neural network. Once the neural network was trained to the best output for the data set, the
verification stage was undertaken.
A spreadsheet was developed to enable the trained network to be tested using the verification data
that was not used for the training. The network was then tested and the results examined todetermine the accuracy of the neural network.
Each of these stages is described in more details in the following three sections.
-
7/26/2019 Estimation of Oil Thickness
17/41
- ssue Estimation of Oil Thickness
AEA Technology 10
5 Data Collation
There are two sub-sections of data that have been used in the neural network research programmes;
the 1994 and 1995 sea trial data, which was used to train the original neural network, and the 1997
sea trial data which was used to validate the trained neural network.These combined data sets consist of a total of 225 data observations from four oil types (Alaska
North Slope crude oil, Forties Blend crude oil, Medium Fuel Oil/Gas Oil blend and IFO-180 Heavy
Fuel Oil). The data sets contain the following parameters for each data observation:
1. Oil slick thickness (as measured by sorbent pad sampling)
2. Thermal IR response (brightness values)
3. Altitude of aircraft
4. Time of day
5. Camera offset and gain6. Oil type
7. Wind speed and direction
8. Air and sea temperature
9. Irradiance
10.Surface current (at 1 metre depth)
All of these parameters were studied in the initial research programme and it was determined that
many of these parameters have no effect on the prediction of oil thickness using the thermal IR
brightness data. The studies undertaken as part of the earlier validation of the neural network
(Davies et al, 1998) highlighted the importance of solar heating on the prediction of oil thicknesstherefore certain parameters were identified as essential to predict the changes in the thermal IR
imagery as a result of heating. This is discussed further in Section 4.2.
These data sets have undergone different levels of data processing and interpretation. The latest data
set, from the 1997 sea trial, is the most accurate and robust as a result of the continuing development
of our data processing and sampling techniques. Therefore it has been vital to the retraining
programme that all the data sets were reassessed to ensure all the data used in the retraining was as
accurate and robust as possible.
5.1 DATA SELECTION
The data selection process was aimed at reducing the likelihood of data being included in the
retraining which represents a false relationship between the oil thickness and IR response.
The first stage of the data selection procedure was to re-calculate the thickness measurements made
in 1994 and 1995 using sorbent pad samples to ensure that the increased thickness of emulsions
was considered.
-
7/26/2019 Estimation of Oil Thickness
18/41
- ssue Estimation of Oil Thickness
AEA Technology 11
The data sets were then subjected to a range of data selection criteria, listed as follows:
1. Thermal IR imagery must have been obtained at an altitude of less than 3000ft (as the resolution
of the image reduces significantly with increasing altitude).
2. Removal of spurious data when:
The brightness value is taken from a poor quality image
The brightness value is not taken in the vicinity of the sampling vessel.
The thickness sampling position is unknown.
3. Data sets of a particular oil type, which because of the nature of the field experiment, over-
represented data of particularly low thickness value were reduced in number to prevent the data
from biasing the training of the neural network toward these very low thickness values.
This data selection process reduced the number of data observations from 225 to 96. The majority
of data removed was that obtained during the 1994 sea trials during the Medium Fuel Oil/Gas Oil
blend continuous release trials, where the data obtained consisted of thickness measurements of less
than 10 m for more than 80 data observations. This level of thickness is difficult to accurately
measure with sorbent pad samples and is at the limit of detection for the thermal IR camera.
Therefore, the relationship between the thermal IR brightness, oil thickness and other variables may
not be represented as accurately as the data observations taken at higher oil thickness levels. So by
including such a large number of samples of less than 10 m, this may have biased the training of the
neural network toward the low oil thickness values, which are of little interest in operational
response. The majority of the other data observations removed were as a result of low resolution
(high altitude) and poor quality thermal IR imagery.
5.2 TRAINING PARAMETERS
The neural network was trained in the initial research programme (Wood et al., 1997) using the
following key parameters:
Brightness of oil
Brightness of oil and sea
Time
Oil type
Wind speed
Sea temperature
During the validation studies in the previous research project, it was observed that there were further
parameters that may be important in predicting oil thickness from thermal IR imagery. It was
observed in the validation data that the extent of solar heating significantly affected the brightness of
the thermal IR imagery, therefore it would be important to include irradiance and air temperature
data as key parameters in the neural network.
-
7/26/2019 Estimation of Oil Thickness
19/41
- ssue Estimation of Oil Thickness
AEA Technology 12
Thermal IR emission is related to both the way a surface emits IR radiation as a function of
temperature and the actual temperature of the surface. This relationship is described by the
emissivity, therefore emissivity is an important parameter when studying thermal IR response. The
emissivity values for the oils were not included in the data sets used to train the original neural
network. Therefore the inclusion of emissivity data as a parameter in the re-training was considered
essential to improve the accuracy of the thickness predictions.
The emissivity value for each of the four oils in the data set was obtained and is detailed as follows.
The directional reflectance of the oil was measured over a wavelength range of 2-25 m. The
thermal IR camera operates over a range of 8-13 m, so the emissivity values over this range were
noted and the average emissivity value taken for each oil type. The emissivity values are shown in
the following table.
Table 2. Emissivity Values for the Oils.
Wavelength
(m)
Alaska North
Slope crude oil
Forties Blend
crude oil
Medium Fuel
oil/Gas oil blend
IFO-180 Heavy
Fuel Oil
8 0.957 0.963 0.960 0.953
9 0.958 0.964 0.960 0.953
10 0.957 0.963 0.960 0.953
11 0.957 0.963 0.960 0.953
12 0.957 0.963 0.959 0.952
13 0.956 0.962 0.958 0.951
Average 0.957 0.963 0.960 0.953
The parameters selected for re-training the neural network are as follows.
1. As used in the original neural network;
Brightness of oil
Brightness of oil and sea
Sea Temperature
2. Additional core variables identified during the validation exercise; Emissivity
Irradiance
Air Temperature
The additional core variables in the current training parameters were identified during the validation
exercise.
-
7/26/2019 Estimation of Oil Thickness
20/41
- ssue Estimation of Oil Thickness
AEA Technology 13
It is important to keep the number of key training parameters to a minimum to reduce over-fitting in
the neural network training. Note that the emissivity values replaced the oil type input variable in
the neural network. Also the time of day and wind speed parameters have been excluded as these
are considered to have a lesser effect on the brightness and the effects are accounted for through the
other parameters (irradiance, sea and air temperature).
The change in the parameters selected for re-training the neural network in comparison to the
previous training are summarised in the following table.
Table 3. Previous and current training parameters
Previous training parameters Current training parameters
Brightness of oil Brightness of oil
Brightness of oil and sea Brightness of oil and seaSea temperature Sea Temperature
Oil type Emissivity
Time of day Irradiance
Wind speed Air Temperature
5.3 PRIMARY DATA SETAfter the data had undergone the data selection procedure and the key training parameters were
selected, the data, labelled the primary data set, was divided into a training data set and a
verification data set. The selection of the verification data set was carried out by taking a
representative sample from each thickness prediction range to:
Obtain a verification data set that was representative of a range of oil thickness with different oil
types, at different temperatures, different brightness ranges and different irradiance levels.
Ensure the input variables for the verification data were within the limits of the training data set.
The following table details the total number of data observations in each of the thickness classification
ranges and the amount of data taken for training and testing the neural network.
Table 4. Number of thickness observations in each prediction range for the primary data set.
Thickness Range TOTAL Training Verification
-
7/26/2019 Estimation of Oil Thickness
21/41
- ssue Estimation of Oil Thickness
AEA Technology 14
0-100 28 23 5
101-500 33 28 5
501-1000 14 11 3
>1001 21 18 3
TOTAL 96 80 16
This table can be divided into each oil type as follows:
Table 5. Number of thickness observations in each prediction range for each oil type.
Thickness
Range
Alaska North Slope crude oil Forties Blend crude oil
Total Training Verification Total Training Verification
0-100 3 3 0 5 4 1
101-500 9 8 1 5 4 1501-1000 3 2 1 5 4 1
>1001 6 5 1 12 10 2
TOTAL 21 18 3 27 22 5
Thickness
Range
IFO-180 Heavy fuel oil Medium Fuel Oil/Gas Oil
Total Training Verification Total Training Verification
0-100 0 0 0 20 16 4
101-500 2 2 0 17 14 3
501-1000 0 0 0 6 5 1>1001 1 1 0 2 2 0
TOTAL 3 3 0 45 37 8
Tables 4 and 5 demonstrate that the primary data set consists of data observations in each of the
thickness ranges for all of the oil types specified (except for IFO-180 heavy fuel oil). There is a
fairly even spread of data observations for each thickness range, thereby reducing the chance of
biasing the training of the neural network to certain thickness ranges during the re-training.
It is important that the limits of the training set are defined clearly if the trained neural network is to beapplied to other data sets. The neural network can not reliably predict outside the limits of the
training data set. The primary training data set consists of the following upper and lower boundaries
for each of the parameters.
Table 6. Primary Training Set Range of Parameters.
-
7/26/2019 Estimation of Oil Thickness
22/41
- ssue Estimation of Oil Thickness
AEA Technology 15
Parameter All Data
Max Min
Thickness 5079 4
Brightness of oil & sea 230 32
Brightness of sea 215 87
Brightness of oil 85 -126
Sea Temperature 18.15 14.8Air Temperature 18 14.12
Irradiance 940 -11.7
Emissivity 0.963 0.953
5.4 SECONDARY DATA SET
The primary data set consists of data from the 1994, 1995 and 1997 sea trials only. These trials
were all carried out during the summer months (July, August and September) when the ambient
temperature was fairly warm (14-18C). As a result, once trained with this data, the neural network
may only be able to predict oil thickness within these summer temperatures. Therefore the neuralnetwork was also trained with a secondary data set containing data collected at colder temperatures
(i.e. during the winter months). This secondary data set allows the robustness of the neural network
to be established.
The secondary data set consists of the primary data set (summer data) and data from the Sea
Empress incident (winter data). The Sea Empress incident occurred in February 1996 and
throughout the incident, thermal IR imagery and a range of meteorological parameters were
recorded. This information was compiled into a data set to train the neural network. However, it
must be noted that this data should not be considered an accurate training set because of the absence
of in-situ measurements of oil thickness and will only be used to determine the robustness of thetrained neural network.
The data required by the neural network trained in 1999 is:
Thermal IR imagery brightness values
Sea temperature
Air temperature
Irradiance
Emissivity
Oil thickness
The thermal IR imagery, sea and air temperature were recorded during the Sea Empress incident.
The irradiance data were obtained from Met Office for the dates required specifically for this
project. The emissivity of the oil spilt during the Sea Empress (Forties Blend crude oil) was obtained
for the primary data set so further emissivity analysis was not required. The oil thickness was not
measured at specific locations during the incident, however visual estimates were made during the
early stages of the incident so can be used as a guide to the range of oil thickness observed during
the incident.
-
7/26/2019 Estimation of Oil Thickness
23/41
- ssue Estimation of Oil Thickness
AEA Technology 16
A data set was created for the neural network from thermal IR imagery taken from three dates in
February 1996 (22, 27 and 29). As no thickness measurements were made during the incident, the
data set was created through taking the brightness level in the thermal IR image from different areas
of the slick that are most likely to have a different oil thickness. The oil thickness at the specific
location where the brightness level was recorded was estimated into one of four categories as
follows:
1. 0-100 m = specific thickness applied to the data observation was 50 m
2. 101-500 m = specific thickness applied to the data observation was 250 m
3. 501-1000 m = specific thickness applied to the data observation was 750 m
4. 1000-2500 m = specific thickness applied to the data observation was 1750 m
These estimates of thickness were based on the relative thickness indicated in the thermal IR imagery
and our knowledge of the behaviour of Forties Blend crude oil. The thickness estimates underwent a
series of checks through independent estimates of the thickness by three experienced staff. This
resulted in more than 60% of the data being classified into the same thickness category. However, it
is important to note that this method of estimating the thickness is by no means accurate and has
been used as a method of allowing the use of the Sea Empress data, because it is invaluable in testing
the neural network at this stage.
The final Sea Empress data set consisted of 84 data observations from the three dates in February.
The data included imagery from thick and thin oil layers, ambient temperatures ranging from 2 to 6C
and a range of irradiance levels. A total of 18 data observations were removed from the data set for
the testing phase of the project. These were representative of the range of the parameters
represented in the whole data set.
The following table details the thickness observations in the Sea Empress data set.
Table 7. Number of observations in each thickness category for the Sea Empress data set.
Thickness Range Forties Blend crude oil (Sea Empress)
Total Training Verification
0-100 7 5 2
101-500 19 15 4
501-1000 31 26 5
>1001 27 20 7
TOTAL 84 66 18
The Sea Empress data set was added to the primary data set to create the secondary data set. This
data set contains a total of 180 data observations of which 146 are for the training and 34 are for the
verification. The following table details the number of data observations for each thickness range for
the secondary data set.
-
7/26/2019 Estimation of Oil Thickness
24/41
- ssue Estimation of Oil Thickness
AEA Technology 17
Table 8. Number of data observations in each thickness category for the secondary data set.
Thickness Range Secondary data set
Total Training Verification
0-100 35 28 7101-500 52 43 9
501-1000 45 37 8
>1001 48 38 10
TOTAL 180 146 34
The secondary data set, including observations of the Forties Blend crude oil spilt at the Sea
Empress incident, significantly extends the range of environmental parameters in comparison to the
primary data set. The following table shows the range of parameters the total of the secondary data
set covers (primary and Sea Empress). The range of parameters for the other oil types are as in the
primary data set detailed in Table 6.
Table 9. Secondary Data Set Range of Parameters for the Secondary Data.
Parameter All Data
Max Min
Thickness 5079 1
Brightness of oil & sea 255 0
Brightness of sea 230 32
Brightness of oil 215 -170
Sea Temperature 18 8Air Temperature 18 2
Irradiance 940 -11.7
Emissivity 0.963 0.953
6 Neural Network Re-Training
The primary and secondary training data sets were used in a Multi-Layer Perceptron (MLP) neural
network architecture, as used in the initial development of the neural network. Our previous work
(Wood et al., 1997) has suggested this is the best system for approaching these types of complex
relationships. This methodology is described in Section 3.1.
A problem anticipated at the start of the neural network training process was the danger of over-
fitting the network - a problem which frequently arises when the number of observations is small
compared to the number of adjustable parameters. The MLP has the property of being a universal
-
7/26/2019 Estimation of Oil Thickness
25/41
- ssue Estimation of Oil Thickness
AEA Technology 18
function approximator, which means that given the right architecture (number of processing units) it
can learn any functional mapping. This is its strength and also its weakness. The ability to learn
mapping means that it can learn to model the noise in the data used to train it. Thus, training
examples can be reproduced well, but the model is influenced by the noise in the measurement data.
The model will not then produce sensible answers when tested on a new data set.
As detailed in Section 5.2, 7 input variables have been selected for the revised training of the
network. The network is forced to define the relationships between each of these variables and each
of the nodes in the inner layers (described in Section 3.1) and between the elements of the hidden
layer and the final output variable. The two sets of data available for training contain just 80 (primary
data set) and 146 (secondary data set) data observations. These small numbers of data were not
expected to be sufficient to avoid the problems of over-fitting.
As discussed previously (Wood et al., 1997) an initial solution is to train the network on only a
specified fraction of the available training data (the training data set) and to retain the remainder of
the training data set for testing the network during the training process (the training test data set).
The ability of the network to reproduce the training data set improves throughout the training
process. However, the parameters used to define the trained network are not those obtained when
the best reproduction of the training data set is achieved, instead they are those obtained when the
best performance against the training test data set is achieved.
Once the best configuration of the neural network was defined for the primary and secondary data
sets, it was then tested against the verification data set. The MLP network allows for two
approaches to be adopted during training i.e. regression and classification. Both of these have been
explored during this project.
6.1 REGRESSION APPROACH
The network was trained using 75% of the available training data from the Primary data set and, as a
separate process, using 75% of the available training data from the Secondary data set. Whilst the
data span the range from 4-5000 m, only 3 out of 146 data are in excess of 2500 m. The
network was, therefore, trained to estimate log (thickness) to prevent errors in the estimation of these
few large values biasing the error analysis.
To determine the optimum number of hidden neurons for the network, 15 training runs (trials) were
carried out with the network configured with 2-10 hidden neurons. The performance of the network
in reproducing the test data set was compared for each of the groups of 15 trials using a T-test. The
results indicated that the network did not perform significantly better with any one number of hiddenneurons - this is again an indication that the problem was over-fitted.
In general, the network was able to provide an acceptable reproduction of the measured thickness in
the training data set. The typical performance of the network is shown in Fig 4 for the network
configured with 7 hidden neurons and trained against the Primary data set. Whilst acceptable, this is
not as good as the performance reported during the initial training of the network (Wood et al.,
1997) which had an r2of 0.85.
-
7/26/2019 Estimation of Oil Thickness
26/41
- ssue Estimation of Oil Thickness
AEA Technology 19
Figure 4. Scatter plot for predicted against actual thickness for the primary data set.
6.2 CLASSIFICATION APPROACH
Instead of treating the prediction of oil thickness as a regression problem, it is also possible to treat it
as a classification problem. Using this approach, the network does not attempt to predict the exact
thickness, but to assign the data into one of a series of bins representing thickness ranges. The
thickness ranges used in this study were selected to be consistent with those used in this previous
study (Wood et al, 1997). Alternative ranges, e.g. based on equal bandwidths, would have been
equally valid.
Training the network on both the Primary and Secondary data sets resulted in the same optimalarchitecture for both data sets. In both cases, 8 hidden neurons gave the optimal performance,
although the numerical values assigned to each of the weights in the network differed. The optimal
architecture is shown below:
Figure 5. Optimal architecture of neural network for both data sets as a classification problem.
Emissivity
Brightness of oil and sea
0 - 100 m
Brightness of sea
101 - 500 m
Brightness of oil
501 - 1000 m
-
7/26/2019 Estimation of Oil Thickness
27/41
- ssue Estimation of Oil Thickness
AEA Technology 20
Sea temperature
1001 - 6000 m
Air temperature
Irradiance
Hidden
Neurons
The bias weights are omitted for clarity on Figure 5.
The results from a typical training run of the network trained on the Primary data set are summarisedin thematrix below. The numerical values relate to the number of data points predicted to lie in each
combination of predicted and measured thickness categories. Diagonal elements indicate a correct
classification, whilst off-diagonal elements indicate an incorrect classification.
Table 10. Classification matrix using the Primary data set
Predicted
Measured 0-100 m 101-500 m 501-1000 m 1001-6000 m % Correct
0-100 m 17 6 0 0 74%
101-500 m 6 20 1 1 71%
501-1000 m 1 2 6 2 55%1001-6000 m 2 1 0 15 83%
73%
When trained with data from the Primary data set, the network is able to correctly classify over 70%
of the available training data.
The corresponding matrix for the network trained with the Secondary data set is summarised below.
Table 11. Classification matrix for the Secondary data set
PredictedMeasured 0-100 m 101-500 m 501-1000 m 1001-6000 m % Correct
0-100 m 17 7 1 3 61%
101-500 m 2 36 2 3 84%
501-1000 m 3 3 31 0 84%
1001-6000 m 1 1 4 32 84%
79%
-
7/26/2019 Estimation of Oil Thickness
28/41
- ssue Estimation of Oil Thickness
AEA Technology 21
Using data from a wider range of environmental conditions, the network is able to correctly classify
almost 80% of the available training data.
6.3 SUMMARY
In summary, the neural network was trained with two data sets; the primary and secondary data sets.
The primary data set contained reasonably accurate in-situ thickness observations. The secondary
data set consisted of the primary data set and data from the Sea Empress incident which consisted of
estimates of the in-situ thickness by three members of the NETCEN team.
The optimum training approach was the classification method. The neural network was trained with
each of the data sets and the test errors of the training are as follows:
Correct Classification (%)
Primary Data Set 73
Secondary Data Set 79
This indicates the training using the secondary data set was able to produce a higher percentage of
correct classifications than for the primary data set.
-
7/26/2019 Estimation of Oil Thickness
29/41
- ssue Estimation of Oil Thickness
AEA Technology 22
7 Neural Network Verification
This phase consists of testing the trained neural network with a selection of data that was not used to
re-train the network. This exercise can not be used as a true validation of the network since the
verification data set is a subset of the data being used to train the network. This verification processis used to establish whether the neural network is internally consistent and is able to predict oil
thickness representative of the data sets used to train the network.
A spreadsheet was developed to run the neural network model produced in the training phase. The
results of the verification are detailed as follows.
7.1 PRIMARY DATA SET
The following table details the results of the verification of the neural network trained as a
classification problem with the primary data set. The table includes the measured oil thickness values
and those predicted by the trained neural network (the correctly classified thickness values are
shaded in solid grey).
Table 12. Test data and thickness predictions for the primary data set.
Oil type Oil
Brightness
Sea
Temp
(C)
Air
Temp
(C)
Irradiance Measured
Thickness
(m)
Predicted
Thickness
(m)
Alaska North Slope 13 18.01 14.12 742 222 0-100
Alaska North Slope -72 18.05 14.58 184 525 1001-6000
Alaska North Slope 52 18.00 15.00 24 1556 1001-6000
Forties Blend 25 17.68 16.83 410 7 1001-6000
Forties Blend 37 18.10 17.02 375 310 101-500
Forties Blend 27 18.10 16.08 86 954 101-500
Forties Blend 10 14.99 16.93 270 1531 1001-6000
Forties Blend 19 14.80 16.88 20 3316 1001-6000
MFO/GO 15 14.99 17.02 5 7 0-100
MFO/GO 35 16.44 17.75 359 21 101-500MFO/GO 15 16.83 17.71 412 23 0-100
MFO/GO 8 15.29 16.68 120 29 0-100
MFO/GO 12 15.24 16.97 260 104 0-100
MFO/GO 30 16.73 17.71 451 204 101-500
MFO/GO 25 14.99 17.17 840 435 0-100
MFO/GO 10 15.04 17.17 930 537 101-500
-
7/26/2019 Estimation of Oil Thickness
30/41
- ssue Estimation of Oil Thickness
AEA Technology 23
The results of the verification are summarised in the matrix below.
Table 13. Classification matrix for the verification of the network trained on the Primary data set
Predicted
Measured 0-100 m 101-500 m 501-1000 m 1001-6000 m % correct0-100 m 3 1 0 1 60%
101-500 m 3 2 0 0 40%
501-1000 m 0 2 0 1 0%
1001-6000 m 0 0 0 3 100%
50%
The network correctly classifies 50% of the verification data into the appropriate thickness bin.
However the verification data set contains a number of datum for which the measured thickness lies
on the borderline of two bins (e.g. 104 m has been classified as 0-100 m). These are indicated inTable 12 by shading in grey diagonal lines. If these values are also included, the network can be
regarded as correctly classifying 63% of the values into the correct thickness bin.
On examination of the correct and incorrect thickness predictions (Table 13), the neural network
does not appear to be predicting thickness consistently incorrectly for specific oil types, thickness
categories, thermal IR brightness or temperature. There is no systematic bias to the 37-50%
incorrect classification.
It was observed in the validation project (Davies et al, 1998) that once the data was outside the
training range of the neural network, the network could not predict oil thickness reliably. As an
additional verification, the neural network trained with the primary data set was used to predict theoil thickness using the data from the Sea Empress incident. The primary network is trained with data
collected during summer conditions and the Sea Empress data was collected during the winter.
Therefore the Sea Empress data set will not fall in the range of the primary training data set. The
result of this verification is that the neural network classifies the oil thickness for all the Sea Empress
data observations into the 1001-6000 m thickness range despite the estimated thickness ranging
from 50-1750 m. This demonstrates that the neural network cannot reliably predict oil thickness
outside the range of the training data set. The results of this verification are shown in more detail in
Appendix 1.
7.2 SECONDARY DATA SET
The following table details the results of the verification of the neural network trained as a
classification problem with the secondary data set. The correct thickness classifications are
indicated with a solid grey shaded area.
-
7/26/2019 Estimation of Oil Thickness
31/41
- ssue Estimation of Oil Thickness
AEA Technology 24
Table 14. Test data and thickness predictions for the secondary data set.
Oil type Oil
Brightness
Sea
Temp
(C)
Air
Temp
(C)
Irradiance Measured
Thickness
(m)
Predicted
Thickness (m)
Alaska North Slope 13 18.01 14.12 742 222 0-100Alaska North Slope -72 18.05 14.58 184 525 501-1000
Alaska North Slope 52 18.00 15.00 24 1556 1001-6000
Forties Blend 25 17.68 16.83 410 7 101-500
Forties Blend 37 18.10 17.02 375 310 101-500
Forties Blend 27 18.10 16.08 86 954 1001-6000
Forties Blend 10 14.99 16.93 270 1531 501-1000
Forties Blend 19 14.80 16.88 20 3316 1001-6000
MFO/GO 15 14.99 17.02 5 7 0-100
MFO/GO 35 16.44 17.75 359 21 101-500
MFO/GO 15 16.83 17.71 412 23 101-500MFO/GO 8 15.29 16.68 120 29 0-100
MFO/GO 12 15.24 16.97 260 104 0-100
MFO/GO 30 16.73 17.71 451 204 0-100
MFO/GO 25 14.99 17.17 840 435 0-100
MFO/GO 10 15.04 17.17 930 537 101-500
Forties Blend 15 7.75 2.14 127 50 101-500
Forties Blend 24 7.95 7.00 251 50 101-500
Forties Blend 32 7.65 2.14 400 250 501-1000
Forties Blend 38 7.75 2.14 127 250 101-500
Forties Blend 39 7.80 6.69 75 250 101-500
Forties Blend 58 7.95 7.00 251 250 501-1000
Forties Blend 54 7.65 2.14 400 750 501-1000
Forties Blend 44 7.65 2.14 400 750 501-1000
Forties Blend 59 7.75 2.14 127 750 501-1000
Forties Blend 88 7.80 6.69 75 750 501-1000
Forties Blend 60 7.95 7.00 251 750 501-1000
Forties Blend 71 7.65 2.14 400 1750 1001-6000
Forties Blend 59 7.65 2.14 400 1750 1001-6000
Forties Blend 83 7.65 2.14 400 1750 1001-6000
Forties Blend -40 7.75 2.14 127 1750 1001-6000
Forties Blend 98 7.80 6.69 75 1750 501-1000
Forties Blend -62 7.95 7.00 251 1750 501-1000
Forties Blend -51 7.95 7.00 251 1750 1001-6000
The verification results are summarised in the following matrix.
-
7/26/2019 Estimation of Oil Thickness
32/41
- ssue Estimation of Oil Thickness
AEA Technology 25
Table 15. Classification matrix for the network trained on the Secondary data set
Predicted
Measured 0-100 m 101-500 m 501-1000 m 1001-6000 m % correct
0-100 m 2 5 0 0 29%
101-500 m 4 3 2 0 33%
501-1000 m 0 1 6 1 75%1001-6000 m 0 0 3 7 70%
53%
Overall, the network is able to correctly classify 53% of the data into the appropriate thickness bin.
As with the primary data set, the verification data contains a number of datum for which the
measured thickness lies on the borderline of two bins (e.g. 104 m has been classified as 0-100m).
These are indicated in Table 14 by shading in grey diagonal lines. If these values are also included,
the network can be regarded as correctly classifying 59% of the values into the correct thickness bin.
In some situations, the network returns a high probability that the thickness lies in one particular bin,indicating a high confidence that the result lies in that bin. In other cases, a more even distribution of
probabilities may be returned, indicating a lower confidence in the prediction of the network. If the
analysis also takes account of the predictions where the probability that the thickness lying in the
correct bin is only slightly lower than the probability that the thickness lies in an alternative bin, then
the results improve and the network comes close to correctly classifying almost 70% of the data.
The overall percentage error in the verification results for the secondary data set is very similar to that
for the primary data set. In the secondary data set, the verification data classified incorrectly
accounted for 16 of the 34 data observations whereas in the primary data set 8 of the 16 data
observations were incorrectly classified. As discussed previously there is no apparent bias to theincorrect predictions, i.e. they are not oil type, thickness or specific variable dependent.
It is worthy of note that the majority of the data set incorrectly classified by the network trained into
the primary data set were also incorrectly classified by the network trained with secondary data set.
This implies that the network is consistently failing to learn the relationship between certain variables
as a result of the lack of training data. It is also possible that the input variables used in the training
do not adequately represent the relationship between the input variables and the thickness, therefore
it may be that another input variable is required to predict the thickness in some instances.
7.3 COMMITTEE OF NETWORKS
-
7/26/2019 Estimation of Oil Thickness
33/41
- ssue Estimation of Oil Thickness
AEA Technology 26
To further address the problem of over-fitting the data, another approach was adopted in which a
number of networks were trained using the Secondary data set. To make a prediction, the outputs
from these trained networks were combined, summed and normalised. As each network has a
slightly different training set (because only 75% of the available training data is used for training, as
described in Section 6.1) each one will over-fit the data in a different way and hence when averaged
the effects of over-fitting should be averaged out. This approach (termedforming a committee of
networks) is a well-established technique and was also used in the earlier studies (Wood et al.,
1997).
The neural network was re-trained using the committee of networks approach with the secondary
data set. The resultant committee underwent testing with the verification data set and the results are
detailed in the Table 16.
-
7/26/2019 Estimation of Oil Thickness
34/41
- ssue Estimation of Oil Thickness
AEA Technology 27
Table 16. Test data and thickness predictions for the secondary data set using a committee of
networks.
Oil type Oil
Brightness
Sea
Temp
(C)
Air
Temp
(C)
Irradiance Measured
Thickness
(m)
Predicted
Thickness (m)
Alaska North Slope 13 18.01 14.12 742 222 101-500
Alaska North Slope -72 18.05 14.58 184 525 501-1000
Alaska North Slope 52 18.00 15.00 24 1556 1001-6000
Forties Blend 25 17.68 16.83 410 7 101-500
Forties Blend 37 18.10 17.02 375 310 101-500
Forties Blend 27 18.10 16.08 86 954 1001-6000
Forties Blend 10 14.99 16.93 270 1531 0-100
Forties Blend 19 14.80 16.88 20 3316 1001-6000
MFO/GO 15 14.99 17.02 5 7 0-100
MFO/GO 35 16.44 17.75 359 21 500-1000MFO/GO 15 16.83 17.71 412 23 0-100
MFO/GO 8 15.29 16.68 120 29 0-100
MFO/GO 12 15.24 16.97 260 104 0-100
MFO/GO 30 16.73 17.71 451 204 0-100
MFO/GO 25 14.99 17.17 840 435 101-500
MFO/GO 10 15.04 17.17 930 537 101-500
Forties Blend 15 7.75 2.14 127 50 0-100
Forties Blend 24 7.95 7.00 251 50 0-100
Forties Blend 32 7.65 2.14 400 250 101-500
Forties Blend 38 7.75 2.14 127 250 101-500
Forties Blend 39 7.80 6.69 75 250 101-500
Forties Blend 58 7.95 7.00 251 250 101-500
Forties Blend 54 7.65 2.14 400 750 501-1000
Forties Blend 44 7.65 2.14 400 750 501-1000
Forties Blend 59 7.75 2.14 127 750 501-1000
Forties Blend 88 7.80 6.69 75 750 501-1000
Forties Blend 60 7.95 7.00 251 750 501-1000
Forties Blend 71 7.65 2.14 400 1750 1001-6000
Forties Blend 59 7.65 2.14 400 1750 1001-6000
Forties Blend 83 7.65 2.14 400 1750 1001-6000
Forties Blend -40 7.75 2.14 127 1750 1001-6000
Forties Blend 98 7.80 6.69 75 1750 501-1000
Forties Blend -62 7.95 7.00 251 1750 1001-6000
Forties Blend -51 7.95 7.00 251 1750 1001-6000
-
7/26/2019 Estimation of Oil Thickness
35/41
- ssue Estimation of Oil Thickness
AEA Technology 28
The data are summarised into the following matrix.
Table 17. Classification matrix for the committee of networks trained on the Secondary data set
Predicted
Measured 0-100 m 101-500 m 501-1000 m 1001-6000 m % correct
0-100 m 5 1 1 0 71%
101-500 m 2 7 0 0 78%
501-1000 m 0 1 6 1 75%
1001-6000 m 1 0 1 8 80%
76%
The network is able to correctly classify 76% of the data into the appropriate thickness bin.
As previously, the verification data contains a number of datum for which the measured thickness lieson the borderline of two bins (e.g. 104 m has been classified as 0-100 m). These are indicated in
Table 16 by shading in grey diagonal lines. If these values are also included, the network can be
regarded as correctly classifying 82% of the values into the correct thickness bin.
The committee of networks approach should decrease the noise in the network training and thereby
decrease the error in the predictions. On comparison of the results of the verification using the single
classification training (Tables 14 and 15) and using the committee of networks training (Tables 16
and 17), the percentage error in the predictions has decreased as expected. There was a significant
improvement in the number of correct predictions for the lower thickness categories (0-100 m and
101-500 m) and a slight improvement in the high thickness category (1001-6000 m). This is atotal of 26 correct predictions of the 34 data observations in the verification data set.
The incorrect predictions are the same data observations incorrectly predicted in the verification of
the single classification training. This supports the theory that the network is consistently failing to
learn the relationship between certain variables and the oil thickness.
7.4 SUMMARY
The verification tests of three neural networks trained resulted in the following verification results:
-
7/26/2019 Estimation of Oil Thickness
36/41
- ssue Estimation of Oil Thickness
AEA Technology 29
Table 18. Summary of the verification tests for the neural networks
Network Training
Type
Data Set % Correct % Correct within
10% error
Single Classification Primary 50 63
Single Classification Secondary 53 59Committee of
networks classification
Secondary 76 82
The best verification results were obtained from the committee of networks classification with the
secondary data set which was shown to be capable of correctly classifying the thickness of the
verification data in over 76% of cases. A 10% error is included in the % correct to allow for
occasions where the actual thickness is close to the boundary of the thickness categories, for
example an actual thickness of 104m is classified as 0-100m. This raises the percentage correctly
predicted for all the training types, the highest % correct remains the committee of networks at 82%.
It was observed with each of the trained neural networks that the same verification data observations
were being incorrectly classified for the majority of cases. Some of the data observations had similar
input variables, so the network classified them into the same thickness category despite sometimes
having actual thickness values from different thickness categories.
This implies that the network is consistently failing to learn the relationship between certain input
variables and the thickness. This could be due to the size of the training data set, if more data was
available for training the network would have a better chance of fully learning the relationship
between the input variables and the thickness. It is also possible that the input variables used in the
training do not adequately represent the relationship between the input variables and the thickness,
therefore another input variable may be required to predict the thickness in some cases.
8 Conclusions
The re-calibration suggests that the re-trained neural network is internally consistent in predicting oil
thickness based on the input data used. This analysis includes data sets consisting of a wider range
of environmental variables (sea and air temperature) and thermal IR imagery recorded in both winter
and summer that has not been previously included in the training.
The errors identified in the predictions by the network show no clear pattern with oil type, brightness
or thickness categories. This suggests the network does not have a systematic bias. The 1999 re-
trained network demonstrates the ability to account for solar heating which was not the case for the
1997 trained network used for the validation project (Davies et al., 1998). This is evident through
correct predictions from data taken throughout the day (i.e. at different irradiance levels and ambient
temperatures) and from different oil thickness levels.
-
7/26/2019 Estimation of Oil Thickness
37/41
- ssue Estimation of Oil Thickness
AEA Technology 30
However it is important to emphasise that there is no guarantee that the data sets that have been used
to retrain the network will cover the complete range of the parameters which will be encountered in
the event of a spill, such as the meteorological conditions and oil type. In addition, some of the data
used are estimated rather than measured. It is still not clear whether the size of the training data set
required to produce a robust system makes this neural network technique impracticable in the long
term.
Using the committee of networks approach it was possible to obtain the correct classification in 76%
of cases. For the majority of the remaining cases, the misclassification was only by one thickness
category. However, the verification studies carried out in this project indicate that there is still a
significant error in the thickness predictions. This may be improved with:
A larger training data set
Or/and
More input variables
This additional data could be obtained during actual incidents and experimental field trials in the
future. In this way, the neural network can be continually re-trained and updated to take account of
additional validated data taken in different environmental conditions. This would maximise the
chances of the operational data falling in the range of environmental parameters used in the training
process.
9 Recommendations
The ultimate aim of this research is to develop a technique for MCA to measure oil slick thickness.
Having knowledge of the thickness of the slick allows calculation of the volume of oil on the sea
surface in an oil spill incident. This enhances the operational response to a spill by allowing
assessments to be made on the efficiency of a response technique and ensures that response
equipment is used to its maximum effectiveness.
The re-training of the neural network has shown that this technique is partially successful in measuring
oil thickness. However, the networks derived during this project can not be considered as beingsufficiently robust to be reliable during operational use at present. In order to continue the
development of this technique and to provide some benefit of the research to MCA at this stage, we
recommend a pilot system is developed which can be deployed as an experimental system at the
next spill or experimental field trial.
On deployment, the pilot system can provide MCA with an additional tool for the operational
response. The practicalities of the system and its usefulness to the MCA can be evaluated. This
includes the data analysis and collection facilities and procedures. The accuracy of the neural
-
7/26/2019 Estimation of Oil Thickness
38/41
- ssue Estimation of Oil Thickness
AEA Technology 31
network can be studied further and re-training undertaken as and when additional data sets are
collected to continue development of this technique.
To develop the current experimental system into a pilot operational system would require the
following relatively minor developments:
Collaboration with Air Atlantique to standardise on image analysis software (for example, using
ER Mapper) and to devise procedures for downloading the thermal IR imagery to a suitable
format for analysis.
Determine suitable techniques for collecting all the variables required by the neural network, i.e.
air and sea temperature, irradiance.
Develop a sorbent pad sampling technique that includes real-time analysis for calibration of the
neural network.
If the pilot system is developed and deployed (potentially using the FLIR system on the SAR
helicopters as well as the MCAs remote sensing planes) at future incidents then the system can be
used to give estimates of volumes, alongside the normal visual estimates, for operational use.
As an illustration of how the volume could be calculated from the oil thickness predictions from the
neural network, a thermal IR image was taken from the Forties Blend experiment in the 1997
experimental sea trials. This image (shown in Appendix 1) was divided into four brightness bands
and the surface area of the oil slick in each brightness band determined. The thickness relating to a
single brightness value in each brightness band was determined using the trained committee of
networks. For each of the brightness bands, the neural network predicted the thickness in the range
of 1001-6000 m. This is not unreasonable because the release of the Forties Blend was completed
only 10 minutes before the thermal IR was recorded, therefore the oil is likely to be at a fairly
uniform thickness. To calculate the volume of oil, a single thickness value was assigned to the
thickness range. The value for the 1001-6000 m range was taken to be 1750 m based on thespread of the training data in the thickness range. Table 18 details the surface area and predicted
volume using the neural network.
Table 19. Volume prediction using the committee of networks.
Neural Network
prediction
IR
Brightness
Range
Area
(m2)
Thickness
(m)
Volume
(m3)
0-126 155 1750 0.3
126-183 6035 1750 10.6
183-208 10324 1750 18.0
208-255 11116 1750 19.5
TOTAL 27630 TOTAL 48.4
-
7/26/2019 Estimation of Oil Thickness
39/41
- ssue Estimation of Oil Thickness
AEA Technology 32
The actual volume of the oil slick when the image was taken is estimated at 44-51 m3. The neural
network volume estimate is 48 m3, which is within this estimated range.
-
7/26/2019 Estimation of Oil Thickness
40/41
- ssue Estimation of Oil Thickness
AEA Technology 33
9 References
Davies L., Dooley K., Lunel T and Lewis A., Field validation of neural network analysis to measure
oil slick thickness, AEAT-3474, AEA Technology, Culham, Oxfordshire, UK, 1998.
Goodman R., Overview and future trends in oil spill remote sensing, Spill Science and Technology
Bulletin, Vol 1, No 1, pp11-21, 1994.
Wood P, Strachan I, Davies L and Lunel T., Determination of oil thickness by neural network
analysis, AEAT-1151, AEA Technology, Culham, Oxfordshire, UK, 1997.
-
7/26/2019 Estimation of Oil Thickness
41/41
Appendix 1
Data Sets
CONTENTS
Table 1 Primary Data Set
Table 2 Secondary Data Set
Table 3 Verification results using Sea Empress data on the neural network
trained with the primary data set
Figure 1 Thermal IR image of Forties Blend experiment from 1997 seatrial