alternative optimization techniques
TRANSCRIPT
Alternative Optimization Alternative Optimization TechniquesTechniques
Kirk Cameron, Ph.D.MacStat Consulting, [email protected]
2
Ways to Skin a CatWays to Skin a Cat
• Optimization techniques for LTM have advanced over last several years– Improved mathematics– Increased complexity– Combination of statistical & physical data
• Highlight some alternate methods– Genetic algorithms– Kalman-type filtering, updating
3
A Question of TradeoffsA Question of Tradeoffs
• Computational horsepower– More analysis time, computer power often required– More complex models, greater reliance on advanced
numerical methods• More complex data needs
– “Garbage in, garbage out” principle– More kinds and quantities of data typically needed– Needed information not always available
4
It’s an Empirical ThingIt’s an Empirical Thing
• No optimization possible without data– Moving from simpler to more complex
techniques exposes a dilemma– Advanced approaches may look ‘slick,’ but
without extra data, results may be no better• Determine which data inputs are crucial
– Forward thinking: how can this data be collected in the future?
5
MultiobjectiveMultiobjective Groundwater Groundwater Monitoring DesignMonitoring Design
• Patrick Reed and Venkat Devireddy– Department of Civil and Environmental Engineering – The Pennsylvania State University
• Contact info: [email protected]
6
Genetic Algorithm BasicsGenetic Algorithm Basics
• Genetic algorithms search among all possible monitoring plans– Create an initial “population” of sampling plans– Sampling plans are represented as a string binary digits
• 1 – ith well sampled• 0 – ith well not sampled
– Judge each sampling plan based on how well it satisfies monitoring objectives
• Plans that satisfy design objectives are “highly fit”• Have higher likelihood of mating and passing traits
– Sampling plans undergo Darwinian “natural selection” until the best set of plans are evolved
0 1 00
7
– Selection– Measure of the fitness of string– Fitness is rated in terms of the objective functions– Population members compete to mate & pass traits
– Mating
– Mutation
How do How do GAsGAs work?work?
Parents
1 1 1 1
0 1 00 0
0 0
11
11
1
Crossover occurs with user specified
probability, Pc
1 1 1 1 1 1 1 0
Bit change occurs with user specified
probability, Pm
Children
8
Multiobjective EvolutionMultiobjective Evolution
• The Nondominated Sorted Genetic Algorithm-II:– Population classified into fronts using non-dominated
sorting– Better ranking = Higher likelihood of passing traits– Entire tradeoffs evolved in single run
Cost
Estim
atio
n Er
ror
Cost
Estim
atio
n Er
ror
Rank = 1
Rank = 2
Rank = 3
Rank = 4
9
Case Study: Enumerated Case Study: Enumerated TradeoffTradeoff
• Two-Objectives:– Minimize Cost– Minimize Relative Mapping
Error (termed SREE)
• 220 possible designs• Evaluated using a
nonlinear least squares interpolation
• Best published result– Reed et al. (2004) WRR– 38000 evaluations
10
Case Study: Efficiency of New Case Study: Efficiency of New ToolsTools
• Results for 50 trials (using interactive archiving)– Reduced solution time from hours to seconds– New tool is up to 90% more efficient
2540074400Max. no. of design evaluations
799239889Avg. no. of design evaluations
140016800Min. no. of design evaluations
ε-NSGA2 (New Tool)
Reed et al. 2003
11
Case Study: Typical ResultCase Study: Typical Result
12
Adaptive Environmental Monitoring Adaptive Environmental Monitoring System (AEMS)System (AEMS)
– Under development at RiverGlass Inc., Champaign, IL
• Software development company launched by the University of Illinois
• Project lead: Barbara Minsker, PhD [email protected]
– Beta testing of AEMS expected to begin in late Summer 2005
• Completed functionality available now on a project basis – contact Dr. Minsker for more information
13
AEMS CapabilitiesAEMS Capabilities– Build data-driven trend models based on historical
data (geostatistical or analytical models)– Assess new data in real time to
• Identify significant deviations from previous trends (spatial ortemporal), providing automated alerts
• Identify locations/times where additional data would be most beneficial to reducing risks
– Identify temporal and spatial redundancies in existing monitoring regimes
14
Redundancy Analyses with AEMSRedundancy Analyses with AEMS
• Method considers– Tradeoffs among multiple objectives (e.g.,
cost and error) with mathematical optimization– Uncertainty in identifying robust designs– Temporal and spatial redundancy
simultaneously• Applied at 2 BP sites and 1 DOE site
– 22-36% redundancy found in well sampling plans
15
0
1
2
3
4
5
6
7
8
9
10
24 26 28 30 32 34 36
Number of Wells
Max
imum
Err
or (p
pb)
AEMS Identifies Monitoring AEMS Identifies Monitoring TradeoffsTradeoffs
•Each diamond is an optimal monitoring plan for a given level of sampling
•Shows tradeoffs between number of wells sampled ($) and interpolation errors
22% reduction in costsBenzene MCL
16
Optimization of Large Scale Optimization of Large Scale Subsurface Environmental ImpactsSubsurface Environmental Impacts
• Larry M. Deschaine, PE– Engineering Physicist– SAIC & Chalmers University of Technology
• Contact info: [email protected]
17
Information Content Fused Information Content Fused ApproachApproach
• Integrated algorithm(s) consist of:– Simulation models based on physics– Data models based on sampling– Uncertainty handled through (geo)-statistics
• Information content fusion (Data & Physics):– Signal processing (i.e. Kalman Filters, etc.)– Genetic Programming
• Optimal System Estimate– Optimal estimate of “system” for locating plume at
given time, or time-space correlated estimates of long term monitoring programs
18
Fundamentally Different From Fundamentally Different From EstimationEstimation
• Current methods typically gather data, calibrate model and use model for predictions– Models break down as physics becomes complex,
data sparse or input parameters not well known• This method fuses the information content via
signal processing / machine learning algorithms:– Integrated data/physics model provide optimal
estimates based on knowledge gained from both the physical simulator and the data, updated as new information is obtained
19
0.05
0.1
0.15
0.2
0.25
0.3
0.35
0.4
0.45
0.5
0.55
0.6
0.65
0.7
0.75
0.8
0.85
0.050.10.150.20.250.30.350.40.450.50.550.60.650.70.750.80.850.9
Determining optimal well locations & their benefit to Determining optimal well locations & their benefit to understanding plume location understanding plume location –– when to stop adding when to stop adding
wells, how to justifywells, how to justify
No wells [Max uncertainty]
Existing MW system [93.4% reduction]
Add One Well [98% reduction]
Add 2nd Well [98.5% reduction]
5
10
15
20
25
30
35
40
0.2
0.4
0.6
0.8
1
1.2
1.4
1.6
1.8
2
2.2
2.4
2.6
2.8
20
LTM: Extend the reduction of uncertainty to include value of LTM: Extend the reduction of uncertainty to include value of historic samples in space and time historic samples in space and time --> Results in less > Results in less
samples needed to understand long term system behaviorsamples needed to understand long term system behavior
O p tim a l L o n g T e rm M o n ito rin g E x a m p le
0 .0 0
2 0 .0 0
4 0 .0 0
6 0 .0 0
8 0 .0 0
1 0 0 .0 0
1 2 0 .0 0
0 2 4 6 8 1 0 1 2 1 4 1 6
T im e (y e a rs )
Nor
mal
ized
Con
cent
ratio
n
E stim a teU p p e r C o nf L im itL o w e r C o nf L im it
Samples are collected when the space-time correlated uncertainty exceeds preset limits
Excessive Uncertainty
21
OPDATEOPDATE
• David Dougherty, PE– Subterranean Research, Inc.
• Contact info: [email protected]
22
Fusion of Data, SimulationFusion of Data, Simulation
• Site-specific, consulting approach– Strong emphasis on decision-trees
• Simulation models first used to both:– Help plan remedy phases during RI, FS and
design LTM program• One goal of LTM design should be to improve simulation
model for anticipated future remedy/risk assessments
– Make predictions about future subsurface conditions
23
Use of Model in LTM Planning: Placement of Use of Model in LTM Planning: Placement of MWsMWs Accounts for Different Phases of RemedyAccounts for Different Phases of Remedy
Distance (ft)500 1000 1500 2000 2500 3000
-50
050
100
150
MW-90B
SV-4SV-3
1
2
3
4
5
6
7
8
9
Model Layer
1
2
3
4
5
6
7
8
9
Model Layer
b) Optimized Pumping Condition
C-1 (Proposed)
MW-202B
MW-37
MW-39
Distance (ft)500 1000 1500 2000 2500
-50
0
Approximate Location of T-25 Perimiter
Approximate Location of Lake CochituateApproximate Location of Lake Cochituate
Distance (ft)
Ele
vatio
n(m
sl)
500 1000 1500 2000 2500 3000
-50
050
100
150
MW-90B
SV-4SV-3
1
2
3
4
5
6
7
8
9
Model Layer
Elev
atio
n (ft
msl
)
a) No-Pumping Condition
C-1 (Proposed)
MW-202B
MW-37
MW-39
Distance (ft)500 1000 1500 2000 2500
-50
0
Approximate Location of T-25 Perimiter Approximate Location
of Lake Cochituate
Distance (ft)
Ele
vatio
n(m
sl)
500 1000 1500 2000 2500 3000
-50
050
100
150
MW-90B
SV-4SV-3
1
2
3
4
5
6
7
8
9
Model Layer
1
2
3
4
5
6
7
8
9
Model Layer
Elev
atio
n (ft
msl
)
a) No-Pumping Condition
C-1 (Proposed)
MW-202B
MW-37
MW-39
Distance (ft)500 1000 1500 2000 2500
-50
0
Approximate Location of T-25 Perimiter Approximate Location
of Lake CochituateApproximate Location of Lake Cochituate
40201086420
TravelTime (yr)
24
Retrospective AnalysisRetrospective Analysis
• Retrospective hydrologic data assimilation– Applied to observed data to:
• 1) add branches to LTM decision trees, and• 2) improve quality of forecasts
• Measured data used to update and optimize (“opdate”) simulation models
25
Using Data AssimilationUsing Data Assimilation
Larger AdjustmentsEarly Time Late Time
Smaller AdjustmentsUse Observation Data to Update Models and UncertaintyAssessments
Predictive Model Forecast
Adjust c(x,y,z) based on measurements
26
““OpdateOpdate””--inging“Snapshot in time” (July, 1999), prior to and after updating the predicted concentration plumes with monitoring data.
27
Retrospective Data Retrospective Data AssimilationAssimilation——OPDATE™OPDATE™
Linea
r Kri
ged
Bes
t Est
imat
e Lo
g K
rige
d
15 Events 8 Events 4 Events
Quarterly Semi-Annual Annual
Sce
nar
ios
Updating Frequency
Sensitivity to Historical Sampling Frequency
When and Where DoErrors Occur?
28
SummarySummary
• ‘Next Generation’ methods offer great promise… however– Many sites currently lack sufficiently detailed
data or right kind of data– More expertise required to correctly set-up,
implement• Power, adaptability of genetic algorithms,
Kalman filter techniques cannot be denied– Especially true as computer horsepower gets
faster & cheaper
29
Summary (cont.)Summary (cont.)
• Each method being developed for real-world applications– Savannah River, BP, DOE, AF facilities– Some “scalability” possible, especially with genetic
algorithm approaches• Key benefits
– Formal incorporation of additional kinds of data– Enumeration of entire cost tradeoff curve, sort of