those who don't learn from history tom bascom, president greenfield technologies
TRANSCRIPT
Those Who Don't Learn From History
Tom Bascom, President
Greenfield Technologies
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
“Those who do not learn from history are doomed to repeat it.”
George Santayana (1863-1952)
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
Overview
Types – Of monitoring activities What – To monitor How – Methods for gathering and
presenting monitoring data Why – Benefits of historical data
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
Types of Monitoring Activities
Baselining Benchmarking Interactive troubleshooting Capacity management Resource Optimization
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
Baselining
Allows you to quantify changes in performance
Apply to: Frequently executed tasks Important tasks
Time, Activity, Costs, Revenue, Resources
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
Benchmarking
Benchmarking is much like baselining but it generally seeks to find the limits of a configuration. SpecINT TPC ATM ReadProbe 4glProbe
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
Interactive Troubleshooting
Confirm configuration (trust, but verify) What is happening NOW?
You can “prove” anything with a single data point
How is NOW different from the baseline?
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
Capacity Management
Filesystem space Database extent utilization
Memory consumption
Peak CPU utilization Network utilization IO throughput
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
Resource Optimization
Finding unbalanced resources Disk “hot spots” Wasted memory
-spin Buffers (-B, -mmax …)
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
What Metrics To Monitor
DB metrics OS metrics Application metrics Business metrics
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
DB Metrics
Logical IO CRUD Global and per user Table and index stats
Physical IO Latch waits and timeouts Record locks and transactions Connections and servers Extent and area utilization
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
OS Metrics
CPU utilization -- %usr, %sys, %wio Disk
Free space Operations Queues and service times
Memory – budget, usage & leaks Network – bandwidth, latency Tables and limits – nfiles, nproc,
semaphores
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
Application/Business Metrics
Orders, applications taken Shipped orders, closed loans Items shipped, invoices printed,
document packages prepared Items in inventory, users online Turn time, fallout ratio
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
Business Metrics
Expenses Revenue Margin Profit
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
How To Gather Data
Screen Scraping PROMON Scripts
VSTs ProTop ProMonitor
Low Level APIs Fathom
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
PROMON04/29/05 Activity: Summary 10:12 (10:36:05)
Event Total Per Sec |Event Total Per SecCommits 354 35.4 |DB Reads 1724 172.4Undos 0 0.0 |DB Writes 50 5.0Record Reads 15091 1509.1 |BI Reads 0 0.0Record Updates 64 6.4 |BI Writes 12 1.2Record Creates 183 18.3 |AI Writes 7 0.7Record Deletes 71 7.1 |Checkpoints 0 0.0Record Locks 3776 377.6 |Flushed at chkpt 0 0.0Record Waits 0 0.0
Rec Lock Waits 0 % BI Buf Waits 0 % AI Buf Waits 0 %Writes by APW 100 % Writes by BIW 17 % Writes by AIW 71 %DB Size: 26 GB BI Size: 249 MB AI Size: 87 MBEmpty blocks:1268766 Free blocks: 84945 RM chain: 805939Buffer Hits 96 % Active trans: 0
121 Servers, 513 Users (204 Local, 309 Remote, 10 Batch), 4 Apws
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
Screen Scrapingpromon $DBNAME > $TMP/mon.$TM << - "EOF" 2> /dev/nullR&D17p43p2ppp21p3xEOF
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
Screen Scraping
ls -1 sample.??.?? | while read FILE do
grep -i "^logical read" $FILE | cut -c46-56 >> $TMP/lr grep -i "^o/s read" $FILE | cut -c46-56 >> $TMP/osr grep -i "^hit ratio" $FILE | cut -c11-14 >> $TMP/hr grep -i "^commit " $FILE | cut -c46-56 >> $TMP/trx grep -i "^latch " $FILE | cut -c46-56 >> $TMP/lto
done
paste $TMP/lr $TMP/osr $TMP/hr $TMP/trx $TMP/lto
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
VSTs -- ProTop16:06:50 ProTop xvi -- Progress Database Monitor 05/01/05Sample sports2000 [/data/s2k/sports2000] RateHit Ratio: 63:1 148:1 Commits: 2 3 Sessions: 18 Miss% : 1.590% 0.676% Latch Waits: 232 234 Local: 17 Hit% : 98.410% 99.324% Tot/Mod Bufs: 170 25 Remote: 0Log Reads: 4465 10498 Evict Bufs: 908 206 Batch: 16 OS Reads: 71 71 Lock Table: 8192 27 Server: 0Rec Reads: 1141 1062 LkHWM|OldTrx: 67 00:03 Other: 1 Log/Rec: 3.9132 9.8851 Old/Curr BI: 1 1 TRX: 6Area Full: 1 99.12% After Image: Disabled Blocked: 0 Resource Waits Id Resource Locks Waits Lock%--- -------------------- ---------- ---------- ------- 10 DB Buf S Lock 10497 0 100.00% 6 Record Get 1061 0 100.00% 7 DB Buf Read 71 0 100.00% 2 Record Lock 28 0 100.00% 11 DB Buf X Lock 14 0 100.00% 19 TXE Share Lock 14 0 100.00%
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
Low Level APIs -- Fathom™
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
Low Level APIs -- Fathom
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
Metrics Database
Multiple targets Generic metrics Individualized metric properties Arbitrary grouping of metrics Flexible reporting Graphical output
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
Why? The Benefits of History
Spend your money wisely! Be the first to know when
something is wrong! Better yet – know before it happens
… and prevent it!
Don’t be Doomed!
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
Why?
Spend your money wisely! Be the first to know when
something is wrong! Better yet – know before it happens
… and prevent it!
Don’t be Doomed!
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
Effect of ChangesEMC
8k Blocks
conv89
Storage areas
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
Why?
Spend your money wisely! Be the first to know when
something is wrong! Better yet – know before it happens
… and prevent it!
Don’t be Doomed!
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
Where’s the Problem?04/29/05 Activity: Summary 10:12 (10 sec)
Event Total Per Sec |Event Total Per SecCommits 354 35.4 |DB Reads 1724 172.4Undos 0 0.0 |DB Writes 50 5.0Record Reads 15091 1509.1 |BI Reads 0 0.0Record Updates 64 6.4 |BI Writes 12 1.2Record Creates 183 18.3 |AI Writes 7 0.7Record Deletes 71 7.1 |Checkpoints 0 0.0Record Locks 3776 377.6 |Flushed at chkpt 0 0.0Record Waits 0 0.0
Rec Lock Waits 0 % BI Buf Waits 0 % AI Buf Waits 0 %Writes by APW 100 % Writes by BIW 17 % Writes by AIW 71 %DB Size: 26 GB BI Size: 249 MB AI Size: 87 MBEmpty blocks:1268766 Free blocks: 84945 RM chain: 805939Buffer Hits 96 % Active trans: 0
121 Servers, 513 Users (204 Local, 309 Remote, 10 Batch), 4 Apws
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
TRX Rate
Normal
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
TRX Rate
Elevated
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
Jump in Background TRX Rate
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
Why?
Spend your money wisely! Be the first to know when
something is wrong! Better yet – know before it happens
… and prevent it!
Don’t be Doomed!
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
Business Surge
Rates D
rop
!
Ag
ain!
Hirin
g lag
s…
5x init vo
lum
e
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
Statistical “Rules of Thumb”
A Metric is “In Control” if it is within 3 standard deviations of the baseline.
Metrics that are between the 2nd &3rd standard deviation should be viewed as warnings.
4 consecutive samples trending away from the mean are a warning.
4 out of 6 on one side of the mean is a warning.
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
Summary
Obtain a baseline Consistently gather comprehensive data
Business Application Database Operating System
Review, analyze, and publish
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
“Insanity is doing the same thing over and over again and expecting different results.”
Albert Einstein(1879 - 1955)
Comp-06: Those Who Don't Learn From History Are Doomed To Repeat It
Questions?
?Resources:
http://www.greenfieldtech.com/exchange05.shtml