best practices for optimizing blackboard learn steve feldman, [email protected]

Best Practices for Optimizing Blackboard Learn

Steve Feldman,[email protected]

mailto:[email protected]

What We’ll Cover

• A deployment approach for the ages.

• How to make use of the new sizing guide.

• Optimizing the platform components.

Flexible and Scalable Application Deployment


• An ideal deployment will contain…

– Availability at every edge of the application environment• Strategy: Physical distribution of load-balanced systems• Strategy: Minimum DB recovery, not necessarily 0 downtime

– Consumption of every possible machine resource• Strategy: Virtualization provisioning

– Techniques for improving user experience• Strategy: Techniques and tools for achieving page-level SLAs

– Large addressable memory spaces• Strategy: 64-bit and large OS process space allocations


• An ideal deployment will contain…

– Minimum Storage Recovery Time• Strategy: Enterprise storage with Snapshot capabilities

– Advanced monitoring for operations and planning• Strategy: Measurement tools and analytics

– Automation…Automation…Automation• Strategy: Investment in repeatable, reliable automated

processes.

Deployment: Availability

• VLEs are different beasts today then in the past.

– Communities are bigger– Sessions last longer– Content is richer– Key point: Adoption is greater and users expect their sites up 24 x

7 x 365

• Architecture is designed for many parallel instances of the product scaled in a horizontal fashion.

– Distributed physical deployments– Virtualization is a key element

• Database failover more important than horizontal database scalability.

– Emphasis on vertical database scalability

Deployment: Resource Utilization

• Moore’s law is in full effect

– CPUs are getting faster with more cores– Memory is in abundance and cheap– Storage is grossly abundant

• Massive systems can be obtained at low cost, but cannot be saturated in stand-alone configurations.

• Virtualization offers the opportunity…

– Deploy with availability in mind– Saturate system resources

Deployment: Improving Page Responsiveness

• Gzip…Gzip…Gzip…

– All of our supported browsers handle gzip?– Reduces payload

• Improves lower latency connections like Cable, DSL and Dial-up– Minor overhead on the application layer (~2% to ~5%)

• Have the option to perform at the load-balancer layer– Most Bb deployments do not enable Gzip at all

• Even when enabled, some proxies and software packages mess-up the Accept Encoding Header

• Optimize your images

– Page size really does matter– Reduce the size without reducing the quality

Deployment: Large Address Space

• As of Blackboard Learn™ Release 9.1 all supported/certified configurations include a 64-bit option.

• Pushing more processing to client and DB over the last few releases, but major memory management technique is to use more application caches.

– Memory stays persistent longer– Less wasteful from a creation/destruction perspective, but

puts greater demands on larger spaces.

• Most of our application testing focused on 4GB and 8GB JVM deployments on 6GB and 10GB OS spaces.

– Limited testing at 16GB and 32GB

Deployment: Storage MTTR

• Reference architecture pushes for “diskless” boots in which ISCSI or NFS partition resides on an enterprise storage system.

• Both OS/VM partition and data partition served up from remote storage deployment designed for performance and scalability.

– Make your hardware work from a CPU, Memory and Network perspective…save the Disk for the experts.

• Consider scenarios for reducing “Mean Time to Recovery or Repair”

– Snapshot technology offering minutes for recovery

Deployment: Advanced Monitoring

• Measurement is the secret sauce for successful deployments.

– Most reliable and scalable deployments measure beyond the server infrastructure

• Different types of measurements

– System/Environmental measurements– Business measurements– Synthetic measurements

• Collecting is only part of the prize

– Need to analyze the data to drive business decisions from the data.

Deployment: Automation

• Goal of moving to 100% unattended and fully automated deployment.

• Reduce MTTR and prevent disasters

• Automation requires intimacy…intimacy requires knowledge

• Use automation for

– Configuration Management and Deployment– Maintenance – Repeatable tasks– Adaptive tuning– Minimize possibility of human error

• http://dev2ops.org/storage/downloads/FullyAutomatedProvisioning_Whitepaper.pdf

http://dev2ops.org/storage/downloads/FullyAutomatedProvisioning_Whitepaper.pdf

http://dev2ops.org/storage/downloads/FullyAutomatedProvisioning_Whitepaper.pdf

Sizing the Application: To Hyper-Thread or Not

• Applies to Intel deployments only

– “..delivers thread-level parallelism on each processor resulting in more efficient use of processor resources—higher processing throughput—and improved performance on multi-threaded software.” –Intel Corporation

• Greatly improved in series 5500+ processor

• Provides double worker thread capacity

• If it’s not turned on, stop what you are doing and enable it ASAP!

Moving Away from Clusters

• Tomcat clusters were introduced back in Blackboard Learn 7.X prior to the transcendence of server virtualization.

– Only supported 32-bit configurations at the time, but systems were being shipped with 8GB, 16GB and 32+GB of RAM.

– Needed a way to take advantage of memory, but were limited to a 1.7GB address space.

– Recommending “distributed” deployment approaches as well.• Still applies, but can be achieved differently.

• Clustering has its advantages, but also has its penalties.

– Failover not as ideal as one would desire.

• Best approach is to scale up with 64-bit spaces and distributed JVMs across both virtual and physical configs

Sizing Using P.A.Rs

• PAR = Performance Archetype Ratios

– Methodology for sizing based on units of work that can be applied to “unit of configuration”

• PARs assume a world of linear units

– Add units of configuration to meet growing demands of unit of work.

• PARs based on (4) key resources: CPU, Memory, Disk and I/O and application interfaces (threads and connections).

• Used for making capacity decisions for sizing both virtual and physical components.

Optimizing the Web Server

• The web server in the Blackboard Learn configuration is nothing more than a gateway to the application container.

– When clusters were more relevent, the web server acted as a pseudo load-balancer.

• Not many opportunities for optimization other than

– KeepAlives– Interfaces– Compression

• It can become a bottleneck if not properly optimized

– Better to have high ceilings from an interface perspective

Optimizing the JVM

• Java hotspot offers standard –X and non-standard –XX options for performance and behavior.

– -X options are always guaranteed across releases and patches of Java.

– -XX options must be used with caution as they are subject to change with any release of Java.

• -XX options should be tested and measured using the production safe arguments.

• Read the release notes of Java for “performance” updates

– http://java.sun.com/javase/6/webnotes/ReleaseNotes.html

http://java.sun.com/javase/6/webnotes/ReleaseNotes.html

Optimizing the JVM

• Cross-platform recommendation for using Concurrent Mark Sweep Collector

– Best optimized for 64-bit address– Combine –XX:+UseConcMarkSweepGCwith –XX:

+UseParNewGC

• Manually size New Space using –XX:NewSize and –XX:MaxNewSize options (1/4 to 1/3 total heap).

– Consider Survivor Space ratios 4 or lower.

• Be careful about sharing –XX non-standard options across customers.

– If you don’t understand what the option does and it’s not recommended by Blackboard, best choice is to not use it.

Optimizing the Database: SQL Server

• # of data files makes no difference on SQL Server for Data and Transaction

• Allow the data/transaction files to grow as big as they want within reason.

– What’s reason: 64GB– http://msdn.microsoft.com/en-us/library/ms143432(sql.90).aspx

• TempDB is completely different story

– # of files = # of DB Threads– Set first X files to a uniform size, set last file to same size with

auto-extension ON– Determine size need over time

• Separate volume for paging file

http://msdn.microsoft.com/en-us/library/ms143432(sql.90).aspx

Optimizing the Database: SQL Server

• Be aware of MDOP: Max Degree of Parallelism

– Setting to unlimited can have a negative affect on query performance unintentionally.

• AWE can and does work on 64-bit systems

• Configure READ_COMMITTED_SNAPSHOT

• Two nuggets of information:

– Learn How to Use SQL DMVs– Study SQL Server Wait Events and Tuning

Optimizing the Database: Oracle

• Balance I/Os across multiple data files (~2 to 8GB per file).

• REDO is critical to performance a session/query level.

– Be aware of how much REDO is being used over time.– NOLOGGING will disable, be we rarely use NOLOGGING

• TEMP is very complex and used for managing transient data.

– One TEMP file is adequate– If latency exists on TEMP, consider introducing TEMP file groups

• SGA is important, but PGA can be your best friend or your worst enemy with high concurrency.

Optimizing the Database: Oracle

• Oracle DBO can be your friend

– Must understand optimizer behavior– Willingness to read Cost Execution Plans

• Using Wait Events and Cost Execution Plans for tuning initiatives

– Wait events are at a system, session and query level

• Importance of Statistics and Histograms

– CBO is just guessing without properly set statistics and histograms.

– CBO is dependent on your data.

Please provide feedback for this session by [email protected].

The subject of the email should be title of this session:

[INSERT TITLE HERE]

mailto:[email protected]

best practices for optimizing blackboard learn steve feldman, [email protected]

Documents

gb slide

recovery slide

quality slide

ideal deployment

automated deployment

deployment approach

remote storage deployment

planning strategy