symantec i3 - inquire & cluster

Click here to load reader

Download Symantec I3 - Inquire & Cluster

Post on 04-Jul-2015




1 download

Embed Size (px)


Performance Management form synthetic transactions, Web, J2EE, OS and Cluster HW



2. The Web Applications Environment

  • Multi-tier applications that can include :
    • Client (Web browser)
    • Web Server
    • J2EE Application Server
    • Interfaces to external systems
    • Database
  • Worldwide, around-the-clock user access
  • Usage patterns are not controlled
  • Management requires the expertise of multiple IT teams

3. Architecture Web Clients Web Clients Web Servers J2EE Server Oracle Databases Legacy System External System 4. Symantec i 3

  • Symantec i 3is anintegratedsolution that is
  • end-user focused .
  • Symantec i 3provides a built-inmethodology that enables you to effectively manage theAvailabilityandPerformance
  • of business applications.

5. Why an Integrated Solution?

  • With a multi-tier architecture, IT must:
  • Align IT to business priorities
    • Monitor the availability and performance of business processes
    • Monitor business processes 24x7from multiple geographies
    • Constantly update the business processes by adjusting to real usage patterns initiated by real users
  • Easily & rapidly detect problems, & analyze in context
  • Correlate application flow activities among technologies
  • Pinpoint & analyze their causes
  • Implement proactive tuning of your performance and availability


  • Enterprise web application running on a 4-tier architecture
  • International flower ordering service
    • The system is critical to sales & support activities
    • The worldwide data center is located in NY
    • The central orders database is updated at 10:00pm NY time
    • Sales peak around local holidays, e.g. Mothers Day on May 9 in Italy & USA

9. 10. 11. 12. Status

  • Conclusions symptoms detected
  • A problem was identified in thecheck_my_orderbusiness transaction from USA & Italy
  • A synthetic transaction identified the problem
  • The problem occurred after database update, at10:05pm NY time (4:05am the next day in Italy)
  • Next Steps find the source
  • Identify the problematic page component

13. 14. 15. Status

  • Conclusions source found
  • Java class/verify/orders/country/orders_list.jspis the problematic component
  • Next Steps focus on the reason
  • Identify whether this problem also occurs with real transactions

16. AVAILABILITY & PERFORMANCE 17. 18. 19. 20. Status

  • Interim Conclusions focus on the reason
  • We have contrasted synthetic transactions vs. reality
  • The problem only occurs with synthetic transactions, due to timing (10:05pm NY / 4:05am Italy)
  • Real users will be affected once the next day starts -Mothers Day in Italy, USA & some other countries
  • Next Steps
  • Investigate server side to pinpoint the problematic tier

21. 22. 23. Status

  • Interim Conclusions focus on the reason
  • The J2EE tier is dominating resource consumption
  • The problematic method isverify_orders_country_orders_list._jspService
  • Next Steps
  • Further investigation is needed in the J2EE tier

24. AVAILABILITY & PERFORMANCE 25. 26. 27. Status

  • Interim Conclusions focus on the reason
  • Smartune ranked JDBC Access Major Time as the main problem
  • Next Steps
  • Seek more details and advice

28. 29. 30. Status

  • Conclusions
  • The problem is not the database tier, but how frequently the database is accessed
  • Due to the unusually high number of db queries, time spent in the JSP is too long
  • Italy and USA behavior differs from other countries due to the Mothers Day high increase in orders
  • Next Steps address the problem & verify the solution
  • Instead of querying each order item individually, a new single query that returns all items should be implemented

31. 32. Case Study Summary

  • We have successfully completed the methodology cycle:
  • Detect
    • An SLA breach problem was detected by a synthetic transaction in Italy and USA, before the problem was experienced by real users
  • Find
    • A specific JSP is responsible for the slow response time, but only in specific countries
  • Focus
    • The unusually long order list in Italy & USA caused an exceptional number of connections, resulting in lengthy JDBC access time
  • Improve
    • Restructure queries and results pages
  • Verify
    • The SLA problem has disappeared without affecting real users

33. Synthetic Transactions Versus Reality

  • Synthetic transactions indicate availability
  • They should systematically reflect a dynamic real-world situation
    • Identify application usage changes relating to both existing and new applications
    • Reflect new delivered functionality
    • Reflect trends regarding usage from new locations

34. 35. 36. 37. Synthetic Transactions Versus Reality

  • Summary
  • Systematically analyzed changes in usage pattern
  • Identified new used functionality, change in usage patterns, and heavily used locations
  • Based on the findings, the user can now adjust the synthetic transactions to reflect the actual real-world situation


  • Enterprise web application running on a 4-tier architecture
  • Symantec Cluster Server (VCS) is activated on two machines running Oracle
  • Each machine switches to the other upon failure

40. 41. 42. 43. Status

  • Interim Conclusions
  • Upon Oracle tier failure, VCS activated failover
  • Next Steps
  • Follow the Alerts advice and use the VRTS Cluster Manager to investigate further

44. AVAILABILITY & PERFORMANCE 45. 46. 47. 48. Status

  • Conclusions
  • Oracle ran on sys1
  • DiskGroup-oradg failed on sys1, hence VCS switched Oracle to sys2
  • Despite a major problem, the application remained available
  • But, to ensure availability over the long-term, the problematic resource must be repaired

49. 50. Proactive & Preventive VCS Actions

  • Implement a systematic review of resource consumption overtime to forecast utilization
  • Identify up coming over-consumption
  • Establish VCS failover policies and assign system resources, accordingly

51. 52. 53. 54. Proactive & Preventive VCS Actions

  • Summary
  • Discovered peak of CPU consumption on server SrvProdD at the end of each month
  • In the future, this increase will exceed the overall CPU capacities
  • Based on the findings, the user can now establish an appropriate VCS failover system


View more