power point for data mining
TRANSCRIPT
Data Mining & OLAP
What is Data Mining? Data Mining is the set of activities
used to find new, hidden, or unexpected patterns in data.
What is OLAP? On-Line Analytical Processing
a category of software technology that enables analysts and executives to gain insight to data through fast, consistent, interactive access to a wide variety of possible views of information that has been transformed from raw data to reflect the real dimensionality of the enterprise as understood by the user.
OLAP Functionalities dynamic multi-dimensional analysis of consolidated
data supporting end user analytical and navigational activities including:
Calculations and modeling applied across dimensions, through hierarchies and/or across members
Trend analysis over sequential time periods Slicing subsets for on-screen viewing Drill down to deeper levels of consolidation Reach-through to underlying detail data Rotation to new dimensional comparisons in the
viewing area
2 Approaches to conduct the analysis Multidimensional OLAP (MOLAP)
Hypercube Relational OLAP (ROLAP)
In ROLAP, multidimensional database server is replaced with a large relational database server
Internal data
External data
Data Transformation
services
Mapping
measures and
dimensions
Transaction database
Data warehouse
Multidimensional cube
End User OLAP Interface
OLTP
OLTP
Components of OLAP
Infrastructure of Data Warehouses & OLAP Systems
Hypercube data representations make it convenient to query data along any dimension
Sales Performance from Various Markets
CountryCountry
Drill Down Operation of OLAP Cube
Country > RegionCountry > Region
Drill Down Operation of OLAP Cube
Country > Region> CityCountry > Region> City
Workflow Monitoring
CompanyCompanyCustomerCustomer
CustomerCustomer Sales & Sales & MarketingMarketing
ManufacturingManufacturingPMCPMC ShipperShipper
AccountingAccountingWarehouseWarehouse
purchase orderpurchase order
order requestorder requestapprovalapproval
order requestorder request
job job orderorder
delivery notedelivery note
shipping ordershipping order
invoiceinvoice
paymentpayment
purchase purchase confirmationconfirmation
Schematic Diagram of Business Flow
Sample Workflow for Electronic Procurement - Participating Organizations
SupplierSupplierSupplierSupplierBuyerBuyerBuyerBuyerUser
InvoiceApprover
POApprover
CommerceFinance Supplier Reviewer Shipper
Purchase Request
PO RequestApproval
PO ApprovalPurchaseOrder
Configuration
ReviewPurchase Confirmation and ETA
Shipping OrderInvoice
Invoice Request Approval
Invoice ApprovalPayment
App
Shipment Verification
Management And Monitoring
Process
FK1
PK
ScheduleIDPortN am e
PortID
Port
FK1
PK
ScheduleIDM essageN am e
M essageID
M essage
FK1FK2
PK
ScheduleN am eC ontext_optionalA ttributeswhere_optionalM oduleIDG roupIDSta te
ScheduleID
Schedule
PK
M oduleN am e
M oduleID
M oduleA m o du le is a co llec tion o f
scheudu les - in nospe c ified o rde r
m essage s and po rts a redesc ribed in lis ts tha t a re
pa rt o f the sched u le
hea de r.
FK1
PK
m sgportScheduleID
W hereID
W hereTable
PK
G roupN am e
GroupID
GroupTable
SQL SERVERSQL SERVER
HandleApproval
Query
ReceiveApproval Status
Update
Approve
Email UserChangeStatus
Call ValidateSchedule
M onitoring Application
M onitoring Application
BiztalkBiztalk
CustomCustom
Orchestrating Business Activities
BizTalkBizTalk Orchestration EngineOrchestration Engine
COM Components
WebWebServiceService
(Internal)(Internal)
WebWebServiceService
(External)(External)
MSMQ
Exchange Workflows
SQL ServerSQL ServerScriptScriptFilesFiles
BizTalkBizTalk
MessagingMessagingServicesServices
Internal Apps
Business OrchestrationBusinessBusinessProcess Process
FlowFlow
ImplementationImplementation
BizTalk Server- An Integration Server
MS BizTalk Server
Scan-based Trading
Inventory Management
BOM Module
PO Module
CO Module
Other Modules
Other Legacy Systems
Customers
Suppliers
ECTools
BizTalk Server- An Automation Server
MS BizTalk Server
Scan-based Trading
Inventory Management System
Customer
Accounting System
Begin
Receive Inventory Record
Issue Delivery Note
Update Inventory Record
Credit or COD
CustomerIssue InvoiceCOD
Credit customers’ account
Receive Payment
AccountAccounting System
EndPre-defined Business Rule could be added for process automation
Support various types of protocol for messaging
Data format conversion for different formats ECTools
Questions for Discussion Determine the potential OLAP
applications in business operation?
Suggested Answer:
Marketing and sales analysisDatabase marketingBudgetingFinancial reportingManagement reportingProfitability analysisQuality analysis
Questions for Discussion MOLAP is good for handling what kind
of data?
Suggested Answer:
MOLAP is good at handling summarized data, it is not particularly well-suited to handle large amount of detailed data
Questions for Discussion ROLAP is suitable for handling what
kind of data?
Suggested Answer:
ROLAP architectures are especially well-suited to those situations where dynamic access to combinations of summarized and detailed data is more important than the performance gains offered by MOLAP approach using only summarized or pre-consolidated data.
Questions for Discussion Limitations and Challenges to Data
Mining
Suggested Answer:
Identification of missing informationOriginal data set contains the necessary elements for effective mining cannot be detected yetData noise and missing valuesLarge databases and high dimensionality