Anwendersoftware aaAnwendungssoftware
ss
Datawarehousing and AnalyticsIntroduction to Assignments
Holger SchwarzUniversität Stuttgart
Winter Term 2014/2015
Anwendersoftware aaAnwendungssoftware
ssaaAnwendungssoftware
ss
Assignment 1
• Data Mining Discuss option to apply data mining techniques to the data of a give scenario,
in order to provide relevant information that supports management in decision making.
• OLAP Provide various SQL statements providing data that helps to anser typical
OLAP type information requests. For the SQL statements, you may consider the OLAP features of SQL, which you have learned in the lecture.
2
Anwendersoftware aaAnwendungssoftware
ssaaAnwendungssoftware
ss
Rules for Assignment 1
• Work on this assignment in teams of four students• Prepare a result document (PDF) with your solutions for all the tasks• Send this document to [email protected] no later than October
31, 10:00 am. Your email has also to include your name, your team number, your matriculation number and your study programme
• Contact Holger Schwarz for any further questions
3
Anwendersoftware aaAnwendungssoftware
ssaaAnwendungssoftware
ss
Assignment 2: Goals
• Hands-on experience with tools supporting data warehouse processes• Focus on analytics• Set of prepared and guided exercises (document describes what to do)• Additional tasks extending the prepared exercises
• Each team will have access to a virtual machine with IBM InfoSphereWarehouse and Cognos installed.
• You will run through a tutorial that explains same basic tasks of usingthese tools: explore data and data models Prepare, deploy and visualize data cubes Create mining flows Prepare a customer segmentation Prepare a revenue forecast
• Extend the reports and data mining models of the tutorial4
Anwendersoftware aaAnwendungssoftware
ssaaAnwendungssoftware
ss
Rules for Assignment 2
• You participate in this hands-on lab in teams of four students.• Each team gets ist own virtual machine for this exercise.• Each team has to complete all tasks until December 1st. Please send a
PDF to [email protected] covering the following: Names, study programme and matriculation numbers of all team members For each exercises, name the team member that are ready to present the
results List specific difficulties with the hands-on lab. Are there any aspects where
the description does not match with what you found on the server?• Each team has to make an appointment with Holger Schwarz to present
the results (15 to 30 minutes per team).• Each group member has to actively participate in this result presentation.
You will be asked to present selected results.• Result presentations are scheduled for end of November and December.
Appointments will be organized by E-Mail/Doodle.
5
InfoSphere Warehouse Architecture
DB2 Cognos BI
Admin ConsoleControl and monitor flows and cubes on production system
Manages DeploymentWorkload Management
Design StudioDesign, Develop and Optimize Warehouse & Analytics
Deployment
explore datamodel
deploydatamodel
importdata
run report
dimensional data modeling import cube
model
deploycubemodel
exercise 1 exercise 2
deploysegmentation
model
exercise 3
prepare customersegmentation
displaysegmentation
exercise 5
create miningflow
deployand runminingflow
displayminingresults
(forecast)
Lab Overview
• Exercise 1: Using the InfoSphere Warehouse Model Pack for Customer Insight to deploy a basic analytical schema with appropriate Cognos reports
– Import the physical data model– Work with Data Architect tooling (part of Design Studio)– Import predefined Cognos reports
7
Lab Overview
• Exercise 2: Multidimensional modeling with InfoSphere Warehouse and OLAP analysis using Cognos
– Data Architect– SQL Warehousing– Cubing Services– Cognos Reporting
8
Lab Overview
• Exercise 3: Descriptively prepare your data and perform customer segmentation using InfoSphere Warehouse
– Descriptive Data Preparation– Data Mining Solution Plan for Customer Segmentation
9
Lab Overview
• Exercise 5: Revenue Forecasting with InfoSphere Warehouse– Data Mining Preparation– Time Series Forecasting
10
Introduction Anwendersoftware aaAnwendungssoftware
ssaaAnwendungssoftware
ss
Documents
• assignment2_v01.pdf (available in approx. 10 days) Describes details on how to access your VM Each group has its private VM with all the necessary software installed. To access your server …
- You may use any computer/laptop connected to the network of the faculty of computer science (Universitätsstraße 38).
- See the PDF for details on the requirements• HandsOnLab.pdf
Describes five exercises Each team has to complete exercises 1, 2, 3, 5
11
Anwendersoftware aaAnwendungssoftware
ssaaAnwendungssoftware
ss
How to organize the teams?
• Goto: http://goo.gl/vp14sR• Download file template.txt• Fill in the data for your team
of (exactly!) three students• Teams of two students might be
combined (depending on number ofoverall participants)
• Upload the file asLastname1Lastname2Lastname3.txt
• Data must be available onMonday, October 6, 2014, 10 am
• Change requests possible untilThursday, October 9, 2014, 8 am
12
Team member 1:last name:first name:study programme:email:
Team member 2:last name:first name:study programme:email:
Team member 3:last name:first name:study programme:email:…Comments:
Template.txt