supporting decision making

43
Supporting Decision Making

Upload: erling

Post on 23-Mar-2016

56 views

Category:

Documents


0 download

DESCRIPTION

Supporting Decision Making. A Framework for IS Management. Introduction (2). Most computer systems support decision making because all software programs involve automating decision steps that people would take - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Supporting Decision Making

Supporting Decision Making

Page 2: Supporting Decision Making

A Framework for IS Management

Page 3: Supporting Decision Making

Introduction (2) Most computer systems support decision making

because all software programs involve automating decision steps that people would take

Decision making is a process that involves a variety of activities, most of which handle information

A wide variety of computer-based tools and approaches can be used to confront the problem at hand and work through its solution

Page 4: Supporting Decision Making

Introduction (3) Computer technologies that support decision

making Decision support system (DSSs) Data mining Executive information systems (EISs) Expert systems (ESs) Agent-based modeling

Multidisciplinary foundations for DS technologies Database research, artificial intelligence, statistical

inference, human-computer interaction, simulation methods, software engineering etc.

Page 5: Supporting Decision Making

Case Example---A Problem-Solving Scenario Using an EIS to discover a sales shortfall in one

region

Investigate several possible causes Economic conditions Competitive analysis Written sales reports A data mining analysis

Result: no clear problems revealed

Page 6: Supporting Decision Making

Decision Support Systems---History Two contributing areas of research in 1950s-

1960s Organizational decision making in CMU Interactive computer systems in MIT

Middle 1970s: single user and model-oriented DSS

Middle and late 1980s: EIS, GDSS, ODSS 1990s: Data warehousing and OLAP Late 1990s-2000s

Data mining Web-based analytical applications

Page 7: Supporting Decision Making

What is a DSS? A DSS aims to use IT to relieve humans of some

decision making or help us make more informed decisions Systems that support, not replace, managers in their

decision-making activities

DSSs are defined as: Computer-based systems That help decision makers Confront ill-structured problems Through direct interaction With data and analysis models

Page 8: Supporting Decision Making

DSS Architecture (1)

Page 9: Supporting Decision Making

DSS Architecture (2)

The Dialog Component Linking the user to the system

The Data Component Data sources --- use all the important data sources

within and outside the organization in the form of summarized data (DW & DM)

The Model Component Models provide the analysis capabilities for a DSS

Using a mathematical representation of the problem, algorithmic processes are employed to generate information to support decision making

Page 10: Supporting Decision Making

A Taxonomy of DSS

Using the mode of assistance as the criterion A model-driven DSS A communication-driven DSS A data-driven DSS or data-oriented DSS A document-driven DSS A knowledge-driven DSS

Page 11: Supporting Decision Making

Executive Information System (1) The emphasis of EIS is on graphical displays

and easy-to-use user interfaces

EIS can be viewed as a DSS that: Provides access to summary performance data Uses graphics to display and visualize the data in

an easy-to-use fashion, and Has a minimum of analysis for modeling beyond

the capability to "drill down" in summary data to examine components

Page 12: Supporting Decision Making

Executive Information System (2) EISs aim to provide both internal and external

information relevant to meeting the strategic goals of the organization Gauge company performance Scan the environment

EIS and data warehousing technologies are converging in the marketplace

The term EIS has lost popularity in favor of Business Intelligence

Page 13: Supporting Decision Making

Data Mining: Motivations The explosive growth of data: from TB to PB

Data collection and data availability Automated data collection tools, database systems, Web,

computerized society Major sources of abundant data

Business: Web, e-commerce, transactions, stocks, … Science: remote sensing, bioinformatics, … Society and everyone: news, digital cameras, YouTube

We are drowning in data, but starving for knowledge! “Necessity is the mother of invention”—Data mining—

Automated analysis of massive data sets

Page 14: Supporting Decision Making

What Is Data Mining?

Data mining (knowledge discovery from data) Extraction of interesting patterns or knowledge from huge amount

of data Alternative names

Knowledge discovery (mining) in databases (KDD), knowledge extraction, data/pattern analysis, data archeology, data dredging, information harvesting, business intelligence, etc.

Watch out: Is everything “data mining”? Simple search and query processing (Deductive) expert systems

Page 15: Supporting Decision Making

Knowledge Discovery (KDD) Process Data mining—core of

knowledge discovery process

Data Cleaning

Data Integration

Databases

Data Warehouse

Task-relevant Data

Selection

Data Mining

Pattern Evaluation

Page 16: Supporting Decision Making

Architecture: A Typical Data Mining System

data cleaning, integration, and selection

Database or Data Warehouse Server

Data Mining Engine

Pattern Evaluation

Graphical User Interface

Knowledge-Base

DatabaseData

WarehouseWorld-WideWeb

Other InfoRepositories

Page 17: Supporting Decision Making

Data Mining: Confluence of Multiple Disciplines

Data Mining

Database Technology Statistics

MachineLearning

PatternRecognition

Algorithm

OtherDisciplines

Visualization

Page 18: Supporting Decision Making

Why Not Traditional Data Analysis? Tremendous amount of data

Algorithms must be highly scalable to handle TB of data High-dimensionality of data

Micro-array may have tens of thousands of dimensions High complexity of data

Data streams and sensor data Time-series data, temporal data, sequence data Structure data, graphs, social networks and multi-linked data Heterogeneous databases and legacy databases Spatial, spatiotemporal, multimedia, text and Web data Software programs, scientific simulations

New and sophisticated applications

Page 19: Supporting Decision Making

Multi-Dimensional View of Data Mining (1) Data to be mined

Relational, data warehouse, transactional, stream, object-oriented/relational, active, spatial, time-series, text, multi-media, heterogeneous, legacy, WWW

Knowledge to be mined Characterization, discrimination, association,

classification, clustering, trend/deviation, outlier analysis, etc.

Multiple/integrated functions and mining at multiple levels

Page 20: Supporting Decision Making

Multi-Dimensional View of Data Mining (2) Techniques utilized

Database-oriented, data warehouse (OLAP), machine learning, statistics, visualization, etc.

Applications adapted Retail, telecommunication, banking, fraud

analysis, bio-data mining, stock market analysis, text mining, Web mining, etc.

Page 21: Supporting Decision Making

Data Mining Functionalities (1) Multidimensional concept description:

characterization and discrimination Generalize, summarize, and contrast data characteristics,

e.g., dry VS. wet regions Frequent patterns, association, correlation vs.

causality Diaper Beer [0.5%, 75%]

Classification and prediction Construct models (functions) that describe and distinguish

classes or concepts for future prediction E.g., classify countries based on (climate), or classify cars based

on (gas mileage) Predict some unknown or missing numerical values

Page 22: Supporting Decision Making

Data Mining Functionalities (2) Cluster analysis

Class label is unknown: Group data to form new classes, e.g., cluster houses to find distribution patterns

Maximizing intra-class similarity & minimizing interclass similarity

Outlier analysis Outlier: Data object that does not comply with the general

behavior of the data Noise or exception? Useful in fraud detection, rare events

analysis Trend and evolution analysis

Trend and deviation: e.g., regression analysis Periodicity analysis

Page 23: Supporting Decision Making

Major Issues in Data Mining (1) Mining methodology

Mining different kinds of knowledge from diverse data types, e.g., bio, stream, Web

Performance: efficiency, effectiveness, and scalability Pattern evaluation: the interestingness problem Incorporation of background knowledge Handling noise and incomplete data Parallel, distributed and incremental mining methods Integration of the discovered knowledge with existing

one: knowledge fusion

Page 24: Supporting Decision Making

Major Issues in Data Mining (2) User interaction

Data mining query languages and ad-hoc mining Expression and visualization of data mining

results Interactive mining of knowledge at multiple levels

of abstraction Applications and social impacts

Domain-specific data mining & invisible data mining

Protection of data security, integrity, and privacy

Page 25: Supporting Decision Making

Artificial Intelligence (1)

AI is a group of technologies that attempts to mimic our senses and emulate certain aspects of human behavior such as reasoning and communication 1956, a conference in Dartmouth College

John McCarthy, Marvin Minsky, Allen Newell and Herbert Simon ( MIT, CMU and Stanford)

1965, H. A. Simon: "machines will be capable, within twenty years, of doing any work a man can do"

1967, Marvin Minsky: "Within a generation ... the problem of creating 'artificial intelligence' will substantially be solved"

Heavily funded by DARPA

Page 26: Supporting Decision Making

Artificial Intelligence (2)

They had failed to recognize the difficulty of some of the problems they faced: The lack of raw computing power The intractable combinatorial explosion of their algorithms, The difficulty of representing commonsense knowledge and

doing commonsense reasoning, The incredible difficulty of perception and motion The failings of logic

First AI Winter In 1974, DARPA cut off all undirected, exploratory

research in AI

Page 27: Supporting Decision Making

Artificial Intelligence (3)

In the early 80s, the field was revived by the commercial success of expert systems By 1985 the market for AI had reached more than

a billion dollars. Minsky and others warned the community that

enthusiasm for AI had spiraled out of control and that disappointment was sure to follow

Second AI Winter The collapse of the Lisp Machine market in 1987

Page 28: Supporting Decision Making

Artificial Intelligence (4)

In the 90s AI achieved its greatest successes Artificial intelligence was adopted throughout

the technology industry, providing the heavy lifting for Data mining Logistics Medical diagnosis …

Page 29: Supporting Decision Making

Expert System

An expert system is an automated type of analysis or problem-solving model that deals with a problem the way an "expert" does The process involves consulting a base of

knowledge or expertise to reason out an answer based on the characteristics of the problem

Page 30: Supporting Decision Making

Architecture of an ES

InferenceEngine

KnowledgeBase

User

Interface

Description of a problem

Advice and explanation

User

Page 31: Supporting Decision Making

Knowledge Representation In AI, the primary aim of knowledge

representation is to store knowledge so that programs can process it and achieve the verisimilitude of human intelligence The representation theory has its origin in cognitive

science

Knowledge can be represented in a number of ways Case-based reasoning Artificial neural networks Stored as rules

Page 32: Supporting Decision Making

Case-based Reasoning (1)

Case-based reasoning The process of solving new problems based on

the solutions of similar past problems A case consists of a problem, its solution, and,

typically, annotations about how the solution was derived

Page 33: Supporting Decision Making

Case-based Reasoning (2)

Case-based reasoning as a four-step process Retrieve: given a target problem, retrieve cases

from memory that are relevant to solving it Reuse: map the solution from the previous case

to the target problem Revise: test the new solution, if necessary, revise

it. Retain: After the solution has been successfully

adapted to the target problem, store the resulting experience as a new case in memory

Page 34: Supporting Decision Making

Supervised vs. Unsupervised Learning Supervised learning

Supervision: The training data (observations, measurements, etc.) are accompanied by labels indicating the class of the observations

New data is classified based on the training set

Unsupervised learning The class labels of training data is unknown Given a set of measurements, observations, etc. with

the aim of establishing the existence of classes or clusters in the data

Page 35: Supporting Decision Making

Artificial Neural Network (1) An interconnected group of artificial neurons

Using a mathematical or computational model for information processing based on a connectionistic approach to computation.

An adaptive system that changes its structure based on external or internal information that flows through the network.

ANNs can be used to model complex relationships between inputs and outputs or to find patterns in data Non-linear statistical data modeling or decision making

tools

Page 36: Supporting Decision Making

Artificial Neural Network (2)

Training set:(1) high salary, owns a house, has a dog, [profitable customer](2) less than 3 years on job, prior bankruptcy, owns a dog, [deadbeat]......

Page 37: Supporting Decision Making

Rule-based Systems (1)

Knowledge stored as rules The most commonly used form of rules is the if-

then statement e.g. IF some condition THEN some action

A rule-based inference model: decision tree Each internal node (non-leaf node) denotes a test

on an attribute Each branch represents an outcome of the test Each leaf node holds a class label

Page 38: Supporting Decision Making

Rule-based Systems (2)

age income student credit_rating buys_computer<=30 high no fair no<=30 high no excellent no31…40 high no fair yes>40 medium no fair yes>40 low yes fair yes>40 low yes excellent no31…40 low yes excellent yes<=30 medium no fair no<=30 low yes fair yes>40 medium yes fair yes<=30 medium yes excellent yes31…40 medium no excellent yes31…40 high yes fair yes>40 medium no excellent no

Training dataset for decision tree buys_computer

Page 39: Supporting Decision Making

Rule-based Systems (3)

age?

overcast

student? credit rating?

<=30 >40

no yes yes

yes

31..40

no

fairexcellentyesno

Decision tree buys_computer

Page 40: Supporting Decision Making

Agent-based Modeling

Simulate the behavior that emerges from the decisions of a large number of distinct individuals Computer generated agents, each making

decisions typical of the decisions an individual would make in the real world

Trying to understand the mysteries of why businesses, markets, consumers, and other complex systems behave as they do

Page 41: Supporting Decision Making

Toward the Real-Time Enterprise The essence of the phrase real-time

enterprise is that organizations can know how they are doing at the moment Digitization and automation of some crucial

enterprise activities traditionally completed by people Esp. information analysis

Better sense-and-response

Page 42: Supporting Decision Making

Real-time Reporting

Real-time reporting is occurring on a whole host of fronts including: Enterprise nervous systems

A network that connects people, applications and devices To coordinate company operations

Straight-through processing To reduce distortion in supply chains

Real-time CRM To automate decision making relating to customers, and

Communicating objects To gain real-time data about the physical world E.g. radio frequency identification device (RFID)

Page 43: Supporting Decision Making

The Dark Side of Real Time

Object-to-object communication could compromise privacy Knowing the exact location of a company truck

every minute of the day is an invasion the driver's privacy

In the era of speed, a situation can become very bad very fast E.g. "circuit breaker" to stop deep dives in NYSE