data mining in knowledge management

31
Data Mining in Knowledge Data Mining in Knowledge Management Management Fakulti Sains Komputer & Fakulti Sains Komputer & Teknologi Maklumat Teknologi Maklumat Fatimah Sidi Fatimah Sidi 19/06/2002 19/06/2002

Upload: tommy96

Post on 02-Jul-2015

1.050 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Data Mining in Knowledge Management

Data Mining in Knowledge Data Mining in Knowledge ManagementManagement

Fakulti Sains Komputer & Fakulti Sains Komputer & Teknologi MaklumatTeknologi Maklumat

Fatimah SidiFatimah Sidi

19/06/200219/06/2002

Page 2: Data Mining in Knowledge Management

2

Definitions of KM

■ Address business Address business problems particular problems particular to businessto business– creates and deliver innovative products or creates and deliver innovative products or

services; services; – managing and enhancing relationships with managing and enhancing relationships with

existing and and new customers, partners, existing and and new customers, partners, and suppliers; or and suppliers; or

– administering and improving work practices administering and improving work practices and processes. (Tiwana, 2000)and processes. (Tiwana, 2000)

Page 3: Data Mining in Knowledge Management

3

Definitions of KM

■ A system produces knowledgeA system produces knowledge – gathers information gathers information – compares conceptual formulations compares conceptual formulations

describing and evaluating its experience, describing and evaluating its experience, with its goals, objectives, expectations or with its goals, objectives, expectations or past formulations of descriptions, or past formulations of descriptions, or evaluations by comparison with reference evaluations by comparison with reference to validation criteria (Firestone, 1998)to validation criteria (Firestone, 1998)

Page 4: Data Mining in Knowledge Management

4

Definitions of KM

■ A system maintains knowledge by A system maintains knowledge by continues to evaluate its knowledge continues to evaluate its knowledge base against new information by base against new information by subjecting the knowledge base to subjecting the knowledge base to continuous testing against its validation continuous testing against its validation criteria.criteria.

Page 5: Data Mining in Knowledge Management

5

Definitions of KM

■ requires a knowledge base to begin requires a knowledge base to begin operation where it enhances its own operation where it enhances its own knowledge base with the passage of knowledge base with the passage of time because it is a self-correcting time because it is a self-correcting system, and subjects its knowledge system, and subjects its knowledge base to testing against experience.base to testing against experience.

Page 6: Data Mining in Knowledge Management

6

Definitions of KM

■ ““re-badging” of earlier information and re-badging” of earlier information and data management methodsdata management methods

■ Like any system of thgought that has Like any system of thgought that has value, both old and new and its value, both old and new and its combined new ideas with ideas that combined new ideas with ideas that “everyone has know all along” (Prusak, “everyone has know all along” (Prusak, 2001)2001)

Page 7: Data Mining in Knowledge Management

7

Definitions of KM

■ ConclusionConclusion– Knowledge Management is Knowledge Management is

providing the growth of knowledge providing the growth of knowledge and also a new ways to channel and also a new ways to channel raw data into meaningful raw data into meaningful information which in turn can information which in turn can become knowledgebecome knowledge

Page 8: Data Mining in Knowledge Management

8

Difference Between Data, Information & Knowledge

■ DataData– facts, numbers, or text facts, numbers, or text – operational or transactional dataoperational or transactional data– non operational datanon operational data– metadata - data about the data metadata - data about the data

Page 9: Data Mining in Knowledge Management

9

Difference Between Data, Information & Knowledge

■ InformationInformation– Collection of data is not information Collection of data is not information

unless exist relation between the dataunless exist relation between the data– Patterns, associations or relationships Patterns, associations or relationships

among data provide informationamong data provide information

Page 10: Data Mining in Knowledge Management

10

Difference Between Data, Information & Knowledge

■ KnowledgeKnowledge– Information converted to knowledge Information converted to knowledge

about historical patterns and future about historical patterns and future trendstrends

– Subset of information Subset of information extracted, filtered or formatted in a very extracted, filtered or formatted in a very

special wayspecial way Subjected to and passed tests of validationSubjected to and passed tests of validation

Page 11: Data Mining in Knowledge Management

11

Difference Between Data, Information & Knowledge

■ KnowledgeKnowledge

Common sense knowledge is Common sense knowledge is information that has been validated by information that has been validated by common sense experiencecommon sense experience

Page 12: Data Mining in Knowledge Management

12

Difference Between Data, Information & Knowledge

■ KnowledgeKnowledge

Scientific knowledge is information Scientific knowledge is information (hypotheses and theories) validated (hypotheses and theories) validated by rules and tests applied to it by by rules and tests applied to it by some scientific communitysome scientific community

Page 13: Data Mining in Knowledge Management

13

Difference Between Data, Information & Knowledge

■ KnowledgeKnowledge

Organizational knowledge is Organizational knowledge is information validated by rules and information validated by rules and tests of the organization seeking tests of the organization seeking knowledge that improves knowledge that improves organizational performanceorganizational performance

Page 14: Data Mining in Knowledge Management

14

Difference Between Data, Information & Knowledge

■ KnowledgeKnowledge

leads to Wisdom arises when one leads to Wisdom arises when one understands the foundational understands the foundational principles responsible for the patterns principles responsible for the patterns representing knowledge.representing knowledge.

Page 15: Data Mining in Knowledge Management

15

Difference Between Data, Information & Knowledge

data understanding

information

Understandingrelations

knowledge

Understandingpatterns

wisdom

Understandingprinciples

Contextindependece

( Gene Bellinger)( Gene Bellinger)

Page 16: Data Mining in Knowledge Management

16

Components KM technology framework (Tiwana, 2000)

Knowledge Management

Workflow

Data Mining

Project Management

Document Management

Groupware

Decision Support System

Page 17: Data Mining in Knowledge Management

17

Components KM technology framework (Tiwana, 2000)

■ Key Functions :-Key Functions :-– Knowledge FlowKnowledge Flow– Information mappingInformation mapping– Information sourcesInformation sources– Information and knowledge exchangeInformation and knowledge exchange– Intelligent agent and network miningIntelligent agent and network mining

Finding knowledgeFinding knowledge

Page 18: Data Mining in Knowledge Management

18

Data mining in KM

■ mechanism to appropriately cluster mechanism to appropriately cluster search results in different pre-specified search results in different pre-specified content categories as specified in the content categories as specified in the knowledge map. knowledge map.

■ Drill down into a relevant category Drill down into a relevant category without having to learn the subtleties of without having to learn the subtleties of complex query languages and syntaxescomplex query languages and syntaxes

Page 19: Data Mining in Knowledge Management

19

Definitions of DM

■ Sometimes called data or knowledge Sometimes called data or knowledge discovery discovery

■ Process Process of analyzing data from different of analyzing data from different perspectives and summarizing it into perspectives and summarizing it into useful information anduseful information and

■ Finding correlations or patterns among Finding correlations or patterns among dozens of fields in large relational dozens of fields in large relational databases. databases.

Page 20: Data Mining in Knowledge Management

20

Definitions of DM

■ (Holsheimer and Siebes, 1994) (Holsheimer and Siebes, 1994) – searching for relationships and global searching for relationships and global

patterns that exist in large databases, but patterns that exist in large databases, but are “hidden” among the vast amounts of are “hidden” among the vast amounts of data. data.

Page 21: Data Mining in Knowledge Management

21

Definitions of DM

■ (Miller and Rohberg, 1996) (Miller and Rohberg, 1996) – tool that identifies and characterize tool that identifies and characterize

interrelationships among multivariable interrelationships among multivariable dimensions without requiring a human to dimensions without requiring a human to ask specific questions. ask specific questions.

– looks for trends and patterns looks for trends and patterns – finds relationships and make prediction.finds relationships and make prediction.

Page 22: Data Mining in Knowledge Management

22

Definitions of DM

■ (Han and Kamber, 2001)(Han and Kamber, 2001)– extracting or “mining” knowledge from extracting or “mining” knowledge from

large amounts of data. large amounts of data. – essential step in the process of knowledge essential step in the process of knowledge

discovery in databases, consists of an discovery in databases, consists of an iterative sequence of the following steps:iterative sequence of the following steps:

■ Data cleaningData cleaning

Page 23: Data Mining in Knowledge Management

23

Definitions of DM

– Data integrationData integration– Data selectionData selection– Data transformationData transformation– Data miningData mining– Pattern evaluationPattern evaluation– Knowledge presentationKnowledge presentation

Page 24: Data Mining in Knowledge Management

24

How does DM work?

■ Large scale information evolved Large scale information evolved transaction and analylitical systems transaction and analylitical systems separatelyseparately

■ DM provides link between the twoDM provides link between the two– Analyzes relationships and pattern in Analyzes relationships and pattern in

stored transaction data based on open stored transaction data based on open queries.queries.

Page 25: Data Mining in Knowledge Management

25

How does DM work?

■ Several types of analytical software Several types of analytical software available available – StatisticalStatistical– Machine learning andMachine learning and– Neutral networksNeutral networks

■ DM functionalities used to specify kind DM functionalities used to specify kind of pattern found in data mining task :of pattern found in data mining task :

Page 26: Data Mining in Knowledge Management

26

Classification of DM

■ Summarization Summarization (Holsheimer and Siebes, 1994)(Holsheimer and Siebes, 1994) ■ Association RulesAssociation Rules■ ClassificationClassification■ ClusteringClustering■ PredictionPrediction■ Sequential PatternsSequential Patterns■ Similarity SearchSimilarity Search

Page 27: Data Mining in Knowledge Management

27

Classification of DM

■ Similarity SearchSimilarity Search (Algawal & Swami, 1993)(Algawal & Swami, 1993) ■ Outlier Anlysis Outlier Anlysis (Han & Kamber, 2001)(Han & Kamber, 2001) ■ Evolution AnalysisEvolution Analysis

Page 28: Data Mining in Knowledge Management

28

Major element in DM

■ Extract, transform and load Extract, transform and load transactional data to DWtransactional data to DW

■ Store and manage the data Store and manage the data ■ Provide data access to business Provide data access to business

analysts and information technology analysts and information technology professionalsprofessionals

■ Analyze the dataAnalyze the data■ Present the dataPresent the data

Page 29: Data Mining in Knowledge Management

29

Levels of Analysis

■ Artificial neural networks: Non-linear Artificial neural networks: Non-linear predictive modelspredictive models

■ Genetic algorithmsGenetic algorithms■ Decision treesDecision trees■ Nearest neighbor methodNearest neighbor method■ Rule inductionRule induction■ Data visualizationData visualization

Page 30: Data Mining in Knowledge Management

30

Objectives of the study

■ To study the effective method of mining To study the effective method of mining the knowledge in data miningthe knowledge in data mining

■ To develop and implement the methods To develop and implement the methods in mining the knowledgein mining the knowledge

■ To test and measure its performance To test and measure its performance retrieving the knowledgeretrieving the knowledge

Page 31: Data Mining in Knowledge Management

31

Thank You