data analysis
DESCRIPTION
This is on Data analysisTRANSCRIPT
Presented By:
Shankar Kumud
Kushwaha Sikander
Data Analysis is
Data analysis is a process of:Gathering;Modeling; and Transforming of data.
With the goal of highlighting useful Information; Suggesting conclusions; and Supporting decision making.
04/11/23 XIDAS, Jabalpur 2
04/11/23 XIDAS, Jabalpur 3
04/11/23 XIDAS, Jabalpur 4
04/11/23 XIDAS, Jabalpur 5
04/11/23 XIDAS, Jabalpur 6
04/11/23 XIDAS, Jabalpur 7
04/11/23 XIDAS, Jabalpur 8
04/11/23 XIDAS, Jabalpur 9
04/11/23 XIDAS, Jabalpur 10
04/11/23 XIDAS, Jabalpur 11
Thank You
04/11/23 XIDAS, Jabalpur 12
Major Data Analysis Techniques Correlation Analysis; Regression Analysis; Factor Analysis; Cluster Analysis; Correspondence Analysis (Brand
Mapping); Conjoint Analysis; CHAID Analysis; Discriminant /Logistic Regression Analysis; Multidimensional Scaling; and Structural Equation Modeling.
04/11/23 XIDAS, Jabalpur 13
CORRELATION ANALYSIS Correlation analysis, expressed by correlation
coefficients, measures the degree of linear relationship between two variables.
Feature of Correlation coefficient:Between + and – 1;The sign of the correlation coefficient (+, -) defines
the direction of the relationship, +tive or –tive;A positive correlation coefficient means that as the
value of one variable increases, the value of the other variable also increases; as one decreases the other decreases; and
A negative correlation coefficient indicates that as one variable increases, the other decreases, and vice-versa.
04/11/23 XIDAS, Jabalpur 14
Cont.. The absolute value of the correlation
coefficient measures the strength of the relationship.
A correlation coefficient of r=0.50 indicates a stronger degree of linear relationship than one of r=0.40.
Correlation coefficient of zero (r=0.0) indicates the absence of a linear relationship.
Correlation coefficients of r=+1.0 and r=-1.0 indicate a perfect linear relationship.
04/11/23 XIDAS, Jabalpur 15
Diagrammatic presentation “r”
04/11/23 XIDAS, Jabalpur 16
R=0.5
R=1
R= -0.5
Regression analysis Regression analysis measures the:
strength of a relationship between a variable (e.g. overall customer satisfaction)
one or more explaining variables (e.g. satisfaction with product quality and price).
Correlation provides a single numeric summary of a relation (called the correlation coefficient), while regression analysis results in a "prediction" equation.
The regression equation describes the relation between the variables. If the relationship is strong (expressed by the Rsquare value), it can be used to predict values of one variable given the other variables have known values.
04/11/23 XIDAS, Jabalpur 17
Cont..
For example: how will the overall satisfaction score change if satisfaction with product quality goes up from 6 to 7.
04/11/23 XIDAS, Jabalpur 18
Factor Analysis Factor analysis aims to
describe a large number of variables or questions by only using a reduced set of underlying variables, called factors.
It explains a pattern of similarity between observed variables. Questions which belong to one factor are highly correlated with each other.
Factor Analysis
Exploratory Confirmatory
04/11/23 XIDAS, Jabalpur 19
Types of Factor Analysis
Use of Factor Analysis Factor analysis is often used in customer satisfaction
studies to identify underlying service dimensions, and in profiling studies to determine core attitudes.
For example, as part of a national survey on political opinions, respondents may answer three separate questions regarding environmental policy, reflecting issues at the local, regional and national level.
Factor analysis can be used to establish whether the three measures do, in fact, measure the same thing.
It is can also prove to be useful when a lengthy questionnaire needs to be shortened, but still retain key questions.
Factor analysis will indicate which questions can be omitted without losing too much information.
04/11/23 XIDAS, Jabalpur 20
CLUSTER ANALYSISCluster analysis is an exploratory tool designed to reveal natural groupings within a large group of observations. Cluster analysis segments the survey sample, i.e. respondents or companies, into a small number of groups.
04/11/23 XIDAS, Jabalpur 21
BRAND MAPPING (CORRESPONDENCE ANALYSIS) Correspondence analysis is a technique
which: Allows rows and columns of a data matrix, E.g. average satisfaction scores for several
products, to be displayed as points in a two dimensional space or map. It reduces a complicated set of data to a graphical display which is immediately and easily interpretable. Brand maps are based on correspondence analysis.
Brand maps are often used to illustrate customers' images of the market by placing products and attributes together on a map. This allows close interpretation of company perceptions with a variety of product and service attributes simultaneously.
04/11/23 XIDAS, Jabalpur 22
04/11/23 XIDAS, Jabalpur 23
CONJOINT ANALYSIS Conjoint analysis is a technique for measuring
respondent preferences about the attributes of a product or service.
It is the ideal tool for new/improved product development.
The conjoint analysis task asks the respondents to make choices in the same fashion as consumers normally do, by trading off features one against the other, either by ranking or choosing one of several product combinations.
E.g. a task could be: do you prefer a "flight that is cramped, costs £250 and has one stop" or a "flight that is spacious, costs £500 and is direct"?
04/11/23 XIDAS, Jabalpur 24
Example of C A
04/11/23 25XIDAS, Jabalpur
Importance Of Printer Features, Plus Simulator
Cont..
04/11/23 XIDAS, Jabalpur 26
CHAID ANALYSIS CHAID (Chi Squared Automatic
Interaction Detection) is used to build: a predictive model, based on a
classification system. The analysis subdivides the sample into
a series of subgroups that :○ 1) share similar characteristics towards a
specific response variable and that ○ 2) maximises our ability to predict the
values of the response variable.
04/11/23 XIDAS, Jabalpur 27
04/11/23 28XIDAS, Jabalpur
The output is a tree of which the branches are the predictor variables that split the sample in discriminating groups.
DISCRIMINANT/LOGISTIC REGRESSION ANALYSIS Discriminant and logistic
regression analysis are statistical techniques that point out the differences between two or more groups based on several characteristics (most often rating scales when Discriminant analysis, while logistic regression can handle any type of variable)
Is often used : to determine which
customers are likely to buy a company's product
to decide whether a bank should offer a loan to a new company or
to identify patients which may be at high risk for medical problems
04/11/23 XIDAS, Jabalpur 29
Diagrammatic Presentation
04/11/23 XIDAS, Jabalpur 30
MULTIDIMENSIONAL SCALING Multidimensional scaling (MDS) can be
considered to be an alternative to factor analysis.
In general, the goal of the analysis is to detect meaningful underlying dimensions that allow the researcher to explain observed similarities or dissimilarities between the investigated objects. In factor analysis, the similarities between objects (e.g. variables) are expressed in the correlation matrix.
With MDS one may analyse any kind of similarity or dissimilarity matrix, in addition to correlation matrices.
04/11/23 XIDAS, Jabalpur 31
MDS methods are applicable to a wide variety of research designs.
04/11/23 XIDAS, Jabalpur 32
STRUCTURAL EQUATION MODELING Structural Equation Modeling (SEM) is a
very general, very powerful multivariate analysis technique that includes a number of other traditional analysis methods as special cases.
It effectively includes a whole range of standard multivariate analysis methods, such as regression, factor analysis and analysis of variance.
A structural equation model can exist with several regression and factor analysis models, which are estimated simultaneously.
04/11/23 XIDAS, Jabalpur 33
E.g. of a CRM model using survey
04/11/23 XIDAS, Jabalpur 34
Thank You
04/11/23 XIDAS, Jabalpur 35