a lesson 1 introduction to statistics & spss
TRANSCRIPT
-
8/12/2019 A Lesson 1 Introduction to Statistics & SPSS
1/8
Statistical Analysis using SPSS Lesson 1
Lesson 1
Introduction to Statistics
What is statistics?
Why is statistics needed?
Population and sample
Variable
Measurement
Data
Introduction to SPSS Windows
Basic steps in data analysis using SPSS
1
-
8/12/2019 A Lesson 1 Introduction to Statistics & SPSS
2/8
-
8/12/2019 A Lesson 1 Introduction to Statistics & SPSS
3/8
Statistical Analysis using SPSS Lesson 1
Information tested and retested with different samples and at different time points
becomes facts, if the data can consistently support it. Finally, facts become knowledge
when they are used in the successful completion of the decision process. The whole
sequence is known as data-driven decision-making process. Figure below shows the level
of statistical methods or procedures needed for a study depends on the desired level of
improvement in decision making.
Level of Knowledge
statistical
methods
Facts
Information
Data Level of improvement in decision making
Data-driven decision-making process
Population and Sample
Population: A set of things or objects in which we have an interest at the particular time.
Examples: Workers at a factory, students in a college, in-patients at a hospital.
Sample: A subset of the population
Examples: A group of workers at the factory, a selection of students from the college.
Several sampling methods can be used to obtain a sample from the population. They can
be classified under two broad categories: probability sampling and non-probability
sampling. In probability sampling, we require a sample frame a list of all the items or
objects in the population. Within probability sampling there are a few types of sampling
procedures, the basic one is the simple random sample. All analyses in SPSS assume that
the data are collected using this simple random sampling procedure.
Variable
A variable can be defined as a characteristic of things or objects that take different values
in different items that are tested. The opposite of a variable is a constant.
For example, the weight of newborn babies varies from one to another. So, the weight of
newborn babies is considered to be a variable. The gender of the babies also differs from
one to another. So, gender is a variable too. The whole field of statistics revolves around
this term variable and the hundreds of statistical tools we have are concerned withdescribing these variables and finding associations between them.
3
-
8/12/2019 A Lesson 1 Introduction to Statistics & SPSS
4/8
Statistical Analysis using SPSS Lesson 1
There are two types of variables:
Qualitative variable: This is a phrase used to describe characteristics that cannot
be measured or counted, but merely categorized like race, sex, colour, exam
grades and blood group.
Quantitative variable: This is a phrase used to describe measurable characteristics
like height, weight, age and exam marks and counts like number of passes,
number students and number of accidents.
Measurement
Measurement is the assignment of numbers to represent a characteristic. It is useful to
clarify what is being measured and what it measures. For example, the clinical
thermometer measures the body temperature. But what does the body temperature
measure or indicate? Perhaps, the body temperature is an indicator of the presence ofbacterial or viral infection.
The units of measurements are equally important for computations and inference
purposes. For example, consider an increase in body temperature. An increase of 30
Fahrenheit may not be a cause for concern, but an increase of 30 Celsius may be critical.
Concepts and Indicators
A concept is what we are hoping to capture and indicators are what we use to capture it.
Say, a doctor wants to establish the health status of a group of workers. The health status
is a concept. Since health status varies from person to person, it can be considered to be a
variable too. Health status is a concept and it is not directly measurable it is an
unobserved measure, often called a latent variable. Then, how do we measure the
unobserved? First, we need to identify some reliable indicators of health status. In
healthcare, variables like weight, blood pressure, cholesterol, blood sugar levels are often
used as some indicators of health status. These are measurable variables and their units of
measurement are different too. If a persons blood pressure is always high, he is said to be
of poor health. Blood pressure is also known to have high levels of association with
cholesterol level, blood sugar level and weight. A person who has values in the normal
range, for all of these measures, is said to be healthy.
Data
Data can be considered as the raw material of statistics. The information gathered, facts
tested and ultimately the knowledge gained, depends heavily on the quality of data
collected. Therefore, considerable importance must be paid to the data collection stage.
Data can be obtained either from primary or secondary sources. Data compiled from
sources like records, journals and archives are called secondary data. While data collected
primarily through designed experiments or surveys, by the researcher are called primarydata.
4
-
8/12/2019 A Lesson 1 Introduction to Statistics & SPSS
5/8
Statistical Analysis using SPSS Lesson 1
Types of data
1.
Qualitative data can be classified further into nominal dataand ordinal data.
Nominal data are categorical characteristics that you can name.
Examples: Gender: Male or female based on physical traits.
Blood group: A, B, AB or O based on allele types.
Of course, it is not true that group A is better than group B.
They are just names given based on particular characteristics.
Ordinal data are categorical characteristics that you can name and rank as well.
Examples: Socio-economic status: Low, middle or high.
Exam grades: A, B, C, D or E based on level of achievement.
Of course, grade A is better than grade B and so on.
2. Quantitative data can be classified into discrete dataand continuous data.
Discrete data are numerical characteristics that are countable (whole numbers).
Examples: Number of males and number of females
Number of patients waiting for surgery
Number of students sitting for an exam
Continuous data are numerical characteristics that are measurable.
Examples: Marks obtain by students
Body mass index (BMI) of patients
Time taken by athletes to complete a road race
Since continuous data are measureable, they can be measured in decimals.
It is very important to understand the different types of data so that they can be described
and presented in an appropriate manner. For example, it does not make sense to find the
average for a group of males and females. In this case the information is best stated in the
form of percentages. Variables like weight and height are best described using average
and percentiles. For visual data presentations, bar charts should be used for qualitativedata and histograms should be used for quantitative data. The underlying distributions
also differ for different data types. In making inferences, the choice of statistical tests
depends on the type of data.
5
-
8/12/2019 A Lesson 1 Introduction to Statistics & SPSS
6/8
Statistical Analysis using SPSS Lesson 1
Introduction to SPSS Windows
Statistical Packages for Social Sciences (SPSS) for Windows provides a powerful
statistical analysis and data management system in a graphical environment, using
descriptive menus and simple dialog boxes to do most of the work for you. Most tasks
can be accomplished simply by pointing and clicking the mouse.
In addition to the simple point-and-click interface for statistical analysis, SPSS for
Windows provides:
Data Editor. A versatile spreadsheet-like system for defining, entering, editing,
displaying data.
Viewer. The Viewer makes it easy to browse your results, selectively show and hide
output, change the display order results, and move presentation-quality tables and charts
between SPSS and other applications.
Multidimensional pivot tables. Results come alive with multidimensional pivot tables.
Explore your tables by rearranging rows, columns, and layers. Uncover important
findings that can get lost in standard reports. Compare groups easily by splitting your
table so that only one group is displayed at a time.
High-resolution graphics. High-resolution, full-color pie charts, bar charts, histograms,
scatterplots, 3-D graphics, and more are included as standard features in SPSS.
Database access. Retrieve information from databases by using the Database Wizard
instead of complicated SQL queries.
Data transformations. Transformation features help get your data ready for analysis. You
can easily subset data, combine categories, add, aggregate, merge, split, and transpose
files, and more.
Electronic distribution.Send e-mail reports to others with the click of a button, or export
tables and charts in HTML format for Internet and Intranet distribution.
Online Help.Detailed tutorials provide a comprehensive overview; context-sensitive Help
topics in dialog boxes guide you through specific tasks; pop-up definitions in pivot table
results explain statistical terms; the Statistics Coach helps you find the procedures thatyou need; and Case Studies provide hands-on examples of how to use statistical
procedures and interpret the results.
Command language.Although most tasks can be accomplished with simple point-and-
click gestures, SPSS also provides a powerful command language that allows you to save
and automate many common tasks. The command language also provides some
functionality not found in the menus and dialog boxes.
6
-
8/12/2019 A Lesson 1 Introduction to Statistics & SPSS
7/8
Statistical Analysis using SPSS Lesson 1
SPSS Windows
SPSS for Windows provides a powerful statistical analysis and data management system
in a graphical environment, using descriptive menus and simple dialog boxes to do most
of the work for you. Simply pointing and clicking the mouse can accomplish most tasks.
SPSS for Windows provides:
SPSS Data Editor. A versatile spread-sheet-like system for defining, entering, editing,
and displaying data.
SPSS Viewer. The new Output Navigator makes it easy to browse your results,
selectively show and hide output, change the display order results, and move
presentation-quality tables and charts between SPSS and other applications.
SPSS Chart Editor.Helps you edit charts. You can change the pattern, color, style, and
label of the graphs. You can also modify the axis, rotate or swap the axis.
SPSS Syntax Editor. This can be used to save, view, modify and rewrite the syntax.
Help. Comprehensive overview of SPSS basics is also available in the online tutorial
under the Help menu. The meanings of the statistical terms can also be obtained by
double-clickingon the terms themselves.
7
-
8/12/2019 A Lesson 1 Introduction to Statistics & SPSS
8/8
Statistical Analysis using SPSS Lesson 1
Basic steps in Statistical Data Analysis Using SPSS
The four basic steps in data analysis in SPSS is summarized as below.
Step 1
Bring
your data into
SPSS
Get your data into SPSS Data Editor
This can be done either by;
directly entering the data in the Data Editor.
open a previously saved SPSS file.
read a spreadsheet/text data file.
Step 2
Select
a procedure
from the menu
Select a procedure from the men.
This depends on the objective of the study.
Graphprocedure to create a chart.
Analyzeprocedure to perform statistical analysis.
Step 3
Select
variable(s)
for the analysis
Select a variable
Make sure the procedure is appropriate for the
variable.
all the variables in data file are displayed in a
Dialog Box.
just highlight and click the variable(s) into the
respective dialog boxes.
Step 4
Run & Examine
the results
Run the procedure by clicking OK
Results are displayed in the OUTPUT VIEWER.
it can be a chart,
it can be descriptive statistics,
it can be inferential statistics,
Based on the output, draw conclusions accordingly.
The four steps in data analysis
8