applied data analysis in criminal justice
DESCRIPTION
APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE. CJ 525 MONMOUTH UNIVERSITY Juan P. Rodriguez. Perspective. Research Techniques Accessing, Examining and Saving Data Univariate Analysis – Descriptive Statistics Constructing (Manipulating) Variables Association – Bivariate Analysis - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/1.jpg)
APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE
CJ 525 MONMOUTH UNIVERSITY
Juan P. Rodriguez
![Page 2: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/2.jpg)
Perspective Research Techniques Accessing, Examining and Saving Data Univariate Analysis – Descriptive Statistics Constructing (Manipulating) Variables Association – Bivariate Analysis Association – Multivariate Analysis Comparing Group Means – Bivariate Multivariate Analysis - Regression
![Page 3: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/3.jpg)
Lecture 7
Multivariate AnalysisWith Linear Regression
![Page 4: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/4.jpg)
Lectures 5 and 6 examined methods for testing relationships between 2 variables: bivariate analysis
Many projects, however, require testing the association of multiple independent variables with a dependent variable: multivariate analysis
Multivariate analysis is performed after the researchers understand the characteristics of individual variables (univariate) and the relationships between any 2 variables (bivariate)
![Page 5: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/5.jpg)
Reasons for Multivariate Analysis
Social behavior is usually associated with many factors and can not be explained by the association with just one variable. By including more than one variable in the statistical model, the researcher can create a more accurate model to predict or explain social behavior
![Page 6: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/6.jpg)
Reasons for Multivariate Analysis
Multivariate analysis can account for the influence of spurious factors by introducing control variables
![Page 7: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/7.jpg)
Linear Regression
Used when the increase in an independent variable is associated with a consistent and constant change in the dependent variable.
The dependent variable should be numeric and conform to a normal distribution
![Page 8: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/8.jpg)
LR: Bivariate Example Using the States data, we will study the
relationship between poverty and teen births.
![Page 9: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/9.jpg)
LR: A Bivariate example
The graph indicates that teenage births seem to increase with poverty rate.
Using Linear Regression, we will create an equation that can be used to illustrate this tendency Load the States dataset
![Page 10: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/10.jpg)
LR: A Bivariate example
![Page 11: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/11.jpg)
LR: A Bivariate example
![Page 12: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/12.jpg)
LR: A Bivariate example
![Page 13: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/13.jpg)
LR: A Bivariate example
The R2 measures the usefulness of the model: A value of 1 indicates that 100% of the variation in the
dependent variable is explained by variations in the independent variable
A value of 0.455 indicates that 45.5% of the variation in the teenage birth rate from state to state can be explained by variations in poverty rates. The remaining 54.5% can be explained by other factors not included in the model
![Page 14: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/14.jpg)
LR: A Bivariate example
The ANOVA measured if the model fitted the data:
The results indicated that the variation explained by the regression model was about 41 times larger than that explained by other factors.
The P value lower than 0.001 indicated that the chances of this being due to random chance were very small, i.e. the model used fitted the data
![Page 15: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/15.jpg)
LR: A Bivariate example
B, (slope) is the size of the difference in the dependent variable corresponding to a change of one unit in the independent variable
The value of 2.735 in this model indicates that for every 1% change in poverty rate there is a predicted increase in the teen birth rate of nearly 3 births (2.735)
The significance score of 0.000 indicates that there is a significant association between teen birth rate and poverty
![Page 16: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/16.jpg)
LR: A Bivariate example
The constant (intercept) is the predicted value of the dependent variable when the independent variable is zero.
In this case, the constant indicates that there would be 15 teen births per 1000 teenage women even if there were no poor people in a state
![Page 17: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/17.jpg)
Making Predictions
The linear regression equation is:Y’ = a + bX
Y’ is the predicted value of the dependent variable
a is the constant b is the slope X is the value of the independent
variable
![Page 18: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/18.jpg)
Making Predictions In our case, the regression equation is:
Y’ = 15.16 + 2.735X If we wanted to predict the teenage birth
rate for a poverty rate of 20%: Y’ = 15.16 + 2.735 x 20 = 69.86
Predictions should be limited to the available range of values of the independent variable (in our case between 1% and 22%)
![Page 19: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/19.jpg)
Graphing Bivariate Regression lines
![Page 20: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/20.jpg)
Graphing Bivariate Regression lines
![Page 21: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/21.jpg)
Graphing Bivariate Regression lines
![Page 22: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/22.jpg)
Graphing Bivariate Regression lines
![Page 23: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/23.jpg)
Graphing Bivariate Regression lines
![Page 24: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/24.jpg)
Graphing Bivariate Regression lines
![Page 25: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/25.jpg)
Multiple Linear Regression Regression model includes more
than one independent variable We’ll look at some factors affecting
teenage birth rate: Poverty (PVS500) Expenditures per pupil (SCS141) Unemployment rate (EMS171) Amount of welfare a family gets
(PVS526)
![Page 26: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/26.jpg)
Multiple Linear Regression
![Page 27: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/27.jpg)
Multiple Linear Regression
![Page 28: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/28.jpg)
Multiple Linear Regression
![Page 29: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/29.jpg)
MLR: Coefficients Looking at the significance tests
for the coefficients, only 2 are significant: States with higher poverty rates
have higher teenage birth rates (1.506 per 10000 women) for every 1% raise in poverty rates.
States that give more welfare aid had lower teen birth rates (-0.0379) for every $1 given as welfare aid.
![Page 30: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/30.jpg)
MLR: R - Squared
MLR uses the Adjusted R2 instead of the R2 to account for only those variables that contribute significantly to the model
The AR2 in this case, 0.594, indicates that the model accounts for 59.4% of the variation in the teenage birth rate
![Page 31: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/31.jpg)
MLR: R - Squared
The ANOVA indicates that the variables considered account for about 19 times of the variation due to other causes. The P<0.001 indicates that the model is a good fit to the data.
![Page 32: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/32.jpg)
Multiple Regression Equation
The equation is:Y’ = 41.874 + 1.506X1 - 0.0009X2 + 2.515X3 -
0.037X4
X1 : Poverty Rate in 1998 – PVS500 X2 : Expenditures per pupil – SCS141 X3 : Unemployment rate – EMS171 X4 : Amount of welfare received – PVS526
![Page 33: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/33.jpg)
Graphing the Multiple Regression
The multiple regression equation is:
Y’ = a + b1X1 + b2X2 + b3X3 + b4X4
Y’ is the predicted value of the dependent variable
a is the constant bi is the slope for variable i Xi is the value of the independent variable
i
![Page 34: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/34.jpg)
Graphing the Multiple Regression Dependent variable is plotted against
one independent variable at a time The other variables are held constant, at
any value, but usually at their mean value We will graph the association between
welfare benefits and teenage birth rates holding poverty rates, school expenditures and unemployment rates at their mean values This requires computing TEENPRE, the
predicted value of teen birth rate according to the equation
![Page 35: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/35.jpg)
Graphing the Multiple Regression
Transform Compute
Target Variable: TEENPRE Numeric Expression: 41.874 +
(1.506*12.73) + (-0.0009*6341.98) + (2.515*4.16) + (-0.037*PVS526)
Type and Label Label: Predicted Teenage Birth Rate Continue
OK
![Page 36: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/36.jpg)
Graphing the Multiple Regression
![Page 37: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/37.jpg)
Graphing the Multiple Regression
![Page 38: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/38.jpg)
Graphing the Multiple Regression
![Page 39: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/39.jpg)
Graphing the Multiple Regression
![Page 40: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/40.jpg)
Graphing the Multiple Regression
![Page 41: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/41.jpg)
Linear Regression Concerns
Linear Relationships A numerical dependent variable Normality of residuals
The residuals should follow a normal distribution with a mean of 0
Check is this is the case by saving and plotting the residuals when doing the MLR
![Page 42: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/42.jpg)
Normality of Residuals
![Page 43: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/43.jpg)
Normality of Residuals
![Page 44: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/44.jpg)
Normality of Residuals
![Page 45: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/45.jpg)
Normality of Residuals
![Page 46: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/46.jpg)
Normality of Residuals
![Page 47: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/47.jpg)
Normality of Residuals
![Page 48: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/48.jpg)
Normality of Residuals
![Page 49: APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE](https://reader035.vdocuments.net/reader035/viewer/2022070407/56814366550346895dafe181/html5/thumbnails/49.jpg)
Normality of Residuals