crime forecasting using boosted ensemble classifiers chung-hsien yu crime forecasting using boosted...

Crime Forecasting Using Boosted Ensemble Classifiers Chung-Hsien Yu

Crime Forecasting Using Boosted Ensemble Classifiers

Department of Computer Science University of Massachusetts Boston

2012 GRADUATE STUDENTS SYMPOSIUM

Present by: Chung-Hsien Yu

Advisor: Prof. Wei Ding

• Retaining spatiotemporal knowledge by applying multi-clustering to monthly aggregated crime data.

• Training baseline learners on these clusters obtained from clustering.

• Adapting a greedy algorithm to find a rule-based ensemble classifier during each boosting round.

• Pruning the ensemble classifier to prevent it from overfitting. • Constructing a strong hypothesis based on these ensemble

classifiers obtained from each round.

Abstract

Original Data

Residential Burglary

911 Calls

Arrest

Foreclosure

Street Robbery

Aggregated Data

Monthly Data3

Monthly Clusters (k=3)

Monthly Clusters (k=4)

Flow Chart

Algorithm (Part I)

Algorithm (Part II)

Confidence Value

From AdaBoosting (Schapire & Singer 1998) we have

Let and ignore the boosting round .

𝑍=∑𝑖

𝑤 (𝑖 ) exp (−𝐶𝑅¿ 𝑦 𝑖)¿

is defined as the confidence value for the rule and if .

Objective Function

Therefore,

𝑊 0= ∑{ 𝑖|𝑥 𝑖∉𝑅 }

𝑤 (𝑖 )𝑊+¿= ∑{𝑖|𝑥𝑖∈𝑅 𝑎𝑛𝑑 𝑦=1 }

𝑤 ( 𝑖 ) ¿𝑊−= ∑{𝑖|𝑥 𝑖∈𝑅𝑎𝑛𝑑 𝑦=− 1}

𝑤 (𝑖 )

𝑊 0+𝑊+¿+𝑊 −=1¿

Minimum Z Value

𝑑𝑍𝑑𝐶𝑅

=−𝑊+¿exp (−𝐶 𝑅 )+𝑊 −exp (𝐶𝑅 )=0¿

→𝑊−exp (𝐶𝑅 )=𝑊+¿ exp (−𝐶𝑅 ) ¿

→ ln (𝑊 −exp (𝐶𝑅 ))=ln ¿¿→ ln (𝑊 −)+𝐶𝑅=ln ¿¿→2𝐶𝑅=ln¿ ¿

→𝐶𝑅=12ln ¿¿

has the minimum value when

𝑑𝑍𝑑𝐶𝑅

2=𝑊+¿ exp (−𝐶𝑅 )+𝑊−exp (𝐶𝑅 )>0¿

BuildChain Function

𝑊 0+𝑊+¿+𝑊 −=1¿

Repeatedly adding a classifier to R until it maximizes . This will minimize as well.

PruneChain Function

�́�=¿Loss Function:

Minimize by removing the last classifier from R.

is obtained from GrowSet.

are obtained from applying R to PruneSet

Update Weights

Calculate with ensemble classifier R on the entire data set.

Strong Hypothesis

At the end of boosting, there are chains,

�̂�𝑅𝑡=0 𝑖𝑓 𝑥 ∉𝑅𝑡

1. The grid cells with the similar crime counts clustered together also are close to each other on the map geographically. Besides, the high-crime-rate area and low-crime-rate area are separated with cluster.

2. The original data set is randomly divided into two subsets each round. The greedy weak-learn algorithm adapts confidence-rate evaluation to “chain” the base-line classifiers using one data set. And then, “trim” the chain using the other data set.

3. The strong hypothesis is easy to calculate.

SUMMARY

THANK YOU!!

crime forecasting using boosted ensemble classifiers chung-hsien yu crime forecasting using boosted...

chunghsien yu advisor

monthly aggregated crime

wei ding slide

rulebased ensemble classifier

loss function

greedy algorithm

multi clustering

boosting round

Documents

steganalysis by ensemble classifiers with boosting by...

chapter 45 ensemble methods for...

manual - save boosted stealth owner... · 2020-03-07 · 8...

object detection using cascades of boosted classifiers...

cifar-10: knn-based ensemble of classifiers

5 classifiers

decision trees - cornell what about decision trees?...

ensemble learning model selection statistical...

boosted svm

ensemble methods construct a set of classifiers from the...

fully automatic facial feature point detection using · pdf...

eecs 274 computer vision object detection. human detection...

© tan,steinbach, kumar introduction to data mining...

regularized weighted ensemble of deep classifiers

chapter 45 ensemble methods for · pdf filechapter 45...

2 venturi systems – boosted pressure spray-alls airless...

local topic discovery via boosted ensemble of nonnegative...

mining several databases with an ensemble of classifiers...

boosted tree

sign classification boosted cascade of classifiers using...