consumer behavior prediction using parametric and nonparametric methods

49
Consumer Behavior Prediction using Parametric and Nonparametric Methods Elena Eneva CALD Masters Presentation 19 August 2002 Advisors: Alan Montgomery, Rich Caruana, Christos Faloutsos

Upload: sonja

Post on 14-Jan-2016

83 views

Category:

Documents


0 download

DESCRIPTION

Consumer Behavior Prediction using Parametric and Nonparametric Methods. Elena Eneva CALD Masters Presentation 19 August 2002 Advisors: Alan Montgomery, Rich Caruana, Christos Faloutsos. Outline. Introduction Data Economics Overview Baseline Models New Hybrid Models Results - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Consumer Behavior Prediction using Parametric and

Nonparametric Methods

Elena EnevaCALD Masters Presentation

19 August 2002

Advisors: Alan Montgomery, Rich Caruana,

Christos Faloutsos

Page 2: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Outline

Introduction Data Economics Overview Baseline Models New Hybrid Models Results Conclusions and Future Work

Page 3: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Background

Retail chains are aiming to customize prices in individual stores

Pricing strategies should adapt to the neighborhood demand

Stores can increase operating profit margins by 33% to 83%

Page 4: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Price Elasticity

consumer’s response to price change

Ppercent

Qpercent E

inelastic elastic

Q is quantity purchased

P is price of product

Page 5: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Data Example

0

20000

40000

60000

80000

100000

0.02 0.03 0.04 0.05 0.06price

quantity

Page 6: Consumer Behavior Prediction using Parametric and Nonparametric Methods

2.75

3.25

3.75

4.25

4.75

5.25

-1.58 -1.53 -1.48 -1.43 -1.38 -1.33 -1.28ln(price)

ln(quant)

Data Example – Log Space

Page 7: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Assumptions

Independence– Substitutes: fresh fruit, other juices– Other Stores

Stationarity– Change over time– Holidays

Page 8: Consumer Behavior Prediction using Parametric and Nonparametric Methods

“The” ModelCategory

Price of Product 1

Price of Product 2

Price of Product 3

Price of Product N

. . .

“I know your

customers”

PredictorPredictor

Quantity bought of Product 1

. . .

Quantity bought of Product 2

Quantity bought of Product 3

Quantity bought of Product N

Need to multiply this across many stores, many categories.

),0(~

))(ln(~)ln(2

N

pfq

conv

ert t

o ln

spa

ce

conv

ert t

o or

igin

al s

pace

Page 9: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Converting to Original Space

),0(~

))(ln(~)ln(2

N

pfq

))(ln(ˆ)ln( pfq

),(~))(ln(|)ln( 2Npfq

2^

2

1)ln(

ˆq

eq

eqE2

2

1

][

Page 10: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Existing Methods

Traditionally – using parametric models (linear regression)

Recently – using non-parametric models (neural networks)

Page 11: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Our Goal

Advantage of LR: known functional form (linear in log space), extrapolation ability

Advantage of NN: flexibility, accuracy

robustness

acc

ura

cy

NNnew

LR

Take Advantage: use the known functional form to bias the NN

Build hybrid models from the baseline models

Page 12: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Datasets

weekly store-level cash register data at the product level

Chilled Orange Juice category

2 years 12 products 10 random stores selected

Page 13: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Evaluation Measure

Root Mean Squared Error (RMS) the average deviation between the

predicted quantity and the true quantity

N

iii qq

NRMSerror

1

1

Page 14: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Models

Hybrids– Smart Prior– MultiTask Learning– Jumping Connections– Frozen Jumping

Connections

Baselines–Linear Regression–Neural Networks

Page 15: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Baselines

Linear Regression

Neural Networks

Page 16: Consumer Behavior Prediction using Parametric and Nonparametric Methods

q is the quantity demanded pi is the price for the ith product K products overall The coefficients a and bi are determined by

the condition that the sum of the square residuals is as small as possible.

Linear Regression

),0(~

)ln()ln(

2

1

N

pbaq i

K

ii

Page 17: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Linear Regression

Page 18: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Results RMS

0

2000

4000

6000

8000

10000

12000

LR NN SmPr MTL JC FJC Vote WAV

Page 19: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Neural Networks

generic nonlinear function approximators

a collection of basic units (neurons), computing a (non)linear function of their input

backpropagation

Page 20: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Neural Networks

1 hidden layer, 100 units, sigmoid activation function

Page 21: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Results RMS

0

2000

4000

6000

8000

10000

12000

LR NN SmPr MTL JC FJC Vote WAV

Page 22: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Hybrids

Smart Prior MultiTask Learning Jumping Connections Frozen Jumping Connections

Page 23: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Smart Prior

Idea: start the NN at a “good” set of weights, help it start from a “smart” prior.

Take this prior from the known “linearity” NN first trained on synthetic data generated

by the LR model NN then trained on the real data

Page 24: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Smart Prior

Page 25: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Results RMS

0

2000

4000

6000

8000

10000

12000

LR NN SmPr MTL JC FJC Vote WAV

Page 26: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Multitask Learning

Idea: learning an additional related task in parallel, using a shared representation

Adding the output of the LR model (built over the same inputs) as an extra output to the NN

Make the net share its hidden nodes between both tasks

Custom halting function Custom RMS function

Page 27: Consumer Behavior Prediction using Parametric and Nonparametric Methods

MultiTask Learning

Page 28: Consumer Behavior Prediction using Parametric and Nonparametric Methods

0

2000

4000

6000

8000

10000

12000

LR NN SmPr MTL JC FJC Vote WAV

Results RMS

Page 29: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Jumping Connections

Idea: fusing LR and NN

change architecture add connections which “jump” over the

hidden layer Gives the effect of simulating a LR and

NN all together

Page 30: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Jumping Connections

Page 31: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Results RMS

0

2000

4000

6000

8000

10000

12000

LR NN SmPr MTL JC FJC Vote WAV

Page 32: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Frozen Jumping Connections

Idea: you have the linearity, now use it!

same architecture as Jumping Connections, plus really emphasizing the linearity

freeze the weights of the jumping layer, so the network can’t “forget” about the linearity

Page 33: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Frozen Jumping Connections

Page 34: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Frozen Jumping Connections

Page 35: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Frozen Jumping Connections

Page 36: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Results RMS

0

2000

4000

6000

8000

10000

12000

LR NN SmPr MTL JC FJC Vote WAV

Page 37: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Models

Hybrids– Smart Prior– MultiTask Learning– Jumping Connections– Frozen Jumping

Connections

Baselines:–Linear Regression–Neural Networks

Combinations–Voting–Weighted Average

Page 38: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Combining Models

Idea: Ensemble Learning

Committee Voting – equal weights for each model’s prediction

Weighted Average – optimal weights determined by a linear regression model

2 baseline and 3 hybrid models (Smart Prior, MultiTask Learning, Frozen Jumping Conections)

Page 39: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Committee Voting

Average the predictions of the models

Page 40: Consumer Behavior Prediction using Parametric and Nonparametric Methods

0

2000

4000

6000

8000

10000

12000

LR NN SmPr MTL JC FJC Vote WAV

Results RMS

Page 41: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Weighted Average – Model Regression

Linear regression on baselines and hybrid models to determine vote weights

Page 42: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Results RMS

0

2000

4000

6000

8000

10000

12000

LR NN SmPr MTL JC FJC Vote WAV

Page 43: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Normalized RMS Error

Compare model performance across stores Stores of different sizes, ages, locations, etc Need to normalize Compare to baselines

Take the error of the LR benchmark as unit error

Page 44: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Normalized RMS Error

0.75

0.80

0.85

0.90

0.95

1.00

1.05

1.10

LR NN SmPr MTL JC FJC Vote WAV

Page 45: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Conclusions

Clearly improved models for customer choice prediction

Will allow stores to price the products more strategically and optimize profits

Maintain better inventories Understand product interaction

Page 46: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Future Work Ideas

analyze Weighted Average model compare extrapolation ability of new

models use other domain knowledge

– shrinkage model – a “super” store model with data pooled across all stores

Page 47: Consumer Behavior Prediction using Parametric and Nonparametric Methods

Acknowledgements

I would like to thank my advisors

and

my CALDling friends and colleagues

Page 48: Consumer Behavior Prediction using Parametric and Nonparametric Methods

The Most Important Slide

for this presentation and the paper:

www.cs.cmu.edu/~eneva/research.htm

[email protected]

Page 49: Consumer Behavior Prediction using Parametric and Nonparametric Methods

References

Montgomery, A. (1997). Creating Micro-Marketing Pricing Strategies Using Supermarket Scanner Data

West, P., Brockett, P. and Golden, L (1997) A Comparative Analysis of Neural Networks and Statistical Methods for Predicting Consumer Choice

Guadagni, P. and Little, J. (1983) A Logit Model of Brand Choice Calibrated on Scanner data

Rossi, P. and Allenby, G. (1993) A Bayesian Approach to Estimating Household Parameters