1 data mining at work krithi ramamritham. 2 dynamics of web data dynamically created web pages --...

15
1 Data Mining at work Krithi Ramamritham

Upload: laurel-griselda-goodman

Post on 19-Jan-2016

214 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 1 Data Mining at work Krithi Ramamritham. 2 Dynamics of Web Data Dynamically created Web Pages -- using scripting languages Ad Component Headline Component

1

Data Miningat work

Krithi Ramamritham

Page 2: 1 Data Mining at work Krithi Ramamritham. 2 Dynamics of Web Data Dynamically created Web Pages -- using scripting languages Ad Component Headline Component

2

Dynamics of Web Data Dynamically created Web Pages

-- using scripting languages

Ad Component

Headline Component

Headline Component

Headline Component

Headline Component

Personalized Component

Navig

ati

on C

om

ponent

Page 3: 1 Data Mining at work Krithi Ramamritham. 2 Dynamics of Web Data Dynamically created Web Pages -- using scripting languages Ad Component Headline Component

3

1. What to deliver?

Page content may be based on • queries on dynamically changing data

– e.g., sports scores, stock prices, environment

• type of access device• time and location of access/user

Existing sites may contain new information

New sites (URLs) may come into being

Page 4: 1 Data Mining at work Krithi Ramamritham. 2 Dynamics of Web Data Dynamically created Web Pages -- using scripting languages Ad Component Headline Component

4

2. How to deliver?

Data sources

Proxies/caches

End-hosts

servers

sensors

wired host

mobile host

Netw

ork

Netw

ork

Page 5: 1 Data Mining at work Krithi Ramamritham. 2 Dynamics of Web Data Dynamically created Web Pages -- using scripting languages Ad Component Headline Component

5

Keep Data Up-to-date

• Update Mumbai temperature every 2 degrees

• The proxy obtains data from the source(s)

• Maintains | | UU((tt) - ) - SS((tt) | <= ) | <= 22

SourceS(t)

Proxy / DBP(t)

UserU(t)

Page 6: 1 Data Mining at work Krithi Ramamritham. 2 Dynamics of Web Data Dynamically created Web Pages -- using scripting languages Ad Component Headline Component

6

When to poll the source?

After a specific interval

Server Proxy UserPull

Based on temporal data mining – time series analysis – and prediction of when change will exceed 2 degrees

Page 7: 1 Data Mining at work Krithi Ramamritham. 2 Dynamics of Web Data Dynamically created Web Pages -- using scripting languages Ad Component Headline Component

7

Where to do the work?

• Diverse client devices– Differ in hardware, software,

network connectivity,

form factor

• Web content needs to be tailored for each client type

Each response depends not only on the requested URL but also on the capabilities of the client

Page 8: 1 Data Mining at work Krithi Ramamritham. 2 Dynamics of Web Data Dynamically created Web Pages -- using scripting languages Ad Component Headline Component

8

Transcoding

Conversion of one data version to another–Decreasing Image Quality (JPEG quality level) and size

- “convert” utility in Linux–Summarizing text

transcode =>

Info extraction/retrieval/

classification

Page 9: 1 Data Mining at work Krithi Ramamritham. 2 Dynamics of Web Data Dynamically created Web Pages -- using scripting languages Ad Component Headline Component

9

Who should transcode?

1. Download desired version from server

2. Transcode higher version locally

• Factors influencing decision– Transcoding Complexity– Proxy-server network connection – Load on proxy

(Multiple Linear) Regression Predict based on a (linear) model of overheads

Page 10: 1 Data Mining at work Krithi Ramamritham. 2 Dynamics of Web Data Dynamically created Web Pages -- using scripting languages Ad Component Headline Component

10

What is new on the Web?

How is the monsoon progressing?

Time series analysis:Change prediction, pattern mining

Page 11: 1 Data Mining at work Krithi Ramamritham. 2 Dynamics of Web Data Dynamically created Web Pages -- using scripting languages Ad Component Headline Component

‘Bhav Puchiye’

www.broadmoor.com

Interface for Bhav Puchiye

Page 12: 1 Data Mining at work Krithi Ramamritham. 2 Dynamics of Web Data Dynamically created Web Pages -- using scripting languages Ad Component Headline Component

Inverted Pyramid Interfaces

Inverted pyramid approach

Conclusion

Findings

Discussions

Conclusion

Discussions

Findings

Background & related Information

Background & related Information

Page 13: 1 Data Mining at work Krithi Ramamritham. 2 Dynamics of Web Data Dynamically created Web Pages -- using scripting languages Ad Component Headline Component

Bhav Poochiye

Pricing Module developed

for selected commodities

for selected markets

for selected areas

DEMO

Page 14: 1 Data Mining at work Krithi Ramamritham. 2 Dynamics of Web Data Dynamically created Web Pages -- using scripting languages Ad Component Headline Component

14

Building Usage Profiles

Estimate access probabilities based on:

• Current user/community navigational patterns over site contents

(in the form of click streams)

• Historical user/community access patterns over site contents

(in the form of association rules)

Cluster needs based onlocation, income/age of user, time-of-day

Page 15: 1 Data Mining at work Krithi Ramamritham. 2 Dynamics of Web Data Dynamically created Web Pages -- using scripting languages Ad Component Headline Component

15

Data Mining

From datato information

to knowledge

to money!