new visual analytics and data mining in...

9
1 Mining Spatio-Temporal Data @ PKDD, Porto, 2.10.2005 1 Visual Analytics and Data Mining Visual Analytics and Data Mining in S in S-T- applications applications Gennady Andrienko & Natalia Andrienko Fraunhofer Institute AIS Sankt Augustin Germany http://www.ais.fraunhofer.de/and Mining Spatio-Temporal Data @ PKDD, Porto, 2.10.2005 2 A View on S A View on S- T Data Mining T Data Mining Method(s) Input data Output data May have many parameters; May be computationally intensive Need to be interpreted; To be used for directing further analysis Complex and multidimensional; May contain errors Data Complexities : 1) Space, 2) Time, 3) Multiple attributes & dimensions 4) Outliers, discontinuities

Upload: others

Post on 11-Oct-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: New Visual Analytics and Data Mining in S-T-applicationsgeoanalytics.net/and/slides/2005-10-02-VisualAnalytic... · 2007. 11. 6. · 8 Mining Spatio-Temporal Data @ PKDD, Porto, 2.10.2005

1

Mining Spatio-Temporal Data @ PKDD, Porto, 2.10.2005 1

Visual Analytics and Data Mining Visual Analytics and Data Mining in Sin S--TT--applicationsapplications

Gennady Andrienko & Natalia AndrienkoFraunhofer Institute AISSankt AugustinGermanyhttp://www.ais.fraunhofer.de/and

Mining Spatio-Temporal Data @ PKDD, Porto, 2.10.2005 2

A View on SA View on S--T Data MiningT Data Mining

Method(s)

Input data

Output data

May have many parameters;May be computationally intensive

Need to be interpreted;To be used for directing further analysis

Complex and multidimensional;May contain errors Data

Complexities:

1) Space, 2) Time, 3) Multiple attributes & dimensions4) Outliers, discontinuities

Page 2: New Visual Analytics and Data Mining in S-T-applicationsgeoanalytics.net/and/slides/2005-10-02-VisualAnalytic... · 2007. 11. 6. · 8 Mining Spatio-Temporal Data @ PKDD, Porto, 2.10.2005

2

Mining Spatio-Temporal Data @ PKDD, Porto, 2.10.2005 3

ComplexitiesComplexities• Number of attributes• Length of time series• Number of spatial objects• High dimensionality • Abrupt temporal changes• Great variability of values

Mining Spatio-Temporal Data @ PKDD, Porto, 2.10.2005 4

Complexities: example 1Complexities: example 1

Page 3: New Visual Analytics and Data Mining in S-T-applicationsgeoanalytics.net/and/slides/2005-10-02-VisualAnalytic... · 2007. 11. 6. · 8 Mining Spatio-Temporal Data @ PKDD, Porto, 2.10.2005

3

Mining Spatio-Temporal Data @ PKDD, Porto, 2.10.2005 5

Complexities: example 2Complexities: example 2

Mining Spatio-Temporal Data @ PKDD, Porto, 2.10.2005 6

Aggregation method 1: by intervalsAggregation method 1: by intervals

Page 4: New Visual Analytics and Data Mining in S-T-applicationsgeoanalytics.net/and/slides/2005-10-02-VisualAnalytic... · 2007. 11. 6. · 8 Mining Spatio-Temporal Data @ PKDD, Porto, 2.10.2005

4

Mining Spatio-Temporal Data @ PKDD, Porto, 2.10.2005 7

Aggregation method 2: by Aggregation method 2: by quantilesquantiles

Mining Spatio-Temporal Data @ PKDD, Porto, 2.10.2005 8

Attend to particulars: high variationAttend to particulars: high variation

1. Aggregate time graph by quantiles2. Save counts3. Visualise e.g. on a scatter plot4. Select items with high variation

Page 5: New Visual Analytics and Data Mining in S-T-applicationsgeoanalytics.net/and/slides/2005-10-02-VisualAnalytic... · 2007. 11. 6. · 8 Mining Spatio-Temporal Data @ PKDD, Porto, 2.10.2005

5

Mining Spatio-Temporal Data @ PKDD, Porto, 2.10.2005 9

Attend to particulars: high fluctuationAttend to particulars: high fluctuation

• Select items with maximal number of jumps between quantiles

Mining Spatio-Temporal Data @ PKDD, Porto, 2.10.2005 10

Attend to particulars: stable extremesAttend to particulars: stable extremes

• Select items being always in the topmost 10%

Page 6: New Visual Analytics and Data Mining in S-T-applicationsgeoanalytics.net/and/slides/2005-10-02-VisualAnalytic... · 2007. 11. 6. · 8 Mining Spatio-Temporal Data @ PKDD, Porto, 2.10.2005

6

Mining Spatio-Temporal Data @ PKDD, Porto, 2.10.2005 11

Attend to particulars: extreme changesAttend to particulars: extreme changes

1. Transform the time graph to show changes

2. Select extreme changes in a specific year (here 2003)

Mining Spatio-Temporal Data @ PKDD, Porto, 2.10.2005 12

Visual Analytics in Data MiningVisual Analytics in Data Mining

Data preview

visualisation, display

manipulation, etc.

Method selection

Data preparation

data manipulation

Method application

Result exploration

and interpretation

visualisation, etc.

Page 7: New Visual Analytics and Data Mining in S-T-applicationsgeoanalytics.net/and/slides/2005-10-02-VisualAnalytic... · 2007. 11. 6. · 8 Mining Spatio-Temporal Data @ PKDD, Porto, 2.10.2005

7

Mining Spatio-Temporal Data @ PKDD, Porto, 2.10.2005 13

Requirements to Visual AnalyticsRequirements to Visual Analytics• Space- and Time-awareness• Work with complex multidimensional data• Support for uncertain and missing data• Scalability• Support and encouraging of several

complementary views on the same data• Dynamic linking and coordination of several data

displays• From the overall view to particulars of interest• From idea generation to hypothesis testing using

statistical methods, followed by reporting

Mining Spatio-Temporal Data @ PKDD, Porto, 2.10.2005 14

Potentially useful tools for MSTDPotentially useful tools for MSTDInformation visualisation tools, for example, HCE & TimeSearcher from HCIL, Univ. MarylandGeovisualisation tools, for example GeoVistaStudio (Penn State Univ.) and Descartes/CommonGIS (Fraunhofer Institute AIS)Graphical statistics tools, for example, Manet & Mondrian (Augsburg Univ.)

Usually such systems are research prototypes that implement innovative ideas, but provide restricted functionality and limited user support

Page 8: New Visual Analytics and Data Mining in S-T-applicationsgeoanalytics.net/and/slides/2005-10-02-VisualAnalytic... · 2007. 11. 6. · 8 Mining Spatio-Temporal Data @ PKDD, Porto, 2.10.2005

8

Mining Spatio-Temporal Data @ PKDD, Porto, 2.10.2005 15

Still open issues (for all tools!)Still open issues (for all tools!)Work with qualitative (non-numeric) dataWork with fuzzy, uncertain, and missing dataContinue scalability effortsSupport in processing and management of findings: recording, structuring, browsing, searching, checking, combining, interpreting…Help in visual communication of derived data, constructed knowledge, and recommended decisionsAdaptability to user, data, tasks, and hardwareEmbedding intelligence into software for helping users and avoiding cognitive overload

Mining Spatio-Temporal Data @ PKDD, Porto, 2.10.2005 16

EDA: from Practice to Practical TheoryEDA: from Practice to Practical Theory

DataTasksToolsPrinciples

to appear ≈ end 2005

Page 9: New Visual Analytics and Data Mining in S-T-applicationsgeoanalytics.net/and/slides/2005-10-02-VisualAnalytic... · 2007. 11. 6. · 8 Mining Spatio-Temporal Data @ PKDD, Porto, 2.10.2005

9

Mining Spatio-Temporal Data @ PKDD, Porto, 2.10.2005 17

Visual Analytics & Data MiningVisual Analytics & Data Mining1. Do they need each other?2. How to benefit from combining two

scientific disciplines and related technologies?

3. How to develop each of two scientific disciplines for achieving a synergy?