integrating predictive models with interactive visualization · with interactive visualization jian...

65
Integrating Predictive Models with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University of Waterloo www.jeffjianzhao.com | [email protected]

Upload: others

Post on 04-Aug-2020

21 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Integrating Predictive Models with Interactive Visualization

Jian Zhao, Ph.D., Assistant ProfessorCheriton School of Computer Science

University of Waterloowww.jeffjianzhao.com | [email protected]

Page 2: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Short bio

2015 2019

20162009

Assistant Professor@ U Waterloo

Researcher@ FXPAL, Palo Alto

Researcher@ Autodesk, Toronto

Ph.D.@ U Toronto

Page 3: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Data Machines Humans

All continuously growing fast!

Page 4: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

I investigate advanced visualizations (vis) that promote the interplay among data, machines (models), and humans (users) in real-world data science applications.

Page 5: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Bella, Data Scientist

“My input data looks similar, but my classifier performs quite different… Why?”

Page 6: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Matejka et al, Same Stats, Different Graphs: Generating Datasets with Varied Appearance and Identical Statistics through Simulated Annealing, CHI’17

Page 7: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Bella, Data Scientist

“I’m building a neural network classifier. I tried many ways, but it doesn’t work… Why?”

Black box

Page 8: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Tensor Flow Playground, http://playground.tensorflow.org/

Page 9: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Bella, Data Scientist

“I finally got some good results, but my boss couldn’t understand them...”

Page 10: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University
Page 11: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Visualization is critical in data analysis workflow

Make sense of data

Make sense of models

Make sense of results

Data exploration

Model explanation

Results communication

Page 12: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Top machine learning and data science methods used at work

http://businessoverbroadway.com/top-machine-learning-and-data-science-methods-used-at-work

Page 13: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Creating effective visualizations is hardProblem/domain specific

No easy one-size-fits-all solution

Technical skillsMatplotlib, D3.js, ggplot2, …

Sense of designHuge design space

Page 14: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

PredictionRecommendation

TablesNetworks

Text & Images…

Data analystsGeneral users

VIS

Make sense of data Make sense of models Make sense of results

Page 15: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Make sense of data Make sense of models Make sense of results

MOOCex

Comprehend missing link prediction in bipartite networks

Leverage video recommendations in online learning

Explore complex data with visualization recommendations

MissBiNChartSeer

Page 16: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Make sense of data

Page 17: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Exploring large information space

???

Page 18: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Challenges Continuously making decision in a large parameter space

Which data variables to explore? What kind of charts to use?

Lacking a holistic view of the analysis spaceHow is the current status?Where am I?

Page 19: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Exploring large information space with recommendation

Page 20: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

J. Zhao, M. Fan, M. Feng, ChartSeer: Interactive Steering Exploratory Visual Analysis with Machine Intelligence, TVCG

ChartSeer

Page 21: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University
Page 22: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

System architecture

Page 23: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Chart summarization

Chart clusters

Variables used

Chart glyphs

Analysis space

Page 24: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Controlled user studyBetween-subjects design

24 participants (13 females and 11 males)

Interface conditionsChartSeer v.s. Baseline

DatasetUS college statistics (18 variables)

TasksSummarization taskExploration task

Page 25: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Results of user behaviors Participants added more charts but updated less charts using ChartSeer

ChartSeer led to a broader range of data variables and visual encodings

ChartSeer encouraged more focused exploration of data variables

ChartSeer allowed for data exploration from more heterogenous visual perspectives

ChartSeer Baseline

Page 26: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Questionnaire results

ChartSeer Baseline

Page 27: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Make sense of models

Page 28: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University
Page 29: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

“Missing” links in bipartite networks

customer

product

E

21

3

5

4

C

A

D

B

???

Page 30: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Missing link prediction

C – 5: 0.974D – 2: 0.965E – 1: 0.873B – 3: 0.852…

E

21

3

5

4

C

A

D

B

Black box

Page 31: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Analysts’ questions

What are the missing links?

Whyis a link missing?

How does a missing link impact?

Page 32: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

MissBiN

A missing link prediction algorithm

An interactive visualization

A comparative analysis approach

What are the missing links?

Whyis a link missing?

How does a missing link impact?

J. Zhao, M. Sun, F. Chen, P, Chiu, MissBiN: Visual Analysis of Missing Links in Bipartite Networks, VIS’19J. Zhao, M. Sun, F. Chen, P, Chiu, Understanding Missing Links in Bipartite Networks with MissBiN, TVCG

Page 33: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Addressing the questions with MissBiN

A missing link prediction algorithm

An interactive visualization

A comparative analysis approach

What are the missing links?

Whyis a link missing?

How does a missing link impact?

Page 34: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

1. Predict the missing links with standard methods (e.g., common neighbors [Chang12])

2. Discover all maximal bicliques, complete subgraphs, of the network (e.g., using MBEA [Zhang14])

3. Re-rank the missing links based on the overlap of bicliques

Prediction of missing links

Page 35: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

In step3, for each pair of bicliques, …

Area(M1)

Area(M2 + M3 + M4 + M5)M4

M5

M1M2

M3

Xi

Xj

Yj

Yi

Page 36: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Re-ranking predicted missing links

!′! = $! % !!

Weights computed in step3, based on bicliques information

Scores computed in step1, based on standard methods

Page 37: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Test on 3 datasetsPerson-place network from Atlantic Storm corpus [Hughes05]User-conversation network from Slack group communication

Compare with 5 base methods Jaccard coefficient (JA)common neighbors (CN)Adamic-Adar coefficient (AA)preferential attachment (PA)random walk (RW)

Evaluation of missing link prediction

Page 38: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Link prediction resultsMostly, PA has the largest performance gainSecondly, CN performs well

Jaccard coefficient (JA), common neighbors (CN), Adamic-Adar coefficient (AA), preferential attachment (PA), random walk (RW)

Orig

inal

met

hod

Our

met

hod

Perfo

rman

ce g

ain

Page 39: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Addressing the questions with MissBiN

A missing link prediction algorithm

An interactive visualization

A comparative analysis approach

What are the missing links?

Whyis a link missing?

How does a missing link impact?

Page 40: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University
Page 41: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University
Page 42: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University
Page 43: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University
Page 44: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University
Page 45: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Interview studyA management school professor on exploring organizational communication networksA computer scientist on investigating relationships of crimes and locations in Washington DC

Case study The Sign of the Crescent [Hughes03]

41 fictional intelligence reportsExtracted person-location network

49 persons and 104 locations, with 328 links Analysis task

Identify suspicious persons and activities from the reports

Evaluation of MissBiN

Page 46: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Make sense of results

Page 47: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Exploring large information space with recommendation

Page 48: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Current interfaces: ranked lists

Page 49: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Linear ranked list is not enough

Semantic map significantly improves users’ comprehension capability compared to a ranked list [Peltonen 2017]

Orienteering helps understand and trust the answers using both prior and contextual information [Teevan 2004]

Support stepping behavior by clustering the information or suggesting query refinements [Teevan 2004]

Page 50: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Mike, the confusedWant to solve an optimization problem in his workJust watched #19 – choosing stepsize and convergence criteria

Recommendations: 1. Sparse models selection2. Dirichlet distribution3. Gradient descent intuition4. Hill climbing5. …

Page 51: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

J. Zhao, C. Bhatt, M. Cooper, D. Shamma, Flexible Learning with Semantic Visual Exploration and Sequence-Based Recommendation of MOOC Videos, CHI’18

MOOCex

Page 52: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University
Page 53: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Current course

Current videoNeighboring videos (learning context)

Recommendation Topics & keywords(sub-region)

Projection based on semantics and

context

Page 54: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Zhao et al, Flexible Learning with Semantic Visual Exploration and Sequence-Based Recommendation of MOOC Videos, CHI’18

Page 55: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

System architecture

Page 56: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Recommendation engine Content-based recommendation

Based on TF-IDF

Sequence-based re-rankingTopic similarity score (TS)Global sequence score (GS)Local sequence score (LS)

Sub-sequence aggregationGreedy search down the ranked list

Dataset~4000 videos, ~350 hours running time, from Coursera, EdX, and Udacity

Page 57: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Visualization generationMultidimensional scaling (MDS) in feature space

Rotate to comply with left-right browsing flowTune positions to avoid overlapMerge consecutive videos

Hierarchical clusteringContext-based region divisionVoronoi tessellation

Topical keywords extractionForce-directed placement

Page 58: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Scenario I: “I missed anything?”

Mike

Confused about this lecture.Wants to check if missed

anything.

Page 59: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Scenario II: “I want to know more.”

Lisa

Already knows about this.Wants to extend her horizon.

Page 60: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Used by MOOC instructorsSemi-structured interviews with two university instructors

“I normally don’t look at what others teach, but the tool provides the awareness of related lectures, so I could borrow some materials to enhance my lecture, and avoid unnecessary duplication.” “If you see one lecture is here [on the Exploration Canvas], then you go very far for the second lecture, and back here again for the third lecture, you should really think about reordering the content presented in the videos.”

Page 61: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

One more thing…

Page 62: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Thank all my collaborators!

Available on https://www.jeffjianzhao.com/webapp/EgoLines/egolines.html

Page 63: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Another thing…

Page 64: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Welcome to apply to Waterloo HCI

http://hci.cs.uwaterloo.ca/

Page 65: Integrating Predictive Models with Interactive Visualization · with Interactive Visualization Jian Zhao, Ph.D., Assistant Professor Cheriton School of Computer Science University

Integrating Predictive Models with Interactive Visualization

Jian Zhao, Ph.D., Assistant ProfessorCheriton School of Computer ScienceUniversity of [email protected]