msc thesis presentation

31
Querying for Model- Querying for Model- Driven Spreadsheets Driven Spreadsheets Rui Pereira Prof. João Saraiva(Supervisor) Prof. Orlando Belo (Co-Supervisor) University of Minho University of Minho Department of Informatics Department of Informatics

Upload: rui-pereira

Post on 25-May-2015

91 views

Category:

Presentations & Public Speaking


0 download

DESCRIPTION

Querying for model-driven spreadsheets

TRANSCRIPT

Page 1: Msc Thesis Presentation

Querying for Model-Querying for Model-Driven SpreadsheetsDriven Spreadsheets

Rui PereiraProf. João Saraiva(Supervisor)

Prof. Orlando Belo (Co-Supervisor)

University of MinhoUniversity of MinhoDepartment of InformaticsDepartment of Informatics

Page 2: Msc Thesis Presentation

Problem: Querying Spreadsheets

2

Page 3: Msc Thesis Presentation

Problem: Querying Spreadsheets

3

Spreadsheets are generally used to store

information

Much like database systems

Have no way to query what data they contain

Attempts have been made to replicate

traditional querying systems for spreadsheets

MS-Query

Google QUERY Function

Page 4: Msc Thesis Presentation

Problem: Querying Spreadsheets

4

Highly restrict user’s spreadsheet data

Requires data to be in a tabular format

Requires header to be present (in first row)

Column letters ≠ Column labels

Counter intuitive queries

Data evolution not supported

Page 5: Msc Thesis Presentation

Example5

Page 6: Msc Thesis Presentation

Example: Running Data6

Model-Driven spreadsheet representing: orders, clients, and products

Page 7: Msc Thesis Presentation

Example: Running Data7

Spreadsheet data instance

Page 8: Msc Thesis Presentation

Example: Running Query8

Query: How much have we profited from each client?

Page 9: Msc Thesis Presentation

Solution: Google QUERY Function

9

Page 10: Msc Thesis Presentation

Solution: Google QUERY Function

10

Denormalization

Page 11: Msc Thesis Presentation

Solution: Google QUERY Function

11

Google Query:

Page 12: Msc Thesis Presentation

Solution: Google QUERY Function

12

Denormalization

Page 13: Msc Thesis Presentation

Solution: Google QUERY Function

13

Google Query:

Page 14: Msc Thesis Presentation

Solution: Envisioned Approach

14

Page 15: Msc Thesis Presentation

Solution: Envisioned Approach

15

Proposed Query: SELECT Client.*, sum(TotalPrice)FROM Client JOIN OrderGROUP BY Client.*LABEL sum(TotalPrice) ‘Profits’

Page 16: Msc Thesis Presentation

Solution: Envisioned Approach

16

Query: How much have we profited from each client?

Page 17: Msc Thesis Presentation

Solution: Envisioned Approach

17

Model Inference

Page 18: Msc Thesis Presentation

Solution: Envisioned Approach

18

Model Inference

Spreadsheet data instance

Page 19: Msc Thesis Presentation

QuerySheet: Architecture19

Page 20: Msc Thesis Presentation

QuerySheet: Architecture20

SELECT Pilots.*, Sum(Hours)WHERE Destination = ‘LIS’GROUP BY Pilots.*LABEL Sum(Hours) ‘Total Hours’

=query(A1:F53; “SELECT B, SUM(F) WHERE D = ‘LIS’ GROUP BY B LABEL SUM(F) ‘Total Hours’”

conforms to

conforms to

denormalization

model inference

execution

translation

Page 21: Msc Thesis Presentation

Preliminary Study21

Page 22: Msc Thesis Presentation

Preliminary Study22

Min

ut

es

QuerySheet was found to be approx. ~70% faster than Google’s QUERY Function

Page 23: Msc Thesis Presentation

Preliminary Study23

After each question, students were asked to

choose which system they preferred in

regards to being:

Intuitive

Faster to write

Easier to write

Understandable

Almost all prefered QuerySheet on every aspect

Page 24: Msc Thesis Presentation

Contributions24

Page 25: Msc Thesis Presentation

Contributions25

Querying Model-Driven Spreadsheets, Jácome Cunha, João Fernandes, Jorge

Mendes, Rui Pereira, and João Saraiva. In proceedings of the IEEE Symposium on

Visual Languages and Human-Centric Computing (VL/HCC 2013), San Jose, CA,

USA. IEEE Computer Society, September 2013.

QuerySheet: A Bidirectional Query Environment for Model-Driven

Spreadsheets, Orlando Belo, Jácome Cunha, João Fernandes, Jorge Mendes, Rui

Pereira, and João Saraiva. In proceedings of the IEEE Symposium on Visual

Languages and Human-Centric Computing (VL/HCC 2013), San Jose, CA, USA. IEEE

Computer Society, September 2013.

Spreadsheet Querying, Rui Pereira. In proceedings of the 5th International School

on Domain Specific Languages (DSL’2013), Cluj-Napoca, Romania, 2013. (to

appear)

Graphical Querying of Model-Driven Spreadsheet, Jácome Cunha, João Paulo

Fernandes, Rui Pereira, and João Saraiva. In proceedings of the 16th International

Conference on Human-Computer Interaction (HCI2014), Heraklion, Crete, Greece,

June 2014. (submitted extended abstract)

QuerySheet Software Prototype. (Available at ssaapp.di.uminho.pt)

Page 26: Msc Thesis Presentation

Conclusion26

Page 27: Msc Thesis Presentation

Conclusion27

Spreadsheet querying is more:

Humanized

Understandable

Robust

Productive

More functionality

Automatically deal with problems caused by

querying denormalized data

Page 28: Msc Thesis Presentation

Future Work28

Page 29: Msc Thesis Presentation

Future Work29

Incremental Denormalization

Graphical Querying Interface

Query/Model/Instance Bidirectionality

Evaluation

Page 30: Msc Thesis Presentation

30

Page 31: Msc Thesis Presentation

Querying for Model-Querying for Model-Driven SpreadsheetsDriven Spreadsheets

Rui PereiraProf. João Saraiva(Supervisor)

Prof. Orlando Belo (Co-Supervisor)

University of MinhoUniversity of MinhoDepartment of InformaticsDepartment of Informatics