astrodas : sharing assertions across astronomy catalogues through distributed annotation

30
AstroDAS: Sharing Assertions across Astronomy Catalogues through Distributed Annotation Rajendra Bose, Robert G. Mann, Diego Prina- Ricotti Digital Curation Centre 4 May 2006 International Provenance and Annotation Workshop (IPAW’06)

Upload: isolde

Post on 24-Jan-2016

23 views

Category:

Documents


0 download

DESCRIPTION

AstroDAS : Sharing Assertions across Astronomy Catalogues through Distributed Annotation. Rajendra Bose, Robert G. Mann, Diego Prina-Ricotti. Digital Curation Centre. 4 May 2006 International Provenance and Annotation Workshop (IPAW’06). Outline. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: AstroDAS : Sharing Assertions across Astronomy Catalogues through Distributed Annotation

AstroDAS: Sharing Assertionsacross Astronomy Cataloguesthrough Distributed Annotation

Rajendra Bose, Robert G. Mann, Diego Prina-Ricotti

Digital Curation Centre

4 May 2006

International Provenance and Annotation Workshop (IPAW’06)

Page 2: AstroDAS : Sharing Assertions across Astronomy Catalogues through Distributed Annotation

Outline

1. Astronomy catalogues and existing OpenSkyQuery system

2. Custom cross-matching algorithms: AstroDAS

3. How AstroDAS compares to other annotation systems

Page 3: AstroDAS : Sharing Assertions across Astronomy Catalogues through Distributed Annotation

SDSS (Visual)

TWOMASS (Infrared)

Page 4: AstroDAS : Sharing Assertions across Astronomy Catalogues through Distributed Annotation

SDSS (Visual)

TWOMASS (Infrared)

Page 5: AstroDAS : Sharing Assertions across Astronomy Catalogues through Distributed Annotation

AstroDAS: Astronomy Distributed Annotation SystemExample astronomy catalogue schema

Page 6: AstroDAS : Sharing Assertions across Astronomy Catalogues through Distributed Annotation

SDSS:Sky node

TWOMASS:Sky node

USNOB:Sky node

:OpenSky Query client

OpenSkyQuery Portal

OpenSkyQuery

SELECTs.objid, t.objid, u.objid,s.ra, s.dec, s.type,t.ra, t.dec,u.ra, u.decFROMSDSS:photoprimary s, TWOMASS:photoprimary t, USNOB:photoprimary uWHEREXMATCH(s,t,u)<3.5 ANDRegion(’CircleJ2000 200 -1 0.02’) ANDs.type=3 AND

ADQL query

SDSS: Sloan Digital Sky Survey

TWOMASS: the Two Micron All Sky Survey

USNOB: U.S. Naval Observatory USNO-B1.0 catalogue

X-Match cross-matching algorithm built into OpenSkyQuery

based on spatial proximity; user specifies parameter sigmawhich encodes tolerance of match

AstroDAS: Astronomy Distributed Annotation SystemExisting OpenSkyQuery system for astronomy catalogue access

National Virtual Observatory. (2006). Open SkyQuery Help: The XMatch Algorithm <http://openskyquery.net/Sky/SkySite/help/algo.asp>

Page 7: AstroDAS : Sharing Assertions across Astronomy Catalogues through Distributed Annotation

QuickTime™ and aTIFF (Uncompressed) decompressor

are needed to see this picture.

QuickTime™ and aTIFF (Uncompressed) decompressor

are needed to see this picture.

AstroDAS: Astronomy Distributed Annotation SystemExisting OpenSkyQuery system for astronomy catalogue access

http://openskyquery.net/

Page 8: AstroDAS : Sharing Assertions across Astronomy Catalogues through Distributed Annotation

SDSS:Sky node

TWOMASS:Sky node

USNOB:Sky node

:OpenSky Query client

OpenSkyQuery Portal

OpenSkyQuery

SELECTs.objid, t.objid, u.objid,s.ra, s.dec, s.type,t.ra, t.dec,u.ra, u.decFROMSDSS:photoprimary s, TWOMASS:photoprimary t, USNOB:photoprimary uWHEREXMATCH(s,t,u)<3.5 ANDRegion(’CircleJ2000 200 -1 0.02’) ANDs.type=3 AND

ADQL query

AstroDAS: Astronomy Distributed Annotation System

QuickTime™ and aTIFF (Uncompressed) decompressor

are needed to see this picture.

Existing OpenSkyQuery system provides X-Match results based on proximity

Page 9: AstroDAS : Sharing Assertions across Astronomy Catalogues through Distributed Annotation

SDSS:Sky node

TWOMASS:Sky node

USNOB:Sky node

:OpenSky Query client

OpenSkyQuery Portal

OpenSkyQuery

SELECTs.objid, t.objid, u.objid,s.ra, s.dec, s.type,t.ra, t.dec,u.ra, u.decFROMSDSS:photoprimary s, TWOMASS:photoprimary t, USNOB:photoprimary uWHEREXMATCH(s,t,u)<3.5 ANDRegion(’CircleJ2000 200 -1 0.02’) ANDs.type=3 AND

ADQL query

AstroDAS: Astronomy Distributed Annotation System

QuickTime™ and aTIFF (Uncompressed) decompressor

are needed to see this picture.

But X-Match results based on proximity not always adequate

Catalogue1

Catalogue2

Page 10: AstroDAS : Sharing Assertions across Astronomy Catalogues through Distributed Annotation

SDSS:Sky node

TWOMASS:Sky node

USNOB:Sky node

:OpenSky Query client

OpenSkyQuery Portal

OpenSkyQuery

SELECTs.objid, t.objid, u.objid,s.ra, s.dec, s.type,t.ra, t.dec,u.ra, u.decFROMSDSS:photoprimary s, TWOMASS:photoprimary t, USNOB:photoprimary uWHEREXMATCH(s,t,u)<3.5 ANDRegion(’CircleJ2000 200 -1 0.02’) ANDs.type=3 AND

ADQL query

AstroDAS: Astronomy Distributed Annotation System

UEdinburgh:<SDSS, 58778470> ↔ <TWOMASS, 35773478><SDSS, 58778470> ↔ <USNOB, 13317280><SDSS, 58777039> ↔ <USNOB, 13317184>

URome:<SDSS, 58778583> ↔ <TWOMASS, 35777512>

So group produces its own cross-match results

Page 11: AstroDAS : Sharing Assertions across Astronomy Catalogues through Distributed Annotation

URome:AstroDAS

Server

SDSS:Sky node

UEdinburgh:AstroDAS

Server

TWOMASS:Sky node

USNOB:Sky node

:OpenSky Query client

:AstroDAS client

AstroDAS Portal

OpenSkyQuery Portal

AstroDAS

OpenSkyQuery

AstroDAS: Astronomy Distributed Annotation System

UEdinburgh:<SDSS, 58778470> ↔ <TWOMASS, 35773478><SDSS, 58778470> ↔ <USNOB, 13317280><SDSS, 58777039> ↔ <USNOB, 13317184>

URome:<SDSS, 58778583> ↔ <TWOMASS, 35777512>

SELECTs.objid, t.objid, u.objid,s.ra, s.dec, s.type,t.ra, t.dec,u.ra, u.decFROMSDSS:photoprimary s, TWOMASS:photoprimary t, USNOB:photoprimary uWHEREXMATCH(s,t,u)<3.5 ANDRegion(’CircleJ2000 200 -1 0.02’) ANDs.type=3 AND

ADQL query

Storing annotations to map database objects

Page 12: AstroDAS : Sharing Assertions across Astronomy Catalogues through Distributed Annotation

AstroDAS: Astronomy Distributed Annotation SystemStoring annotations to map database objects

db_object db_object

SDSS_58778470 TWOMASS_35773478

annote1 author annote_sourceSAME OBJECT

(algorithm1) researcher1

(algorithm1) researcher1

… … … … ……

id

1

2

3

SDSS_58778470 USNOB_13317280

UEdinburgh:<SDSS, 58778470> ↔ <TWOMASS, 35773478><SDSS, 58778470> ↔ <USNOB, 13317280><SDSS, 58777039> ↔ <USNOB, 13317184>

URome:<SDSS, 58778583> ↔ <TWOMASS, 35777512>

SAME OBJECT

NOT SAME OBJECT

(algorithm2) researcher2SDSS_58778470 TWOMASS_35773478

Page 13: AstroDAS : Sharing Assertions across Astronomy Catalogues through Distributed Annotation

URome:AstroDAS

Server

SDSS:Sky node

UEdinburgh:AstroDAS

Server

TWOMASS:Sky node

USNOB:Sky node

:OpenSky Query client

:AstroDAS client

AstroDAS Portal

OpenSkyQuery Portal

null3577751258778583

13317184null58777039

133172803577347858778470

USNOBTWOMASSSDSSAstroDAS

OpenSkyQuery

mapping table created dynamically from annotations

12

3

4

AstroDAS: Astronomy Distributed Annotation SystemQuerying annotations on astronomy catalogues

SELECTs.objid, s.ra, s.dec, s.type,t.objid, t.ra, t.decu.objid, u.ra, u.decFROMSDSS:photoprimary s, TWOMASS:photoprimary t, USNOB:photoprimary uAS:UEdinburgh e, AS:URome rWHERERegion(’CircleJ2000 200 -1 0.02’) ANDs.type=3 ANDe.author=’algorithm1’ ANDr.author=’algorithm2’

DSQL query

Page 14: AstroDAS : Sharing Assertions across Astronomy Catalogues through Distributed Annotation

URome:AstroDAS

Server

SDSS:Sky node

UEdinburgh:AstroDAS

Server

TWOMASS:Sky node

USNOB:Sky node

:OpenSky Query client

:AstroDAS client

AstroDAS Portal

OpenSkyQuery Portal

null3577751258778583

13317184null58777039

133172803577347858778470

USNOBTWOMASSSDSSAstroDAS

OpenSkyQuery

mapping table created dynamically from annotations

12

3

4

AstroDAS: Astronomy Distributed Annotation SystemCreating a mapping table from stored annotations: inference

SELECTs.objid, s.ra, s.dec, s.type,t.objid, t.ra, t.decu.objid, u.ra, u.decFROMSDSS:photoprimary s, TWOMASS:photoprimary t, USNOB:photoprimary uAS:UEdinburgh e, AS:URome rWHERERegion(’CircleJ2000 200 -1 0.02’) ANDs.type=3 ANDe.author=’algorithm1’ ANDr.author=’algorithm2’

DSQL query

UEdinburgh:<SDSS, 58778470> ↔ <TWOMASS, 35773478><SDSS, 58778470> ↔ <USNOB, 13317280><SDSS, 58777039> ↔ <USNOB, 13317184>

URome:<SDSS, 58778583> ↔ <TWOMASS, 35777512>

Page 15: AstroDAS : Sharing Assertions across Astronomy Catalogues through Distributed Annotation

Outline

1. Astronomy catalogues and existing OpenSkyQuery system

2. Custom cross-matching algorithms: AstroDAS

3. How AstroDAS compares to other annotation systems

Page 16: AstroDAS : Sharing Assertions across Astronomy Catalogues through Distributed Annotation

http://www.wormbase.org/db/das/elegans/features?segment=Ihttp:// <server> /das/<source>/<command>?<args>

BioDAS: Biology Distributed Annotation System (Dowell 2001)Example 1: Genome annotation and BioDAS

Dowell, R., Jokerst, R., Day, A., Eddy, S., & Stein, L. (2001). The Distributed Annotation System. BMC Bioinformatics, 2(7).

Page 17: AstroDAS : Sharing Assertions across Astronomy Catalogues through Distributed Annotation

Ensembl system which includes BioDAS functionalityExample 1: Genome annotation and BioDAS

http://www.ensembl.org

Page 18: AstroDAS : Sharing Assertions across Astronomy Catalogues through Distributed Annotation

Ensembl system which includes BioDAS functionalityExample 1: Genome annotation and BioDAS

http://www.ensembl.org

Page 19: AstroDAS : Sharing Assertions across Astronomy Catalogues through Distributed Annotation
Page 20: AstroDAS : Sharing Assertions across Astronomy Catalogues through Distributed Annotation

Outline

1. Astronomy catalogues and existing OpenSkyQuery system

2. Custom cross-matching algorithms: AstroDAS

3. How AstroDAS compares to other annotation systems

Page 21: AstroDAS : Sharing Assertions across Astronomy Catalogues through Distributed Annotation

AstroDAS: Sharing Assertionsacross Astronomy Cataloguesthrough Distributed Annotation

Rajendra Bose, Robert G. Mann, Diego Prina-Ricotti

Digital Curation Centre

4 May 2006

International Provenance and Annotation Workshop (IPAW’06)

Page 22: AstroDAS : Sharing Assertions across Astronomy Catalogues through Distributed Annotation
Page 23: AstroDAS : Sharing Assertions across Astronomy Catalogues through Distributed Annotation

Annotation of the Malaria Mosquito Anopheles gambiae genome sequenceExample 1: Genome annotation and BioDAS

The Genome Sequence of the Malaria Mosquito Anopheles gambiae, Robert A. Holt, et al., Science 4 October 2002: Vol. 298. no. 5591, pp. 129 - 149 DOI: 10.1126/science.1076181; http://www.sciencemag.org/cgi/data/298/5591/129/DC2/1

Page 24: AstroDAS : Sharing Assertions across Astronomy Catalogues through Distributed Annotation

Annotation of the Malaria Mosquito Anopheles gambiae genome sequenceExample 1: Genome annotation and BioDAS

The Genome Sequence of the Malaria Mosquito Anopheles gambiae, Robert A. Holt, et al., Science 4 October 2002: Vol. 298. no. 5591, pp. 129 - 149 DOI: 10.1126/science.1076181; http://www.sciencemag.org/cgi/data/298/5591/129/DC2/1

Page 25: AstroDAS : Sharing Assertions across Astronomy Catalogues through Distributed Annotation

Annotation of the Malaria Mosquito Anopheles gambiae genome sequenceExample 1: Genome annotation and BioDAS

The Genome Sequence of the Malaria Mosquito Anopheles gambiae, Robert A. Holt, et al., Science 4 October 2002: Vol. 298. no. 5591, pp. 129 - 149 DOI: 10.1126/science.1076181; http://www.sciencemag.org/cgi/data/298/5591/129/DC2/1

Page 26: AstroDAS : Sharing Assertions across Astronomy Catalogues through Distributed Annotation

Lauer, Kim P., Llorente, Isabel, Blair, Eric, Seto, Jason, Krasnov, Vladimir, Purkayastha, Anjan, Ditty, Susan E., Hadfield, Ted L., Buck, Charles, Tibbetts, Clark, Seto, DonaldNatural variation among human adenoviruses: genome sequence and annotation of human adenovirus serotype 1J Gen Virol 2004 85: 2615-2625http://intl-vir.sgmjournals.org/cgi/citmgr?gca=vir;85/9/2615

Example of genome annotation from the biological literatureExample 1: Genome annotation and BioDAS

Page 27: AstroDAS : Sharing Assertions across Astronomy Catalogues through Distributed Annotation

QuickTime™ and aTIFF (Uncompressed) decompressor

are needed to see this picture.

Human Brain Project (HBP) image annotation (Gertz 2002, 2003)Example 2: Medical image annotation

http://neuroscience.ucdavis.edu/hbp/viewer.html

Page 28: AstroDAS : Sharing Assertions across Astronomy Catalogues through Distributed Annotation

Human Brain Project (HBP) image annotation (Gertz 2002, 2003)Example 2: Medical image annotation

Gertz, M., Sattler, K.-U., Gorin, F., Hogarth, M., & Stone, J. (2002). Annotating Scientific Images: A Concept-based Approach. Proceedings of the 14th International Conference on Scientific and Statistical Database Management (SSDBM 2002), Edinburgh, Scotland. IEEE Computer Society.Gertz, M., & Sattler, K. U. (2003). Integrating scientific data through external, concept-based annotations. In Efficiency and Effectiveness of Xml Tools and Techniques and Data Integration over the Web (Vol. 2590, pp. 220-240).

Page 29: AstroDAS : Sharing Assertions across Astronomy Catalogues through Distributed Annotation

Edinburgh Mouse Atlas Project (EMAP) (Baldock 1999)Example 2: Medical image annotation

Baldock, R. A., Dubreuil, C., Hill, W., & Davidson, D. (1999). The Edinburgh Mouse Atlas: Basic Structure and Informatics. In S. I. Letovsky (Ed.), Bioinformatics: Databases and Systems (pp. 129-140). Kluwer Academic Publishers. (See http://genex.hgu.mrc.ac.uk/)

Page 30: AstroDAS : Sharing Assertions across Astronomy Catalogues through Distributed Annotation

AstroDAS: Astronomy Distributed Annotation SystemStoring annotations to map database objects

db_object db_object

SDSS_112233 TWOMASS_445566

annote1 annote2 annote_sourceSAME OBJECT

(algorithm1) GROUP1

NOT SAME OBJECT

(algorithm2) GROUP2

NOT SAME OBJECT

(algorithm1) GROUP1

… … … … ……

id

1

2

3

SDSS_112233

SDSS_112233 TWOMASS_778899

TWOMASS_445566