cafe variome: connecting diagnostic networks, disease consortia and diverse third parties - raymond...

Openly share the ‘existence’ rather than the ‘substance’ of the data…thereafter variably manage data access

Connecting Diagnostic Networks

• Need to enable disease consortia to identify patients with similar phenotypes or to identify patients harbouring the same variant(s)

• Currently not possible due to difficulties of data sharing between labs, or with central repositories

• Cafe Variome can solve this...Simple to install and can be deployed either– on a server at one or more of a network of labs– or, hosted by the Cafe Variome team

The Cafe Variome Solution

• Allows 'open discovery' of the existence (rather than actual substance) of relevant data

• Thereby, enables networks of labs to easily query for the existence of patients or variants, without necessarily revealing additional underlying data, thus overcoming issues of patient confidentiality & data ‘ownership'

• Currently being extended to support more sophisticated omics/NGS data handling and deep phenotype data

Cafe Variome Features

• Cafe Variome is not a database but is a searchable 'menu'

• The platform enables data owners/submitters to specify and update lists of who can search for records of interest (using various search parameters)

• Results can be returned to users:- as open data- as links to data at source- by computationally facilitating data-access requests

- Allows users to check whether the same variants(s) /patients (with related phenotypes) have previously been seen by other laboratories

Networks of labs exchanging data

Optional wider

discovery

Clinical Community

Research Community

CENTRAL

optional

- Supports multiple installs & federated searches(data remains at source)

Data Sharing Models (facilitated & controlled access)

Open Access

Core info for each record is shown & made available for download

Restricted Access

Core or full record details are provided per record, if:• User is pre-approved by

group-access permissions set by data owner

• Access is approved after facilitated email request to the data owner

Open Discovery – Reporting Existence of Patients/Variants in Sources

Linked Access

No data, only link to the data source is reported

Source DBresource

Access then control managed by source db

Record Discovery “Menu”

Google-likesearch queriesAND/OR, fuzzy,boosting, etc.

A count of hits in each data source is returned and grouped by the sharing policy

Cafe Variome Variant Report

Data Sharing Granularity

Data owners can control access to variants from individual record level to entire data sets

Administrator’s Interface

Create Custom Groups of Labs

Assign Groups to Variant Sources

Users belonging to groups have pre-approved access to particular variant and patient data

• Make data import as flexible as possible• Allows users to generate import templates

– Excel or tab-delimited– Specify which data fields– Populate with their data– Import into CV

Bulk Data Import Templates

Phenotype Developments

• Allow the phenotypic consequences of genetic variants to be described using public ontologies– Many terms from many ontologies can be associated

with one variant or patient• Also, allow the phenotypic consequences of genetic

variants to be described using a local vocabulary or list

Enable hierarchical viewing and querying of the phenotype ontology data

Built on standards

• Cafe Variome is based on open-source software• HVP Recommended System Status (RSS):

– HGVS nomenclature (RSS001)– Mutalyzer (RSS002)– LOVD (RSS003)– VarioML (RSS004: under review)– Locus Reference Genomic (RSS005)– VariO (RSS006: under review)

• Submitted to HVP for RSS review: May 2014

Summary

• CV is very flexible in terms of the content that it can hold– gross disease/phenotype name or single variant– or, detailed phenotype and thousands of variants– (whole exome/genome scan, in next release)

• Each data source decides what data fields are included– which of these are made discoverable & by whom– which fields are shared if discovery searches hit a record– deeper data sharing may be permitted to particular users

• The API (computer-computer interface) is straightforward, and so other data systems can easily be modified to 'talk to' Cafe Variome installations

• We can host a Cafe Variome for you, or you can run it locally:– one Cafe Variome for the whole project– one per site and federate these to act as a private network– in all cases any number of different users can be given tailored

access rights for discovery and data sharing

• It is simple to populate the system– from various starting formats (we can help you with this)– this can be done automatically and at your preferred interval, if

you have data in other databases

• Key point — it is flexible, and designed to let the data find the data, without compromising patient privacy or researcher/clinician control and ownership of the data

Acknowledgements

• Anthony J Brookes• Owen Lancaster ([email protected])• Tim Beck• Raymond Dalgleish• The research leading to these results has

received funding from the European Community’s Seventh Framework Programme (FP7/2007-2013) under grant agreement number 200754 — the GEN2PHEN project

cafe variome: connecting diagnostic networks, disease consortia and diverse third parties - raymond...

Science