cafe variome: connecting diagnostic networks, disease consortia and diverse third parties - raymond...
Post on 10-May-2015
99 Views
Preview:
DESCRIPTION
TRANSCRIPT
Openly share the ‘existence’ rather than the ‘substance’ of the data…thereafter variably manage data access
Connecting Diagnostic Networks
• Need to enable disease consortia to identify patients with similar phenotypes or to identify patients harbouring the same variant(s)
• Currently not possible due to difficulties of data sharing between labs, or with central repositories
• Cafe Variome can solve this...Simple to install and can be deployed either– on a server at one or more of a network of labs– or, hosted by the Cafe Variome team
The Cafe Variome Solution
• Allows 'open discovery' of the existence (rather than actual substance) of relevant data
• Thereby, enables networks of labs to easily query for the existence of patients or variants, without necessarily revealing additional underlying data, thus overcoming issues of patient confidentiality & data ‘ownership'
• Currently being extended to support more sophisticated omics/NGS data handling and deep phenotype data
Cafe Variome Features
• Cafe Variome is not a database but is a searchable 'menu'
• The platform enables data owners/submitters to specify and update lists of who can search for records of interest (using various search parameters)
• Results can be returned to users:- as open data- as links to data at source- by computationally facilitating data-access requests
- Allows users to check whether the same variants(s) /patients (with related phenotypes) have previously been seen by other laboratories
Networks of labs exchanging data
Optional wider
discovery
Clinical Community
Research Community
CENTRAL
optional
- Supports multiple installs & federated searches(data remains at source)
Data Sharing Models (facilitated & controlled access)
Open Access
Core info for each record is shown & made available for download
Restricted Access
Core or full record details are provided per record, if:• User is pre-approved by
group-access permissions set by data owner
• Access is approved after facilitated email request to the data owner
Open Discovery – Reporting Existence of Patients/Variants in Sources
Linked Access
No data, only link to the data source is reported
Source DBresource
Access then control managed by source db
Record Discovery “Menu”
Google-likesearch queriesAND/OR, fuzzy,boosting, etc.
A count of hits in each data source is returned and grouped by the sharing policy
Cafe Variome Variant Report
Data Sharing Granularity
Data owners can control access to variants from individual record level to entire data sets
Administrator’s Interface
Create Custom Groups of Labs
Assign Groups to Variant Sources
Users belonging to groups have pre-approved access to particular variant and patient data
• Make data import as flexible as possible• Allows users to generate import templates
– Excel or tab-delimited– Specify which data fields– Populate with their data– Import into CV
Bulk Data Import Templates
Phenotype Developments
• Allow the phenotypic consequences of genetic variants to be described using public ontologies– Many terms from many ontologies can be associated
with one variant or patient• Also, allow the phenotypic consequences of genetic
variants to be described using a local vocabulary or list
Enable hierarchical viewing and querying of the phenotype ontology data
Built on standards
• Cafe Variome is based on open-source software• HVP Recommended System Status (RSS):
– HGVS nomenclature (RSS001)– Mutalyzer (RSS002)– LOVD (RSS003)– VarioML (RSS004: under review)– Locus Reference Genomic (RSS005)– VariO (RSS006: under review)
• Submitted to HVP for RSS review: May 2014
Summary
• CV is very flexible in terms of the content that it can hold– gross disease/phenotype name or single variant– or, detailed phenotype and thousands of variants– (whole exome/genome scan, in next release)
• Each data source decides what data fields are included– which of these are made discoverable & by whom– which fields are shared if discovery searches hit a record– deeper data sharing may be permitted to particular users
• The API (computer-computer interface) is straightforward, and so other data systems can easily be modified to 'talk to' Cafe Variome installations
• We can host a Cafe Variome for you, or you can run it locally:– one Cafe Variome for the whole project– one per site and federate these to act as a private network– in all cases any number of different users can be given tailored
access rights for discovery and data sharing
• It is simple to populate the system– from various starting formats (we can help you with this)– this can be done automatically and at your preferred interval, if
you have data in other databases
• Key point — it is flexible, and designed to let the data find the data, without compromising patient privacy or researcher/clinician control and ownership of the data
Acknowledgements
• Anthony J Brookes• Owen Lancaster (ol8@le.ac.uk)• Tim Beck• Raymond Dalgleish• The research leading to these results has
received funding from the European Community’s Seventh Framework Programme (FP7/2007-2013) under grant agreement number 200754 — the GEN2PHEN project
top related