automated task launching to streamline setup, task...

1
RESEARCH POSTER PRESENTATION DESIGN © 2015 www.PosterPresentations.com The Ludmer Centre for Neuroinformatics & Mental Health (ludmercentre.ca) has created a (epi)genetics working group composed of three member institutions. INTRODUCTION OBJECTIVES & METHODS RESULTS The LORIS Genomic Browser[8] enables viewing, filtering, and linking of summary genetic data (CNV, SNP, GWAS). Using the MAVAN study[7], a subset of subject SNPs from PLINK files have been loaded into LORIS. An uploader and genomic viewer were created so researchers can view aggregated CPG beta value distributions visually aligned with their SNP data, gene features and ENCODE H3K4me1 Histone mark data for a given genomic range. Each displayed element can be clicked to access external reference databases (dbSNP, 1000 Genomes), or provide more information, such as methylation level boxplot distributions of beta values grouped by sex and/or genotypes for a specific CpG and a given SNP. The design of complex workflows was also a significant part of the methodology, enabling requisite tool building and porting at various stages of development (Figure 3). Within LORIS, a prototype Dataset Builder creates new datasets by joining filtered genomic data with phenotypic data and/or imaging files queried from the Data Querying Tool (DQT)[6]. These new datasets are processed by CBRAIN statistical pipelines. Several prototype HPC workflows have been constructed and mounted in CBRAIN for the MAVAN project, such as methylation450KPipeline[10] and imputePrepSanger[5] running in a Docker container[4] along with weighted correlation network analysis (WGCNA) and principal component analysis (PCA) pipelines. CONCLUSION The goals of this working group are to facilitate brain research discovery by: Reducing human error with processing automation and seamless linking of multimodal data. Creating format definitions between components of this workflow will improve the integrative impact of these cross-modal tools. Deploying analysis pipelines on HPCs, focusing investigators on research instead of data handling. Optimizing performance, flexibility, and scalability using NoSQL and structured schema databases. Creating hooks between CBRAIN and LORIS for automated task launching to streamline setup, task creation, provenance capture, and re-insertion. REFERENCES [1] Das, S. (2011), 'LORIS: A web-based data management system for multi-center studies', Frontiers in Neuroinformatics, 5:37. [2] Das, S. (2015), 'The MNI data-sharing and processing ecosystem', NeuroImage 124:1188–1195. [3] Das, S. (2016), 'Cyberinfrastructure for Open Science at the Montreal Neurological Institute', Frontiers in Neuroinformatics, DOI: 10.3389/fninf.2016.00053 [4] Glatard, T. (2015). 'Boutiques: an application-sharing system based on Linux containers', Frontiers in Neuroscience, DOI: 10.3389/conf.fnins.2015.91.00012 [5] Forest, M. (2016), Docker repository, https://hub.docker.com/r/eauforest/imputeprepsanger [6] MacFarlane, D., OHBM 2014 [7] O’Donnell, K. A., et al., 2014. 'The maternal adversity, vulnerability and neurodevelopment project: theory and methodology', Canadian Journal of Psychiatry, 2014 Sep;59(9):497-508. [8] Rogers, C., OHBM 2015 [9] Sherif, T., et al., 2014. CBRAIN: A web-based, distributed computing platform for collaborative neuroimaging research. Frontiers in Neuroinformatics, DOI: 10.3389/fninf.2014.00054 [10] Statistical Genetics at the Lady Davis Institute, (2016), GitHub repository, https://github.com /GreenwoodLab/methylation450KPipeline ACKNOWLEDGEMENTS Derek Lo (diagrams), Pierre Rioux (CBRAIN support), Alex Ilea (React.js), Jacob Penny (React.js) The group’s goal is to 1) integrate heterogeneous datasets (behavioral/clinical, imaging, and genomic) within the LORIS[1][3] (loris.ca), 2) streamline analysis, 3) integrate results and 4) facilitate structured and versioned sharing for optimized reproducibility using high performance computing (HPC) on CBRAIN[9]. Biospecimens are collected during a participant's visit. DNA is extracted, undergoes quality control, and shipped for genotyping and methylation assessment using a bead chip array. Raw data are transferred to CBRAIN where a normalization pipeline can be launched, returning results back into the LORIS database along with metadata. 1 Montreal Neurological Institute, McGill University, Montreal, Quebec, 2 Lady Davis Institute for Medical Research, Montreal, Quebec, 3 McGill Centre for Integrative Neuroscience, Montreal, Quebec, 4 Douglas Hospital Research Center, Montreal, Quebec Xavier Lecours Boucher 1 , Samir Das 1 , Christine Rogers 1 , David MacFarlane 1 , Dahlia Snaiderman 1 , Kathleen Klein Oros 2 , Natacha Beck 3 , Marie Forest 2 , Lawrence Chen 2 , Nicolas Brossard 4 , Eva Unternaehrer 4 , Kieran O'Donnell 4 , Helene Gaudreau 4 , Michael Meaney 4 , Celia Greenwood 2 , Alan Evans 1 Integrating multimodal databases into genomics analysis workflows Figure 4 : LORIS web interface showing files inked between CBRAIN and LORIS. Figure #2 : LORIS web interface showing genomic data (Box plot methylation beta-values , SNP and genes aligned on the genome ) in the visualization tool (beta). Figure #3: Methylation data processing as it cycle between storage in LORIS’s database and dockerized processing pipelines hosted on CBRAIN. The outputs are re-inserted into LORIS where it can be visualization and cross modale querying occurs to create CBRAIN input datasets. In CBRAIN, a second pipeline can be launch to add derived variables back in LORIS. Statistical Analysis Tools Neuroinformatics Infrastructure Biological Questions Figure #1 : Neuroinformatics conglomerate http://mcin.ca

Upload: others

Post on 18-Oct-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: automated task launching to streamline setup, task ...mcin.ca/wp-content/uploads/2017/08/Xavier-Lecours-Boucher-HBM2… · using NoSQL and structured schema databases. • Creating

QUICK START (cont.)

How to change the template color themeYou can easily change the color theme of your poster by going to the DESIGN menu, click on COLORS, and choose the color theme of your choice. You can also create your own color theme.

You can also manually change the color of your background by going to VIEW > SLIDE MASTER. After you finish working on the master be sure to go to VIEW > NORMAL to continue working on your poster.

How to add TextThe template comes with a number of pre-formatted placeholders for headers and text blocks. You can add more blocks by copying and pasting the existing ones or by adding a text box from the HOME menu.

Text sizeAdjust the size of your text based on how much content you have to present. The default template text offers a good starting point. Follow the conference requirements.

How to add TablesTo add a table from scratch go to the INSERT menu and click on TABLE. A drop-down box will help you select rows and columns.

You can also copy and a paste a table from Word or another PowerPoint document. A pasted table may need to be re-formatted by RIGHT-CLICK > FORMAT SHAPE, TEXT BOX, Margins.

Graphs / ChartsYou can simply copy and paste charts and graphs from Excel or Word. Some reformatting may be required depending on how the original document has been created.

How to change the column configurationRIGHT-CLICK on the poster background and select LAYOUT to see the column options available for this template. The poster columns can also be customized on the Master. VIEW > MASTER.

How to remove the info barsIf you are working in PowerPoint for Windows and have finished your poster, save as PDF and the bars will not be included. You can also delete them by going to VIEW > MASTER. On the Mac adjust the Page-Setup to match the Page-Setup in PowerPoint before you create a PDF. You can also delete them from the Slide Master.

Save your workSave your template as a PowerPoint document. For printing, save as PowerPoint or “Print-quality” PDF.

Print your posterWhen you are ready to have your poster printed go online to PosterPresentations.com and click on the “Order Your Poster” button. Choose the poster type the best suits your needs and submit your order. If you submit a PowerPoint document you will be receiving a PDF proof for your approval prior to printing. If your order is placed and paid for before noon, Pacific, Monday through Friday, your order will ship out that same day. Next day, Second day, Third day, and Free Ground services are offered. Go to PosterPresentations.com for more information.

Student discounts are available on our Facebook page.Go to PosterPresentations.com and click on the FB icon.

© 2015 PosterPresentations.com 2117 Fourth Street , Unit C Berkeley CA 94710

[email protected]

(—THIS SIDEBAR DOES NOT PRINT—)DESIGN GUIDE

This PowerPoint 2007 template produces a 36”x48” presentation poster. You can use it to create your research poster and save valuable time placing titles, subtitles, text, and graphics.

We provide a series of online tutorials that will guide you through the poster design process and answer your poster production questions. To view our template tutorials, go online to PosterPresentations.com and click on HELP DESK.

When you are ready to print your poster, go online to PosterPresentations.com

Need assistance? Call us at 1.510.649.3001

QUICK START

Zoom in and outAs you work on your poster zoom in and out to the level that is more

comfortable to you. Go to VIEW > ZOOM.

Title, Authors, and AffiliationsStart designing your poster by adding the title, the names of the authors, and the affiliated institutions. You can type or paste text into the provided boxes. The template will automatically adjust the size of your text to fit the title box. You can manually override this feature and change the size of your text.

TIP: The font size of your title should be bigger than your name(s) and institution name(s).

Adding Logos / SealsMost often, logos are added on each side of the title. You can insert a logo by dragging and dropping it from your desktop, copy and paste or by going to INSERT > PICTURES. Logos taken from web sites are likely to be low quality when printed. Zoom it at 100% to see what the logo will look like on the final poster and make any necessary adjustments.

TIP: See if your school’s logo is available on our free poster templates page.

Photographs / GraphicsYou can add images by dragging and dropping from your desktop, copy and paste, or by going to INSERT > PICTURES. Resize images proportionally by holding down the SHIFT key and dragging one of the corner handles. For a professional-looking poster, do not distort your images by enlarging them disproportionally.

Image Quality CheckZoom in and look at your images at 100% magnification. If they look good they will print well.

ORIGINAL DISTORTEDCorner handles

Go

od

pri

nti

ng

qu

alit

y

Bad

pri

nti

ng

qu

alit

y

RESEARCH POSTER PRESENTATION DESIGN © 2015

www.PosterPresentations.com

The Ludmer Centre for Neuroinformatics & Mental Health (ludmercentre.ca) has created a (epi)genetics working group composed of three member institutions.

INTRODUCTION

OBJECTIVES & METHODS

RESULTS

The LORIS Genomic Browser[8] enables viewing, filtering, and linking of summary genetic data (CNV, SNP, GWAS). Using the MAVAN study[7], a subset of subject SNPs from PLINK files have been loaded into LORIS. An uploader and genomic viewer were created so researchers can view aggregated CPG beta value distributions visually aligned with their SNP data, gene features and ENCODE H3K4me1 Histone mark data for a given genomic range. Each displayed element can be clicked to access external reference databases (dbSNP, 1000 Genomes), or provide more information, such as methylation level boxplot distributions of beta values grouped by sex and/or genotypes for a specific CpG and a given SNP. The design of complex workflows was also a significant part of the methodology, enabling requisite tool building and porting at various stages of development (Figure 3).

Within LORIS, a prototype Dataset Builder creates new datasets by joining filtered genomic data with phenotypic data and/or imaging files queried from the Data Querying Tool (DQT)[6]. These new datasets are processed by CBRAIN statistical pipelines. Several prototype HPC workflows have been constructed and mounted in CBRAIN for the MAVAN project, such as methylation450KPipeline[10] and imputePrepSanger[5] running in a Docker container[4] along with weighted correlation network analysis (WGCNA) and principal component analysis (PCA) pipelines.

CONCLUSION

The goals of this working group are to facilitate brain research discovery by:

• Reducing human error with processing automation and seamless linking of multimodal data.

• Creating format definitions between components of this workflow will improve the integrative impact of these cross-modal tools.

• Deploying analysis pipelines on HPCs, focusing investigators on research instead of data handling.

• Optimizing performance, flexibility, and scalability using NoSQL and structured schema databases.

• Creating hooks between CBRAIN and LORIS for automated task launching to streamline setup, task creation, provenance capture, and re-insertion.

REFERENCES

[1] Das, S. (2011), 'LORIS: A web-based data management system for multi-center studies', Frontiers in Neuroinformatics, 5:37.

[2] Das, S. (2015), 'The MNI data-sharing and processing ecosystem', NeuroImage 124:1188–1195.

[3] Das, S. (2016), 'Cyberinfrastructure for Open Science at the Montreal Neurological Institute', Frontiers in Neuroinformatics, DOI: 10.3389/fninf.2016.00053

[4] Glatard, T. (2015). 'Boutiques: an application-sharing system based on Linux containers', Frontiers in Neuroscience, DOI: 10.3389/conf.fnins.2015.91.00012

[5] Forest, M. (2016), Docker repository, https://hub.docker.com/r/eauforest/imputeprepsanger

[6] MacFarlane, D., OHBM 2014

[7] O’Donnell, K. A., et al., 2014. 'The maternal adversity, vulnerability and neurodevelopment project: theory and methodology', Canadian Journal of Psychiatry, 2014 Sep;59(9):497-508.

[8] Rogers, C., OHBM 2015

[9] Sherif, T., et al., 2014. CBRAIN: A web-based, distributed computing platform for collaborative neuroimaging research. Frontiers in Neuroinformatics, DOI: 10.3389/fninf.2014.00054

[10] Statistical Genetics at the Lady Davis Institute, (2016), GitHub repository, https://github.com /GreenwoodLab/methylation450KPipeline

ACKNOWLEDGEMENTS

Derek Lo (diagrams), Pierre Rioux (CBRAIN support), Alex Ilea (React.js), Jacob Penny (React.js)

The group’s goal is to 1) integrate heterogeneous datasets (behavioral/clinical, imaging, and genomic) within the LORIS[1][3] (loris.ca), 2) streamline analysis, 3) integrate results and 4) facilitate structured and versioned sharing for optimized reproducibility using high performance computing (HPC) on CBRAIN[9].

Biospecimens are collected during a participant's visit. DNA is extracted, undergoes quality control, and shipped for genotyping and methylation assessment using a bead chip array. Raw data are transferred to CBRAIN where a normalization pipeline can be launched, returning results back into the LORIS database along with metadata.

1Montreal Neurological Institute, McGill University, Montreal, Quebec, 2Lady Davis Institute for Medical Research, Montreal, Quebec, 3McGill Centre for Integrative Neuroscience, Montreal, Quebec, 4Douglas Hospital Research Center, Montreal, Quebec

Xavier Lecours Boucher1, Samir Das1, Christine Rogers1, David MacFarlane1, Dahlia Snaiderman1, Kathleen Klein Oros2, Natacha Beck3, Marie Forest2, Lawrence Chen2, Nicolas Brossard4, Eva Unternaehrer4, Kieran O'Donnell4, Helene Gaudreau4, Michael Meaney4, Celia Greenwood2, Alan Evans1

Integrating multimodal databases into genomics analysis workflows

Figure 4: LORIS web interface showing files inked between CBRAIN and LORIS.

Figure #2: LORIS web interface showing genomic data (Box plot methylation beta-values , SNP and genes aligned on the genome ) in the visualization tool (beta).

Figure #3: Methylation data processing as it cycle between storage in LORIS’s database and dockerized processing pipelines hosted on CBRAIN. The outputs are re-inserted into LORIS where it can be visualization and cross modale querying occurs to create CBRAIN input datasets. In CBRAIN, a second pipeline can be launch to add derived variables back in LORIS.

Statistical Analysis Tools

Neuroinformatics InfrastructureBiological

Questions

Figure #1: Neuroinformatics conglomerate

http://mcin.ca