:an open web platform for advanced plant phenotyping from ...€¦ · community building via github...

1
Plant IT: An open web platform for advanced plant phenotyping from imaging data. Chris Cotter 1,2 - Megan Flory 1,2 - Maria Pedrosa 1 – Alexander Bucksch 1,3,4 1 Plant Biology, 2 Lamar Dodd School of Art, 3 Institute of Bioinformatics, 4 Warnell School of Forestry and Natural Resources, Researchers Software Developers $ cookiecutter GitHub.com/[...]/cookiecutter_PlantIT.git repo_name [PlantIT_Workflow]: example_workflow name [Example Workflow]: app_name [example_workflow]: class_name [ExampleWorkflow]: description []: An example workflow github_username []: cotter your_email []: [email protected] your_name []: Chris 2) Add workflow code to singularity container 3) Register singularity container with Singularity Hub 4) Register workflow with Plant IT Adding a workflow: 1) Generate a pre-configured workflow template using cookiecutter (Figure 1) Simple APIs for integrating analysis code. Workflows run inside singularity containers. Fully documented and open source. Access to high performance computing resources. Templates for integrating code into Plant IT. Any Input/Output datatype. A cookiecutter template is provided to aid developers in integrating their analysis workflow. The template performs most of the configuration necessary to include a workflow in Plant IT. Associate metadata with experiments. Access to high performance computing resources. Free to use. Share and collaborate on data with other users. Community building via GitHub integration and slack channels. Make data sets public after publication, complete with DOI. High Performance Cluster HPC Plant IT Framework Singularity Container Plant IT Sample API Analysis code get_sample API (iRODS) Django Plant IT iRODS CyVerse Data Store Workflow System interactions within Plant IT. Data storage, transfer, and system-system communications are handled by Plant IT. Module systems will allow integration of other researchers clusters and data storage systems without affecting workflow developers. RESTful (HTTP) SSH University of Georgia. Athens, GA Chris Cotter Ph.D. www.sciencesundries.com We are looking for beta testers Analysis Steps github.com/cottersci [email protected] I spend most of my time performing data wrangling and interpretation using machine learning (with special interest in deep learning and generative methods) and developing ways to share complex-data-analysis tools with broad audiences. Manage collections of experimental data Set required workflow parameters Choose a workflow to apply to the data set Track progress of the analysis Download results Upload data via web interface or directly to CyVerse's iRODS data system (Cyberduck, iCommands, WebDav, FUSE, etc.). Keep data private until publication. GitHub.com/Computational-Plant-Science [email protected] Do you have: An analysis workflow you want to share with the community? Many data sets and a analysis pipeline that requires HPC resources? Contact us to help with Plant IT beta testing:

Upload: others

Post on 30-Dec-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: :An open web platform for advanced plant phenotyping from ...€¦ · Community building via GitHub integration and slack channels. Make data sets public after publication, complete

Plant IT: An open web platform for advanced plant phenotyping from imaging data.Chris Cotter1,2 - Megan Flory1,2 - Maria Pedrosa1 – Alexander Bucksch1,3,4

1Plant Biology, 2Lamar Dodd School of Art, 3Institute of Bioinformatics, 4Warnell School of Forestry and Natural Resources,

Researchers Software Developers

$ cookiecutter GitHub.com/[...]/cookiecutter_PlantIT.gitrepo_name [PlantIT_Workflow]: example_workflowname [Example Workflow]: app_name [example_workflow]: class_name [ExampleWorkflow]: description []: An example workflowgithub_username []: cotteryour_email []: [email protected]_name []: Chris

2) Add workflow code to singularity container3) Register singularity container with Singularity Hub4) Register workflow with Plant IT

Adding a workflow:1) Generate a pre-configured workflow template using cookiecutter (Figure 1)

Simple APIs for integrating analysis code.Workflows run inside singularity containers.

Fully documented and open source.Access to high performance computing resources.

Templates for integrating code into Plant IT.

Any Input/Output datatype.

A cookiecutter template is provided to aid developers in integrating their analysis workflow. The template performs most of the configuration necessary to include a workflow in Plant IT.

Associate metadata with experiments.

Access to high performance computing resources.

Free to use.Share and collaborate on data with other users.

Community building via GitHub integration and slack channels.

Make data sets public after publication, complete with DOI.

High Performance Cluster

HPC Plant IT Framework

Singularity Container

Plant IT Sample API Analysis code

get_sample API

(iRODS)

Django

Plant IT

iRODS

CyVerse Data Store

Workflow

System interactions within Plant IT. Data storage, transfer, and system-system communications are handled by Plant IT. Module systems will allow integration of other researchers clusters and data storage systems without affecting workflow developers.

RESTful (HTTP)

SSH

University of Georgia. Athens, GA

Chris Cotter Ph.D.

www.sciencesundries.com

We are looking for beta testers

Analysis Steps

github.com/cottersci

[email protected]

I spend most of my time performing data wrangling and interpretation using machine learning (with special interest in deep learning and generative methods) and developing ways to share complex-data-analysis tools with broad audiences.

Manage collections of experimental data

Set required workflow parameters

Choose a workflow to apply to the data set

Track progress of the analysis

Download results

Upload data via web interface or directly to CyVerse's iRODS data system (Cyberduck, iCommands, WebDav, FUSE, etc.).

Keep data private until publication.

GitHub.com/[email protected]

Do you have:An analysis workflow you want to share with the community? Many data sets and a analysis pipeline that requires HPC resources?

Contact us to help with Plant IT beta testing: