Bioinformatics on GCP, AWS or Azure
This Repo contains my own 'study notes' as I learn genomic-scale cloud bioinformatics. It includes descriptions of common tools, platforms and summaries of my work with clients. I update this Repo frequently. It is organized via the folder structure shown below.
In addition to this Repo, I have a number of other Repos with cloud bioinformatics information. Also, I've included two of my favorite link aggregator resources here for additional learning.
learn-cloud
Repo - https://github.com/lynnlangit/learning-cloud
gcp-for-bioinformatics
open source course - https://github.com/lynnlangit/gcp-for-bioinformatics
aws-for-bioinformatics
open source course - https://github.com/lynnlangit/aws-for-bioinformatics
learn-wdl
open source course - https://github.com/openwdl/learn-wdl
The Data Lake (or Data Mesh [Lake of Lakes]) pattern is key for implementing bioinformatics workloads effectively on any public cloud. Shown below is a simple conceptual explanation of this key concept.
Teri is the impetus for my movement into the world of genomic research. She was diagnosed with breast cancer in 2016. She survived, but suffered a long course of intense and painful treatment due in part to the lack of availability of personalized treatment options at the time of her diagnosis.