Ace bioinformatician and data scientist looking for role in data science, life sciences or statistics.
Experience
University of Colorado Anschutz
Aug 2018 – Present
Aurora, CO
PhD Student/Research Assistant
In addition to full time education, my position at University of Colorado Anschutz includes a full time Research Assistant position under the UCHealth BioBank. My responsibilities are ad hoc statistical analysis around genetic association studies. Primarily, I work with unifying phenotype and covariate data around the electronic health records in the BioBank. This involves pulling data using Google Analytics BigQuery and SQL frameworks, data quality control, cleaning and preparation using primarily R. I deliver high quality phenotype reports to other scientists performing GWAS, and collaborate further with them for what statistical models to use, i.e. Least Squares Regression, Maximum Likelihood Estimation, etc. Finally, I support data access requests and grant writing in technical descriptions of methods and results to advance the funding and public data research accessibility for investigators looking to use our resources.
Weill Cornell Institute for Precision Medicine
Jun 2016 – Jun 2018
New York, NY
Bioinformatics Analyst
Expanded the analytical platform in the Institute for Precision Medicine (IPM). Developed new software tools and built new pipelines for novel bioinformatics analyses in the context of clinically relevant results for cancer patients. Focused on expanding existing pipelines and production of novel ones with regard to efficiency, robustness, reproducibility and interpretability. Developed tools for visualization of results in an intuitive format. Finally, integrated diverse data to look for biologically relevant connections that might not otherwise be apparent. In all this work focused on the goal of using big data for precision medicine breakthroughs in patients' lives.
Rockefeller University
Mar 2015 – Jun 2016
New York, NY
Bioinformatics Analyst
Developed and executed bioinformatics pipelines for processing next generation sequence data from whole exome and whole genome samples. Communicated regularly with next generation sequencing centers for coherence on data delivery, formatting and pipeline procedure. Performed well documented variant calling and annotation methods in addition to experimental RNAseq, de novo variant calling and somatic variant calling on specific projects. Responsible for SQL database maintenance of over 7500 individuals and associated calls for more than 2 million variants. Maintained Gleeson lab internal web server and reporting system for variants of potential causal interest. After curating data for postdoctoral scholars, hosted regular meetings addressing informatics concerns, planning for future experiments, and solving issues as they arose. As Lead Bioinformatics Specialist trained and supervised daily workflow of UCSD branch bioinformatics colleagues as well rotating institutional scholars, students, and interns. Additionally provided consultation to SimulConsult clinical diagnostic software providing training data and helping formalize software goals and functionality.