Lee Falin

Scientist & Storyteller

Curricula Vitae

ORCID: 0000-0002-7776-2519
Google Scholar Profile

Education

  • PhD Genetics, Bioinformatics, and Computational Biology Virginia Tech: 2011

  • Bachelor of Science in Computer Science University of Illinois Springfield: 2005

Academic Appointments

  • Adjunct Faculty of Computer Science University of the Cumberlands: 2023 – Present

  • Faculty of Computer Science BYU-Idaho: 2020 – 2021

  • Assistant Professor of Computer Science Southern Virginia University: 2017 – 2018

  • Visiting Faculty of Computer Science BYU-Idaho: 2015 – 2017

  • CS/EE Adjunct BYU-Idaho: 2014 – 2015

  • Bioinformatician EMBL–European Bioinformatics Institute: 2012 – 2014

  • Bioinformatics Research Assistant, Tyler Lab Virginia Bioinformatics Institute: 2006 – 2011

University Courses Designed & Taught

  • Machine Learning
  • Software Design and Development
  • Procedural Programming in C++
  • Object-Oriented Programming in C++
  • Data Structures & Algorithms in C++
  • Introductory Programming in Python
  • Introductory Programming in Java
  • Database Systems with PostgreSQL
  • Theory of Computation
  • Unity Game Design
  • Client-Side Web Development
  • Server-Side Web Development
  • Essential of Gamification
  • Games for Learning & Simulation

University Citizenship Assignments

  • Digital Literacy Committee – SVU
  • Course design council – BYU-Idaho
  • AI Faculty Mentor – BYU-Idaho

Publications / Conference Presentations

Languages

  • English: Native Proficiency
  • Brazilian Portuguese: C1 Proficiency

Selected Industry Experience

Lead Data Scientist at Featurespace

Aug 2022 – Mar 2023

  • Led a team of data scientists developing and deploying machine learning models for financial fraud detection by multinational banks and credit unions.

  • Worked with in-house AWS Cloud Architects to develop least-privilege IAM role architectures and RBAC policy frameworks in AWS to enable secure data sharing with remote clients via cross-account roles.

  • Developed SOP guidelines to ensure adherence to GDPR and other multinational data security and privacy regulations.

  • Increased project scheduling capacity by over 50% through cross training an underutilized team.

  • Redesigned data scientist onboarding and training materials to reduce onboard time from 5 to 3 months.

  • Created project management procedures to enforce data provenance and streamline transferring projects between divisions.

Principal Software Developer / Owner at CrewPlannr

Aug 2019 – Jul 2020

  • Developed an ML-powered, automatic shift scheduling SaaS product using React, NodeJS, and Firebase.

  • Built Bash and Python scripts to automate deployment to Heroku.

  • Integrated with Stripe payment APIs for subscription-based payments.

  • Wrote technical documentation, user onboarding guides, and other customer support materials.

Data Science Consultant at International TechneGroup Inc.

May 2017 – Jul 2019

  • Used the CRISP-DM process model to develop custom heuristic algorithms and statistical methods to help clients solve data analysis, data visualization, and other machine learning problems, using a mix of custom algorithms and off-the-shelf libraries, including pandas, NumPy, matplotlib, scikit-learn, TensorFlow, and keras.

  • Used Python to create a custom data processing and machine learning pipeline for a CAD software company to allow for enhanced metadata recognition and textual analysis.

  • Wrote process reports and technical whitepapers for stakeholders of mixed technical backgrounds.

Bioinformatician at the European Bioinformatics Institute

Jul 2012 – April 2014

  • Worked with international collaborators to help add over 30,000 genomic datasets to a bioinformatics SaaS research tool accessed by tens of thousands of users.

  • Developed custom map-reduce algorithms in Perl and Python to efficiently process proteomics and metabolomics data using Platform LSF pipelines.

  • Developed REST APIs used by thousands of researchers, accessing data from multiple datastores, including MySQL, Oracle, Neo4J, and MongoDB.

  • Developed ML algorithms and data pipelines for genomic data processing, protein function prediction, and metabolomic network inference.

  • Wrote training and project updated presentations and whitepapers, contributed to internal documentation and team publications.

Bioinformatics Research Assistant at Virginia Bioinformatics Institute, Virginia Tech

Jan 2006 – May 2011

  • Reduced microarray analysis costs by implementing novel statistical methods for quantifying uncertainty in sample data using C and the GNU Scientific Library.

  • Developed an oomycete genome browser using Perl visualization and analysis scripts.

  • Developed microarray data processing pipelines in Python, R, Perl, and C.

  • Developed training materials, taught ad-hoc and formal undergraduate classes and training cohorts.