Machine Learning Engineer

Terray Therapeutics

Remote / Monrovia, CA, US
  • Job Type: Full-Time
  • Function: Data Science
  • Industry: Life Sciences
  • Post Date: 04/03/2024
  • Website: www.terraytherapeutics.com
  • Company Address: 129 N. Hill, Suite 103, Pasadena, California , 91106, US

About Terray Therapeutics

Terray Therapeutics is a biotechnology company headquartered in Pasadena, California. Terray utilizes a novel screening and optimization platform (tArray) to develop treatments for historically intractable causes of human disease.

Job Description

Terray Therapeutics is a venture-backed biotechnology company led by pioneers and long-time leaders in artificial intelligence, synthetic chemistry, automation, and nanotechnology. We’re generating chemical data purpose-built to propel drug discovery into the information age — and we’re doing it on a larger scale and faster than has ever before been possible. Our closed loop system generates precise chemical datasets at unrivaled scale that work seamlessly with AI to systematically map biochemical interactions between small molecules and causes of disease. Iterative cycles of virtual molecular design and experimentation power AI and machine learning models, which in turn guide the next cycle of design. With a chemistry engine that measures billions of interactions daily and becomes increasingly precise with every cycle, we can answer an unprecedented array of questions — deriving insights that enable us to predictably create drugs for patients in need.

Position Summary

Terray is currently seeking a motivated, creative, and experienced machine learning engineer. As an integral member of our Computational and Data Sciences (CDS) team, the candidate will be responsible for developing and deploying state-of-the-art machine learning models trained on up to billions of small molecule affinity/activity data points in order to accelerate drug discovery efforts internally and for our partners. Researchers in this position have a unique opportunity to test models with very high certainty, and build precise models facilitated by a scale of data unavailable elsewhere. 

The core responsibilities of this position are:

  • Train, polish and extend our suite of 2D and 3D conditioned drug generative models in our proprietary PyTorch codebase (JAX permitted)
  • Test embeddings of protein environments, and further improve clinical likelihood enrichment
  • Iterate training and assessment of novel property models for new targets as our data process provides new information
  • Feed the training pipeline on our NVidia DGX Cloud distributed training resources
  • Deploy models as components to our existing inference and design tools
  • Communicate and Coordinate your experiments with the rest of the Machine Learning Team

Experience and Qualifications

Part of Terray’s success is nurtured by a hands-on work environment where everyone is accountable, vested in a vision of excellence, and actively taking part in the success of the business. Terray supports a positive work environment where employees can feel engaged, recognized and empowered to be creative.

Required Qualifications

  • Fluent in PyTorch and/or JAX
  • Experience authoring high performance modules for expressive encoders/decoders and/or generators of structured data manifolds
  • Comfortable manipulating data and models in a heterogeneous, multi-cloud, decentralized infrastructure (sql, no-sql, docker, redis, aws, dgx)  
  • Experience with distributed model training
  • Familiarity with molecular data manifolds is a plus
  • Willingness to demonstrate mastery of ML in a short (< 6 hr) at-home exercise, or with a preponderance of visible work

Compensation Details

$132,000 - $198,000 (annually) depending on seniority; participation in the Company's option plan; 3% 401K contribution; full benefits