Data Infrastructure Engineer: Big Data, Functional Programming, Drug Discovery - remote

Empirico Inc.

Posted 3 years ago • San Diego, CA

Spark Scala Python AmazonWebServices FunctionalProgramming

Empirico, an early-stage biotechnology company, is looking for a talented software engineer that is motivated by the opportunity to build scalable data systems that power the discovery of new medicines. You will work closely with a team of engineers and computational scientists to build and extend Empirico’s data infrastructure, which include modern cloud-based systems and services that operate on some of the largest biological datasets in the world.

Responsibilities:

Your responsibilities will focus around designing and implementing robust and extensible data systems. You will be expected to:

Design and implement scalable data infrastructure and pipelines
Implement scalable algorithms in a distributed systems setting
Collaborate closely with an interdisciplinary team of scientists and engineers to address
system pain points
Improve developer efficiency and system quality through emphasis on elegant code
Advocate for systems and engineering practice improvements

Requirements:

2+ years professional experience designing and developing software on modern distributed data systems
Experience processing and analyzing large and heterogeneous datasets
Strong technical skill set that spans a broad range of technologies, programming languages,
and paradigms
Passionate about systems thinking and drive towards elegant and automated solutions to
data problems
Experience with Spark and Scala or other functional programming language is a plus
Applicants must have authorization to work in the United States

Apply