Data Infrastructure Engineer: Big Data, Functional Programming, Drug Discovery - remote

Empirico Inc.
Posted 3 years ago  • San Diego, CA
Stack Overflow

Empirico, an early-stage biotechnology company, is looking for a talented software engineer that is motivated by the opportunity to build scalable data systems that power the discovery of new medicines. You will work closely with a team of engineers and computational scientists to build and extend Empirico’s data infrastructure, which include modern cloud-based systems and services that operate on some of the largest biological datasets in the world.

Responsibilities:

Your responsibilities will focus around designing and implementing robust and extensible data systems. You will be expected to:

  • Design and implement scalable data infrastructure and pipelines

  • Implement scalable algorithms in a distributed systems setting

  • Collaborate closely with an interdisciplinary team of scientists and engineers to address

    system pain points

  • Improve developer efficiency and system quality through emphasis on elegant code

  • Advocate for systems and engineering practice improvements

Requirements:

  • 2+ years professional experience designing and developing software on modern distributed data systems

  • Experience processing and analyzing large and heterogeneous datasets

  • Strong technical skill set that spans a broad range of technologies, programming languages,

    and paradigms

  • Passionate about systems thinking and drive towards elegant and automated solutions to

    data problems

  • Experience with Spark and Scala or other functional programming language is a plus

  • Applicants must have authorization to work in the United States