Current Openings

Senior Data Engineer

Job posted: 10/10/23
Full-time Fully remote or hybrid Cambridge, MA

Job Description

Our team at the Broad Institute is seeking a skilled Senior Data Engineer to help us build and maintain the tools necessary for processing and managing large genomic datasets for display in public resources such as gnomADgenebass, SCHEMA, and other resources for visualizing exome association results and genome-wide association study (GWAS) data. The ideal candidate will be familiar with a variety of database technologies (NoSQL and SQL) and be excited about creating data models and data processing graphs that automate regular data generation based on a variety of data inputs. They should also be interested in writing pipelines that process datasets at scale using the Hail Python package. These pipelines should be executable in various environments and scales and integrate with our CI/CD systems.

As a Senior Data Engineer, you will work with a team of software engineers, computational biologists, and researchers. You will also work closely with the front-end software engineering team to ensure applications are fast, reliable, and scalable. Ideally, you are excited to learn about genetics, genomics, biology, and human health.

Requirements:

  • Bachelor's or Master's degree in Computer Science or related field or equivalent experience.

  • At least five years of full-time employment in a Data Engineering role

  • Experience with SQL and NoSQL databases.

  • Experience with Python.

  • Experience building scalable data architectures.

  • Experience with a pipeline tool such as Airflow, Luigi, Prefect, or Dagster.

  • Experience with cloud computing platforms such as AWS or Google Cloud.

  • Highly collaborative attitude and ability to work well in a team setting.

  • Excellent communication skills.

  • Demonstrated attention to detail and analytical skills.

An ideal candidate may have any of the following:

  • Experience with Docker, Kubernetes, and a major cloud provider (Compute, Object Storage, IAM, Functions-as-a-Service) is a plus.

  • Experience with bioinformatics datasets and analyses is a big plus.

What we offer:

  • The chance to work on a project of international significance with a clear impact on families affected by rare genetic disorders.

  • The opportunity to share your work: many of our projects are open source, allowing you to showcase your contributions to the community.

  • Comprehensive benefits package: paid vacation and sick time, health/dental care, matching 401K, commuter benefits, child care.

The key to our success is growing a strong team with a diverse membership who foster a culture of continual learning, and who support the growth and success of one another. Towards this end, we are committed to seeking applications from women and underrepresented groups. We know that many excellent candidates choose not to apply despite their capabilities; please allow us to enthusiastically counter this tendency. If you are a data engineer who is eager to grow professionally, contribute to our team culture, and participate in high-impact, open science, then we encourage you to apply!

All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or protected veteran status.

EEO is The Law - click here for more info

Equal Opportunity Employer Minorities/Women/Protected Veterans/Disabled

Interested? Click here to apply!