Data Engineer, Informatics

Job Description

Lumen Bioscience is seeking a motivated Data Engineer to build and maintain systems that turn raw scientific data into highly useful, accessible information that supports biologic drug development. The person in this role will architect, construct, and administer component technologies, including analytical databases and cloud data processing pipelines.

The solutions established and maintained by the Data Engineer will collect data from a variety of sources (e.g., laboratory instruments, sensors, probes, LIMS software). These solutions will store, transform, integrate, and make data available in formats useful for additional query and analysis in the cloud. The data will feed webapps, dashboards, reports, and data science tools, helping to generate critical insights across research, development, and manufacturing. 

Essential Duties and Responsibilities

The major tasks for this position are as follows:

  • Employ a variety of languages and tools to combine data sources and create reliable data pipelines
  • Design and implement the structure and functional capabilities of key data resources (e.g., data base schema, data lake layers)
  • Architect appropriate data solutions to meet research and business needs while also ensuring fit within Lumen’s cloud ecosystem
  • Develop scripts that transform data into useful formats for further analysis by researchers
  • Investigate opportunities for additional data acquisition and processing automation; working closely with scientist and bioinformatics team members to define key requirements
  • Assist in the development and implementation of standard data tables, reports, dashboards, and visualizations that support scientific research and business goals
  • Provide recommendations to improve data quality, reliability, and processing efficiency
  • Collaborate with consultants and contractors to provide sound data solutions built for performance, reliability, and security
  • Administer data infrastructure and resources to help ensure data security and integrity
  • Facilitate legacy data migrations
  • Document system design
  • Where appropriate, provide end-user support, documentation, and training

Desired Qualifications and Requirements

Education and Experience Requirements:

  • B.S./B.A. degree in Computer Science or related field, or equivalent work experience
  • Demonstrated experience developing advanced SQL queries and building relational databases
  • Proficiency in shell scripting and Python
  • Demonstrated experience with ETL development
  • In-depth knowledge of cloud computing platforms (i.e., AWS, GCP, or Azure)
  • Experience processing large, multi-dimensional datasets from diverse, distributed sources

Desired Qualifications:

  • Biotechnology experience
  • Cloud data lake and data warehouse experience
  • Exposure to software validation process

The successful candidate will have the following attributes

  • Takes pride in building systems and responsibility for wrangling data
  • Strong attention to detail
  • Strong analytical skills
  • Quality focus
  • Curious, active learner
  • Takes initiative in driving projects forward
  • Collaborative team player

Applications Instructions

Please submit a resume and cover letter to with the job title in the subject line.

This position is available immediately. Applications will be reviewed upon receipt. Only successful applicants will be contacted.