We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results
New

Lead Scientific Data Engineer (Joint Genome Institute)

Lawrence Berkeley National Laboratory
sick time, 401(k), relocation assistance, remote work
United States, California, Berkeley
1 Cyclotron Road (Show on map)
May 19, 2026

Berkeley Lab's (LBNL) Joint Genome Institute (JGI) has an opening for a Lead Scientific Data Engineer to join the Advanced Analysis Team!

JGI has a long history of generating world-class genomic data to address pressing national energy and environmental security challenges. Building on this expertise, JGI is now helping to define the data foundation for an emerging era of AI-enabled scientific discovery in support of the Genesis Mission. The Advanced Analysis team at JGI builds the core data infrastructure, advanced bioinformatics workflows, and ML/AI data pipelines needed to prepare genomic data for new AI-enabled capabilities. We are looking for a Lead Scientific Data Engineer to help drive the evolution of these systems and shape the platforms JGI will rely on to meet the scale, complexity, and urgency of data-driven science.

This is an exciting and unique opportunity to provide senior technical leadership for some of the core scientific data systems that support JGI operations, genomic data workflows, and AI capabilities. You will be asked to lead the implementation of strategic initiatives across data management, job orchestration, and platform integration. We will trust you to define practical technical roadmaps, guide architecture decisions, and help ensure long-term platform value and scalability. As a senior technical leader at JGI, you will also contribute to cross-team engineering strategy, team culture, and platform evolution across the organization.

This position has an anticipated start date of July 1, 2026.

What You Will Do:

  • Provide senior technical leadership for JGI's core scientific data and compute platforms by developing technical implementation roadmaps, data system architectures, and long-term data system strategy.
  • Lead the design and implementation of production automated systems, APIs, and workflows supporting genomic data movement, metadata management, job orchestration, data access, and large-scale scientific computing.
  • Improve the reliability, scalability, observability, interoperability, and maintainability of shared production data systems while supporting sustainable operations and delivery.
  • Work closely with product managers, scientists, and users to drive cross-team technical alignment and integration decisions that address complex technical challenges and shared priorities.

What We Are Looking For:

  • A Bachelor's Degree (or equivalent knowledge/training) in Computer Science or a related field and a minimum of 12 years of related professional experience with large-scale scientific data and compute infrastructures or an equivalent combination of education and experience.
  • Demonstrated experience leading the design, development, integration, and operation of production software and data systems that support metadata management, workflow orchestration, data lifecycle operations, and broad user data access.
  • Advanced knowledge of data and software engineering fundamentals relevant to data-intensive distributed systems, including system design, concurrency, performance, and testing.
  • Broad experience with databases and data storage technologies including relational databases, object storage, and systems for managing structured, semi-structured, and large-scale data.
  • Experience with data engineering and event-driven technologies such as Airflow or Kafka.
  • Strong experience effectively using AI coding agents such as Claude Code, Codex, or Cursor, including demonstrated judgment in reviewing and validating generated software for correctness, quality, security, maintainability, and suitability for production use.
  • Proficiency in Python and experience with one or more additional programming languages.
  • Excellent communication skills, including experience organizing and presenting complex technical information to varying audiences.
  • Demonstrated ability to lead through influence and bring people together to deliver technical results in complex, interdisciplinary environments, including aligning users, stakeholders, and engineering teams around shared requirements and implementation plans.

Desired Qualifications:

  • A Master's Degree (or equivalent knowledge/training) in Computer Science or a related field.
  • Experience working with genomics, bioinformatics, and/or next-generation sequencing data.
  • Experience with scientific workflow languages or workflow systems such as WDL and Nextflow.
  • Experience with full-stack and front-end application development.
  • Experience working in High Performance Computing (HPC) environments.

Additional Information:

  • Application Date: Priority consideration will be given to candidates who apply with a resume and cover letter by June 1, 2026. Applications will be accepted until the job posting is removed.
  • Appointment Type: This is a full time, exempt from overtime pay (monthly paid), 2 year (benefits eligible), Term appointment with the possibility of extension or conversion to Career appointment based upon satisfactory job performance, continuing availability of funds, and ongoing operational needs.
  • Salary Range: This position has a budgeted salary range of $158,808 - $198,492 annually, which fits within the full salary range of $158,808 - $267,996 annually for job code C71.4. It is not typical for an individual to be offered a salary at or near the top of the range for a position. Salary will be commensurate with the final candidate's qualification and experience, including skills, knowledge, relevant education, certifications, and aligned with the internal peer group.
  • Background Check: This position is subject to a background check. Any convictions will be evaluated to determine if they directly relate to the responsibilities and requirements of the position. Having a conviction history will not automatically disqualify an applicant from being considered for employment.
  • Work Modality: This position is eligible for a hybrid work schedule - a combination of teleworking and performing work on site at Lawrence Berkeley National Lab located at 1 Cyclotron Road, Berkeley, CA 94720. Individuals working a hybrid schedule must reside within 150 miles of Berkeley Lab. Work schedules are dependent on business needs. In rare cases, full-time telework or remote work modes may be considered. A REAL ID or other acceptable form of identification is required to access Berkeley Lab sites (for more information click here).
  • Relocation Assistance: This position is eligible for relocation assistance.
  • Work Authorization: Applicants must be legally authorized to work in the United States. Berkeley Lab does not provide visa sponsorship for this position.

We're here for the same mission, to bring science solutions to the world. Join our team and YOU will play a key role in our goal to address global challenges! Have a high level of impact and work for an organization associated with 17 Nobel Prizes!

Who is JGI?

The Joint Genome Institute (JGI) is a global leader in genome science, helping shape the future of biological discovery through advanced genomic capabilities, expert support, and large-scale, AI-ready data resources. As a DOE Office of Science user facility supported by the Biological and Environmental Research (BER) program, JGI advances BER's mission to achieve a predictive understanding of complex biological, Earth, and environmental systems in support of the nation's energy and infrastructure security. Through world-class capabilities in genome sequencing, synthesis, transcriptomics, metabolomics, natural products, and data science, JGI supports cutting-edge research on plants, fungi, algae, microorganisms, and microbiomes. JGI is headquartered in Berkeley, CA at Berkeley Lab's Integrative Genomics Building (IGB) (Virtual Tour).

Why join Berkeley Lab?

We invest in our employees by offering a total rewards package you can count on:

  • Exceptional health and retirement benefits, including pension or 401K-style plans
  • A culture where you'll belong - we are invested in our teams!
  • In addition to accruing vacation and sick time, we also have a Winter Holiday Shutdown every year.
  • Parental bonding leave (for both mothers and fathers)
  • Pet insurance

Want to learn more about working at Berkeley Lab? Please visit: careers.lbl.gov

Equal Employment Opportunity Employer: The foundation of Berkeley Lab is our Stewardship Values: Team Science, Service, Trust, Innovation, and Respect; and we strive to build community with these shared values and commitments. Berkeley Lab is an Equal Opportunity Employer. We heartily welcome applications from all who could contribute to the Lab's mission of leading scientific discovery, excellence, and professionalism. In support of our rich global community, all qualified applicants will be considered for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, age, protected veteran status, or other protected categories under State and Federal law.

Misconduct Disclosure Requirement: As a condition of employment, the finalist will be required to disclose if they are subject to any final administrative or judicial decisions within the last seven years determining that they committed any misconduct, are currently being investigated for misconduct, left a position during an investigation for alleged misconduct, or have filed an appeal with a previous employer.

Applied = 0

(web-77cf7d65c7-z52c2)