We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results
New

Technology Safety & Reliability Architect (Integrations)

Peraton
United States, Virginia, Reston
1875 Explorer St (Show on map)
Mar 18, 2026

Technology Safety & Reliability Architect (Integrations)
Job Locations

US




Requisition ID
2026-164635

Position Category
Information Technology

Clearance
Public Trust



Responsibilities

Peraton is seeking a Technology Safety & Reliability Architect to lead the design, governance, and continuous improvement of technology safety, resilience, and system integration assurance for the FAA Brand New Air Traffic Control System (BNATCS) program.

This role will operate on the CIO governance and architecture layer, ensuring that technology platforms, integration activities, and operational services are designed and operated using safe-by-design and resilience-first principles. The Safety & Reliability Architect will establish architectural guardrails, reliability engineering standards, and system integration safety controls to reduce the likelihood and impact of outages, misconfigurations, integration failures, and high-risk operational changes.

The role will work across CIO leadership, CTO engineering teams, and program delivery organizations to ensure technology architecture, integration patterns, and operational platforms meet program safety, reliability, and resilience objectives.

Key Responsibilities may include:

Safety Architecture & Reliability Engineering

    Define and maintain technology safety and resilience architecture for enterprise platforms across on-prem, cloud, and hybrid environments
  • Establish safety-related non-functional requirements (NFRs) including availability, reliability, recoverability, performance, and maintainability
  • Translate NFRs into implementable engineering controls and architecture guardrails
  • Develop and maintain platform hardening standards and baseline configurations aligned to operational safety objectives
  • Define safe-by-default architecture patterns for platform services and shared infrastructure

System Integration Safety Governance

  • Lead integration safety and reliability governance across systems participating in the BNATCS technology ecosystem
  • Define integration architecture safety patterns for data exchange, service interoperability, and platform dependencies
  • Ensure integration activities meet availability, fault isolation, and operational resilience standards
  • Support the definition of integration validation criteria and readiness reviews prior to major deployments or system transitions
  • Identify and mitigate system-level integration risks impacting program safety, stability, or operational continuity

Resilience Engineering & Operational Stability

  • Design and validate resilience strategies including redundancy, failover, fault isolation, backup/restore, and disaster recovery
  • Ensure monitoring, alerting, and telemetry capabilities support early detection of operational anomalies
  • Define and monitor reliability metrics such as availability, MTTR, incident frequency, and change failure rates
  • Introducing resilience testing practices including failover validation, recovery testing, and operational readiness reviews

Change & Operational Safety Controls

  • Embed safety validation checkpoints into change and release management processes
  • Define readiness criteria for high-risk deployments including rollback procedures and post-change validation
  • Implement risk scoring and change classification models for operational safety governance
  • Ensure major operational changes meet safety and resilience readiness criteria

Incident Management & Reliability Improvement

  • Serve as a stabilization leader during major incidents impacting program technology services
  • Lead root cause analysis and post-incident reviews to identify systemic improvement opportunities
  • Develop and implement corrective actions including automation, configuration, standardization, and architecture improvements
  • Drive measurable improvements in platform stability and reliability

Program Governance & Reporting

  • Conduct architecture reviews, resilience assessments, and configuration audits to validate platform safety posture
  • Provide executive reporting on technology risk posture, resilience maturity, and operational safety trends
  • Partner with CIO and Domain leadership to align technology architecture decisions with program safety objectives


Qualifications

Required Qualifications

  • Minimum of 12+ years of experience with a BS/BA, 10+ years with a MS/MA, or 7+ years with a PhD in Information Technology, Computer Science, Cybersecurity, Systems Engineering, or related field (or equivalent experience)
  • 7+ years of experience designing and operating enterprise technology platforms with responsibility for availability, reliability, and operational resilience
  • Demonstrated experience defining architecture standards, engineering guardrails, and operational controls across complex enterprise environments
  • Strong systems thinking with the ability to understand dependencies across identity, networking, compute, storage, endpoints, and platform services
  • Experience developing architecture documentation, reference designs, and operational runbooks
  • Experience in implementing operational excellence practices including:
    • Monitoring and observability
    • Incident management
    • Change safety controls
    • Disaster recovery planning and validation
  • Strong stakeholder coordination skills across architecture, engineering, operations, and program leadership team
  • Excellent written and verbal communication skills for executive briefings and governance documentation
  • US Citizenship with the ability to obtain/maintain an FAA Public Trust

Preferred Qualifications

  • Experience applying system safety analysis methods such as:
    • FMEA (Failure Mode and Effects Analysis)
    • Fault Tree Analysis
    • STPA (System Theoretic Process Analysis)
    • Hazard Analysis
  • Familiarity with governance frameworks include:
    • ITIL
    • TOGAF
    • NIST
    • FISMA
    • ISO 27001 / ISO 22301
    • FedRAMP
  • Experience operating in regulated or safety-critical environments
  • Experience designing cloud resilience patterns across AWS or Azure
  • Familiarity with Infrastructure-as-Code and Policy-as-Code practices


Peraton Overview

Peraton is a next-generation national security company that drives missions of consequence spanning the globe and extending to the farthest reaches of the galaxy. As the world's leading mission capability integrator and transformative enterprise IT provider, we deliver trusted, highly differentiated solutions and technologies to protect our nation and allies. Peraton operates at the critical nexus between traditional and nontraditional threats across all domains: land, sea, space, air, and cyberspace. The company serves as a valued partner to essential government agencies and supports every branch of the U.S. armed forces. Each day, our employees do the can't be done by solving the most daunting challenges facing our customers. Visit peraton.com to learn how we're keeping people around the world safe and secure.



Target Salary Range

$112,000 - $179,000. This represents the typical salary range for this position. Salary is determined by various factors, including but not limited to, the scope and responsibilities of the position, the individual's experience, education, knowledge, skills, and competencies, as well as geographic location and business and contract considerations. Depending on the position, employees may be eligible for overtime, shift differential, and a discretionary bonus in addition to base pay.


EEO

EEO: Equal opportunity employer, including disability and protected veterans, or other characteristics protected by law.
Applied = 0

(web-bd9584865-vpmzc)