Description & Requirements
ENG Security Architecture operates and evolves enterprise security infrastructure that protects employee-facing services and internal platforms. This role is infrastructure engineering with a Site Reliability Engineering (SRE) mindset applied to security: secure operations, observability, operational automation, and disciplined change control through Git workflows and continuous integration gates. You will own services that must be measurable, stable, and audit ready under real production load, and you are expected to operate independently with minimal ramp up while raising the bar on reliability and security operations across the organization. We own these platforms end-to-end from management consoles and backend services through endpoint agents across Windows and macOS, with Linux support expected over time. Scope includes security control planes across endpoint controls, secure web and data controls, vulnerability and exposure telemetry, logging pipelines, and identity and privileged access services spanning SaaS and on-premise environments.
We'll trust you to
- Define and operate service level objective (SLO) backed reliability for security platforms, including dependency mapping, failure mode analysis, resiliency improvements, and rollback readiness Build and maintain operational automation for provisioning, validation, drift detection, evidence collection, and exception handling using scripting and orchestration
- Run secure change management for configurations and integrations via Git based review, continuous integration (CI) gates, environment promotion, ring-based rollout discipline, and rollback readiness
- Establish observability operators can trust, such as health metrics, data freshness and coverage, structured logs, dashboards, and actionable alerting with noise control
- Lead incident response and problem management for security service degradation, from triage and containment through root cause analysis (RCA) and corrective actions
- Partner with infrastructure, identity, endpoint, and corporate technology teams to standardize operational contracts, access patterns, audit artifacts, and runbooks; drive vendor escalations with evidence packs and regression prevention
- 4+ years of strong experience operating enterprise infrastructure in production in a security context across SaaS and on prem dependencies, including on call incident ownership and root cause analysis (RCA)
- 4+ years building and maintaining production automation in an object-oriented language, typically Python, plus PowerShell or Bash for operational execution; experience integrating with REST APIs and enterprise infrastructure services
- Proven troubleshooting strength across authentication flows, enterprise networking, proxy constrained environments, certificates, and distributed dependencies
- Hands on experience operating endpoint agents and security services at scale on Windows and macOS; able to extend operational patterns to Linux environments over time
- Strong familiarity with continuous integration and continuous delivery (CI/CD) pipelines and secure change management for infrastructure and configuration changes using Git workflows, pull request (PR) review discipline, continuous integration (CI) gates, controlled promotion between environments, and rollback strategy
- Experience building and operating observability for infrastructure services, including dashboards, alerting, health metrics, and data quality checks with noise control
- Hands on background supporting identity and privileged access workflows, including least privilege and time bound elevation patterns
- Hands on operation of enterprise security controls: endpoint, secure web and data, vulnerability telemetry, logging pipelines, privileged access; comfortable with Infrastructure as Code or orchestration frameworks and converting runbooks into automated checks
We’d love to see
- Track record defining service level objectives (SLO) and error budgets for security infrastructure services and using them to drive prioritization
- Experience building self-healing automation with closed loop validation, drift remediation, and audit ready evidence
- Demonstrated ability to lead cross organization reliability and security initiatives, setting standards and influencing roadmaps
- Experience performing threat modeling and architecture reviews for internal platforms and translating findings into concrete control requirements
- Familiarity with automation hardening and secrets and key management: artifact signing, dependency controls, secrets storage and rotation, and certificate or key lifecycle management at scale
We offer one of the most comprehensive and generous benefits plans available and offer a range of total rewards that may include merit increases, incentive compensation (exempt roles only), paid holidays, paid time off, medical, dental, vision, short and long term disability benefits, 401(k) +match, life insurance, and various wellness programs, among others. The Company does not provide benefits directly to contingent workers/contractors and interns.