Turquoise Health is building its Platform Engineering function from the ground up and this is your chance to shape it. As our first Senior Platform Operations Engineer, you'll lay the foundation for how we build, deploy, and operate infrastructure that supports real-world healthcare outcomes.
This is a high-ownership role on a new team. You'll establish the practices, tooling, and standards that the rest of engineering will rely on, from observability and reliability to deployment workflows and scalability. If you're energized by greenfield problems, comfortable navigating ambiguity, and motivated to automate and simplify before being asked, we'd love to build something great together.
Responsibilities:
Own the Platform Infrastructure Manage and scale our container environment on Amazon EKS, implement GitOps workflows using ArgoCD, and maintain CI/CD pipelines through GitHub Actions to ensure that deployments are fast, consistent, and automated
Build for Reliability Define and track SLIs and SLOs, lead incident response including on-call rotations, root cause analysis, and post-mortems, and contribute to disaster recovery planning to keep our systems highly available
Drive Observability Design and maintain our monitoring and logging stack using Datadog, Sentry, and CloudWatch — giving engineering teams clear visibility into system health and performance before problems reach users
Shape the Platform's Future Collaborate on architectural decisions, build internal tooling and self-service workflows that make the platform easier to operate, and contribute meaningfully to how we scale and evolve our infrastructure
What You’ll Bring:
6+ years in SRE, DevOps, or Cloud Infrastructure, 2+ years in a Senior role with a proven track record of building and operating production systems at scale
Confident working with core AWS services (VPC, IAM, EKS, RDS) and a strong understanding of cloud networking and security best practices
Expert in using Infrastructure as code with Terraform, CloudFormation, or Crossplane
Proficient with GitHub and GitHub Actions as a core component of your CI/CD and automation pipelines- not just for source control
Experienced with running Kubernetes clusters in production and managing application deployments through GitOps workflows (ArgoCD/Flux) and Helm Charts
Proficient with observability tooling such as Datadog, Sentry, CloudWatch, Grafana to include building alerts, dashboards, and log pipelines
Experience writing solid Python scripts to glue systems together, automate infrastructure tasks, or handle custom workflows
Comfortable working independently in a remote setup, asking questions when needed, and keeping momentum without being micromanaged
Bachelor’s degree in Computer Science, Engineering, or equivalent experience
Nice to haves:
Certifications: AWS, Kubernetes, Terraform or Python
Benefits:
Competitive pay with equity options
Stellar health care plan options (Medical, Dental & Vision), with FSA, DCFSA, & HSA options
Company-sponsored disability & life insurance
Unlimited PTO
401(k) + 4% Matching
Fully remote work + flexible working hours
$750 work-from-home setup budget
Paid biannual in-person company summits
Quarterly $150 co-hanging stipend to meet up with coworkers
Monthly $100 health and wellness benefit
Generous paid family leave
Annual $1,200 learning & development stipend