About this team and role:

A Staff Operations Engineer leads the design, reliability, and evolution of hybrid-cloud and workplace infrastructure. This senior role spans teams, drives complex initiatives, sets technical direction, and ensures systems are scalable, secure, and efficient.

This role combines technical execution with leadership, shaping architecture both directly and collaboratively.

What you’ll do:

Domain Architecture & Technical Direction

Own and evolve architecture within a defined infrastructure domain
Design and implement scalable, reliable systems spanning multiple teams or environments
Establish and promote best practices, patterns, and standards within the domain
Contribute to medium- and long-term technical strategy (typically 6-18 months)

Complex Problem Solving & Execution

Lead delivery of ambiguous, high-impact infrastructure projects
Break down elaborate system problems into implementable solutions
Drive migrations, re-architectures, and performance/reliability improvements
Remain hands-on with critical systems and implementations

Cross-Team Collaboration

Work across teams (IT, SRE, Security, Service Owners) to unify solutions.
Influence technical decisions through design reviews and collaboration
Ensure systems integrate cleanly across infrastructures (office, DC, cloud)

Reliability, Operations & Scaling

Improve system reliability through monitoring, alerting, and operational design
Contribute to defining SLIs/SLOs and capacity planning within the domain
Participate in and lead root cause analysis for complex incidents
Decrease operational toil through automation and system improvements

Infrastructure & Networking Depth

Design and support core infrastructure components (compute, DNS, networking, identity, etc.)
Drive improvements in performance, scalability, and dependability
Contribute deep expertise in at least one area (e.g., DNS, network architecture, cloud infra)

Automation & Tooling

Build and improve automation using scripting and Infrastructure as Code
Contribute to internal tooling and platform improvements
Promote repeatable, standardized approaches to system management

Mentorship & Technical Guidance

Mentor engineers and guide system design and troubleshooting
Raise the technical quality of the team through reviews and shared practices
Act as a go-to resource within the domain

Documentation & Operational Clarity

Maintain clear documentation, diagrams, and runbooks for systems owned
Ensure systems are understandable and operable by others
Contribute to knowledge sharing across teams

What you’ll bring:

6+ years of experience in systems engineering or infrastructure roles
Strong experience designing and operating production infrastructure
Solid expertise in: VMware, Cisco UCS, Application/Network Loadbalancers, Linux/Unix Operating Systems, Networking fundamentals (DNS, TCP/IP, routing, firewalls), Data center environments
Demonstrated ability to lead complex technical work across teams

Staff Operations Engineer

TrulyRemote Verified

Technical Requirements

About this team and role:

What you’ll do:

What you’ll bring:

Similar Jobs

Senior Radar Software Engineer

Senior Test Tooling Engineer

Embedded Software Engineer

Sales Development Rep - North America (Must be located in India)

Staff Engineer, AI Platform & Architecture