Location
Remote
Salary
Not specified
Type
fulltime
Posted
Today
via linkedin
Job Description
Feldspar \& Flint LLC is a Recruiting \& Staffing firm that specializes in operational strategy across core business functions.
Lead the evolution of enterprise reliability engineering as a Principal SRE. This role offers the opportunity to define strategy, build scalable reliability practices, mentor high-performing engineers, and influence the design of resilient cloud-native platforms that support mission-critical business operations.
Responsibilities
- Establish and scale a modern Site Reliability Engineering function aligned with product delivery and platform objectives.
- Define and drive reliability strategies, including observability, performance engineering, and resilience initiatives.
- Champion service level frameworks, error budgets, and operational excellence practices across engineering teams.
- Design standards, patterns, and metrics that support highly available, scalable, and performant systems.
- Lead the adoption of effective monitoring, alerting, and end-to-end service visibility capabilities.
- Mentor and develop SRE professionals while fostering a culture of continuous improvement and technical excellence.
- Collaborate with architecture, security, operations, and development teams to deliver reliable cloud-based solutions.
- Evaluate and optimize tooling, processes, and platform roadmaps to improve operational effectiveness and long-term scalability.
Qualifications
- 5\+ years of experience implementing Site Reliability Engineering practices and reliability-focused solutions.
- Strong expertise with cloud platforms such as AWS, Azure, or Google Cloud and modern distributed architectures.
- Demonstrated experience leading high-performing engineering teams and driving large-scale reliability initiatives.
- Background in software engineering, infrastructure, platform engineering, DevOps, or related technical disciplines.
- Experience working with APIs, microservices, automation, observability, or cloud-native technologies.
- Proven ability to influence stakeholders, solve complex technical challenges, and thrive in collaborative environments.
Looking for more opportunities?
Browse thousands of graduate jobs and entry-level positions.