Location
London Area, United Kingdom
Salary
Not specified
Type
Full-time
Posted
Today
Job Description
About the Company
We're working with a fast-growing, computer vision SaaS company. This is a very complex platform that is private by design and have various clients across different industries, with different use cases and criteria for their AI-driven insights and detection.
About the Role
They're looking for an SRE to join a small, high-ownership infrastructure team as they scale. You'll be joining as a core member of the SRE function at a pivotal point - the foundational infrastructure is in place, but there's a significant amount of work still to do. More clusters, more architecture, observability, incident response, network optimisation for sensor data ingestion, and building out a scalable blueprint for new customer deployments. You'll have real influence over how things are built and the autonomy to drive it. This is not a ticket-closing role. You'll be working closely with backend engineering and product teams, and the expectation is that you think like an owner.
Responsibilities
- Managing and scaling Google Cloud (GKE) infrastructure alongside some on-premise servers
- Building and improving observability, monitoring, and incident response processes
- Automating operational tasks and improving deployment pipelines
- Optimising network performance for real-time camera and sensor data ingestion
- Designing infrastructure blueprints to support enterprise customer deployments, including varying regulatory requirements
- Collaborating closely with ML and backend engineering teams
- Maintaining high-quality documentation as a first-class deliverable
Required Skills
- Solid Google Cloud and GKE experience
- Strong Linux and networking fundamentals
- Comfortable with automation and scripting - you don't need to be a software engineer but you need to be able to code
- Experience with observability tooling and incident response
- Good communicator who can work across engineering, product and leadership
- GitHub familiarity
- Exposure to MLOps or ML inference infrastructure is a real plus
Qualifications
They're looking for someone who has come up through an engineering or DevOps path into SRE, and who has done it at a startup or scale-up where they had strong ownership. If you've spent your career in large enterprise or consultancy environments where someone else built and maintained the infrastructure, this probably isn't the right fit. A computer science degree is not required - what matters is the skillset and the mindset.
Looking for more opportunities?
Browse thousands of graduate jobs and entry-level positions.