Location
Ontario, Canada
Salary
Not specified
Type
fulltime
Posted
Today
Job Description
About The Company
Litmus is a pioneering growth-stage software company dedicated to building the data foundation that powers industrial AI. Recognizing that AI's effectiveness hinges on real-world, contextualized data, Litmus addresses the critical gap faced by many industrial environments that lack access to actionable operational data. The company's platform enables manufacturers to access, structure, and utilize real-time data from machines, systems, and sensors at the edge, sitting at the intersection of edge computing, AI, and industrial operations. Trusted by global giants such as Google, Microsoft, Dell, Oracle, and Mitsubishi, Litmus empowers some of the world's largest companies to run operations in real time, reduce downtime, and optimize production processes. Backed by leading investors, Litmus is at the forefront of the shift toward software-defined manufacturing, making industrial AI a tangible reality.
About The Role
Litmus is seeking a highly experienced Senior DevOps Leader to own and transform the company's DevOps function into an AI-enabled engineering discipline. Reporting directly to the Head of Technology, this role involves leading a distributed team across North America and India, overseeing the entire DevOps landscape, and driving technical excellence and innovation. The successful candidate will inherit a robust technical foundation, including self-hosted GitLab for CI/CD, multi-cloud infrastructure on AWS and GCP, Kubernetes workloads, and an on-premises VMware estate. Your primary responsibilities will include elevating platform maturity, reducing delivery friction, and making strategic technical decisions to support rapid scaling and operational reliability. This role offers an exciting opportunity to work at the intersection of platform engineering, cloud infrastructure, security automation, and AI transformation, shaping the future of industrial AI infrastructure at Litmus.
Qualifications
- 5\+ years of progressive DevOps or platform engineering experience, including at least 2 years in a technical leadership or staff role.
- Deep hands-on experience with GitLab CI/CD, including self-hosted GitLab administration, pipeline management, and security integrations.
- Proficiency in production Kubernetes environments, preferably EKS, with expertise in cluster upgrades, node management, networking, and reliability engineering.
- Strong multi-cloud infrastructure experience across AWS and GCP/Azure, including IAM, VPC networking, EKS, and cost optimization.
- Extensive experience with Infrastructure as Code using Terraform, including module design, drift detection, and automation pipelines.
- Knowledge of identity and access management, including federating VMware vCenter, AWS, and Azure AD/Entra ID with SSO and group-based access controls.
- Hands-on security tooling experience such as vulnerability scanning (Qualys or equivalent), secrets management (1Password, Vault), and SBOM/CVE pipelines.
- Proficiency in scripting languages like Bash or Python for automation tasks.
- Excellent written and verbal communication skills, capable of creating clear design documents and engaging with cross-functional teams and leadership.
- Experience with AI tooling within engineering workflows, including pipeline diagnostics, incident response, and developer productivity tools.
Responsibilities
- Lead and mentor a distributed DevOps team across North America and India, including a dedicated security-focused sub-team.
- Serve as the primary decision-maker for DevOps architecture, tooling, prioritization, and standards.
- Collaborate with Engineering, QA, and Product teams to streamline delivery processes and improve key metrics such as lead time, deployment frequency, and MTTR.
- Manage and enhance the self-hosted GitLab platform, including upgrades, runner fleet management, and pipeline maturity.
- Drive the evolution of CI/CD capabilities, integrating security and static analysis tools, container scanning, and infrastructure testing.
- Oversee EKS cluster operations, including upgrades, node management, networking, and reliability improvements.
- Manage multi-cloud infrastructure, optimize costs, and oversee resource lifecycle across AWS and GCP.
- Lead the migration from legacy on-premises infrastructure to cloud-native solutions where appropriate.
- Implement and enforce security standards, including SSO federation, vulnerability management, secrets handling, and data security protocols.
- Build and own the internal developer platform, improving developer experience through automation and self-service tools.
- Lead the observability initiatives, including metrics collection, alerting, and performance monitoring, fostering a metrics-driven culture.
- Drive AI-enabled transformations within the DevOps landscape, integrating AI tools for automation, diagnostics, incident response, and cost optimization.
- Establish governance frameworks for AI tooling, ensuring security, compliance, and effective use of AI capabilities in engineering workflows.
Benefits
- Competitive base salary ranging from CA$145,000 to CA$185,000, commensurate with experience.
- Comprehensive benefits package including health, dental, and vision coverage.
- Equity participation opportunities to share in the company's growth and success.
- Professional development allowance to support continuous learning and certification.
- Flexible work arrangements to promote work-life balance.
- Collaborative and inclusive company culture that values curiosity, ownership, and impact.
Equal Opportunity
Litmus is proud to be an equal opportunity workplace and is committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, or any other basis protected by federal, state, or local law.
Looking for more opportunities?
Browse thousands of graduate jobs and entry-level positions.