We are Lenovo. We do what we say. We own what we do. We WOW our customers.

Lenovo is a US$69 billion revenue global technology powerhouse, ranked #196 in the Fortune Global 500, and serving millions of customers every day in 180 markets. Focused on a bold vision to deliver Smarter Technology for All, Lenovo has built on its success as the world’s largest PC company with a full-stack portfolio of AI-enabled, AI-ready, and AI-optimized devices (PCs, workstations, smartphones, tablets), infrastructure (server, storage, edge, high performance computing and software defined infrastructure), software, solutions, and services. Lenovo’s continued investment in world-changing innovation is building a more equitable, trustworthy, and smarter future for everyone, everywhere. Lenovo is listed on the Hong Kong stock exchange under Lenovo Group Limited (HKSE: 992) (ADR: LNVGY).

This transformation together with Lenovo’s world-changing innovation is building a more inclusive, trustworthy, and smarter future for everyone, everywhere. To find out more visit www.lenovo.com, and read about the latest news via our StoryHub.

About Our Team

Lenovo is building

Quantum

, a next‑generation hybrid AI platform that spans Windows, Android, and cloud. As part of this initiative, we are growing the reliability engineering organization that powers

Qira

, Lenovo’s cross‑device Personal AI.

We are hiring

Site Reliability Engineers (SREs)

to strengthen the reliability, observability, and operational excellence of Qira’s AI systems acrossdevice, edge, and cloud. Depending on your strengths, you may be aligned to areas such as Observability, Operations, or Service Reliability.

Qira works with the speed and creativity of a startup inside Lenovo —you’llhelp build foundational systems with clarity, ownership, and modern engineering practices.

Location:

On-site in Chicago, IL. Hybrid (3 days on-site, 2 days remote)

What You Might Work On

As an SRE, you maybe responsible fora subset of the following, depending on team placement and skill alignment:

Reliability Systems Engineering

Support the reliability, availability, and performance of distributed systems across cloud, edge, and device environments.
Help define, measure, and monitorSLIs and SLOsfor core Qira services.
Identifyreliability risks and collaborate with senior engineers on mitigation plans.

Operational Excellence

Participate in on‑call rotations andassistwith incident response and post‑incident reviews.
Contribute improvements to runbooks, automation, and tooling that reduce alert noise and operational toil.
Help enhance detection, alerting, and response workflows.

Observability Insight

Implement and improve telemetry usingOpenTelemetry,Grafana, and related tools.
Build dashboards and tools that improve visibility into system health and AI service behavior.
Ensure observability data is complete,accurate, and actionable.

Deployments Change Safety

Support safe, reliable deployment workflows including canaries, staged rollouts, and automated rollbacks.
Assistin improving CI/CD systems and deployment tooling.

Collaboration Best Practices

Work closely with senior SREs, DevOps engineers, AI/ML teams, and platform engineers.
Contribute to reliability reviews, operational readiness checks, and cross‑team projects.
Advocate for modern SRE and DevOps practices within the organization.

Basic Qualifications

4\+ yearsof experience inSite Reliability Engineering, DevOps, Platform Engineering, or production systems operations.
Bachelor’s Degree in Computer Science, Engineering, or related technical field (or equivalent practical experience).
Foundational experience supportingdistributed systemsin production.
Ability to write scripts or tools in Python, Go, Bash, or similar languages.
Solid understanding of Linux systems, networking basics, and system performance fundamentals.
Experience with cloud platforms (Azure preferred, AWS or GCP acceptable).
Familiarity with monitoring/observability (metrics, logs, tracing).
Experience with containers and Kubernetes.

Preferred Qualifications

Experience withOpenTelemetryinstrumentation and telemetry pipelines.
Hands‑on experience withGrafana, Prometheus, Loki, or Tempo.
Exposure toAI/ML systems, inference services, or data‑intensive workloads.
Experience contributing to CI/CD processes and deployment automation.
Familiarity with hybridarchitecturesspanningdevice,edge, andcloud.
Passion for automation, reliability, and operational excellence.

What Success Looks Like

Systems become easier tooperate, observe, and trust.
Alerts are moreaccurateand actionable.
On‑call load decreases through thoughtful automation and improvements.
Deployment workflows become more reliable and repeatable.
You grow toward deeper ownership and technical leadership within thereliabilityengineering organization.

The base salary budgeted range for this position is $120K - $150K. Individuals may also be considered for bonus and/or commission.

Lenovo’s various benefits can be found on www.lenovobenefits.com.

We are an Equal Opportunity Employer and do not discriminate against any employee or applicant for employment because of race, color, sex, age, religion, sexual orientation, gender identity, national origin, status as a veteran, and basis of disability or any federal, state, or local protected class.

Site Reliability Engineer

Job Description

Looking for more opportunities?