Apply now »

APIGEE X SITE RELIABILITY ENGINEER - VOIS

Aggregate function: Shared Services

Business Area: Technology _VOIS

Posting Country: India

Date Posted: 9 Apr 2026

Full Time / Part Time: Full Time

Contract Type: Permanent

We challenge and innovate in order to connect people, businesses, and communities across the world. Delighting our customers and earning their loyalty drive us, and we experiment, learn fast and get it done, together.

With us, you can be truly be yourself and belong, share inspiration, embrace new opportunities, thrive, and make a real difference.

Who we are

VOIS (Vodafone Intelligent Solutions) is a strategic arm of Vodafone Group Plc, creating value for customers by delivering intelligent solutions through Talent, Technology & Transformation.
As the largest shared services organisation in the global telco industry with 30,000 FTE, our portfolio of next-generation solutions and services are designed in partnership with customers across Vodafone Group, local markets, and partner markets to simplify and drive growth. With our strategic partner Accenture, we work alongside our Vodafone customers, other Telco and tech companies to drive transformation, meet the challenges of our industry and ensure we stay relevant and resilient. This partnership is a unique, industry-first model which brings together the best of in-house and 3rd party capability.
We work with customers across 28 countries from 10 VOIS locations: Albania, Egypt, Hungary, India, Romania, Spain, Turkey, UK, Germany, Ireland, and with a network of teams in Czech Republic, Italy, Greece, and Portugal.
#VOIS #BeUnrivalled #CreateTheFuture

About this Role

We are seeking an Apigee X Site Reliability Engineer (SRE) with strong production experience operating APIs at scale. This role is focused on ensuring the reliability, performance, and resilience of Apigee X–backed services that support critical customer journeys. The individual will take ownership of monitoring, observability, incident response, and reliability engineering practices, with a clear mandate to improve SLO attainment, reduce mean time to recovery, minimise incident recurrence, and continuously reduce operational toil.

What you will do

Define, implement, and maintain Service Level Indicators (SLIs) and Service Level Objectives (SLOs) for Apigee-backed services, including availability, latency, error rates, and throughput
Establish SLO targets, manage error budgets, and own reliability reporting cadence
Design, implement, and continuously tune alerting strategies across the API platform to reduce noise and improve actionability
Classify and route alerts by severity (P1/P2/P3) based on customer impact and SLO burn rates
Implement alert correlation patterns, including authentication failures, quota spikes, and backend target failures
Own and enhance operational dashboards covering Golden Signals and dependency health using Datadog, with adaptability to future observability tools
Build and maintain dashboards for traffic, latency, error rates, backend dependencies, DNS health, certificate expiry, and authentication providers
Create SLO burn-rate views and identify top impacted API proxies
Proactively identify anomalies and performance degradation trends such as p95 latency drift, rising 429 responses, backend timeouts, and token failures
Analyse seasonality patterns and establish intelligent baseline thresholds
Produce weekly and monthly reliability reports covering SLO performance, major incidents, recurring root causes, change failure rate, and MTTR
Implement and maintain synthetic monitoring and user journey checks for critical API flows, including authentication, API invocation, and backend dependencies
Participate in 24x7 on-call rotations and lead incident response and problem management activities

Who you are

An experienced reliability or production support professional with strong hands-on expertise in the Apigee platform, particularly Apigee X
Proficient in custom reporting and advanced debugging within Apigee environments
Experienced with APM and observability tools, including creating dashboards, alerts, and monitors (Datadog preferred)
Comfortable operating in production environments and responding to incidents with a structured, customer-impact-focused approach
Knowledgeable in modern cloud technologies and distributed systems
Familiar with Agile ways of working and collaborative, cross-functional delivery
Educated to bachelor’s degree level in Computer Science, Computer Engineering, or equivalent practical experience

Not a Perfect Fit?

Concerned you may not meet every requirement? Vodafone is committed to creating an inclusive workplace where everyone can thrive. If you are excited about this role but your experience does not align exactly with every aspect of the job description, you are encouraged to apply. You may be the right candidate for this or another opportunity, and the recruitment team will support you in exploring where your skills fit best.

What’s in it for you

The opportunity to work on large-scale, business-critical API platforms supporting high-impact customer journeys
Exposure to advanced reliability engineering practices within a global technology organisation
Collaboration with diverse, cross-functional teams across markets and partners
A role with clear ownership, influence, and measurable outcomes in platform reliability and resilience

What skills you will learn

Advanced SRE practices including error budgets, burn-rate alerting, and reliability governance
Deep operational insight into Apigee X runtime behaviour and API performance optimisation
Enhanced observability and monitoring design skills across complex, distributed systems
Incident leadership, problem management, and continuous improvement techniques at scale

VOIS Equal Opportunity Employer Commitment

Vodafone recognises and celebrates the value of diversity in building a workforce that reflects the customers and communities it serves. No form of discrimination is tolerated. This includes, but is not limited to, discrimination based on race, colour, age, veteran status, gender identity, gender expression, sexual orientation, pregnancy, maternity or parental status, ethnicity, disability, religion or belief, political affiliation, trade union membership, nationality, citizenship, indigenous status, medical condition, HIV status, neurodiversity, social origin, cultural background, marital or civil partnership status, or socio-economic background.

Join Us

At Vodafone, we’re working hard to build a better future. A more connected, inclusive and sustainable world. As a dynamic global community, it's our human spirit, together with technology, that empowers us to achieve this.
We challenge and innovate in order to connect people, businesses, and communities across the world. Delighting our customers and earning their loyalty drive us, and we experiment, learn fast and get it done, together.
With us, you can be truly be yourself and belong, share inspiration, embrace new opportunities, thrive, and make a real difference.

Alert

Apply for Vodafone jobs only through the official Vodafone Careers website to avoid job scams and fraud.” #JDEnhancedByTARA

Follow us on social media and #StayConnected

LinkedIn: https://www.linkedin.com/company/vois/
Facebook: https://www.facebook.com/voisglobal
Instagram: https://www.instagram.com/voisglobal

Vodafone is committed to attracting, developing and retaining the very best people by offering a motivating and inclusive workplace in which talent is truly recognised and rewarded. We are committed to promoting Inclusion for All with the belief that diversity plays an important role in the success of our business. We actively encourage everyone to consider becoming a part of our journey.

Apply now »