Jobs on TAL
All jobsOnsiteEngineeringai/ml infra4-8 yearsprometheus
OnsiteSeniorai/ml infra

AI Solutions and Platforms Observability Engineer

PepsiCoHyderabad, Telangana, IndiaPosted 19 May 2026

This Senior Observability Engineer role at PepsiCo focuses on building and maintaining a scaled observability platform for agentic AI workflows. The engineer will deploy telemetry pipelines, implement distributed tracing for complex agent loops, and manage FinOps for AI workloads. The role requires deep expertise in OpenTelemetry, Kubernetes, and IaC, alongside experience monitoring LLM-specific signals. This position sits at the intersection of AI platform development and reliability engineering.

Matched by TAL

50k new jobs listed every day. Install TAL to find more jobs like this.

Install TAL

Experience

4-8 years

Function

Engineering

Work mode

Onsite, India

Company

Tier 2

What you will work on

This Senior Observability Engineer role at PepsiCo focuses on building and maintaining a scaled observability platform for agentic AI workflows. The engineer will deploy telemetry pipelines, implement distributed tracing for complex agent loops, and manage FinOps for AI workloads. The role requires deep expertise in OpenTelemetry, Kubernetes, and IaC, alongside experience monitoring LLM-specific signals. This position sits at the intersection of AI platform development and reliability engineering.

TAL's take

Quality 65/1005/5 clarityTier 2 company

Role offers high-impact exposure to cutting-edge Agentic AI observability at a large scale, though the company is non-tech native.

The JD provides a highly specific, well-structured list of responsibilities and technical requirements focused on AI agent telemetry.

Salaries at PepsiCo

26.0 LPA average

Based on 37 Grapevine salary entries for PepsiCo.

View all salaries

Engineering

0 - 2 years | L5

12 LPA average

Range: 12 - 12 LPA

Engineering

8 - 10 years | L7

28 LPA average

Range: 28 - 28 LPA

Engineering

10 - 12 years | L8

38 LPA average

Range: 38 - 38 LPA

Other roles

0 - 2 years | L5

7 LPA average

Range: 5 - 9 LPA

Must haves

  • Hands-on experience with observability tools like Prometheus, Grafana, or Datadog
  • Deep working knowledge of OpenTelemetry instrumentation
  • Experience observing agentic AI systems and RAG pipelines
  • Strong Kubernetes experience including Helm and operators
  • Strong automation skills in Python, Bash, or Go
  • Experience with Infrastructure-as-Code tools like Terraform
  • Production operations and debugging distributed systems

Tools and skills

prometheusgrafanaelasticsplunkdatadogopentelemetrykuberneteshelmpythonbashgoterraformbicepcloudformation

Nice to have: crew.ai, langchain, semantic kernel, autogen.

About the company

Global consumer goods corporation with significant internal technology operations but not a primary engineering-first tech firm.

Posts mentioning PepsiCo