Jobs on TAL
All jobsOnsiteEngineeringai/ml infra3-5 yearsredis
OnsiteMid Levelai/ml infra

AI Infrastructure Engineer

Hitya GlobalBengaluru, Karnataka, IndiaPosted 20 May 2026

Hitya Global is looking for an AI Infrastructure Engineer to design and implement multi-agent orchestration and scalable inference pipelines. The role involves optimizing latency, managing AI conversation flows, and building real-time streaming infrastructure. Candidates must have production experience with high-concurrency systems, AI model serving, and asynchronous architectures. You will own critical AI infrastructure decisions and contribute to the growth of financial AI tools.

Matched by TAL

50k new jobs listed every day. Install TAL to find more jobs like this.

Install TAL

Experience

3-5 years

Function

Engineering

Work mode

Onsite, India

Company

Tier 2

What you will work on

Hitya Global is looking for an AI Infrastructure Engineer to design and implement multi-agent orchestration and scalable inference pipelines. The role involves optimizing latency, managing AI conversation flows, and building real-time streaming infrastructure. Candidates must have production experience with high-concurrency systems, AI model serving, and asynchronous architectures. You will own critical AI infrastructure decisions and contribute to the growth of financial AI tools.

TAL's take

Quality 55/1005/5 clarityTier 2 company

Solid tier-2 startup with clear technical requirements and defined scope for AI infrastructure.

Highly specific technical requirements and responsibilities centered on AI inference and system architecture.

Must haves

  • 3 to 5 years building production systems handling >10k concurrent users
  • Proven experience with async/event-driven architectures
  • Hands-on experience scaling ML/AI inference in production
  • Deep understanding of caching strategies
  • Experience with message queues and real-time communication protocols
  • Built systems integrating multiple LLM/AI models in production

Tools and skills

redistensorflow servingtritonmessage queueswebsocketsasync/event-driven architectures

About the company

unfamiliar company, default mid-tier

Posts mentioning Hitya Global