Jobs on TAL
All jobsOnsiteEngineeringai/ml infra5+ yearspython
OnsiteSeniorai/ml infra

Byteridge - DevOps/MLOps Engineer

ByteridgeMumbai, IndiaPosted 17 May 2026

Byteridge is hiring a Rapid Prototyping Engineer to lead AI infrastructure and model optimization projects for strategic customers. The role focuses on deploying, fine-tuning, and optimizing large language models using AWS services like SageMaker and custom GPU silicon. Candidates must have deep expertise in ML infrastructure, distributed training, and CUDA-level optimizations. This is a highly technical role requiring strong Python skills and a background in production ML systems.

Matched by TAL

50k new jobs listed every day. Install TAL to find more jobs like this.

Install TAL

Experience

5+ years

Function

Engineering

Work mode

Onsite, India

Company

Tier 2

What you will work on

Byteridge is hiring a Rapid Prototyping Engineer to lead AI infrastructure and model optimization projects for strategic customers. The role focuses on deploying, fine-tuning, and optimizing large language models using AWS services like SageMaker and custom GPU silicon. Candidates must have deep expertise in ML infrastructure, distributed training, and CUDA-level optimizations. This is a highly technical role requiring strong Python skills and a background in production ML systems.

Must haves

  • Bachelor's degree in CS, Engineering, or equivalent
  • 5+ years of experience in ML infrastructure, model deployment, or GPU computing
  • Strong programming skills in Python
  • Experience with ML frameworks such as PyTorch, TensorFlow, or JAX
  • Deep understanding of LLM architectures and training methodologies
  • Hands-on experience deploying large language models in production

Tools and skills

pythonpytorchtensorflowjaxawsamazon sagemakercuda

Nice to have: quantization, pruning, distillation.

About the company

Established mid-sized IT services and technology solutions firm without flagship Tier-1 status.