OnsiteSeniorai/ml infra

Byteridge - DevOps/MLOps Engineer

ByteridgeMumbai, IndiaPosted 17 May 2026

Byteridge is hiring a Rapid Prototyping Engineer to lead AI infrastructure and model optimization projects for strategic customers. The role focuses on deploying, fine-tuning, and optimizing large language models using AWS services like SageMaker and custom GPU silicon. Candidates must have deep expertise in ML infrastructure, distributed training, and CUDA-level optimizations. This is a highly technical role requiring strong Python skills and a background in production ML systems.

Matched by TAL

50k new jobs listed every day. Install TAL to find more jobs like this.

Install TAL

Experience

5+ years

Function

Engineering

Work mode

Onsite, India

Company

Tier 2

What you will work on

Must haves

Bachelor's degree in CS, Engineering, or equivalent
5+ years of experience in ML infrastructure, model deployment, or GPU computing
Strong programming skills in Python
Experience with ML frameworks such as PyTorch, TensorFlow, or JAX
Deep understanding of LLM architectures and training methodologies
Hands-on experience deploying large language models in production

Tools and skills

pythonpytorchtensorflowjaxawsamazon sagemakercuda

Nice to have: quantization, pruning, distillation.

About the company

Established mid-sized IT services and technology solutions firm without flagship Tier-1 status.