Byteridge - DevOps/MLOps Engineer
Byteridge is hiring a Rapid Prototyping Engineer to lead AI infrastructure and model optimization projects for strategic customers. The role focuses on deploying, fine-tuning, and optimizing large language models using AWS services like SageMaker and custom GPU silicon. Candidates must have deep expertise in ML infrastructure, distributed training, and CUDA-level optimizations. This is a highly technical role requiring strong Python skills and a background in production ML systems.
50k new jobs listed every day. Install TAL to find more jobs like this.

Experience
5+ years
Function
Engineering
Work mode
Onsite, India
Company
Tier 2
What you will work on
Byteridge is hiring a Rapid Prototyping Engineer to lead AI infrastructure and model optimization projects for strategic customers. The role focuses on deploying, fine-tuning, and optimizing large language models using AWS services like SageMaker and custom GPU silicon. Candidates must have deep expertise in ML infrastructure, distributed training, and CUDA-level optimizations. This is a highly technical role requiring strong Python skills and a background in production ML systems.
Must haves
- Bachelor's degree in CS, Engineering, or equivalent
- 5+ years of experience in ML infrastructure, model deployment, or GPU computing
- Strong programming skills in Python
- Experience with ML frameworks such as PyTorch, TensorFlow, or JAX
- Deep understanding of LLM architectures and training methodologies
- Hands-on experience deploying large language models in production
Tools and skills
Nice to have: quantization, pruning, distillation.
About the company
Established mid-sized IT services and technology solutions firm without flagship Tier-1 status.