Jobs on TAL
All jobsOnsiteEngineeringai/ml infraExperience not specifiedpytorch
OnsiteMid Levelai/ml infra

AI Engineer – Model Optimization & Acceleration

AMDBengaluru, Karnataka, IndiaPosted 20 May 2026

AMD is seeking an AI Engineer in Bengaluru to focus on optimizing and deploying machine learning models across heterogeneous hardware platforms. The role involves working with generative models, vision systems, and multi-modal architectures to improve inference latency and throughput. You will apply quantization and model compression techniques while profiling performance on GPUs and NPUs. This is a technical role centered on production-ready AI systems at the intersection of software and hardware.

Matched by TAL

50k new jobs listed every day. Install TAL to find more jobs like this.

Install TAL

Experience

Experience not specified

Function

Engineering

Work mode

Onsite, India

Company

Tier 1

What you will work on

AMD is seeking an AI Engineer in Bengaluru to focus on optimizing and deploying machine learning models across heterogeneous hardware platforms. The role involves working with generative models, vision systems, and multi-modal architectures to improve inference latency and throughput. You will apply quantization and model compression techniques while profiling performance on GPUs and NPUs. This is a technical role centered on production-ready AI systems at the intersection of software and hardware.

TAL's take

Quality 80/1005/5 clarityTier 1 company

Role at a top-tier semiconductor firm focusing on high-impact AI model optimization for hardware acceleration.

The JD clearly defines the technical stack, the specific nature of the optimization work, and the targeted hardware platforms.

Salaries at AMD

45.7 LPA average

Based on 3 Grapevine salary entries for AMD.

View all salaries

Other roles

2 - 4 years

20 LPA average

Range: 20 - 20 LPA

Other roles

12 - 14 years

76 LPA average

Range: 76 - 76 LPA

Other roles

16 - 18 years

41 LPA average

Range: 41 - 41 LPA

Must haves

  • Strong proficiency in PyTorch or equivalent framework
  • Proficiency in Python and C++
  • Experience with GPU or hardware acceleration like CUDA or ROCm
  • Solid understanding of deep learning models including transformers and CNNs
  • Knowledge of model optimization, quantization, and performance tuning

Tools and skills

pytorchonnxpythonc++cudarocm

Nice to have: edge ai, embedded deployment, generative ai, multi-modal ai, distributed inference.

About the company

AMD is a major global semiconductor company with a well-established engineering brand.

Posts mentioning AMD