RemoteMid Levelai/ml infra

AI Research Engineer (Kernel & Inference Optimization)

JobgetherIndiaPosted 20 May 2026

This AI Research Engineer role involves optimizing model serving architectures for high throughput and low latency. The position focuses on building scalable inference pipelines for cloud and edge environments. Candidates will conduct benchmarking, profiling, and performance validation to improve AI infrastructure. The role offers full remote flexibility within a research-driven, global team.

Matched by TAL

50k new jobs listed every day. Install TAL to find more jobs like this.

Install TAL

Experience

Experience not specified

Function

Engineering

Work mode

Remote, India

Company

Tier 2

What you will work on

TAL's take

Quality 45/1004/5 clarityTier 2 company

Role involves interesting technical challenges in AI inference, but the company context is obscured by the Jobgether platform intermediary.

Responsibilities and requirements are clearly defined around inference optimization, though specific technical stack details are omitted.

Must haves

Strong experience in AI/ML engineering
Deep understanding of model deployment architectures
Expertise in optimizing latency, throughput, scalability, and memory footprint
Hands-on experience with performance monitoring, benchmarking, and profiling
Experience building AI systems across cloud or edge environments

Tools and skills

ai/ml engineeringinference optimizationmodel servingai systems performanceperformance monitoringbenchmarkingprofiling

About the company

Jobgether is a recruitment platform acting as an intermediary; the end client is not disclosed.