AI Research Engineer (Kernel & Inference Optimization)
This AI Research Engineer role involves optimizing model serving architectures for high throughput and low latency. The position focuses on building scalable inference pipelines for cloud and edge environments. Candidates will conduct benchmarking, profiling, and performance validation to improve AI infrastructure. The role offers full remote flexibility within a research-driven, global team.
50k new jobs listed every day. Install TAL to find more jobs like this.

Experience
Experience not specified
Function
Engineering
Work mode
Remote, India
Company
Tier 2
What you will work on
This AI Research Engineer role involves optimizing model serving architectures for high throughput and low latency. The position focuses on building scalable inference pipelines for cloud and edge environments. Candidates will conduct benchmarking, profiling, and performance validation to improve AI infrastructure. The role offers full remote flexibility within a research-driven, global team.
TAL's take
Role involves interesting technical challenges in AI inference, but the company context is obscured by the Jobgether platform intermediary.
Responsibilities and requirements are clearly defined around inference optimization, though specific technical stack details are omitted.
Must haves
- Strong experience in AI/ML engineering
- Deep understanding of model deployment architectures
- Expertise in optimizing latency, throughput, scalability, and memory footprint
- Hands-on experience with performance monitoring, benchmarking, and profiling
- Experience building AI systems across cloud or edge environments
Tools and skills
About the company
Jobgether is a recruitment platform acting as an intermediary; the end client is not disclosed.