Senior/Lead Data Engineer - PySpark/Azure Databricks
NetConnectGlobal is hiring a Senior/Lead Data Engineer to design and optimize scalable enterprise data platforms in the AI/ML infrastructure domain. Responsibilities include building distributed pipelines, optimizing Spark performance, and supporting LLMOps and RAG initiatives. The role requires deep expertise in Databricks, PySpark, and CI/CD automation. This position involves technical leadership, including mentoring junior engineers and implementing data governance.
50k new jobs listed every day. Install TAL to find more jobs like this.

Experience
5+ years
Function
Engineering
Work mode
Onsite, India
Company
Tier 2
What you will work on
NetConnectGlobal is hiring a Senior/Lead Data Engineer to design and optimize scalable enterprise data platforms in the AI/ML infrastructure domain. Responsibilities include building distributed pipelines, optimizing Spark performance, and supporting LLMOps and RAG initiatives. The role requires deep expertise in Databricks, PySpark, and CI/CD automation. This position involves technical leadership, including mentoring junior engineers and implementing data governance.
TAL's take
The role offers clear technical leadership scope and high-impact work with modern AI/ML data stacks, despite being at a tier 2 firm.
The JD is crisp, provides a clear tech stack, and lists well-defined responsibilities relevant to a specialized lead data engineer role.
Must haves
- 5+ years of relevant experience
- Expertise in Python and PySpark
- Proficiency in Databricks and Delta Lake
- Experience with Spark performance optimization
- Ability to build GitLab CI/CD pipelines
- Knowledge of AI/ML workflows and RAG pipelines
Tools and skills
About the company
NetConnectGlobal is a well-established IT services and staffing organization, fitting the tier 2 criteria for established mid-sized firms.