HybridSeniordata infra

Senior Data Engineer

Version 1Pune, IndiaPosted 19 May 2026

Version 1 is seeking a Senior Data Engineer to join their data practice, focusing on building scalable, high-performance data platforms. The role involves designing and operating batch and streaming data pipelines using Databricks and cloud-native AWS services. Candidates are expected to possess strong Python or Java skills, advanced SQL, and deep experience with distributed data processing systems. This position offers long-term career growth within a large, award-winning technology services firm.

Matched by TAL

50k new jobs listed every day. Install TAL to find more jobs like this.

Install TAL

Experience

5+ years

Function

Engineering

Work mode

Hybrid, India

Company

Tier 2

What you will work on

TAL's take

Quality 60/1005/5 clarityTier 2 company

Solid tier-2 company with a well-defined role in their data practice and clear technical expectations.

The role has a very clear focus on Databricks and data pipeline development with well-defined responsibilities.

Salaries at Version 1

29.0 LPA average

Based on 2 Grapevine salary entries for Version 1.

View all salaries

Other roles

8 - 10 years

18 LPA average

Range: 18 - 18 LPA

Other roles

12 - 14 years

40 LPA average

Range: 40 - 40 LPA

Must haves

5+ years of hands-on data engineering experience
Strong programming skills in Python and/or Java
Advanced SQL skills
Proven experience building and running data pipelines in production
Solid understanding of distributed systems and scalable data architectures
Hands-on experience with AWS data services
Experience with Databricks or similar distributed data processing platforms

Tools and skills

pythonjavasqlawsdatabrickssparkci/cd

Nice to have: ml, ai, bi.

About the company

Established technology services company with over 3000 employees and strong partner status, fitting the definition of an established mid-stage or large-scale services firm.

Posts mentioning Version 1

Grapevine January 13th update

Thank you for all the love and support over the last few weeks. :) Grapevine is just 5 weeks old and we are a very small team :) But we are trying address feature requests as they come from you all. A few things from us: 1. We recommend you update the app from the play/app store for the best experience 2. As per popular demand - timestamps have now been introduced in the app (available on latest version) 3. New (and hopefully better) algo for the popular feed has been rolled out Lastly, please feel free to drop in your feedback and feature requests in this post. We will try to implement everything we can.

News Discussion7444

Title: Data Scientist (~2 YOE) – Built planning tools, pipelines, and AI system. Need honest feedback on profile.

Hi all, Looking for practical feedback on my profile before I start applying. I’ll keep this structured so it’s easier to evaluate. --- 1) Planning Tools / Web Applications Problem: Forecasting workflows were fragmented and heavily Excel-driven: - Multiple data sources (orders, shipments, different forecast versions) - Manual merging, lookups, and adjustments - No way to simulate scenarios or compare forecasts cleanly - Different planners using different methods → inconsistency What I built: - Two internal applications for planning workflows: - A planning tool integrating 8+ data sources - A forecasting simulator supporting multi-level editing (high → granular) Key capabilities: - Real-time scenario simulation - Side-by-side comparison of multiple forecast types - Hierarchical adjustments across levels - SQL write-back for persistence Scale: - Processes ~150K+ records per cycle - Used in monthly planning cycles by multiple teams Impact: - Removed fragmented Excel workflows - Enabled consistent decision-making across users - Reduced manual effort and improved visibility into forecast behavior --- 2) Automation & Data Pipelines Problem: Core workflows were manual and repetitive: - Multi-file Excel processing - Data cleaning + merging across systems - Version tracking errors - High effort per cycle (1–4 hours depending on workflow) What I built: - Multiple pipelines automating end-to-end workflows Examples: - Large-scale consolidation pipeline: - Input: ~1M+ rows across 20+ files - Output: clean, unified dataset (~75% reduction) - 2nd pipeline: - Replaced a 23-step manual process - Standardized inconsistent formats across datasets - 3rd automation processing: - Automated unpivoting, enrichment, and version tracking Impact: - Reduced processing time from hours → minutes per cycle - Eliminated manual errors (copy-paste, lookup mistakes) - Standardized workflows across users --- 3) Power BI / Monitoring Problem: Recent data (orders/shipments) showed inconsistencies, but: - No visibility into changes over time - Hard to identify where data drift was happening What I built: - Power BI dashboards with: - Hierarchical filters - Drill-down views - Month-over-month comparison Scale: - ~30K+ records analyzed Impact: - Enabled early detection of data inconsistencies - Helped planners validate inputs before forecasting - Improved trust in upstream data --- 4) Side Project (AI System) What I built: - AI-powered job assistant system Features: - Scrapes job postings - Scores relevance using LLMs - Generates tailored resume points and outreach messages - Tracks applications Tech: - FastAPI backend - LLM routing (cloud + local fallback) - SQLite storage Goal: - Build a system-driven workflow (not just model usage) --- My concern Most of my work sits at the intersection of: - forecasting - data systems - workflow automation I’m trying to move into: 👉 Applied Data Scientist / Product-oriented roles --- Questions 1. Does this profile look too niche (forecasting-heavy)? 2. Does “building systems around data” help or hurt for DS roles? 3. What’s the biggest gap you see (if any)? Would really appreciate honest feedback. Thanks.

Data Scientists40

Let's Mainframe! Important Interview questions for a Mainframe developer

Questions asked in Mainframe interviews for a mid level experienced candidate- 1. What is checkpoint restart logic and how will you code it in your program? 2. What is file parallel processing in Cobol programs? 3. What is commit logic in programs? 4. Explain the use of Cursor. 5. Explain these abends- -305, -803,-904,-911. 6. What is real time use of GDG concept? 7. INREC FIELD ,INCLUDE COND Sort examples. 8. What is Array and how is it used, what are subscript and index? 9. What are search and search all and how are they coded? 10.Why is version control tool used? (next is how do you deploy a component from dev to prod?) 11.How do u schedule a job using job scheduling tool? 12.What is your daily task and how do u get requirement? 13.Have u worked on a project from end to end? 14.Challenges you faced while delivering a project. 15.Have you interacted with stakeholders? 16.What is SDLC process? 17.Have u ever done estimation of any requirement? 18.Difference between continue and next sentence. 19.Redefines real time examples. There are more but these are frequently asked questions- Hope it helps!

Software Engineers50