About this role
Skillset Strong expertise in GCP (BigQuery, Composer, Dataproc, Dataflow, Pub/Sub) Hands-on experience with Databricks for large-scale data processing Strong programming skills in SQL and PySpark Experience with Oracle and SQL Server databases Strong understanding of data modelling (dimensional, medallion, UDM) Experience in batch and streaming pipeline development Knowledge of data ingestion, CDC, and orchestration frameworks Familiarity with data governance, quality, and lineage Exposure to CI/CD and DevOps practices Strong leadership and stakeholder management skills Detailed Responsibilities Design and build scalable batch and streaming data pipelines Develop data ingestion and transformation frameworks Implement CDC and incremental data loading strategies Work on GCP platforms including BigQuery, Dataflow, Dataproc, and Pub/Sub Build and manage workflows using Cloud Composer (Airflow) Implement metadata-driven and reusable pipeline frameworks Ensure data quality, validation, and monitoring Drive migration from Oracle/SQL Server to cloud platforms Convert legacy SQL and ETL logic to BigQuery/PySpark Collaborate with analytics and reporting teams Lead and mentor data engineering teams Optimise pipelines for performance and cost Troubleshoot and resolve pipeline/data issues Support advanced analytics and AI/ML use cases Location: Chennai / Bengaluru / Hyderabad Experience: 8 + Years