![Profile](https://weekday-user-pictures.s3.ap-south-1.amazonaws.com/profile-images/sumit-agrawal01.jpg)
Sumit Agrawal
Lead Data Engineer at Gojek
Sumit Agrawal is a seasoned Senior Software Engineer at Gojek with over 10 years of relevant experience in the IT industry. He has a diverse background in storage, processing, and analytics of various datasets, making him an expert in designing and developing scalable ETL workflows. Sumi
Show more
11.6
Years of Experience
Education
savitribai phule pune university, government polytechnic jalgaon, s. g. s. highschool pachora
Companies
gojek, gojek, continuity 1 business science consulting, tcs
Reach out to Sumit Agrawal via Email, InMail and SMS drip
by installing Chrome extension
Sumit's contact details
Email (Verified)
sumXXXXXXXXXXXXXXXXXXXXXom
Mobile Number
+91XXXXXXXX12
Experience
2021 - Present
gojek
Lead Data Engineer
Lead stakeholder engagement for data warehousing solutions, optimizing decision-making, accessibility, and compliance. Spearhead initiatives for data quality improvement and microservice-based platform development. Implement real-time fraud detection and voucher allocation solutions using Kafka and Flink. Optimize queries for Spark, Hive, Redshift, and other frameworks.
2020 - 2021
gojek
Senior Data Engineer
Data modeling and development for robust data warehouse solutions. Lead data migration projects from AWS Spark to GCP BigQuery, ensuring seamless integration and efficient data processing.
2017 - 2020
continuity 1 business science consulting
Data Enginner
At Continuity1, managed end-to-end delivery of data solutions, catering to diverse stakeholder needs with limited resources. Led development of efficient data pipeline with Spring Batch, AWS EMR, Redshift, and Metabase. Implemented optimizations for improved timeliness and data quality.
2012 - 2017
tcs
Big Data Engineer
In Nielsen Digital Ad Ratings, supported data processing tasks, resolving issues and transitioning Java map reduce jobs to Spark. Utilized Cloudera Hadoop, Hive, Impala, Spark, Oracle DB, Sqoop, and Oozie for seamless data processing. In Invesco, facilitated migration of data pipelines from Sybase to Oracle, emphasizing thorough testing and understanding of both systems' features in the banking sector.
Experience
66 Skills
Airflow
Amazon Elastic MapReduce (EMR)
Amazon Redshift
Amazon S3
Amazon Web Services (AWS)
apache
Apache Airflow
Apache Flink
Apache Hive
Apache Kafka
Apache Oozie
Apache Spark
Apache Spark
Apache Sqoop
Big Data
CI/CD
Cloudera
Cloudera Impala
Core Java
Core Java
Dagger
Dagger (Software)
Data Engineering
Data Migration
Data Modeling
Data Processing
Data Warehouse
Data Warehousing
Design
Docker
Education
ElasticSearch
Extract, Transform, Load (ETL)
GitHub
Gitlab
Golang
Google Cloud Platform (GCP)
Hadoop
HDFS
Hive
Hive
Integration
Java
Jenkins
Kafka
Kubernetes
Logstash
Machine Learning (ML)
MapReduce
Microservices
oozie
optimization
Problem solving
pyspark
Python
quality assurance (QA)
Shell Scripting
Spring
Spring Batch
spring batch
Sqoop
Stored Procedures
sybase
test
testing
Web
Education
2009 - 2012
savitribai phule pune university
Bachlor's of Engineering.
Information Technology
2006 - 2009
government polytechnic jalgaon
Diploma in Information Technology
2001 - 2006
s. g. s. highschool pachora
Secondary School Certificate