Profile

Sumit Agrawal

Lead Data Engineer at Gojek

Sumit Agrawal is a seasoned Senior Software Engineer at Gojek with over 10 years of relevant experience in the IT industry. He has a diverse background in storage, processing, and analytics of various datasets, making him an expert in designing and developing scalable ETL workflows. Sumi

Show more

11.6

Years of Experience

Education

savitribai phule pune university, government polytechnic jalgaon, s. g. s. highschool pachora

Companies

gojek, gojek, continuity 1 business science consulting, tcs

Reach out to Sumit Agrawal via Email, InMail and SMS drip

by installing Chrome extension

Sumit's contact details

Email

Email (Verified)

sumXXXXXXXXXXXXXXXXXXXXXom

Email

Mobile Number

+91XXXXXXXX12

Experience

  • img

    2021 - Present

    gojek

    Lead Data Engineer

    Lead stakeholder engagement for data warehousing solutions, optimizing decision-making, accessibility, and compliance. Spearhead initiatives for data quality improvement and microservice-based platform development. Implement real-time fraud detection and voucher allocation solutions using Kafka and Flink. Optimize queries for Spark, Hive, Redshift, and other frameworks.

  • img

    2020 - 2021

    gojek

    Senior Data Engineer

    Data modeling and development for robust data warehouse solutions. Lead data migration projects from AWS Spark to GCP BigQuery, ensuring seamless integration and efficient data processing.

  • img

    2017 - 2020

    continuity 1 business science consulting

    Data Enginner

    At Continuity1, managed end-to-end delivery of data solutions, catering to diverse stakeholder needs with limited resources. Led development of efficient data pipeline with Spring Batch, AWS EMR, Redshift, and Metabase. Implemented optimizations for improved timeliness and data quality.

  • img

    2012 - 2017

    tcs

    Big Data Engineer

    In Nielsen Digital Ad Ratings, supported data processing tasks, resolving issues and transitioning Java map reduce jobs to Spark. Utilized Cloudera Hadoop, Hive, Impala, Spark, Oracle DB, Sqoop, and Oozie for seamless data processing. In Invesco, facilitated migration of data pipelines from Sybase to Oracle, emphasizing thorough testing and understanding of both systems' features in the banking sector.

Experience

66 Skills

Airflow

Amazon Elastic MapReduce (EMR)

Amazon Redshift

Amazon S3

Amazon Web Services (AWS)

apache

Apache Airflow

Apache Flink

Apache Hive

Apache Kafka

Apache Oozie

Apache Spark

Apache Spark

Apache Sqoop

Big Data

CI/CD

Cloudera

Cloudera Impala

Core Java

Core Java

Dagger

Dagger (Software)

Data Engineering

Data Migration

Data Modeling

Data Processing

Data Warehouse

Data Warehousing

Design

Docker

Education

ElasticSearch

Extract, Transform, Load (ETL)

GitHub

Gitlab

Golang

Google Cloud Platform (GCP)

Hadoop

HDFS

Hive

Hive

Integration

Java

Jenkins

Kafka

Kubernetes

Logstash

Machine Learning (ML)

MapReduce

Microservices

oozie

optimization

Problem solving

pyspark

Python

quality assurance (QA)

Shell Scripting

Spring

Spring Batch

spring batch

Sqoop

Stored Procedures

sybase

test

testing

Web

Education

  • img

    2009 - 2012

    savitribai phule pune university

    Bachlor's of Engineering.

    Information Technology

  • img

    2006 - 2009

    government polytechnic jalgaon

    Diploma in Information Technology

  • img

    2001 - 2006

    s. g. s. highschool pachora

    Secondary School Certificate