Learn Data Skills
Beta
Nimish Bajaj

Nimish Bajaj

Data Engineer

Apple

Technologies

My Portfolio Highlights

My New Course

Data Manipulation in SQL

Quantitative guru, enlightening the world with data-driven wisdom.

My Work

Take a look at my latest work.

course

Data Manipulation in SQL

course

Writing Functions in Python

course

Introduction to NumPy

DataCamp Course Completion

Take a look at all the courses I’ve completed on DataCamp.

My Work Experience

Where I've interned and worked during my career.

Apple | May 2022 - Present

Software Engineer Intern

N/A

University of Florida | Jan 2022 - Present

Graduate Research Assistant

Conducting research on methods for extracting and visualizing semantic differences in textual inputs.
Show More

LTI - Larsen & Toubro Infotech | Oct 2020 - Jul 2021

Senior Software Engineer

Created the Spark Framework for Machine Learning Automation (AutoML). It's built to be scalable and efficient for a variety of tasks (binary/multi- class classification and regression) on tabular datasets with a variety of characteristics, including numeric, categorical, dates, texts, and so on. Within three months, the framework was developed and integrated into the frontend of L&T's LymByc product, and it was used by five major clients for key driver analysis, data exploration, and insights development. Designed and implemented a Model Management System (MMS) for storing and retrieving PySpark models through S3. Leading the project to develop the Auto-Tune framework, which tunes Spark tasks on clusters automatically. For tuning Spark workloads, the approach employs a Heuristics-based approach (Rule-based approach) and an Optimization-based strategy (Machine Learning). Without requiring any human intervention, AutoTuning saves 30% of cluster resources and significantly improves the Spark Job success rate. It is used extensively in L&T's Lymbyc product. Extensively worked with Spark, AWS EMR, and AWS S3

Quaero | Jan 2020 - Oct 2020

Machine Learning Engineer

Now acquired by CSG Designed and built a complete package for handling end-to-end machine learning tasks, including data preprocessing, advanced feature development, cross validation, and hyperparameter tuning for various models. Also allows the user to generate model training and profiling reports in order to assess model outcomes and uncover insights not apparent from the initial dataset. Developed scalable and modular microservices and optimized APIs utilizing multi threading in Python, reducing response time to less than 1 second Developed mechanisms to launch, monitor, and terminate stateless Spark Clusters thereby saving 30\% in VM cost Built ETL workflows on Spark achieving a 5X improvement from traditional Python workflow performance

Mu Sigma Inc. | Sep 2017 - Oct 2019

Decision Scientist - Mu Sigma Innovation Lab

help multiple global enterprise clients turn raw data into actionable insights. Creating and executing data pipelines and streamlining data operations. I built a big data pipeline for a telecom client to generate key insights from users' web interactions data. For this project, I conducted extensive research into the Lambda architecture for processing both real-time and batch data. I built a real-time processing pipeline using AWS Kinesis and Spark Streaming to process the data at a rate of over 1 million records per second. Reduced query times by pre-computing batch and real-time views of the data, resulting in a significant decrease in query runtime from over 5 seconds to 300 milliseconds on average. Created python notebooks and packages to solve NLP problems including intent classification, entity extraction, and topic modeling to be used across the organization Built MuSigma’s Artificial Intelligence-based assistant which acts as a layer of intelligence over MuSigma's CMS Developed and maintained several backend services for client needs using REST APIs. Improved algorithms and experimented with ML models for intent classification

My Education

Take a look at my formal education

Master of Science - MS, Computer ScienceUniversity of Florida | 2023
Bachelor of Technology (B.Tech.), Computer ScienceMaharaja Surajmal Institute Of Technology | 2017
High School in GeneralKendriya Vidyalaya | 2013

About Me

Nimish Bajaj

I am a data engineer and I love to build scalable data solutions.

Powered by

  • Work
  • Courses
  • Experience
  • Education
  • About Me
  • Create Your Data Portfolio for Free