Learn Data Skills
Gustavo Ferratti

Gustavo Ferratti

Certified

ML Engineer

NTT Data | Brazil

Technologies

My Portfolio Highlights

My New Certification

Data Scientist Professional

My New Track

Data Engineer

My New Project

Performing a Code Review

From Data Chaos to Strategical Intelligence =]

My Work

Take a look at my latest work.

article

Social Structure, Power, and Language: a Critical Analysis of Big Tech CEOs

PythonTheory
github

ML Engineer - Datathon

PythonGitDocker
github

MTG Mana Curve Prediction

PythonGitShell

My Certifications

These are the industry credentials that I’ve earned.

Data Scientist Professional

Data Scientist Professional

DataCamp Course Completion

Take a look at all the courses I’ve completed on DataCamp.

My Work Experience

Where I've interned and worked during my career.

Ipiranga | Feb 2025 - Present

Data Scientist

As a Data Scientist, I have applied my knowledge of Programming, Statistics, and Machine Learning to develop solutions in the context of strategic pricing. I work with predictive models, including classification, regression, boosting, and anomaly detection. My main activities include: 1) Development of an Internal Data Science Library (IPIpy): Collaborating on the development and maintenance of an internal library to streamline data processing within the data science management team (I/O packages in Pandas and Spark, feature engineering, integration with MLFlow, etc.). 2) Predictive Modeling: Developing and reviewing predictive models, ensuring they meet both technical and business requirements. I primarily use libraries such as Scikit-learn, Scipy, and XGBoost. 3) Feature Selection and Engineering: Selecting and developing new features to enhance model performance, applying techniques like encoding and domain-based engineering. 4) Quantile Forecasting: Building models that not only estimate the point value of a variable but also provide confidence intervals, helping to better understand prediction uncertainty and align the model with different strategies. 5) Model Optimization: Continuously tuning and improving models using techniques such as cross-validation and hyperparameter tuning. 6) Model Explainability: Using techniques like SHAP to communicate how models arrive at their predictions, creating visualizations that highlight the most relevant variables. 7) Deployment on Azure ML Experiments: Implementing Machine Learning models in production. 8) Experiment Management: Tracking model versions and preventing drift using MLFlow.
Show More

PlosOne | Apr 2024 - Present

Academic Reviewer

I work as an academic reviewer for the scientific journal PLOS ONE, contributing to ensuring the quality and methodological rigor of research from various fields of knowledge. I do this by participating in the peer review process (double-blind review). I primarily evaluate original research involving the fields of sociology, computer science, and linguistics. PLOS ONE is a journal that emphasizes the accessibility of science, publishing innovative articles freely and without access barriers.

Ipiranga | Aug 2023 - Feb 2025

Analytics Engineer

I work as an Analytics Engineer in the Strategic Pricing team at Ipiranga Combustíveis, ensuring the excellence of Machine Learning models in production and the robustness of the data pipeline. My main activities include: 1) Data Architecture Migration: Assisting in the transition of legacy projects to a new architecture based on the Kedro framework. 2) Notebook Productization: Transforming experimental notebooks into production-grade data pipelines, ensuring scalability and consistency in the production environment. 3) Model Retraining and Versioning: Managing and retraining production models with complete lifecycle tracking using tools like MLFlow. 4) Code Quality and Optimization: Continuously reviewing code with a focus on clean code principles, Zen of Python, and performance optimizations (modularization, downcasting, type hinting, teardown, etc.). 5) ETL with PySpark: Developing and optimizing ETL pipelines using PySpark to process large volumes of data in a distributed environment, such as Databricks. 6) Data Testing and Quality Assurance: Creating unit and integration tests with pytest to validate models and ensure the integrity and reliability of data pipelines. 7) Documentation and Knowledge Management: Leading the creation of a robust documentation culture in Confluence, fostering the sharing of practices and insights within the Data Science team. 8) Cross-Team Integration: Acting as a bridge between Strategic Pricing, Data Science, Data Engineering, and Machine Learning Operations teams, ensuring alignment and seamless end-to-end processes. I strive to combine data science, data engineering, and best practices in software development to deliver efficient, sustainable, and high-impact business results.

UC Santa Barbara | Mar 2022 - Present

Visiting Researcher

I was a visiting scholar at the Technology Management Program, an interdisciplinary graduate program linked to the College of Engineering at the UCSB. I was advised by Prof. Jéssica Santana, an outstanding scholar known by her work on Entrepreneurial Failure. During that time, I developed a project about Diversity, Equity, and Inclusion (DEI) to understand and minimize the cases of academic failure, especially those involving underrepresented minorities (URM). The project had a quantitative stage involving data analytics with Power Bi and Python, and a qualitative stage involving interviews and participant observation. I also helped Prof. Santana building up her entrepreneurial failure website, attended her Entrepreneurship classes as a guest student, and learnt from her the best practices of multidisciplinary studies.

PECEGE ESALQ/USP - São Paulo | Jan 2022 - Jan 2024

Research Advisor

Academic advising on final course projects in Data Science and Digital Business MBA programs.

Simplicode | Jan 2020 - Jan 2021

Programming Professor

I was a Python private professor for teenagers all over Brazil who wanted to learn in online classes how to maker their own scripts, games, and apps.

Lab Dados || Data Lab | Jan 2020 - Present

Co-Founder

I am one of the co-founders of Data Lab. Data Lab is a group affiliated with the Organizational Studies Core (NEO) at UFScar, whose mission is to disseminate methodological and technological knowledge in the field of Data Science to other areas such as Engineering, Social Sciences, Economics, and Administration. My roles included strategic planning, content creation and curation, event promotion, and building a strong network of researchers.

UFSCar - Universidade Federal de São Carlos | Jan 2019 - Dec 2023

CAPES Ph.D. Scholarship Holder

In my Ph.D., I combine quantitative approaches from the Data Science field (Text Mining, NLP) with more qualitative ones from the Organizational Studies area (narratologies, storytelling). Throughout the three articles that together structure my doctoral thesis, I use mixed methods to investigate: 1. Jungian archetypes on fantastic literature; 2. The narratives of Big Tech Companies CEOs on Twitter; 3. The discourse of Brazilian press about technological entrepreneurship. Advisor: Mário Sacomano Neto. Co-Advisor: Silvio Eduardo Alvarez Candido Undergraduate researchers I advise: Ramon Roque and Júlia Ortolani.

Trevisan Escola de Negócios | Jan 2018 - Jan 2019

MBA Professor

I was hired to teach in an in-company course at Serasa Experian for an MBA class in Risk Management and Controllership. I taught the subject of IT Governance with a workload of 16 hours.

UFSCar - Universidade Federal de São Carlos | Jan 2017 - Jan 2019

CAPES Master's Degree Scholarship Holder

My master's degree sought to explore the controversies existing in a prominent startup in the IT segment through the Actor-Network Theory (ANT). Combining ethnography with discourse analysis, I was able to come up with critical questions about a growing startup model in Brazil. This model involves controversies as: reconciling a playful and relaxed image with precarious work practices, declaring to be worried about DEI policies while privileging socially dominant groups, affirming that profit is less important than humanitarian values and always prioritize the financial aspect in decision-making processes. Advisor: Mário Sacomano Neto

Raízen | May 2015 - Jul 2016

Operations Supervisor

I was Supervisor of Logistics Distribution and Trading Operations for the Raízen group (a joint venture between Shell and Cosan). Among my main activities were: • Product quality control; • Management of internal vehicle supply logistics; • Control of periodic maintenance of equipment and assets; • Supervision of the work of the Loading Operators; • Consulting and technical assistance to Station Leaders in the Araraquara Regional. • Management of local infrastructure investments (OPEX); • Administrative control of the terminal (costs, overtime) • Activities aimed at adhering to Health, Safety and Environment plans, with observations to prevent incidents, safety dialogues and committees of good practices; • Purchases and direct negotiations with suppliers obeying the competition guidelines established in the Authorities Manual;

Raízen | Jan 2014 - Apr 2015

Commercial Analyst

I worked at the Paulínia Fuel distribution terminal as a commercial analyst, more specifically, in the business-to-business market. I performed back office services, supporting two Sales teams, namely: New Business Brazil and B2B São Paulo. In my customer service experience, I had contact with: mining companies, power plants, thermal plants, carriers, industries, retailers and others. Among the activities that I developed in this period are: elaboration of commercial proposals, monitoring of customers, cost control, follow up of sales, economic studies, prospecting for new customers and market analysis.

Universidade Estadual Paulista Júlio de Mesquita Filho | Nov 2011 - Nov 2013

Electronic Engineer

The Baja SAE project is a competition between Higher Education Institutions of Engineering that challenges students through the development of an offroad vehicle. The objective of each team is to build a prototype of a robust recreational vehicle, off-road and single-seater, aiming at its commercialization to the enthusiastic amateur public. The vehicle must be safe, easily transportable, and simple to operate. The vehicle must also be able to overcome rough terrains without significant damage. I participated in the project on the Pac Baja team. Our team had the direct contribution of 25 students of Mechanical Engineering, 3 of Electrical Engineering (including myself), and financial support from more than 12 sponsors. Our advisor was Prof. Luiz Daré Neto.

Universidade Estadual Paulista Júlio de Mesquita Filho | Jan 2013 - Jan 2013

Research Assistant

I was a member of Prof. Paulo Aguiar research team, dedicating 8 hours a week to studies and experiments at the Signal Processing Laboratory. During this period, I was able to expand my knowledge in machining techniques, software programming, artificial intelligence, and process modeling. My studies collaborated to the developing an intelligent system capable of classifying burn levels in flat grinding machining processes.

My Education

Take a look at my formal education

Specialist in ML EngineeringFIAP | 2025
Bachelor in Business AdministrationCentro Universitário Senac | 2022
Ph.D. in Industrial EngineeringUFSCar - Universidade Federal de São Carlos | 2022
Master's degree in Industrial EngineeringUFSCar - Universidade Federal de São Carlos | 2018
MBA in Business ManagementUniversidade de São Paulo | 2017
Bachelor's degree in Electrical and Electronic EngineeringUNESP - Universidade Estadual Paulista | 2014

About Me

Gustavo Ferratti

Hi, I'm Gustavo! I'm a data scientist and analytics engineer with strong experience in data analysis, both professionally and academically. Nice to meet you =]

Powered by

  • Work
  • Certifications
  • Courses
  • Experience
  • Education
  • About Me
  • Create Your Data Portfolio for Free