Learn Data Skills
Beta
Thiago Baptista

Thiago Baptista

Data Engineer

AlmavivA Solutions | Vitória / Brazil

Technologies

My Portfolio Highlights

My New Track

Associate Data Engineer

My New Track

Data Analyst

My New Track

GitHub Foundations

Senior Data Engineer with 25+ years of experience designing, modeling, and building Analytics, Big Data, and Data Lakehouse solutions across various industries.

My Work

Take a look at my latest work.

course

Introduction to Docker

course

Databricks Concepts

SQLGit
course

Foundations of PySpark

DataCamp Course Completion

Take a look at all the courses I’ve completed on DataCamp.

My Work Experience

Where I've interned and worked during my career.

Almaviva Solutions | Dec 2024 - Present

Senior Data Engineer @ Prodesp

Working in GDAP (Digital Office of Public Administration), directly serving the Governor’s Office of the State of São Paulo. Responsible for integrating, consolidating, and making state data available on a unified platform to support government decision-making, optimize public policies, and back strategic high- impact initiatives. • Databricks and Spark: Built and optimized scalable data pipelines to integrate multiple sources using Data Lakehouse architectures to unify and democratize data access across the state government. • Azure Storage Account: Managed data in a cloud environment, ensuring high availability, security, and performance for critical operations. • CI/CD with Azure DevOps and GIT: Developed fully automated data pipelines for ingestion, transformation, and deployment, fostering robust governance and efficient version control. • SQL & Python/PySpark: Created high-performance solutions to transform and model large-scale data, enabling the construction of Data Warehouses and the integration of structured and semi-structured data. • Power BI: Designed semantic models and interactive dashboards focused on strategic use by government managers, enabling integrated views and near real-time analysis. • Kanban with Flow Metrics: Employed agile practices for planning and tracking deliverables, ensuring predictability, efficiency, and tangible impact on operations.
Show More

Almaviva Solutions | Mar 2024 - Dec 2024

Senior Database Administrator @ Prodesp

Ensured the efficiency, integrity, and availability of critical databases supporting government programs focused on employability and economic development, directly assisting the State of São Paulo’s Department of Economic Development. • Advanced SQL Server: Developed and optimized views, complex queries, CTEs, stored procedures, scalable functions, triggers, and T-SQL scripts for systems like Qualifica-SP, Meu Emprego, and Via Rápida. • Systems Sustainability: Worked across production, homologation, and development environments, performing proactive monitoring, implementing corrections through GMUD, and handling SLA-based tickets using IBM Maximo (SmartCloud Control Desk 2). • Process Automation: Created Python scripts for data extraction, analysis, and integration, enhancing efficiency and eliminating repetitive manual tasks. • Collaboration with Agile Teams: Participated in daily stand-ups and continuous alignment with development teams, promoting integrated and high- performance database solutions. • Data Integration and Governance: Ensured integrity, security, and compliance in critical initiatives, focusing on relational modeling, query performance, monitoring, and solving complex issues.

Elever Vision | Apr 2014 - Mar 2024

Senior Data Engineer

For nearly 10 years, led Data Engineering initiatives for various clients— ranging from retail (supermarkets) and manufacturing (aluminum) to media and entertainment—focusing on the creation of data warehouses, ETL orchestration, and analytics in both on-premises and Azure cloud platforms. • Data Warehousing & Analytics: Built architectures for Data Lakes and Data Warehouses using Azure Storage, Azure SQL Database, and on-premises (SQL Server, PostgreSQL) environments, enabling advanced, unified analyses for decision-making support. • ETL Pipeline Orchestration: Developed data ingestion and transformation processes with Azure Data Factory, Apache Airflow, and SSIS, connecting multiple sources (including sales systems, ERPs, and digital marketing data via Google Tag Manager/Analytics). • Automation & Python Scripts: Leveraged Python (pandas) for cleaning, enriching, and analyzing large datasets; created T-SQL and PL/pgSQL routines to optimize queries and stored procedures in SQL Server and PostgreSQL databases. • CI/CD & Data Governance: Set up integration and continuous delivery pipelines using Azure DevOps and Git/GitHub, ensuring version control, automated testing, and governance throughout the data lifecycle. • Visualization & Agile Methodologies: Developed Power BI dashboards to support business areas; adopted Kanban and JIRA for planning, execution, and progress tracking, focusing on efficiency and collaboration.

Vale | Mar 2010 - Dec 2013

Data Engineer

Served as a Data Engineer and technical reference for a large-scale BI and Data Warehousing project involving railways and ports. Defined architectural standards, ingestion strategies, and data modeling for a multidisciplinary team of 15 professionals. The project consolidated data from multiple sources (SAP, Oracle ERP, Cognos, logistics operating systems, among others) to support FP&A (Financial Planning & Analysis) processes, focusing on demand forecasting, results calculation, and cost allocation. • Data Warehouse Construction in SQL Server: Developed multidimensional modeling (fact and dimension tables), complex queries, stored procedures, and T-SQL functions, ensuring performance and reliability for critical analyses. • On-Premises ETL Orchestration: Created extraction and transformation processes with SSIS, integrating data from internal and external railway and port systems, emphasizing automation and governance. • BI Solution Architecture: Established standards for data ingestion, consolidation, and availability, enabling advanced financial analyses (including optimistic/pessimistic scenarios) for logistics capacity and budget forecasting. • Automation & VBA/Excel Advanced: Created VBA tools for generating interactive dashboards, calculating results, and allocating costs, integrated with the Data Warehouse, optimizing workflow in commercial and planning areas. • Team Management & Project Methodologies: Served as a technical leader for a multidisciplinary team, coordinating demands from various stakeholders (finance, commercial, operations) and monitoring timelines while adhering to best development and data quality practices.

Fundação Ceciliano Abel de Almeida | Feb 2000 - Mar 2010

Data Engineer

A nonprofit organization affiliated with the Federal University of Espírito Santo (UFES), active in project management, university extension, public exams, and graduate studies. In this role, I focused on creating a Business Intelligence layer over the Sapiens ERP, leveraging SQL Server to deliver advanced analytics and managerial reports in an on-premises environment. • BI Architecture & SQL Server: Built and maintained data structures (tables, views, procedures, and T-SQL functions) for integrated analytics, ensuring performance and data security. • On-Premises Database Administration: Capacity planning, implementing backup routines, performance monitoring, disaster recovery, and optimizations for high availability. • Process Automation with Visual Basic: Created routines for data extraction, transformation, and loading (ETL) from the Sapiens ERP, minimizing manual interventions and reducing failure risks. • Data Modeling & Analysis: Organized and normalized financial and operational data, enabling reliable reporting and strategic insights for various areas of the Foundation. • Interactive Reports with Crystal Reports: Developed dashboards and control panels to measure KPIs, providing visibility into financial, managerial, and academic performance.

Multi - Cia da Informação | Oct 1996 - Dec 1999

Developer

Worked on the development of commercial applications and financial modules for multiple clients, using Visual Basic and relational databases (Access and SQL). Although the main focus was on transactional systems (accounts payable/receivable, billing, etc.), I gained valuable experience in practices now associated with Data Engineering. • Requirement Gathering: Conducted client visits to identify functional needs and propose improvements to ERP modules, covering finance processes to commercial control. • Database Modeling & Architecture: Defined database schemas, performed data normalization, and wrote SQL queries, ensuring integrity and efficiency in data storage and retrieval. • Commercial Application Development: Implemented billing systems via bank slips for Internet Service Providers, in addition to accounts payable/receivable functionality and management reports. • Process Automation: Built Visual Basic routines to automate repetitive tasks, streamlining workflows and reducing manual errors. • Support & Maintenance: Fixed bugs, enhanced applications based on new business requirements, and facilitated go-live processes with end users.

My Education

Take a look at my formal education

Bachelor's degree, Ciência da ComputaçãoUniversidade Federal do Espírito Santo | 2010

About Me

Thiago Baptista

Senior Data Engineer with 25+ years of experience designing, modeling, and building Analytics, Big Data, and Data Lakehouse solutions across various industries.

Powered by

  • Work
  • Courses
  • Experience
  • Education
  • About Me
  • Create Your Data Portfolio for Free