Learn Data Skills
Beta
Jorge Alejandro Cruz Rivera

Jorge Alejandro Cruz Rivera

Certified

Data Engineer

NTT Data

Technologies

My Portfolio Highlights

My New Track

Data Engineer

My New Track

Python Programming

My New Track

Importing & Cleaning Data

Data virtuoso, playing the strings of information to create harmonious insights.

My Work

Take a look at my latest work.

DataLab

Exploring the Bitcoin Cryptocurrency Market

track

Data Engineer

track

Data Manipulation

track

Python Fundamentals

project

Analyzing Industry Carbon Emissions

project

Cleaning Bank Marketing Campaign Data

project

Exploring London's Travel Network

project

Building a Retail Data Pipeline

project

Performing a Code Review

DataLab

Project

track

Python Programming

DataLab

Project: Exploring London's Travel Network

DataLab

Project: Designing a Bank Marketing Database

DataLab

Project: Investigating Netflix Movies

track

Importing & Cleaning Data

DataLab

Project: Consolidating Employee Data

DataLab

The GitHub History of the Scala Language

DataLab

Project: Performing a Code Review

DataLab

Project: Exploring NYC Public School Test Result Scores

DataLab

Analyzing Industry Carbon Emissions

My Certifications

These are the industry credentials that I’ve earned.

Data Engineer Associate

Data Engineer Associate

DataCamp Course Completion

Take a look at all the courses I’ve completed on DataCamp.

My Work Experience

Where I've interned and worked during my career.

NTT Data | Nov 2023 - Present

Data Engineer

Big Data Development team lead. Development of Python SDK’s for use of development platform team. Development of data pipelines to extract, transform and load data from multiple origins into multiple sinks in async, distributed and parallel way (ETL’s, ELT’s, EL’s). Review for queries optimization. Reviews of code optimization (ML Models, data pipelines, general purpose applications). Gathering of requirements with the business for all development projects. Code testing (unit, functional, integration, stress and end-to-end). Estimate, planning of projects and scopes. Sprint planning. SDLC. Attention of development issues. Attention to extraordinary development requests. Processes Optimization. Maintain the health and stability of the cluster, FS and YARN queues. Research and POC’s of new tools and technologies. Advice on good coding practices and infrastructure utilization. Cluster resources configuration and optimization. Code versioning in git. Tracking of requirements and recording in Jira's backlog. Schedule and launch jobs with autosys. Development of scripts in Python for Web Scrapping of CDP API’s. Getting data and uploading to and from cloud repositories (such as S3). Development of processes for reporting and monitoring. Raise and follow up on requests for permissions to FIDs. Raise and follow up on requests for permissions to DBs, schemas, tables, directories, etc. Post-implementation testing of tools. Deploying code between environments. Transmit good coding practices to developers in the team. Creation of documentation of processes and developments. Troubleshooting processes and tools. Logical and physical design of tables (normalized and denormalized) and handling different types of schemas (including the star schema and snowflake schema). Code migration and adaptation due to version updates. DB’s, schemas and tables migration.
Show More

FERROMEX | Mar 2019 - Nov 2023

Data Engineer | Software Engineer | IT Project Leader

•Development and management of projects assigned to the area of operating systems, with a focus on the Back-End with the technologies of: C# and Java. •Management of different database managers such as Oracle Developer, MySQL and SQL Server, PostgreSQL and their administration. •Development of stored procedures, CTE's and views. •Design of tables (normalized) and handling different types of schemas (including the star schema). •Estimate, planning of projects and scopes. •Tracking of requirements and recording in Jira's backlog. •Planning models and architecture. •Testing (unit, comprehensive and functional). •Code publications on different types of servers and scripts for databases. •Development of ETL's to create dashboards in Power BI (Use of Power Query and DAX). •Development of REST API's. •Code publication (programmed tasks, web applications and web API's). Code versioning in GIT. •Configuration of IIS and publication of applications in IIS. •Development of multipurpose console applications. •Optimization of queries and stored procedures. •Development of MVC applications using ASP.NET. •Migration of Oracle DB, MySQL and SQL Server. •Development of Python scripts for big data using dask, Pandas, NumPy and PySpark and process automation. •MongoDB •Job programming with Airflow, Cron, Task Scheduler. •Development of scripts in Python for Web Scrapping. •Clustering of servers to execute tasks in a parallel asynchronous way. •Obtaining raw data of different file types such as: Pickle, .csv, .xlsx, .xls, .txt, json, XML, Parquet, AVRO, ORC, etc., with gigabytes of data, their transformation according to business rules and their loading into databases (SQL/NoSQL) in a matter of seconds. •Development of Middlewares in C# for connection of data sources. •SSIS, SSAS, SSRS. •Getting data and uploading to and from cloud repositories (such as S3, Azure, Onedrive). •Development of dashboards in Power BI. •Visio, Project •WebHooks

CompuSoluciones | Nov 2018 - Mar 2019

Back - End Team Lead

• Responsible for the administration and development of the Talentry platform, with a focus on the Back-End with the technologies of: C#, and management of the SQL Server data manager. • Project planning, estimates, code publications on the server and databases.

CompuSoluciones | Jul 2018 - Nov 2018

Software Development Engineer

• Development of the Talentry platform, with a focus on the Back-End with the technologies of: C#, and management of the SQL Server data manager.

CompuSoluciones | Nov 2017 - Jul 2018

Software Development Engineer

• Development of the Click Suscribe platform (Compusoluciones electronic commerce) with a focus on both the Front-End part, as well as the Back-End with the technologies: Node JS, Angular, C#, and management of MySQL data managers and SQL Server.

Citibanamex | Dec 2015 - Nov 2017

System Analyst

• Development of the Back-End of applications with the technologies: C#, SQL Server, Access, VBA.

My Education

Take a look at my formal education

Bachelor's Degree in Computer ScienceUniversidad de Guadalajara | 2016

About Me

Jorge Alejandro Cruz Rivera

Jorge Alejandro hasn't filled in a bio text

Powered by

  • Work
  • Certifications
  • Courses
  • Experience
  • Education
  • About Me
  • Create Your Data Portfolio for Free