Learn Data Skills
Beta
Thomas Collins

Thomas Collins

Certified

Senior Data Analyst

RELX Group

Technologies

My Portfolio Highlights

My New Course

Supervised Learning with scikit-learn

Data Scientist & Analyst with 9+ years of experience in Python, SQL, and PySpark. Skilled in data analytics, machine learning, and model development, delivering insights that drive measurable business outcomes across research, finance, and e-commerce domains.

My Work

Take a look at my latest work.

DataLab

Project: Analyze International Debt Statistics

1Upvotes
DataLab

Project: Analyzing Crime in Los Angeles

DataLab

Project: Organizing Medical Transcriptions with the OpenAI API

DataLab

Project: Factors that Fuel Student Performance

DataLab

Project: Detecting Cybersecurity Threats using Deep Learning

DataLab

Project: From Data to Dollars - Predicting Insurance Charges

DataLab

Project: Planning a Trip to Paris with the OpenAI API

DataLab

Project: Topic Analysis of Clothing Reviews with Embeddings

DataLab

Project: Cleaning Bank Marketing Campaign Data

DataLab

Project: Building a Calorie Intake Calculator

DataLab

Project: Exploring Trends in American Baby Names

DataLab

Project: Building a Retail Data Pipeline

DataLab

Project: Exploring London's Travel Network

DataLab

Project: Data-Driven Product Management: Conducting a Market Analysis

DataLab

Project: Extracting TV Data Insights

DataLab

Project: Clustering Antarctic Penguin Species

DataLab

Project: Analyzing Electric Vehicle Charging Habits

DataLab

Project: Creating Functions to Register App Users

DataLab

Project: When Was the Golden Era of Video Games?

DataLab

Project: Classifying Emails using Llama

DataLab

Project: Producing Soccer Insights for a Sports Media Agency

DataLab

Project: Cleaning an Orders Dataset with PySpark

DataLab

Project: Analyzing Industry Carbon Emissions

DataLab

Project: Uncovering the World's Oldest Businesses

DataLab

Project: Predictive Modeling for Agriculture

DataLab

Project: Predicting Credit Card Approvals

DataLab

Project: Analyzing Unicorn Companies

DataLab

Project: Predicting Traffic Volume with PyTorch

DataLab

Project: Visualizing the History of Nobel Prize Winners

DataLab

Project: Monitoring A Financial Fraud Detection Model

DataLab

Project: Analyzing Motorcycle Part Sales

My Certifications

These are the industry credentials that I’ve earned.

Python Data Associate

Python Data Associate

SQL Associate

SQL Associate

Data Analyst Associate

Data Analyst Associate

AI Engineer for Data Scientists Associate

AI Engineer for Data Scientists Associate

AI Fundamentals

AI Fundamentals

Data Literacy

Data Literacy

Understanding Data Science

Understanding Data Science

Introduction to SQL

Introduction to SQL

Understanding Artificial Intelligence

Understanding Artificial Intelligence

Introduction to ChatGPT

Introduction to ChatGPT

Understanding Prompt Engineering

Understanding Prompt Engineering

Intermediate SQL

Intermediate SQL

Introduction to Python

Introduction to Python

Intermediate Python

Intermediate Python

Understanding Data Engineering

Understanding Data Engineering

Working with the OpenAI API

Working with the OpenAI API

Understanding Cloud Computing

Understanding Cloud Computing

AI Ethics

AI Ethics

Introduction to AI Agents

Introduction to AI Agents

Other Certificates

Microsoft Microsoft Certified: Azure Fundamentals Certification number: 5F626D-28OE24

DataCamp Course Completion

Take a look at all the courses I’ve completed on DataCamp.

My Work Experience

Where I've interned and worked during my career.

Elsevier | Aug 2018 - Dec 2024

Senior Data Analyst

- Created PySpark code on Databricks to generate publication metrics, migrating legacy MySQL logic into scalable workflows. - Applied NLP and prompt-based methods to improve classification and retrieval of scientific texts. - Led data mining for the Gender Report 2020 and EU She Figures, delivering insights cited by the CEO and partners including Harvard and NIH. - Developed and optimized analytical workflows using Python, PySpark, and SQL on Databricks in a Linux environment, reducing manual effort. - Managed and optimized big data environments (~100M rows, ~30 nested columns), improving scalability and reliability. - Built secure AWS S3 processes for efficient storage and retrieval of critical datasets, including restricted buckets for personal data in compliance with GDPR. - Authored database documentation to streamline onboarding and ensure consistent processes. - Delivered frequent ad hoc data mining projects for the SVP of the team, providing insights that directly informed high-level decision-making. - Partnered with Marketing to integrate data insights into campaigns, increasing data-driven decision-making. - Built foundational workflows for scientific document retrieval and classification. - Development of analytical infrastructure and data assets used across marketing and research analysis. - Created and maintained Production-level PySpark and SQL workflows for large-scale Scopus data mining tasks.
Show More

Amenity Analytics | Jan 2018 - Jun 2018

Data Scientist

- Developed NLP models to analyze sentiment in financial texts, improving accuracy of analytical outputs. - Processed and cleaned large datasets using Python Pandas, optimizing model performance. - Created visual insights and reports using Tableau, Matplotlib, Seaborn, and Pandas to support business needs.

Kathy Kuo Home | Jul 2017 - Dec 2017

Data Scientist

- Conducted A/B testing and performance analysis to enhance user experience and conversion rates. - Built and maintained Tableau dashboards connected to live data sources for sales and marketing KPIs. - Partnered with Marketing teams to improve campaign performance through analytics.

United Capital Source | May 2016 - Jun 2017

Data Scientist

Extracted data from our CRM using SQL in order to build models in python, excel, and R. Population inversion in hyperfine states of Rb with a single nanosecond chirped pulse in the framework of a four-level system Gender imbalances among top-cited scientists across scientific disciplines over time through the analysis of nearly 5.8 million authors Run weekly data meeting with the heads of our new sales and renewals department. Prepare and deliver weekly reports to management using Tableau. Build custom modules and create custom functions in our Zoho CRM.

Democracy Prep Public Schools | Jan 2015 - May 2016

Physics Teacher

-Teaching NYS regents physics. -Maintaining records on BOX in regards to lesson plans and assignments. -Collaboration with grade team and science department to shape instructional habits and form goals.

Hofstra University | Aug 2014 - May 2016

Adjunct Associate Professor

Taught an introductory course in electromagnetism and wave mechanics as well as a ran a calculus based physics laboratory session.

City University of New York | Aug 2013 - May 2016

Adjunct Assistant Professor - Physics

N/A

Tutor the People, LLC | Jul 2013 - Sep 2015

Private Tutor

N/A

Numerix | Sep 2014 - Feb 2015

Junior Quantitative Documentation Specialist

N/A

Varsity Tutors LLC | Aug 2012 - Jun 2013

Private Tutor

Private tutor in the subjects of math and physics.

Stevens Institute of Technology | Jan 2007 - Jan 2012

PhD Student

atoms with a laser field of varying frequency. This yielded a system of coupled differential equations that were solved numerically using Mathematica and Fortran.

VIP Parking Services, Inc | Jan 2007 - Jan 2009

Owner, Manager

· I worked as a private tutor throughout the summer of 2008 teaching cal-

Elite Valet Services, Inc | Jul 2004 - Apr 2007

Valet Attendant and Site Manager

Private Physics and Mathematics Tutor

My Education

Take a look at my formal education

PhD, Physics in PhysicsStevens Institute of Technology | 2012
MS, Physics in PhysicsStevens Institute of Technology | 2010
Bachelor’s Degree, Intensitve BA in Physics with a minor in MathematicsNew York University | 2007
High School Diploma, High School/Secondary Diplomas and Certificates in High School DiplomaMassapequa High School | 2002

About Me

Thomas Collins

Currently looking for a job as Data Scientist, Data Analyst, or Data Engineer. I am open to Quantitative Finance positions and have had experience in FinTech as well as academic training in Portfolio Theory and Derivatives Pricing.

Powered by

  • Work
  • Certifications
  • Courses
  • Experience
  • Education
  • About Me
  • Create Your Data Portfolio for Free