Learn Data Skills
Beta
ALEIX MATABACAS

ALEIX MATABACAS

Certified

Data Scientist

Clarivate | Spain

Technologies

My Portfolio Highlights

My New Track

Data Manipulation

My New Course

Introduction to Python

Results-driven data scientist with expertise in Health Sciences and a solid background in data engineering

My Work

Take a look at my latest work.

course

Understanding Data Science

certification

AI Fundamentals

My Certifications

These are the industry credentials that I’ve earned.

AI Fundamentals

AI Fundamentals

DataCamp Course Completion

Take a look at all the courses I’ve completed on DataCamp.

My Work Experience

Where I've interned and worked during my career.

Clarivate | Aug 2021 - Present

Senior Data Scientist

Responsible for the end-to-end development of multiple Natural Language Processing (NLP) projects using Python (SpaCy) to extract entities and relationships from data provided by agencies such as the FDA, EMA, and JADER. This includes defining data models and automating data gathering processes using Scrapy. Maintaining a real-world pharmacovigilance data pipeline within a Linux distribution, utilizing an Oracle database, PL/SQL, Bash, Python, and R scripts. Development of PL/SQL and Python scripts to manage and process high volumes of data that require parallelization or other specialized strategies. Workflow Automation: Analyzing manual workflows and developing effective automations and data models to significantly reduce time spent on these processes. Using Knime or Microsoft Power Platform (Power Apps, Power Automate & Sharepoint). Maintaining an ensemble model utilizing three distinct supervised algorithms within KNIME. Cross-functional Collaboration: Working with teams in the medical and technology sectors on various data and requirements, such as medical conferences and regulatory documents. Innovation and Training: Promoting and training colleagues in the basics of data science and the use of new technologies like generative AI, organizing and speaking at various training sessions.
Show More

Bioinfogate | Aug 2020 - Aug 2021

Data Scientist

Enhancement of the pharmacovigilance data processing pipeline by transitioning from PostgreSQL to Oracle, incorporating PL/SQL scripts, and developing a data model and data ingestion scripts to enable the pipeline to acquire data from various sources. Development of web crawlers in Python using Scrapy and Selenium, along with the data models that will receive the extracted data.

Bioinfogate | Sep 2019 - Aug 2020

Research Intern

Review documentation and scientific articles to comprehend the state of the art. Specifically, the statistics commonly used in the field. Adapt publicly available scripts and techniques (Bash, PostgreSQL, and R) to process real data volumes instead of a testing subset. Create multiple visualizations using Trelliscope to analyze the results. Unify the various processes into a Python pipeline.

My Education

Take a look at my formal education

Masters Degree in Bioinformatics for health sciencesUniversitat Pompeu Fabra & Universitat de Barcelona | 2020
Engineer's degree in Biological Systems EngineeringUniversitat Politècnica de Catalunya | 2015

About Me

ALEIX MATABACAS

Data scientist with a Health Sciences focus and data engineering skills. Experienced in NLP, workflow automation, and pharmacovigilance. Builds scalable pipelines, leads training in data science and prompt engineering, and fosters team learning.

Powered by

  • Work
  • Certifications
  • Courses
  • Experience
  • Education
  • About Me
  • Create Your Data Portfolio for Free