Skip to main content

Data Science Tool Building

Wes McKinney talks about data science tool building, what it took to get pandas off the ground and how he approaches building “human interfaces to data” to make individuals more productive.

Nov 2018
View Transcript

About Wes McKinney


Photo of Wes McKinney
Guest
Wes McKinney

Since 2007, Wes has been developing data analysis software, mostly for use in the Python programming language. His primary objective has been improving user productivity, increasing performance and efficiency, and enhancing data interoperability. He is best known for creating the pandas project and writing the book Python for Data Analysis. Since 2015, he has been focused on the Apache Arrow project. He also contributed to Apache Kudu (incubating) and Apache Parquet (where I am a PMC member). He was the co-founder and CEO of DataPad. He later spent a couple years leading efforts to bring Python and Hadoop together at Cloudera. In 2018,  Wes founded Ursa Labs, a not-for-profit open source development group in partnership with RStudio. In 2018, he became a Member of The Apache Software Foundation.


Photo of Hugo Bowne-Anderson
Host
Hugo Bowne-Anderson

Hugo is a data scientist, educator, writer and podcaster at DataCamp. His main interests are promoting data & AI literacy, helping to spread data skills through organizations and society and doing amateur stand up comedy in NYC.

Related

How to Become a Data Scientist in 8 Steps

Find out everything you need to know about becoming a data scientist, and find out whether it’s the right career for you!

Jose Jorge Rodriguez Salgado

12 min

How Data Science is Changing Soccer

With the Fifa 2022 World Cup upon us, learn about the most widely used data science use-cases in soccer.
Richie Cotton's photo

Richie Cotton

The 23 Top Python Interview Questions & Answers

Essential Python interview questions with examples for job seekers, final-year students, and data professionals.
Abid Ali Awan's photo

Abid Ali Awan

22 min

Plotly Express Cheat Sheet

Plotly is one of the most widely used data visualization packages in Python. Learn more about it in this cheat sheet.
DataCamp Team's photo

DataCamp Team

0 min

Getting started with Python cheat sheet

Python is the most popular programming language in data science. Use this cheat sheet to jumpstart your Python learning journey.
DataCamp Team's photo

DataCamp Team

8 min

Python pandas tutorial: The ultimate guide for beginners

Are you ready to begin your pandas journey? Here’s a step-by-step guide on how to get started. [Updated November 2022]
Vidhi Chugh's photo

Vidhi Chugh

15 min

See MoreSee More