Subscribe now. Save 33% on DataCamp and make data science your superpower.

Offer ends in  days  hrs  mins  secs
Daniel Chen
Daniel Chen

Data Science Consultant at Lander Analytics

Daniel is a Software Carpentry instructor and a doctoral student in Genetics, Bioinformatics, and Computational Biology at Virginia Tech, where he works in the Social and Decision Analytics Laboratory under the Biocomplexity Institute. He received his MPH at the Mailman School of Public Health in Epidemiology and is interested in integrating hospital data in order to perform predictive health analytics and build clinical support tools for clinicians. An advocate of open science, he aspires to bridge data science with epidemiology and health care.

See More
Collaborator(s)
  • Hugo Bowne-Anderson

    Hugo Bowne-Anderson

  • Eunkyung Park

    Eunkyung Park

Course Description

Python and R have seen immense growth in popularity in the "Machine Learning Age". They both are high-level languages that are easy to learn and write. The language you use will depend on your background and field of study and work. R is a language made by and for statisticians, whereas Python is a more general purpose programming language. Regardless of the background, there will be times when a particular algorithm is implemented in one language and not the other, a feature is better documented, or simply, the tutorial you found online uses Python instead of R. In either case, this would require the R user to work in Python to get his/her work done, or try to understand how something is implemented in Python for it to be translated into R. This course helps you cross the R-Python language barrier.

  1. 1

    The Basics

    Free

    Learn about some of the most important data types (integers, floats, strings, and booleans) and data structures (lists, dictionaries, numpy arrays, and pandas DataFrames) in Python and how they compare to the ones in R.

  2. Control flow, Loops, and Functions

    This chapter covers control flow statements (if-else if-else), for loops and shows you how to write your own functions in Python!

  3. Pandas

    In this chapter you will learn more about one of the most important Python libraries, Pandas. In addition to DataFrames, pandas provides several data manipulation functions and methods.

  4. Plotting

    You will learn about the rich ecosystem of visualization libraries in Python. This chapter covers matplotlib, the core visualization library in Python along with the pandas and seaborn libraries.

  5. Capstone

    As a final capstone, you will apply your Python skills on the NYC Flights 2013 dataset.

info