Skip to main content

Working with Geospatial Data in Python

This course will show you how to integrate spatial data into your Python Data Science workflow.

Start Course for Free
4 Hours16 Videos53 Exercises7,802 Learners4500 XP

Create Your Free Account



By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA. You confirm you are at least 16 years old (13 if you are an authorized Classrooms user).

Loved by learners at thousands of companies

Course Description

A good proportion of the data out there in the real world is inherently spatial. From the population recorded in the national census, to every shop in your neighborhood, the majority of datasets have a location aspect that you can exploit to make the most of what they have to offer. This course will show you how to integrate spatial data into your Python Data Science workflow. You will learn how to interact with, manipulate and augment real-world data using their geographic dimension. You will learn to read tabular spatial data in the most common formats (e.g. GeoJSON, shapefile, geopackage) and visualize them in maps. You will then combine different sources using their location as the bridge that puts them in relation to each other. And, by the end of the course, you will be able to understand what makes geographic data unique, allowing you to transform and repurpose them in different contexts.

  1. 1

    Introduction to geospatial vector data


    In this chapter, you will be introduced to the concepts of geospatial data, and more specifically of vector data. You will then learn how to represent such data in Python using the GeoPandas library, and the basics to read, explore and visualize such data. And you will exercise all this with some datasets about the city of Paris.

    Play Chapter Now
    Geospatial data
    50 xp
    Restaurants in Paris
    100 xp
    Adding a background map
    100 xp
    Introduction to GeoPandas
    50 xp
    Explore the Paris districts (I)
    100 xp
    Explore the Paris districts (II)
    100 xp
    The Paris restaurants as a GeoDataFrame
    100 xp
    Exploring and visualizing spatial data
    50 xp
    Visualizing the population density
    100 xp
    Using pandas functionality: groupby
    100 xp
    Plotting multiple layers
    100 xp
  2. 3

    Projecting and transforming geometries

    In this chapter, we will take a deeper look into how the coordinates of the geometries are expressed based on their Coordinate Reference System (CRS). You will learn the importance of those reference systems and how to handle it in practice with GeoPandas. Further, you will also learn how to create new geometries based on the spatial relationships, which will allow you to overlay spatial datasets. And you will further practice this all with Paris datasets!

    Play Chapter Now
  3. 4

    Putting it all together - Artisanal mining sites case study

    In this final chapter, we leave the Paris data behind us, and apply everything we have learnt up to now on a brand new dataset about artisanal mining sites in Eastern Congo. Further, you will still learn some new spatial operations, how to apply custom spatial operations, and you will get a sneak preview into raster data.

    Play Chapter Now




mari-07494695-96a1-4a02-800a-956e6fd8c0caMari Nazarysara-billenSara Billen
Joris Van den Bossche Headshot

Joris Van den Bossche

Open Source Software Developer; Core Developer of Pandas, GeoPandas and scikit-learn

Joris is an open source python enthusiast and currently working as a freelance developer and teacher. Joris has an academic background in air quality research at Ghent University and VITO (Belgium), and recently, he worked at the Université Paris-Saclay Center for Data Science (at Inria), working both on data science projects as contributing to Pandas and scikit-learn. He regularly gives Python data analysis workshops and is a core contributor to Pandas and the maintainer of GeoPandas.
See More
Dani Arribas-Bel Headshot

Dani Arribas-Bel

Senior Lecturer in Geographic Data Science

Dani Arribas-Bel is interested in computers, cities, and data. He is a senior lecturer in Geographic Data Science at the Department of Geography and Planning of the University of Liverpool. He holds honorary positions at the University of Chicago's Center for Spatial Data Science, the Center for Geospatial Sciences of the University of California Riverside, and the Smart Cities Chair of Universitat the Barcelona. Dani's research interests lie at the intersection of urban studies, computational methods and new forms of data. He is member of the development team of PySAL, the Python library for spatial analysis.
See More

What do other learners have to say?

I've used other sites—Coursera, Udacity, things like that—but DataCamp's been the one that I've stuck with.

Devon Edwards Joseph
Lloyds Banking Group

DataCamp is the top resource I recommend for learning data science.

Louis Maiden
Harvard Business School

DataCamp is by far my favorite website to learn from.

Ronald Bowers
Decision Science Analytics, USAA