Interactive Course

Working with Geospatial Data in Python

This course will show you how to integrate spatial data into your Python Data Science workflow.

  • 4 hours
  • 16 Videos
  • 53 Exercises
  • 1,820 Participants
  • 4,500 XP

Loved by learners at thousands of top companies:

forrester-grey.svg
dell-grey.svg
lego-grey.svg
t-mobile-grey.svg
3m-grey.svg
axa-grey.svg

Course Description

A good proportion of the data out there in the real world is inherently spatial. From the population recorded in the national census, to every shop in your neighborhood, the majority of datasets have a location aspect that you can exploit to make the most of what they have to offer. This course will show you how to integrate spatial data into your Python Data Science workflow. You will learn how to interact with, manipulate and augment real-world data using their geographic dimension. You will learn to read tabular spatial data in the most common formats (e.g. GeoJSON, shapefile, geopackage) and visualize them in maps. You will then combine different sources using their location as the bridge that puts them in relation to each other. And, by the end of the course, you will be able to understand what makes geographic data unique, allowing you to transform and repurpose them in different contexts.

  1. 1

    Introduction to geospatial vector data

    Free

    In this chapter, you will be introduced to the concepts of geospatial data, and more specifically of vector data. You will then learn how to represent such data in Python using the GeoPandas library, and the basics to read, explore and visualize such data. And you will exercise all this with some datasets about the city of Paris.

  2. Projecting and transforming geometries

    In this chapter, we will take a deeper look into how the coordinates of the geometries are expressed based on their Coordinate Reference System (CRS). You will learn the importance of those reference systems and how to handle it in practice with GeoPandas. Further, you will also learn how to create new geometries based on the spatial relationships, which will allow you to overlay spatial datasets. And you will further practice this all with Paris datasets!

  3. Spatial relationships

    One of the key aspects of geospatial data is how they relate to each other in space. In this chapter, you will learn the different spatial relationships, and how to use them in Python to query the data or to perform spatial joins. Finally, you will also learn in more detail about choropleth visualizations.

  4. Putting it all together - Artisanal mining sites case study

    In this final chapter, we leave the Paris data behind us, and apply everything we have learnt up to now on a brand new dataset about artisanal mining sites in Eastern Congo. Further, you will still learn some new spatial operations, how to apply custom spatial operations, and you will get a sneak preview into raster data.

  1. 1

    Introduction to geospatial vector data

    Free

    In this chapter, you will be introduced to the concepts of geospatial data, and more specifically of vector data. You will then learn how to represent such data in Python using the GeoPandas library, and the basics to read, explore and visualize such data. And you will exercise all this with some datasets about the city of Paris.

  2. Spatial relationships

    One of the key aspects of geospatial data is how they relate to each other in space. In this chapter, you will learn the different spatial relationships, and how to use them in Python to query the data or to perform spatial joins. Finally, you will also learn in more detail about choropleth visualizations.

  3. Projecting and transforming geometries

    In this chapter, we will take a deeper look into how the coordinates of the geometries are expressed based on their Coordinate Reference System (CRS). You will learn the importance of those reference systems and how to handle it in practice with GeoPandas. Further, you will also learn how to create new geometries based on the spatial relationships, which will allow you to overlay spatial datasets. And you will further practice this all with Paris datasets!

  4. Putting it all together - Artisanal mining sites case study

    In this final chapter, we leave the Paris data behind us, and apply everything we have learnt up to now on a brand new dataset about artisanal mining sites in Eastern Congo. Further, you will still learn some new spatial operations, how to apply custom spatial operations, and you will get a sneak preview into raster data.

What do other learners have to say?

Devon

“I've used other sites, but DataCamp's been the one that I've stuck with.”

Devon Edwards Joseph

Lloyd's Banking Group

Louis

“DataCamp is the top resource I recommend for learning data science.”

Louis Maiden

Harvard Business School

Ronbowers

“DataCamp is by far my favorite website to learn from.”

Ronald Bowers

Decision Science Analytics @ USAA

Joris Van den Bossche
Joris Van den Bossche

Open Source Software Developer; Core Developer of Pandas, GeoPandas and scikit-learn

Joris is an open source python enthusiast and currently working as a freelance developer and teacher. Joris has an academic background in air quality research at Ghent University and VITO (Belgium), and recently, he worked at the Université Paris-Saclay Center for Data Science (at Inria), working both on data science projects as contributing to Pandas and scikit-learn. He regularly gives Python data analysis workshops and is a core contributor to Pandas and the maintainer of GeoPandas.

See More
Dani Arribas-Bel
Dani Arribas-Bel

Senior Lecturer in Geographic Data Science

Dani Arribas-Bel is interested in computers, cities, and data. He is a senior lecturer in Geographic Data Science at the Department of Geography and Planning of the University of Liverpool. He holds honorary positions at the University of Chicago's Center for Spatial Data Science, the Center for Geospatial Sciences of the University of California Riverside, and the Smart Cities Chair of Universitat the Barcelona. Dani's research interests lie at the intersection of urban studies, computational methods and new forms of data. He is member of the development team of PySAL, the Python library for spatial analysis.

See More
Collaborators
  • Mari Nazary

    Mari Nazary

  • Sara Billen

    Sara Billen

Icon Icon Icon professional info