DataCamp ranked 36th on Deloitte's 2019 Technology Fast 500™! Learn More
Premium Project

The Hottest Topics in Machine Learning

Use Natural Language Processing on NIPS papers to uncover the trendiest topics in machine learning research.

Start Project
  • 8 tasks
  • 3,732 participants
  • 1,500 XP

Project Description

Neural Information Processing Systems (NIPS) is one of the top machine learning conferences in the world where groundbreaking work is published. In this Project, you will analyze a large collection of NIPS research papers from the past decade to discover the latest trends in machine learning. The techniques used here to handle large amounts of data can be applied to other text datasets as well.

Familiarity with Python and pandas is required to complete this Project, as well as experience with Natural Language Processing in Python (sklearn specifically). Check out DataCamp's pandas Foundations, Manipulating DataFrames with pandas, and Natural Language Processing Fundamentals in Python courses to get familiar or brush up on your skills.

Project Tasks

  • 1Loading the NIPS papers
  • 2Preparing the data for analysis
  • 3Plotting how machine learning has evolved over time
  • 4Preprocessing the text data
  • 5 A word cloud to visualize the preprocessed text data
  • 6 Prepare the text for LDA analysis
  • 7Analysing trends with LDA
  • 8The future of machine learning
Instructor Avatar
Lars Hulstaert

Data Scientist at Microsoft

Lars is a Data Scientist at Microsoft where he helps enterprise customers with their machine learning projects. He obtained an MPhil. in Machine Learning from the University of Cambridge, a BSc. and MEng. in Computer Science from Ghent University. Lars is passionate about enabling others to achieve more by being data-driven. As he believes that education is key, he writes blogposts on Medium, makes content for Datacamp and mentors students on Springboard after-hours. You can follow him on Twitter at @larshulstaert.

See More

Technology

  • Python LogoPython
  • Topics

    Data ManipulationData VisualizationMachine LearningProbability & StatisticsImporting & Cleaning Data