Course
Feature Engineering with PySpark
- AdvancedSkill Level
- 4.8+
- 439
Learn the gritty details that data scientists are spending 70-80% of their time on; data wrangling and feature engineering.
Data Manipulation
Follow short videos led by expert instructors and then practice what you’ve learned with interactive exercises in your browser.
or
By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.Course
Learn the gritty details that data scientists are spending 70-80% of their time on; data wrangling and feature engineering.
Data Manipulation
Course
Learn how to manipulate data and create machine learning feature sets in Spark using SQL in Python.
Data Manipulation
Course
Learn about ARIMA models in Python and become an expert in time series analysis.
Machine Learning
Course
Learn how to calculate meaningful measures of risk and performance, and how to compile an optimal portfolio for the desired risk and return trade-off.
Applied Finance
Course
Create more accurate and reliable RAG systems with Graph RAG and hybrid RAG.
Artificial Intelligence
Course
Learn how to make GenAI models truly reflect human values while gaining hands-on experience with advanced LLMs.
Artificial Intelligence
Course
Learn how to work with streaming data using serverless technologies on AWS.
Cloud
Course
Learn how to load, transform, and transcribe speech from raw audio files in Python.
Data Manipulation
Course
Learn how to use RNNs to classify text sentiment, generate sentences, and translate text between languages.
Artificial Intelligence
Course
Extend your regression toolbox with the logistic and Poisson models and learn to train, understand, and validate them, as well as to make predictions.
Probability & Statistics
Course
Explore latent variables, such as personality, using exploratory and confirmatory factor analyses.
Probability & Statistics
Course
This course covers everything you need to know to build a basic machine learning monitoring system in Python
Machine Learning
Course
Master Databricks with Python: learn to authenticate, manage clusters, automate jobs, and query AI models programmatically.
Artificial Intelligence
Course
Learn how to approach and win competitions on Kaggle.
Machine Learning
Course
Learn key techniques to optimize Java performance, from algorithm efficiency to JVM tuning and multithreading.
Software Development
Course
Manage the complexity in your code using object-oriented programming with the S3 and R6 systems.
Software Development
Course
In this course youll learn techniques for performing statistical inference on numerical data.
Probability & Statistics
Course
Prepare for your next statistics interview by reviewing concepts like conditional probabilities, A/B testing, the bias-variance tradeoff, and more.
Probability & Statistics
Course
Learn to build pipelines that stand the test of time.
Machine Learning
Course
In this course youll learn how to leverage statistical techniques for working with categorical data.
Probability & Statistics
Course
Build AI agentic workflows that can plan, search, remember, and collaborate, using LlamaIndex.
Artificial Intelligence
Course
Build real-world applications with Python—practice using OOP and software engineering principles to write clean and maintainable code.
Software Development
Course
In this course youll learn how to perform inference using linear models.
Probability & Statistics
Course
Sharpen your knowledge and prepare for your next interview by practicing Python machine learning interview questions.
Machine Learning
Course
Learn to use AI as a senior engineering partner for code analysis, performance optimization, security, and software architecture decisions.
Artificial Intelligence
Course
Learn to analyze Airbnb data using SQL in Databricks, create dashboards, and derive actionable insights.
Importing & Cleaning Data
Course
Learn tools and techniques to leverage your own big data to facilitate positive experiences for your users.
Machine Learning
Course
Learn how to write recursive queries and query hierarchical data structures.
Software Development
Course
Learn how to write effective tests in Java using JUnit and Mockito to build robust, reliable applications with confidence.
Software Development
Course
Develop a better intuition for advanced probability, risk assessment, and simulation techniques to make data-driven business decisions with confidence.
Probability & Statistics
Data science is an area of expertise focused on gaining information from data. Using programming skills, scientific methods, algorithms, and more, data scientists analyze data to form actionable insights.
You’ll need to learn a programming language such as Python or R and master the principles of math and statistics. Knowledge of data analysis methods and data science tools is also essential. There are many ways to learn data science. As well as formal means of education, such as a degree or university study, there are plenty of other resources to help you learn at your own pace. As well as online courses and tutorials, there are books, videos, and more.
As well as knowledge of mathematics and statistics, data scientists need programming skills in languages such as Python, R, and SQL. Additionally, data science requires the ability to work with large data sets, knowledge of data visualization, data wrangling, and database management. Skills in machine learning and deep learning can also be useful.
In a professional capacity, almost every industry can use data science to some degree. Healthcare organizations use data science to detect and cure diseases, while finance companies use it to detect and prevent fraud. All kinds of industries use data science for marketing, such as building recommendation systems and analyzing customer churn.
Yes, data science is among the fastest-growing sectors in the US and worldwide. It’s also one of the best-paid careers out there. According to data from Payscale, experience data scientists earn an average of $97,609 and have a satisfaction rating of four stars out of five in the US.
There are a few things to consider here. First, data science degrees can be competitive to get onto, often requiring consistently high grades. Similarly, many of the skills required for data science require a lot of study and patience. It can take several months to master all of the necessary basics, as well as a lot of practical experience to secure an entry-level position.
Yes, you’ll need some coding experience in languages such as Python, R, SQL, Java, and C/C++. However, due to its relatively simple syntax, Python programming language is often the preferred choice among newcomers.
For a person with no prior coding experience and/or mathematical background, it can typically take 7 to 12 months of intensive studies to be at the level of an entry-level data scientist. However, it is important to remember that learning only the theoretical basis of data science may not make you a real data scientist.
Once you’ve mastered the foundations of data science, you can then specialize in a variety of areas, including machine learning, artificial intelligence, big data analysis, business analytics and intelligence, data mining, and more.
Make progress on the go with our mobile courses and daily 5-minute coding challenges.