Learn to analyze, plot, and model multivariate data.
By continuing you accept the Terms of Use and Privacy Policy, that your data will be stored outside of the EU, and that you are 16 years or older.
When working with data that contains many variables, we are often interested in studying the relationship between these variables using multivariate statistics. In this course, you'll learn ways to analyze these datasets. You will also learn about common multivariate probability distributions, including the multivariate normal, the multivariate-t, and some multivariate skew distributions. You will then be introduced to techniques for representing high dimensional data in fewer dimensions, including principal component analysis (PCA) and multidimensional scaling (MDS).
In this introduction to multivariate data, you will learn how to read and summarize it. You will learn how to summarize multivariate data using descriptive statistics, such as the mean vector, variance-covariance, and correlation matrices. You'll then explore plotting techniques to provide insights into multivariate data.
This chapter introduces a host of probability distributions to model non-normal data. In particular, you will be introduced to multivariate t-distributions, which can model heavier tails and are a generalization of the univariate Student's t-distribution. You will be introduced to various skew distributions, which are specifically designed to model data that are right or left skewed.
This chapter will introduce you to the most important and widely used multivariate probability distribution, the multivariate normal. You will learn how to generate random samples from a multivariate normal distribution and how to calculate and plot the densities and probabilities under this distribution. You will also learn how to test if a dataset follows multivariate normality.
In the final chapter, you will be introduced to techniques for analyzing high dimensional data, including principal component analysis (PCA) and multidimensional scaling (MDS). You will also learn to implement these techniques by analyzing data.
In this introduction to multivariate data, you will learn how to read and summarize it. You will learn how to summarize multivariate data using descriptive statistics, such as the mean vector, variance-covariance, and correlation matrices. You'll then explore plotting techniques to provide insights into multivariate data.
This chapter will introduce you to the most important and widely used multivariate probability distribution, the multivariate normal. You will learn how to generate random samples from a multivariate normal distribution and how to calculate and plot the densities and probabilities under this distribution. You will also learn how to test if a dataset follows multivariate normality.
This chapter introduces a host of probability distributions to model non-normal data. In particular, you will be introduced to multivariate t-distributions, which can model heavier tails and are a generalization of the univariate Student's t-distribution. You will be introduced to various skew distributions, which are specifically designed to model data that are right or left skewed.
In the final chapter, you will be introduced to techniques for analyzing high dimensional data, including principal component analysis (PCA) and multidimensional scaling (MDS). You will also learn to implement these techniques by analyzing data.
“I've used other sites, but DataCamp's been the one that I've stuck with.”
Devon Edwards Joseph
Lloyd's Banking Group
“DataCamp is the top resource I recommend for learning data science.”
Louis Maiden
Harvard Business School
“DataCamp is by far my favorite website to learn from.”
Ronald Bowers
Decision Science Analytics @ USAA