What is the best statistical programming language? Infograph

A feature that all programming communities have in common is the numerous debates about why their programming language of choice is better, more advanced, faster, holier etc. In today's data science community, it seems as if these discussions are omnipresent with advocates of SAS, SPSS, R, Python, Julia, etc. battling and challenging each other on every online medium on the best statistical programming language. (side note: These 'data driven' debates are often a good example of how you can prove anything with statistics.) While these debates are a good thing for the community and the programming language as a whole, they unfortunately also have a negative effect on those individuals that are just in the beginning of their data analytics career. Biased opinions on all sides of the table make it difficult for new data analysts to see the forest for the trees when choosing a statistical programming language.

An infograph for each statistical programming language

Especially for this new group of data analysts (and future debaters), as well as for everyone else that is interested in learning data science or an additional statistical language, we created the infograph 'Statistical Language Wars' that gives a basic comparison between statistical programming languages like SAS, R and SPSS to see how they stack up. This to provide a more clear starting point.statistical programming language Source: blog.datacamp.com We'll make sure to regularly update this infograph based on the feedback you provide, and we will definitely consider to create some new infographs that focus more on other players such as Python and Julia. Feel free to share!

Embed Code:

[code] <a href="http://blog.datacamp.com/statistical-language-wars-the-infograph/" ><img src="http://datacamp.wpengine.com/wp-content/uploads/2014/05/infograph.png" alt="Statistical language wars: SAS vs R vs SPSS" /></a><br/>Source: <a href="http://blog.datacamp.com">blog.datacamp.com</a><br/> [/code]

Get Access To All Courses

Join now and become a full-fledged Data Scientist!

Join Now

Check Out Intermediate R

Intermediate R Start Course

Up Next

The importance of preprocessing in data science and the machine learning pipeline II: centering, scaling and logistic regression

by Hugo Bowne-Anderson

In the first article in this series, I explored the role of preprocessing in machine learning (ML) classification tasks, with a deep dive into the k-Nearest Neighbours algorithm (k-NN) and the wine quality dataset. There you saw that centering and scaling numerical data improved the performance of k-NN for a... Read More

The importance of preprocessing in data science and the machine learning pipeline I: centering, scaling and k-Nearest Neighbours

by Hugo Bowne-Anderson

Data preprocessing is an umbrella term that covers an array of operations data scientists will use to get their data into a form more appropriate for what they want to do with it. For example, before performing sentiment analysis of twitter data, you may want to strip out any html... Read More


No comments yet. Be the first to respond!