What is the best statistical programming language? Infograph

A feature that all programming communities have in common is the numerous debates about why their programming language of choice is better, more advanced, faster, holier etc. In today's data science community, it seems as if these discussions are omnipresent with advocates of SAS, SPSS, R, Python, Julia, etc. battling and challenging each other on every online medium on the best statistical programming language. (side note: These 'data driven' debates are often a good example of how you can prove anything with statistics.) While these debates are a good thing for the community and the programming language as a whole, they unfortunately also have a negative effect on those individuals that are just in the beginning of their data analytics career. Biased opinions on all sides of the table make it difficult for new data analysts to see the forest for the trees when choosing a statistical programming language.

An infograph for each statistical programming language

Especially for this new group of data analysts (and future debaters), as well as for everyone else that is interested in learning data science or an additional statistical language, we created the infograph 'Statistical Language Wars' that gives a basic comparison between statistical programming languages like SAS, R and SPSS to see how they stack up. This to provide a more clear starting point.statistical programming language Source: blog.datacamp.com We'll make sure to regularly update this infograph based on the feedback you provide, and we will definitely consider to create some new infographs that focus more on other players such as Python and Julia. Feel free to share!

Embed Code:

[code] <a href="http://blog.datacamp.com/statistical-language-wars-the-infograph/" ><img src="http://datacamp.wpengine.com/wp-content/uploads/2014/05/infograph.png" alt="Statistical language wars: SAS vs R vs SPSS" /></a><br/>Source: <a href="http://blog.datacamp.com">blog.datacamp.com</a><br/> [/code]

Get Access To All Courses

Join now and become a full-fledged Data Scientist!

Join Now

Check Out Intermediate R

Intermediate R Start Course

Up Next

The importance of preprocessing in data science and the machine learning pipeline III: scaling your data to combat noise

by Hugo Bowne-Anderson

In two previous posts, I explored the role of preprocessing data in the machine learning pipeline. In particular, I checked out the k-Nearest Neighbors (k-NN) and logistic regression algorithms and saw how scaling numerical data strongly influenced the performance of the former but not that of the latter, as measured... Read More

The importance of preprocessing in data science and the machine learning pipeline II: centering, scaling and logistic regression

by Hugo Bowne-Anderson

In the first article in this series, I explored the role of preprocessing in machine learning (ML) classification tasks, with a deep dive into the k-Nearest Neighbours algorithm (k-NN) and the wine quality dataset. There you saw that centering and scaling numerical data improved the performance of k-NN for a... Read More


No comments yet. Be the first to respond!