big data

Tutorials (3)

must read
importing & cleaning data
+2

Generating Realistic Random Datasets with Trumania

Why do data scientists and data engineers work with synthetic data? How do they obtain it? Discover Trumania, a scenario-based random dataset generator library.
must read
importing & cleaning data
+2
25
25
machine learning
+3

Apache Spark Tutorial: ML with PySpark

Apache Spark tutorial introduces you to big data processing, analysis and ML with PySpark.
87
87

Cheat Sheets (2)

python
+2

PySpark Cheat Sheet: Spark DataFrames in Python

June 15th, 2017This PySpark SQL cheat sheet is your handy companion to Apache Spark DataFrames in Python and includes code samples.

Open Courses (0)

There are currently no open courses in big data, create your own.

Podcast (5)

Blog Posts (12)

Find out how to build a healthy and sustainable data strategy that will move the needle for your company.

Tech Thoughts (0)

There are currently no tech thought posts in big data.