STEP BY STEP
DATA SCIENCE
FRIENDLY DATA SCIENCE TUTORIALS
Home
R
Python
SQL
More
Data wrangling with pandas
Learn how to use the pandas package in python to explore, select, filter and sort your data, create new variables and produce summary statistics.
An introduction to scikit-learn in python
A tutorial on how to use one of the main machine learning packages in python to create and compare powerful predictive models and modelling pipelines.
LightFM tutorial building a wine recommender in Python
Learn how to build a hybrid matrix factorisation model to create recommendations in Python with the LightFM package.
DBSCAN and t-SNE tutorial in Python
Use density based clustering in Python to spot credit card fraud and visualise the results with t-SNE.
Product associations with Yule's Q and hierarchical clustering in Python
Learn about the Yule's Q measure of association and use hierarchical clustering to visualise product clusters.
Building a segmentation in Python with k-means clustering and PCA
Learn how to build a segmentation in Python using the k-means algorithm and principal component analysis from the sklearn package.
Joining data in Python with pandas
Learn how to merge data in Python using the pandas package.
Market Basket Analysis in Python with pandas
Learn how to perform market basket analysis using pandas and understand from first principles how the associations are calculated.