"The function of good software is to make the complex appear to be simple."
Grady Booch


This is a blog with experiences, experiments and insights in the world of Data Science.

They are actually a series of personal notes and projects but I tried to structure them in a way that they could be readable by anyone.

Recent Projects


CardinalityKit

Advanced Cardinality Estimation for Privacy-Preserving Analytics


Easy tagging and automatic tagging

The goal of this project is to make tagging procedures easier. I decided to create a Python package that allows me to speed up any image and text tagging procedures.


Getting emails from Yellow Pages

In this article we will focus on Web Scraping procedures with two programming languages (R and Python) to try to get emails from the Yellow Pages website.


Recycling, a statistical classification problem

Using Machine Learning, Deep Learning and Ensemble Learning techniques, we want to create a bin that, thanks to image recognition and also exploiting other information from selected sensors (for example ultrasonic response, photoresistors, weight, etc.), is able to understand if the thrown object should be placed in paper, plastic, etc.


Analysis of risk factors for an injury

This study aims to analyze which risk factors most influence the healing time of an injury, and also to analyze which possible factors can determine the non-healing of the injury.


Film history analysis

This project focuses on the film industry on which an exploratory analysis is carried out to identify interesting elements of this field.


Predicting the result of a football match

After analyzing the results of the last 25 years of football competitions of all teams from the 5 main European leagues, I decided to create a Bayesian model that could predict the result of a football match and then, using Machine Learning techniques, identify some matches that could be predicted with minimal error.


Articles and insights