Toggle navigation
Menu
Home
All projects
Contact
"If you can’t explain it simply, you don’t understand it well enough"
Albert Einstein
This is a blog with experiences, experiments and insights in the world of Data Science.
They are actually a series of personal notes and projects but I tried to structure them in a way that they could be readable by anyone.
Articles and insights
Machine Learning and statistics
Forecasting, smoothing, webscraping, visualization, Machine Learning, statistics, psychometrics and many others are the tools available to a Data Scientist.
This collection of articles does not aim to explain all the knowledge of statistics and Machine Learning nor to explain the main topics that make up this vast world, but simply includes some topics that I consider important and interesting.
SQL: the foundation of databases
SQL, which stands for Structured Query Language, is a language for interacting with data stored in a relational database.
You can think of a relational database as a collection of tables. A table is just a set of rows and columns, like a spreadsheet, that represents exactly one type of entity.
Excel data analysis
Excel is a very popular tool for its ease of use but many functions that Excel offers are very often unknown to an average user.
This collection of articles aims to consolidate people who already have a minimum basic use of Excel on some topics.
All projects
CardinalityKit
Advanced Cardinality Estimation for Privacy-Preserving Analytics
Posted by
Giacomo Saccaggi
Scomp Link
The Astromech Arm for Your Python Projects
May the code be with you
Posted by
Giacomo Saccaggi
Easy tagging and automatic tagging
The goal of this project is to make tagging procedures easier. I decided to create a Python package that allows me to speed up any image and text tagging procedures.
Posted by
Giacomo Saccaggi
Getting emails from Yellow Pages
In this article we will focus on Web Scraping procedures with two programming languages (R and Python) to try to get emails from the Yellow Pages website.
Posted by
Giacomo Saccaggi
Recycling, a statistical classification problem
Using Machine Learning, Deep Learning and Ensemble Learning techniques, we want to create a bin that, thanks to image recognition and also exploiting other information from selected sensors (for example ultrasonic response, photoresistors, weight, etc.), is able to understand if the thrown object should be placed in paper, plastic, etc.
Posted by
Giacomo Saccaggi
Analysis of risk factors for an injury
This study aims to analyze which risk factors most influence the healing time of an injury, and also to analyze which possible factors can determine the non-healing of the injury.
Posted by
Giacomo Saccaggi
Film history analysis
This project focuses on the film industry on which an exploratory analysis is carried out to identify interesting elements of this field.
Posted by
Giacomo Saccaggi
Predicting the result of a football match
After analyzing the results of the last 25 years of football competitions of all teams from the 5 main European leagues, I decided to create a Bayesian model that could predict the result of a football match and then, using Machine Learning techniques, identify some matches that could be predicted with minimal error.
Posted by
Giacomo Saccaggi
Pseudo-random number generation
Basic methods of random and pseudo-random number generation.
Posted by
Giacomo Saccaggi
Evolution of statistics and data
From classical statistics to Machine Learning: evolution of data.
Posted by
Giacomo Saccaggi
Time Series
Stationarity and non-stationarity of a time series
Multivariate stochastic processes: VARMA and VAR models
Cointegration
Posted by
Giacomo Saccaggi
Monte Carlo Method
Monte Carlo Method with variance reduction techniques: control variables and antithetic variables.
Posted by
Giacomo Saccaggi
Data Science?
What is data science.
Posted by
Giacomo Saccaggi
Ensemble Learning
Stacked models and ensemble learning techniques.
Posted by
Giacomo Saccaggi
Fisher's Exact Test
Hypothesis testing for non-parametric statistics with two dichotomous nominal variables.
Posted by
Giacomo Saccaggi
Network Analysis
Network analysis in R
Network analysis in Python
Posted by
Giacomo Saccaggi
Forecasting Theory
Fundamentals of forecasting theory.
Posted by
Giacomo Saccaggi
Convolutional Neural Network
Convolutional neural networks: definition and elements.
Posted by
Giacomo Saccaggi
Correlation vs Causation
The difference between correlation and causality.
Posted by
Giacomo Saccaggi
Market Analysis and Marketing
Schematic notions on the steps in market research.
Posted by
Giacomo Saccaggi
Large Scale Testing
The multiple testing problem.
Posted by
Giacomo Saccaggi
UCM Components
Trend, Cycle, Seasonality and Noise components of Unobserved Components Models.
Posted by
Giacomo Saccaggi
Boosting
Combining weak classifiers to build a stronger one with boosting techniques.
Posted by
Giacomo Saccaggi
Neural Network and Multi-Layer Perceptron (MLP)
Introduction to neural networks and the Multi-Layer Perceptron architecture.
Posted by
Giacomo Saccaggi
Regression Splines
Non-parametric regression technique for modeling non-linearities and interactions.
Posted by
Giacomo Saccaggi
Record Linkage
Record Linkage and Knowledge Discovery.
Posted by
Giacomo Saccaggi
Bayesian Statistics
Introduction to the Bayesian approach
Prior selection
Hyperparameter selection methods
Predictive inference with the Bayesian approach
Posterior synthesis
Posted by
Giacomo Saccaggi
UCM Models in State Space Form
1. Introduction to State Space models
2. Introduction to UCM models in State Space form with the KFAS library
3. State Space models estimation of unknown parameters
4. Application of State Space models estimation and auxiliary residuals
5. Forecasting, filtering and smoothing
Posted by
Giacomo Saccaggi
The Steps of an Analysis
Summary of the steps of an analysis, from data to prediction.
Posted by
Giacomo Saccaggi
Psychometrics: Myers Briggs Personality Test
Psychometrics and the Myers Briggs personality test as used in market analysis.
Posted by
Giacomo Saccaggi
From Spatial Statistics to Geostatistics
0. Introduction to geostatistical data
1. Introduction to spatial statistics
2. Spatial data in R
3. Spatial Point Processes: The Poisson process
4. Spatial Point Processes: The test for CSR
5. Spatial Point Processes: Estimation of the intensity function
6. Introduction to Geostatistics, large and small scale variability
7. Geostatistical Model
8. Exploratory analysis EDA and ESDA
9. Spatial prediction and kriging in R
Posted by
Giacomo Saccaggi
Competizioni bee viva
Ames House Price
OK Cupid
Bike sharing