Statistical distribution of dataΒΆ

Using Pandas .describe() method and plotting a histogram, we will have an overview of the distribution of each variable and possible outliers and missing values. This will be useful in the next step when we will work on cleaning the data, deciding whether to delete an entry that has a missing value or fill in with the mean/median, for example.

Alternative text Alternative text