Week 4 – Monday

I performed an analysis on a dataset with a focus on outlier detection with five-number summary. I defined a function, detect_outliers(data) where I used the z-score method to get the outliers. I kept the threshold to 3 standard deviations from mean. I calculated quartiles (Q1 and Q3) and the interquartile range (IQR), to understand the central tendency and spread of the data. I then obtained the upper and lower fences.

Lastly, using seaborn,  I made a boxplot to get the distribution and potential outliers. I also learnt how to implement z-test and t-test using python which I will be trying tomorrow.

Leave a Reply

Your email address will not be published. Required fields are marked *