diff --git a/src/data/roadmaps/data-analyst/content/107-data-cleaning/102-finding-outliers.md b/src/data/roadmaps/data-analyst/content/107-data-cleaning/102-finding-outliers.md index f2bed3b34..3cac6dd7f 100644 --- a/src/data/roadmaps/data-analyst/content/107-data-cleaning/102-finding-outliers.md +++ b/src/data/roadmaps/data-analyst/content/107-data-cleaning/102-finding-outliers.md @@ -1,3 +1,7 @@ # Finding Outliers -In the field of data analysis, data cleaning is an essential and preliminary step. This process involves correcting or removing any errors, inaccuracy, or irrelevance present in the obtained raw data, making it more suitable for analysis. One crucial aspect of this process is "finding outliers". Outliers are unusual or surprising data points that deviate significantly from the rest of the data. While they may be the result of mere variability or error, they will often pull the aggregate data towards them, skewing the results and impeding the accuracy of data analysis. Therefore, identifying and appropriately handling these outliers is crucial to ensure the reliability of subsequent data analysis tasks. \ No newline at end of file +In the field of data analysis, data cleaning is an essential and preliminary step. This process involves correcting or removing any errors, inaccuracy, or irrelevance present in the obtained raw data, making it more suitable for analysis. One crucial aspect of this process is "finding outliers". Outliers are unusual or surprising data points that deviate significantly from the rest of the data. While they may be the result of mere variability or error, they will often pull the aggregate data towards them, skewing the results and impeding the accuracy of data analysis. Therefore, identifying and appropriately handling these outliers is crucial to ensure the reliability of subsequent data analysis tasks. + +Visit the following resources to learn more: + +- [@article@Outliers]([https://www.mathsisfun.com/data/outliers.html)