The outliers They are values that can give you key points within the data you are analyzing and can help you look for inconsistencies and errors within your statistical processes. In this post, we tell you what the outliers and how they can help you identify important values in your data.
What are outliers?
The outliers They are abnormal data within a data set. In simple terms, what are the outliers is an extremely high or extremely low value compared to the nearest data point and the rest of the values that exist within the data graph you are working with.
The outliers They are noticeable extreme values within chart patterns.
In this graph we can see a outlier: example 2 is far below compared to its other peers. It is clear that the example of what are outliers It is a very exaggerated sample of what they truly are. Being able to find them depends on your knowledge of the data and the way in which it is collected.
For example, your data may look like this:
5, 10, 24, 42, 50, 120, 260, 45
Maybe here just by looking you can tell that the outliers They would be 120 and 260, as they are two points that are very separated from the others.
But you can also have them as follows:
2, 15, 48, 30, 22, 17, 65, 10, 22, 39, 5
And maybe you will think that the outlier is 2 or 5, but the reality is that the outlier it would be 65.
How to identify them?
Since you know what they are outliers, you may be interested in knowing how to identify them within your data. The truth is that there are no specific or definitive rules to identify outliers. Although there is no specific mathematical definition, there are some things you can keep in mind. to generate ways to examine the data to find it. We are going to give you some of these ways:
Analyze the data
If you have a short list of data, you can do the simplest way to identify outlierswhich would be reviewing and organizing the data you have.
One of the easiest ways to do this is by organizing the data from lowest to highest, this way you can find the highest or lowest values compared to the others.
Graphics
Making the data visible in some way is always one of the best options. There are many types of graphs you can use to graph the data you have. You can also use other methods along with graphs to identify them more easily.
Some of the data you can use are: histograms, scatter plots or bar graphs, among others.
Calculations
Another way is using the method of Interquartile range.
Do you want to continue learning?
If you want to know more about what the outliersas well as about digital marketing and digital analytics in general, you can sign up for our Digital Marketing and Data Analysis Bootcamp. Through different modules, you will learn theoretically and practically all the topics necessary to enter this sector. Request information today and learn with us!