Categorical Variables Visualization

The four most common representations of categorical variables are:

  • Frequency Distribution Tables
  • Bar Charts
  • Pie Charts
  • Pareto Diagrams

 

Finding intervals

The formula for this is:

(largest number – smallest number) / number of desired intervals

It’s a good idea to round them up if provided a decimal. After this the objective is to figure out how many times the frequency occurs in the data set provided.

The relative frequency is the frequency of a given interval as part of the total:

Relative Frequency = Frequency / Total Frequency

 

This can later be applied to the Histogram is the information holds a continuity, as opposed to the bar chart that shows different categories.

The scatter plot is able to find outliners, or cluster data between two variables.

 

 

Posted in Blog Posts.