Send Close Add comments: (status displays here)
Got it!  This site "www.robinsnyder.com" uses cookies. You consent to this by clicking on "Got it!" or by continuing to use this website.  Note: This appears on each machine/browser from which this site is accessed.
Summarizing data : The M&M Problem
by RS  admin@robinsnyder.com : 1024 x 640


1. Summarizing data : The M&M Problem
M&M's is a registered trademark of Mars, Inc. For more information, see http://www.mms.com (as of 2004-01-11).

2. A typical package of M&M's
M&M sample packageAn M&M represents one data sample from one package of M&M's. Here, it is assumed that a package of M&M's contains 60 M&M's.

The M&M colors represent nominal data in that there is preferred ordering or numerical value that can be associated with the M&M's.

3. A package in text form
Here is a typical package contents in text form.
Red Yellow Brown Green Green Blue Green Brown Orange Red Yellow Red Brown Red Blue Yellow Green Yellow Yellow Blue Yellow Brown Orange Red Green Orange Green Red Green Red Brown Brown Orange Red Blue Red Red Yellow Green Orange Red Blue Orange Red Red Blue Red Red Red Blue Blue Yellow Yellow Red Red Blue Blue Red Orange Red


4. Package in list form
Here is the raw data for this problem as a list of colors.
Red Yellow Brown Green Green Blue Green Brown Orange Red Yellow Red Brown Red Blue Yellow Green Yellow Yellow Blue Yellow Brown Orange Red Green Orange Green Red Green Red Brown Brown Orange Red Blue Red Red Yellow Green Orange Red Blue Orange Red Red Blue Red Red Red Blue Blue Yellow Yellow Red Red Blue Blue Red Orange Red


5. Getting data into the computer
Often, a TPS (Transaction Processing System) is used to automatically collect the data and put it into the computer. It is better to automate the collection of data rather than use a human to manually enter data into the computer. What is a TPS and why is a TPS important?


6. Python program
One could write a Python program to summarize the data by color.

There are some built-in ways to summarize the data. Here is the Python code [#1]

Here is the output of the Python code.

pivot table

7. Pareto principle: 80-20 rules
The Pareto chart is named after Vilfredo Pareto, 1848-1923, an Italian economist.

8. Pareto principle: 80-20 rules
Pie chart for 80-20 ruleVilfredo Pareto (1848-1923) made the observation that 20% of the population owned 80% of the wealth.

This idea has been called Pareto's principle and has been applied in many other areas.

Information sign More: Pareto principle: 80-20 rules

9. Pareto chart
Pareto chartA Pareto chart is a combination chart that shows an individual count or frequency using a column bar chart, ordered in decreasing order from left to right, and a cumulative distribution using a line chart.

10. Pareto chart
To get a Pareto chart, the following changes need to be made. Here is the Python code [#2]

Here is the output of the Python code.

pivot table

11. Summary
The above has shows some ways of summarizing nominal data using the M&M problem.

12. End of page

by RS  admin@robinsnyder.com : 1024 x 640