Estimating Population Measures of Center

5 min

Teacher Prep
Setup
Groups of 2. 1 minute quiet work time followed by partner and whole-class discussion

Narrative

This Warm-up asks students to decide whether to use the mean or median based on the distribution of the data. As students compare groups in this section, the choice of measure of center will be important.

Launch

Arrange students in groups of 2. Give students 1 minute quiet work time, followed by 2 minutes to discuss their work with a partner. Then follow with a whole-class discussion.

Student Task

Would you use the median or mean to describe the center of each data set? Explain your reasoning.

heights of 50 basketball players

&lt;p&gt;Histogram from 66 to 80 by 2’s. Height in inches. Beginning at 66 up to but not including 68, height of bar at each interval is 2,3,13,17,7,5,1.&lt;/p&gt;<br>
 

ages of 30 people at a family dinner party

&lt;p&gt;Histogram from 5 to 50 by 5’s. Age in years. Beginning at 5 up to but not including 10, height of bar at each interval is 2, 3, 1, 1, 2, 3, 3, 4, 11&lt;/p&gt;<br>
 

 

backpack weights of sixth-grade students

&lt;p&gt;Dot plot from 0 to 16, by 2’s. backpack weight in kilograms. &lt;/p&gt;<br>
 
<p>Dot plot from 0 to 16, by 1’s. backpack weight in kilograms. beginning at 0, number of dots above each increment is: 4, 7, 9, 12, 7, 6, 3, 3, 2, 1, 0, 0, 0, 0, 0, 0, 1.</p>  

number of books students read over summer break

&lt;p&gt;Box plot from 0 to 16 by 2’s. Number of books. Whisker from 2 to 5. Box from 5 to 10 with vertical line at 8. Whisker from 10 to 15.&lt;/p&gt;<br>
<br>
&lt;p&gt;&lt;br&gt;<br>
 &lt;/p&gt;<br>
 

 

Sample Response

  • Basketball players: Mean. Sample reasoning: The distribution is symmetric. 
  • Ages at a party: Median. Sample reasoning: The distribution is not symmetric. 
  • Backpacks: Median. Sample reasoning: The point at 16 would affect the mean much more than the median.
  • Books: The mean can be used if it is known because the data set is approximately symmetric, but if what's given on the box plot is the only thing known, then the median should be used.
Activity Synthesis (Teacher Notes)

Select students to share their chosen measure of center and reasoning for their choice. Ask students what measures of variability should be used with each measure of center. Ask students,

  • “How does the symmetry of the distribution affect which measure of center we might choose to represent what is typical for a group?” (If the distribution is not symmetric, median is often a better choice for the measure of center because values that are very far from the center of the distribution do not affect the median as much as they do the mean. If the distribution is symmetric, the mean is better because it uses information from all of the data collected rather than ignoring some values to find the middle.)
  • “What measures of variability should be used with each measure of center?” (Mean absolute deviation, or MAD, is used with mean and interquartile range, while IQR is used with median.)
Standards
Building On
  • 6.SP.5.d·Relating the choice of measures of center and variability to the shape of the data distribution and the context in which the data were gathered.
  • 6.SP.B.5.d·Relating the choice of measures of center and variability to the shape of the data distribution and the context in which the data were gathered.
Building Toward
  • 7.SP.2·Use data from a random sample to draw inferences about a population with an unknown characteristic of interest. Generate multiple samples (or simulated samples) of the same size to gauge the variation in estimates or predictions. <em>For example, estimate the mean word length in a book by randomly sampling words from the book; predict the winner of a school election based on randomly sampled survey data. Gauge how far off the estimate or prediction might be.</em>
  • 7.SP.4·Use measures of center and measures of variability for numerical data from random samples to draw informal comparative inferences about two populations. <em>For example, decide whether the words in a chapter of a seventh-grade science book are generally longer than the words in a chapter of a fourth-grade science book.</em>
  • 7.SP.A.2·Use data from a random sample to draw inferences about a population with an unknown characteristic of interest. Generate multiple samples (or simulated samples) of the same size to gauge the variation in estimates or predictions. <span>For example, estimate the mean word length in a book by randomly sampling words from the book; predict the winner of a school election based on randomly sampled survey data. Gauge how far off the estimate or prediction might be.</span>
  • 7.SP.B.4·Use measures of center and measures of variability for numerical data from random samples to draw informal comparative inferences about two populations. <span>For example, decide whether the words in a chapter of a seventh-grade science book are generally longer than the words in a chapter of a fourth-grade science book.</span>

5 min

10 min

15 min