Tasks 07-01 - Descriptive Statistics
Section 07: Probability & Statistics
Problem 1: Measures of Central Tendency (x)
For the dataset: \(15, 22, 18, 25, 22, 19, 22, 28, 17, 22\)
- Calculate the mean.
- Find the median.
- Find the mode.
- Which measure best represents the “typical” value? Why?
Problem 2: Variance and Standard Deviation (x)
For the dataset: \(8, 12, 15, 11, 14\)
- Calculate the mean.
- Calculate the sample variance.
- Calculate the sample standard deviation.
Problem 3: Range and IQR (x)
For the dataset: \(42, 55, 63, 48, 71, 59, 45, 67, 52, 58, 61, 49\)
- Find the range.
- Find Q1 (first quartile).
- Find Q3 (third quartile).
- Calculate the interquartile range (IQR).
Problem 4: Outlier Detection (xx)
For the dataset: \(25, 28, 30, 32, 27, 29, 31, 85, 26, 30\)
- Calculate Q1, Q3, and IQR.
- Determine the lower and upper fences for outliers.
- Are there any outliers? If so, which value(s)?
- Recalculate the mean with and without outliers.
Problem 5: Frequency Distribution (x)
Test scores for 20 students: \(65, 72, 78, 85, 91, 68, 74, 82, 88, 95, 71, 77, 83, 89, 73, 79, 84, 92, 76, 81\)
- Create a frequency table using intervals: 65-74, 75-84, 85-94, 95-100
- Calculate the relative frequency for each interval.
- What percentage of students scored between 75 and 84?
Problem 6: Comparing Datasets (xx)
Two sales teams’ weekly sales (in units):
Team A: \(45, 52, 48, 55, 50\) Team B: \(30, 70, 45, 60, 45\)
- Calculate the mean for each team.
- Calculate the standard deviation for each team.
- Which team is more consistent? Why?
- Which team would you prefer to manage? Justify your answer.
Problem 7: Five-Number Summary (xx)
Monthly revenue data (in thousands Euro): \(120, 145, 132, 158, 175, 142, 138, 165, 155, 148, 162, 170\)
- Find the five-number summary (Min, Q1, Median, Q3, Max).
- Calculate the IQR.
- Describe the shape of the distribution based on the five-number summary.
Problem 8: Grouped Data (xxx)
Employee salaries (in thousands Euro) at a company are grouped:
| Salary Range | Frequency |
|---|---|
| 30-39 | 8 |
| 40-49 | 15 |
| 50-59 | 22 |
| 60-69 | 12 |
| 70-79 | 3 |
- Estimate the mean salary using midpoints.
- Find the modal class.
- Estimate the median class.
- Calculate the relative frequency for each class.
Problem 9: Business Application (xx)
A quality control manager measures the diameter of manufactured bolts (in mm):
\(10.02, 9.98, 10.05, 9.97, 10.01, 10.03, 9.99, 10.02, 10.00, 9.96, 10.04, 10.01\)
Target diameter: 10.00 mm with tolerance ±0.05 mm
- Calculate the mean diameter.
- Calculate the standard deviation.
- Are all bolts within specification?
- If bolts outside tolerance are rejected, what is the reject rate?
Problem 10: Coefficient of Variation (xx)
Compare the variability of these two datasets using the coefficient of variation:
Dataset X (prices in Euro): \(50, 55, 45, 60, 40\) Dataset Y (prices in cents): \(5000, 5500, 4500, 6000, 4000\)
- Calculate mean and standard deviation for both datasets.
- Calculate the coefficient of variation (CV = s/mean × 100%) for both.
- Which dataset has more relative variability?
Problem 11: Percentiles (xxx)
For the dataset: \(12, 15, 18, 22, 25, 28, 31, 35, 38, 42, 45, 48, 52, 55, 58, 62, 65, 68, 72, 75\)
- Find the 25th percentile (P25).
- Find the 75th percentile (P75).
- Find the 90th percentile (P90).
- If a value is at the 60th percentile, how many values are below it?
Problem 12: Comprehensive Analysis (xxxx)
A store tracks daily customer counts for 30 days:
\(42, 58, 65, 38, 71, 45, 52, 67, 55, 48, 63, 72, 44, 59, 68, 51, 56, 74, 41, 62, 49, 57, 69, 46, 54, 70, 43, 60, 66, 50\)
- Calculate all measures of central tendency (mean, median, mode).
- Calculate range, variance, standard deviation, and IQR.
- Construct the five-number summary.
- Identify any outliers using the 1.5 × IQR rule.
- Create a frequency distribution with 5 equal-width classes.
- What can you conclude about the store’s daily customer traffic?