
Section 07: Probability & Statistics
Test your understanding of Integration
Find \(\int x \cdot e^x \, dx\) using integration by parts.
Evaluate \(\int_0^1 (2x + 1) \, dx\)
A company’s marginal profit is \(MP(x) = 60 - 2x\). Find the profit function if \(P(0) = -100\).
Find the area between \(y = x\) and \(y = x^2\) from \(x = 0\) to \(x = 1\).
Section 07 covers essential exam topics:
Probability accounts for approximately 25% of the Feststellungsprüfung!
This is foundational material - brief coverage to prepare for probability!
How do we summarize a data set with a single number?
Three Measures of Center
Mean (Mittelwert): \(\bar{x} = \frac{\sum x_i}{n}\)
Median (Zentralwert): Middle value when data is sorted
Mode (Modalwert): Most frequently occurring value
Monthly sales (in thousands €) for a store:
\[12, 15, 14, 18, 15, 22, 15, 16, 14, 19\]
Mean: \[\bar{x} = \frac{12 + 15 + 14 + 18 + 15 + 22 + 15 + 16 + 14 + 19}{10} = \frac{160}{10} = 16\]
Median: Sort: \(12, 14, 14, 15, 15, 15, 16, 18, 19, 22\)
Middle values: \(\frac{15 + 15}{2} = 15\)
Mode: \(15\) (appears 3 times)

Two datasets can have the same mean but different spreads:
Dataset A: \(48, 49, 50, 51, 52\) (mean = 50)
Dataset B: \(10, 30, 50, 70, 90\) (mean = 50)
We need measures to quantify this difference!
Simplest measure of spread:
\[\text{Range} = \text{Maximum} - \text{Minimum}\]
Dataset A: Range \(= 52 - 48 = 4\)
Dataset B: Range \(= 90 - 10 = 80\)
Range only uses two values - sensitive to outliers!
Variance (Varianz)
Population variance: \[\sigma^2 = \frac{\sum (x_i - \mu)^2}{N}\]
Sample variance: \[s^2 = \frac{\sum (x_i - \bar{x})^2}{n - 1}\]
Standard Deviation (Standardabweichung)
\[\sigma = \sqrt{\sigma^2} \quad \text{or} \quad s = \sqrt{s^2}\]
Data: \(4, 8, 6, 5, 3, 2, 8, 9, 2, 5\) (n = 10)
Step 1: Calculate mean \[\bar{x} = \frac{4+8+6+5+3+2+8+9+2+5}{10} = \frac{52}{10} = 5.2\]
Step 2: Calculate deviations squared \[(4-5.2)^2 + (8-5.2)^2 + ... = 1.44 + 7.84 + 0.64 + 0.04 + 4.84 + 10.24 + 7.84 + 14.44 + 10.24 + 0.04 = 57.6\]
Step 3: Variance and SD \[s^2 = \frac{57.6}{9} = 6.4 \quad \Rightarrow \quad s = \sqrt{6.4} \approx 2.53\]
Raw data: Test scores of 30 students
\[65, 72, 78, 81, 65, 73, 85, 92, 78, 72, 65, 88, 91, 73, 78, 82, 76, 72, 85, 78, 65, 73, 82, 79, 88, 73, 78, 85, 92, 78\]
Question: How can we summarize this data effectively?
| Score Range | Frequency | Relative Frequency |
|---|---|---|
| 60-69 | 4 | 4/30 = 13.3% |
| 70-79 | 14 | 14/30 = 46.7% |
| 80-89 | 9 | 9/30 = 30.0% |
| 90-99 | 3 | 3/30 = 10.0% |
| Total | 30 | 100% |
Relative frequency = Frequency / Total = Probability interpretation!

Five-Number Summary
Interquartile Range (IQR): \(\text{IQR} = Q3 - Q1\)
IQR contains the middle 50% of the data!

Outliers are values that fall outside:
\[\text{Lower fence: } Q1 - 1.5 \times \text{IQR}\] \[\text{Upper fence: } Q3 + 1.5 \times \text{IQR}\]
Example: If \(Q1 = 65\), \(Q3 = 85\), then IQR \(= 20\)
Any value below 35 or above 115 would be an outlier.
A factory measures the diameter of manufactured bolts (in mm):
\[10.2, 10.1, 10.0, 10.3, 9.9, 10.1, 10.0, 10.2, 10.1, 10.0\]
Target: 10.0 mm with tolerance ±0.3 mm
Calculate:
If we assume normal distribution, approximately 99.7% of bolts will be within \(\bar{x} \pm 3s = 10.09 \pm 0.36\) mm, which is within tolerance!
Weekly sales data for 8 weeks (in €1000):
\[45, 52, 48, 55, 62, 50, 48, 56\]
| Measure | Value | Interpretation |
|---|---|---|
| Mean | €52,000 | Average weekly sales |
| Median | €51,000 | Typical week |
| Std Dev | €5,300 | Sales variability |
| Range | €17,000 | Max spread |
Work in pairs
Problem 1: Customer wait times (minutes): \(3, 5, 2, 8, 4, 6, 3, 7, 2, 10\)
Problem 2: Create a frequency table for exam scores: \(75, 82, 91, 78, 85, 68, 73, 88, 95, 79, 82, 76, 84, 90, 77\)
Key connection:
\[\text{Relative Frequency} \approx \text{Probability}\]
Example: If 30% of customers wait more than 5 minutes, then the probability that a randomly selected customer waits more than 5 minutes is approximately 0.30.
This is the frequentist interpretation of probability - probability equals long-run relative frequency!
Coming Next
Session 07-02: Basic Probability Concepts - sample spaces, events, and probability rules!
This material is foundational - make sure you’re comfortable before moving to probability!
Session 07-01 - Descriptive Statistics Essentials | Dr. Nikolai Heinrichs & Dr. Tobias Vlćek | Home