AP Statistics U1 Note

Exam Structure
40 MCQ + 6 FRQ
1.1 Data Analysis (数据的分类)
Individual: 参与统计的个体
variable: 一个特质(eg: gender、frequency)
value: eg: gender里的细分比如Male、Female,frequency的细分:one time、often、 hardly
count: 数量
Categories of var
Categorical Variable:
enum
bool
Quantitative Variable:
int
double
Distribution: Describe the condition of the data
Population: Whole dataset
Sample: Part of the dataset
Analyze Variable
How to spot a misleading graph ?
- missing axis labels or scale (刻度)
- cut off x or y axis or start at a weird place
- picture instead of a bar
Categorical:
Pie chart / Bar chart / Two-way table
Quantitative:
histogram / boxplot / bar chart
Marginal distribution and Conditional distribution (Categorical Variable、Two-way table)
Male | Female | Total | |
---|---|---|---|
one time | num | num c | num a |
often | num | num | num |
hardly | num | num | num |
Total | num | num d | num b |
Marginal distribution:
The proportion of count of all individuals in a variable among the total individuals (a
/b
) (d
/ b
)
Conditional distribution:
指定了条件(与另一个value的intersect)的values的count在指定总数中的占比,若没有指定总数,则默认为全部的总数(b)
Relative frequency:
有可能是上面两个任意一个,看要求什么,不管怎么说分母都是 total
Other graphs to describe data
1.2 Displaying quantitative data with graphs
分布 (shape)
描述分布 (4个维度 SCSO)
Shape
正态分布 Normal Distribution(对称)
拱起来的在中间
左偏 skew倾斜 to the left
左边much longer than the right
拱起来的在右边
右偏 skew to the right
右边much longer than the left
拱起来在左边
来源:medium.com
Center
把整个数据分成两半
一般取 mean 或者 median
还可以叫做 typical
Spread 波动性
表示数据的离散程度,分散就大,聚合就小
一般用 Range 或者 [standard deviation](#Standard Deviation 标准方差)
Outliers
异常值 (极大或极小)
[How to determine](#Identify outliers)
An example
The data shows that the distribution is roughly symmetric.(Reason) Therefore it is approximately as a normal distribution.(Shape) The center of the data is between 227 and 252.(Center) The data varies from 56 to 422.(Spread) We do not see any observations outside the pattern of the distribution.(Outliers)
Plots
Q: 选哪个图:
A: 选什么图,因为它是什么数据,什么图能更好的展示它的 distribution
Dot plot
Stem plot
记得一定要写 Key !!!!!
Key: 一个tips用来描述你展示的值represent什么东西
eg:
Key: 7|7 = 77 apples each day
Histogram
frequency histogram / relative frequency histogram (more useful)
- shows frequency
- no spaces each bar
Drawbacks
- no individual data just interval
- cannot find median
1.3 Describing Quantitative data with numbers
Measuring Center
Means
Median
Mode
Measuring Spread
Range
maximum - minimum
IQR (描述波动性的)
Identify outliers:
Box Plot
- number line + label
- determine minimum,
, (median), , and maximum - a line connect minimum and maximum
- Box between
and - a line at
- usually using median and IQR to measure the center and spread
We only know: five values above
We can’t know: sample size, sample mean
Standard Deviation 标准方差
s: sample variance 样本方差 VS population variance 总体方差
Def: 数据围绕平均值的离散程度
使用条件:用mean去描述center的时候才可以用这个去描述数据的波动性
不具有抗性,可能由于outlier的影响变得更小或者更大
Variance 方差
- Title: AP Statistics U1 Note
- Author: 零风PLFJY
- Created at : 2025-09-20 17:32:37
- Updated at : 2025-10-17 15:21:11
- Link: https://blog.plfjy.top/ap-statistics/ap-statistics-u1-note/
- License: This work is licensed under CC BY-NC-SA 4.0.