输入“/”快速插入内容
Synced Block
Unit1: Exploring One-Variable Data (15-23%)
Categorical Data
Categorical Data 分类变量 is shown in two-way tables and bar graphs, analyzing proportions.
- Data represents groups or categories
- Measured in proportions
- Example: eye color
The association between two categorical variables
We say that there is an association between two variables if knowing the value of one variable helps predict the value of the other. If knowing the value of one variable does not help you predict the value of the other, then there is no association between the variables.
Bar Chart
The columns are positioned over a label that represents a categorical variable.
Segmented Bar Chart - Bivariate Categorical Data Visualization
No Association means (Independent):
The conditional distributions of opinion about becoming rich would be the same for males and females. The segmented bar graphs for the two genders would look the same, too.
Side by Side Bar Chart - Bivariate Categorical Data Visualization
Mosaic Plot - Bivariate Categorical Data Visualization
Graphic Two Way Table with percentages
We can use mosaic plots to draw conclusions about relationships between two categorical variables.
Quantitative Data
Quantitative Data 数值变量 is displayed in histograms, dotplots, box plots, stem and leaf plots, and scatter plots.
- Measured or counted variables
- Measure in means
- Example: height, age
Histogram
The columns are positioned over a label that represents a quantitative variable.
For a histogram -> make sure you approximate the mean (e.g.10%-12.5%) and use words like “no more” / “approximately” when describing range.