The Data
We will use a subset of the diamonds
dataset that comes with the ggplot2
package. This dataset contains the prices and other attributes of almost 54,000 diamonds. Review ?diamonds
to learn about the variables we will be using.
data("diamonds")
set.seed(1410) # Make the sample reproducible
dsmall <- diamonds[sample(nrow(diamonds), 1000), ]