Recipes + Model comparison
Oct 20 - 21, 2022
The data contain the price and other attributes of a random sample of 1000 diamonds. This sample was drawn from the diamonds data frame in the tidyverse R package.
The variables in this example are
price: Price in US dollars
carat: Weight of the diamond (units = carats)
cut: Quality of the cut of the diamond (Fair, Good, Ideal)
The goal is to use the carat and cut to understand variability in the price.
Does the plot show evidence of an interaction effect between carats and cut?
| term | estimate |
|---|---|
| (Intercept) | -2473.555 |
| carat | 6383.913 |
| cutGood | 60.018 |
| cutIdeal | 0.044 |
| carat:cutGood | 982.952 |
| carat:cutIdeal | 2132.642 |
cutGood in the context of the data.carat:cutGood in the context of the data.Fair cut diamonds. What is the intercept? What is the slope of carat ?Ideal cut diamonds. What is the intercept? What is the slope of carat?Lecture notes:
Tidy Modeling in R - Chapter 8: Feature engineering with recipes

