Data Engineering FoundationsChapter 152

Answers

Section 1 of 2-~ 12 min read-Synced from Cuantum content
  1. b) Variance Thresholding
  1. c) Reindexing the data to a regular frequency and using forward-fill or backward-fill
  1. c) The proportion of the dataset’s variance captured by each principal component
  1. c) Encoding features with cyclical patterns, like day of the week
  1. c) They rely on model training to determine feature importance
  1. b) Principal Component Analysis (PCA)
  1. c) Correlation Thresholding
  1. c) When interaction effects between features need to be captured
  1. c) It may over-penalize features, potentially leading to underfitting
  1. b) To ensure that selected features generalize well to new data