Features, Expanded: Computable Features, Imputation and Kernels

Pablo Duboue

doi:10.1017/9781108671682.005

3 - Features, Expanded: Computable Features, Imputation and Kernels

from Part One - Fundamentals

Published online by Cambridge University Press: 29 May 2020

Pablo Duboue

Show author details

Pablo Duboue: Affiliation:
Textualization Software Ltd.

Book contents

Get access

Summary

This chapter deals with the topic of feature expansion and imputation with a particular emphasis on computable features. While poor domain modelling may result in too many features being added to the model, there are times when plenty of value can be gained by looking into generating features from existing ones. The excess features can then be removed using feature selection techniques (discussed in the next chapter). Computable Features will be particularly useful if we know the underlining ML model is unable to do certain operations over the features, like multiplying them (e.g., if the ML involves a simple, linear modelling). Another type of feature expansion involves calculating a best effort approximation of values missing in the data (Feature Imputation). The most straightforward expansion for features happens when the raw data contains multiple items of information under a single column (Decomposing Complex Features). The chapter concludes by borrowing ideas from a technique used in SVMs called the kernel trick. The type of projections that practitioners have found useful can lend themselves to be applied directly without the use of kernels.

Keywords

computable features feature imputation kernels target rate encoding one hot encoding training expansion tidy data

Type: Chapter
Information: The Art of Feature Engineering
Essentials for Machine Learning
, pp. 59 - 78

DOI: https://doi.org/10.1017/9781108671682.005 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2020

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

3 - Features, Expanded: Computable Features, Imputation and Kernels

Summary

Keywords

Access options

Book purchase

Temporarily unavailable

Book contents

3 - Features, Expanded: Computable Features, Imputation and Kernels

Summary

Keywords

Access options

Book purchase

Temporarily unavailable

Save book to Kindle

Save book to Dropbox

Save book to Google Drive