RWTH Aachen University
In the design of machine-learning solutions, a critical and often the most resourceful task is that of feature engineering, for which recipes and tooling have been developed. In this paper, the authors embark on the establishment of database foundations for feature engineering. They propose a formal framework for classification, in the context of a relational database, towards investigating the application of database and knowledge management to assist with the task of feature engineering. They demonstrate the usefulness of this framework by formally defining two key algorithmic challenges.