Genetic programming-based automated machine learning approach to solve regression problems

Research output: Contribution to journalArticlepeer-review

Abstract

Automated machine learning aims to optimize machine learning pipelines automatically given a dataset, task type and a target variable. This research analyzes the use of genetic programming to perform automated feature engineering in regression problems. It introduces a methodology to perform feature selection and to construct new features departing from the original feature set by combining and selecting features in the leaf nodes of the genetic programming tree. A multiple feature generation technique is proposed, where three different feature sets are tested with linear regression, Random Forest regressor and Gradient Boosting regressor. The proposed approach is applied to an industrial process dataset where the target variable is an indicator of the performance of the process. The experimental results reveal the ability of the method to reduce the cardinality of the original feature set while maintaining the performance of the learning models. Moreover, they show the ability of the newly constructed feature to better discriminate the target variable.
Original languageEnglish
Pages (from-to)19-25
Number of pages7
JournalComputers and Informatics
Volume3
Issue number1
Publication statusPublished - 30 Jun 2023

Keywords

  • Automated machine learning
  • Feature engineering
  • Genetic programming
  • Predicted analytics

Fingerprint

Dive into the research topics of 'Genetic programming-based automated machine learning approach to solve regression problems'. Together they form a unique fingerprint.

Cite this