LLM-FE: Automated Feature Engineering for Tabular Data with LLMs as Evolutionary Optimizers

Official implementation of LLM-FE: Automated Feature Engineering for Tabular Data with LLMs as Evolutionary Optimizers.

📄 Overview

LLM-FE is a novel framework that leverages Large Language Models (LLMs) as evolutionary optimizers to automate feature engineering for tabular datasets. LLM-FE iteratively generates and refines features using structured prompts, selecting high-impact transformations based on model performance. This approach enables the discovery of interpretable and high-quality features, enhancing the performance of various machine learning models across diverse classification and regression tasks.

⚙️ Installation

To run the code, create a conda environment and install the dependencies using requirements.txt:

conda create -n llmfe python=3.11.7
conda activate llmfe
pip install -r requirements.txt

🔧 Usage

In run_llmfe.sh file, set the OPENAI API key under

export API KEY = <ENTER YOUR API KEY>

To run the LLM-FE pipeline on a sample dataset:

bash run_llmfe.sh

📝 Citation

@article{abhyankar2025llm,
  title={LLM-FE: Automated Feature Engineering for Tabular Data with LLMs as Evolutionary Optimizers},
  author={Abhyankar, Nikhil and Shojaee, Parshin and Reddy, Chandan K},
  journal={arXiv preprint arXiv:2503.14434},
  year={2025}
}

📄 License

This repository is licensed under MIT licence.

This work is built on top of other open source projects like FunSearch and LLM-SR. We thank the original contributors of these works for open-sourcing their valuable source codes.

📬 Contact Us

For any questions or issues, you are welcome to open an issue in this repo, or contact us at nikhilsa@vt.edu and parshinshojaee@vt.edu.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
data		data
llm_engine		llm_engine
llmfe		llmfe
logs		logs
prompts		prompts
specs		specs
LICENSE		LICENSE
README.md		README.md
evaluation.ipynb		evaluation.ipynb
llmfe.jpg		llmfe.jpg
main.py		main.py
optimization_utils.py		optimization_utils.py
preprocessing.py		preprocessing.py
requirements.txt		requirements.txt
run_llmfe.sh		run_llmfe.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM-FE: Automated Feature Engineering for Tabular Data with LLMs as Evolutionary Optimizers

📄 Overview

⚙️ Installation

🔧 Usage

📝 Citation

📄 License

📬 Contact Us

About

Uh oh!

Releases

Packages

Languages

License

nikhilsab/LLMFE

Folders and files

Latest commit

History

Repository files navigation

LLM-FE: Automated Feature Engineering for Tabular Data with LLMs as Evolutionary Optimizers

📄 Overview

⚙️ Installation

🔧 Usage

📝 Citation

📄 License

📬 Contact Us

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages