Skip to content

andriusvo/data-cleaner

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Turing Data Cleaner

Welcome to the Turing Data Cleaner App! This project is a web application built with Python and Streamlit that provides a platform to clean the data.

Features

  • CSV & XLSX Files upload: Upload CSV and XLSX files to process them
  • Detect & Remove outliers Detect and remove outliers
  • Remove duplicates Remove duplicates from file
  • Drop missing values Drop invalid rows
  • Fill missing values Fill missing values with mean

Preview live

Click here to preview


Getting Started

Follow these steps to set up and run the application on your local machine.

Prerequisites

Ensure you have the following installed:

  • Python 3.8 or later
  • pip (Python package manager)

Installation

  1. Clone the repository:

    git clone git@github.com:TuringCollegeSubmissions/avoito-AE.3.5.git turing-data-cleaner
    cd turing-data-cleaner
  2. Create a virtual environment:

    python -m venv venv
    source venv/bin/activate  # On Windows, use `venv\Scripts\activate`
  3. Install the dependencies:

    pip install -r requirements.txt
  4. Change secrets:

    Adjust secrets.toml with API Keys for specific LLMs

Running the App

  1. Launch the Streamlit app:

    streamlit run cleaner.py
  2. Open your browser and go to:

    http://localhost:8501

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages