Web Scraping and Data Analysis

Project at Universidad Politecnica de Madrid.

Scrapper.py : This python file show a code that uses Selenium and BeautifulSoup to scrape car listings from https://heycar.com/uk/autos/make/audi/page/ .It navigates through multiple pages of listings, extracts car details, and saves them to a CSV file called ' dataset.csv' with columns for car name, type, mileage, year, condition, and price. Finally, it handles any errors and closes the browser after scraping.
Data_visualization.ipynb : This Jupyter Notebook shows a code that use python libraries as matplotlib.pyplot for plotting, pandas for data manipulation, drive from google.colab to access files from Google Drive, and seaborn for advanced data visualization.
dataset.csv : This is the dataset scraped by the python file "Scrapper.py".
Web Scraping and Data visualization: This is the presentation the group made for the video.

To execute the project, it is necessary to upload all files to a Google Drive. There are two sequential methods to proceed:

Initially, the "Scrapper.py" file can be executed to retrieve the dataset. Subsequently, the "Data_visualization.ipynb" can be executed.
Alternatively, the dataset already uploaded within the drive can be utilized, allowing direct execution of the "Data_visualization.ipynb" file.

This procedure ensures the proper execution of the project, facilitating data access and processing.

Gabriel-Cosmin Bolbotina Sajipan Sivatheesan Francesco Della Valle Gaetano Saviano Jan Durinec

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
Data_visualization.ipynb		Data_visualization.ipynb
README.md		README.md
Scrapper.py		Scrapper.py
Web Scraping and Data visualization.pptx		Web Scraping and Data visualization.pptx
dataset.csv		dataset.csv

Provide feedback