Skip to content

adongardive/Module_11

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 

Repository files navigation

Analysis of Factors Influencing Used Car Prices

This project aims to analyze a dataset of used cars to understand which factors influence their prices. The goal is to provide actionable insights and recommendations to a used car dealership, helping them understand what consumers value most in a used car.

Project Overview

This analysis uses a dataset containing information on 426,000 used cars. The data has been sampled from a larger dataset to ensure efficient processing. The objective is to identify key attributes that determine car prices and to suggest strategies for pricing and inventory management.

Methodology

The analysis follows the CRISP-DM (Cross-Industry Standard Process for Data Mining) framework, a widely recognized approach for handling data projects. The following steps were taken:

  1. Business Understanding: Defining the problem and the objectives from the dealership's perspective.
  2. Data Understanding: Exploring the dataset to identify key characteristics and potential influencing factors.
  3. Data Preparation: Cleaning and organizing the data for analysis.
  4. Modeling: Applying statistical models and machine learning techniques to understand the relationship between car attributes and their prices.
  5. Evaluation: Assessing the models' performance to ensure reliability and validity.
  6. Deployment: Formulating recommendations based on the analysis to guide the dealership's decision-making.

Key Findings

The analysis identifies several factors that significantly impact the price of used cars. The findings will help the dealership prioritize these attributes in their pricing strategies and better manage their inventory to meet customer preferences.

Tools and Libraries

  • Python
  • Pandas
  • NumPy
  • Scikit-learn

Getting Started

To replicate this analysis:

  1. Ensure you have Python installed (version 3.6 or higher is recommended).
  2. Install the necessary libraries (pandas, numpy, scikit-learn) using pip:
    pip install pandas numpy scikit-learn
  3. Open the notebook in a Jupyter environment or any Python IDE and run the cells sequentially. You can view and run the Jupyter notebook for this project directly on GitHub: View the Notebook
  4. Github does not allow for files greater than 25MB, so the data folder has the vehicle.csv file zipped to reduce file size

Contributing

Contributions are welcome! Please feel free to submit a pull request or open an issue for any improvements or bug fixes.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published