Geely Auto, a Chinese automobile company, plans to enter the US market by setting up local manufacturing to compete with US and European car manufacturers. They hired an automobile consulting company to identify the factors influencing car pricing in the American market, which may differ from the Chinese market.
Preparation : The dataset includes a CarName variable, which contains the car company's name and the car model. Use only the car company name as an independent variable for model building.
Identify significant variables in predicting car prices. Assess how well these variables describe car prices.
Develop a model to predict car prices based on available independent variables. This model will help management understand price variations and inform design and strategy decisions to meet specific price targets. Additionally, it will provide insights into the pricing dynamics of the US market.
A Linear Regression Model will be used to identify significant variables affecting car prices and assess their predictive power. The model will help understand price variations and inform strategic decisions for entering the US market.
Linear regression is a statistical technique used to model and analyze the relationship between a dependent variable and one or more independent variables. It predicts the value of the dependent variable based on the linear combination of the independent variables. The goal is to find the best-fitting line that minimizes the sum of the squared differences between the observed and predicted values.
- Numpy (for data manipulation)
- Pandas (for data manipulation)
- Matplotlib (for data visualization)
- Seaborn (for data visualization)
- Statsmodels (for data modeling)
- Scikit-Learn (for data modeling)
- Importing the required libraries
- Importing and reading dataset
- Understanding and Visualizing the data
- Exploratory Data Analysis (EDA)
- Preparing the data for modeling
- Training the Model
- Predicting the training data
- Residual Analysis
- Predicting the test data
- Getting the model accuracy
- Sharing Solution to the objectives