Task4

Predicted insurance claims accounts import pandas as pd import numpy as np from sklearn.model_selection import train_test_split from sklearn.preprocessing import LabelEncoder, StandardScaler from sklearn.linear_model import LinearRegression from sklearn.metrics import mean_absolute_error, mean_squared_error import matplotlib.pyplot as plt import seaborn as sns

Load the dataset

df = pd.read_csv('insurance.csv')

Display the first few rows of the dataset

print(df.head())

Encode categorical features

le = LabelEncoder() df['sex'] = le.fit_transform(df['sex']) df['smoker'] = le.fit_transform(df['smoker']) df['region'] = le.fit_transform(df['region'])

Define features and target

X = df.drop(['charges'], axis=1) y = df['charges']

Split data into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

Scale features

scaler = StandardScaler() X_train = scaler.fit_transform(X_train) X_test = scaler.transform(X_test)

Train a linear regression model

model = LinearRegression() model.fit(X_train, y_train)

Make predictions

y_pred = model.predict(X_test)

Evaluate model performance

mae = mean_absolute_error(y_test, y_pred) rmse = np.sqrt(mean_squared_error(y_test, y_pred)) print("Mean Absolute Error (MAE):", mae) print("Root Mean Squared Error (RMSE):", rmse)

Visualize impact of BMI on insurance charges

plt.figure(figsize=(8, 6)) sns.scatterplot(x='bmi', y='charges', data=df) plt.title('Impact of BMI on Insurance Charges') plt.show()

Visualize impact of age on insurance charges

plt.figure(figsize=(8, 6)) sns.scatterplot(x='age', y='charges', data=df) plt.title('Impact of Age on Insurance Charges') plt.show()

Visualize impact of smoking status on insurance charges

plt.figure(figsize=(8, 6)) sns.boxplot(x='smoker', y='charges', data=df) plt.title('Impact of Smoking Status on Insurance Charges') plt.show()

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Bank Marketing Dataset		Bank Marketing Dataset
Forecast short-term		Forecast short-term
README.md		README.md
Segmentation using unsupervised learning		Segmentation using unsupervised learning

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Task4

Load the dataset

Display the first few rows of the dataset

Encode categorical features

Define features and target

Split data into training and testing sets

Scale features

Train a linear regression model

Make predictions

Evaluate model performance

Visualize impact of BMI on insurance charges

Visualize impact of age on insurance charges

Visualize impact of smoking status on insurance charges

About

Uh oh!

Releases

Packages

Abdullahi8852/Task4

Folders and files

Latest commit

History

Repository files navigation

Task4

Load the dataset

Display the first few rows of the dataset

Encode categorical features

Define features and target

Split data into training and testing sets

Scale features

Train a linear regression model

Make predictions

Evaluate model performance

Visualize impact of BMI on insurance charges

Visualize impact of age on insurance charges

Visualize impact of smoking status on insurance charges

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages