Mammographic Masses Prediction README

Overview:

This script utilizes various machine learning models to predict the severity of mammographic masses based on features such as age, mass shape, margin, and density.

Libraries Used:

pandas
numpy
sklearn (for preprocessing, model selection, and various classifiers)
tensorflow (for Keras neural network model)

Dataset:

The dataset used in this script has the following columns:

BI-RADS assessment
Age
Mass Shape
Margin
Density
Severity

The data is loaded from a local path (/Users/amath/Downloads/MLCourse-2/mammographic_masses.data.txt).

Data Preprocessing:

Columns with unknown values (?) are treated as NaN.
Rows with any NaN values are removed from the dataset.
Features are scaled using StandardScaler from scikit-learn.

Machine Learning Models Used:

Decision Tree Classifier
Random Forest Classifier
Support Vector Machine (with various kernels: linear, rbf, sigmoid, poly)
K-Nearest Neighbors (tested for k values ranging from 1 to 50)
Multinomial Naive Bayes
Logistic Regression
Neural Network (using Keras)

Neural Network Architecture:

Input layer with 64 neurons (corresponding to 4 features)
Dropout layer with 50% dropout rate
Hidden layer with 64 neurons and ReLU activation
Dropout layer with 50% dropout rate
Output layer with 1 neuron and sigmoid activation (binary classification)

Results:

After training each model, the script prints the accuracy of the model using 10-fold cross-validation.

How to Use:

Ensure you have all the necessary libraries installed.
Replace the dataset path with the correct path on your machine.
Run the script. After execution, you'll see the accuracy results for each model.

Future Enhancements:

Hyperparameter tuning for improved model accuracy.
Exploration of additional preprocessing steps, like feature engineering.
Inclusion of visualizations to understand the significance of each feature.
Saving the best-performing model for future predictions.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Mammogram.py		Mammogram.py
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mammographic Masses Prediction README

Overview:

Libraries Used:

Dataset:

Data Preprocessing:

Machine Learning Models Used:

Neural Network Architecture:

Results:

How to Use:

Future Enhancements:

About

Releases

Packages

Languages

amath95/Mammogram

Folders and files

Latest commit

History

Repository files navigation

Mammographic Masses Prediction README

Overview:

Libraries Used:

Dataset:

Data Preprocessing:

Machine Learning Models Used:

Neural Network Architecture:

Results:

How to Use:

Future Enhancements:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages