Deepfake Speech Detection Project

This project focuses on detecting deepfake speech using a combination of Convolutional Neural Networks (CNN) and Recurrent Neural Networks (RNN).

Project Outline

Data Preparation
- Audio files are organized into a directory structure with 'data/fake' and 'data/real' subdirectories.
- The generateDataCSV.py script is used to generate CSV files for organizing the audio dataset into training, validation, and evaluation sets.
Data Preprocessing
- The train1.py script preprocesses the audio files to extract MFCC features.
- MFCC features are saved to disk for future use.
Model Training
- The train1.py script defines a CNN-RNN model and trains it on the preprocessed data.
- The trained model is evaluated on the validation and test data.
Model Evaluation
- The eval.py script evaluates the trained model on the test data.
Running the Application
- The app.py script uses the trained model to classify audio files and creates a web-based user interface using Streamlit.

Step-by-Step Guide

1. Prepare Data

Place your audio dataset in the following directory structure:

/path/to/root/dataset/
├── data
│   ├── fake
│   └── real
├── generateDataCSV.py
├── train1.py
├── eval.py
└── app.py

Run the generateDataCSV.py script to generate CSV files for organizing the audio dataset:
```
python generateDataCSV.py
```
- The generated CSV files will be saved in the csvFilesReduced directory, as evaluate train and validate.csv.

2. Train the Model

Run the train.py script to train the model:
```
python train.py
```

1.1. You may also Run the trainProcessedSample.py script to train the model, here features are already extracted of a large dataset:

python trainProcessedSample.py

3. Evaluate the Model

Run the eval.py script to evaluate the trained model:
```
python eval.py
```

4. Run the Application

Run the app.py script to start the application:
```
streamlit run app.py
```

Notes

This is a basic model implementation. Feel free to modify and enhance it based on your requirements and dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
__pycache__		__pycache__
csvFiles		csvFiles
data		data
processedData		processedData
processedDataSample		processedDataSample
savedModels		savedModels
README.md		README.md
app.py		app.py
generateDataCSV.py		generateDataCSV.py
png.py		png.py
requirements.txt		requirements.txt
train.py		train.py
trainProcessedSample.py		trainProcessedSample.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deepfake Speech Detection Project

Project Outline

Step-by-Step Guide

1. Prepare Data

2. Train the Model

3. Evaluate the Model

4. Run the Application

Notes

About

Releases

Packages

Languages

Anmol2059/simpleAudioDeepfakeDetection

Folders and files

Latest commit

History

Repository files navigation

Deepfake Speech Detection Project

Project Outline

Step-by-Step Guide

1. Prepare Data

2. Train the Model

3. Evaluate the Model

4. Run the Application

Notes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages