This Streamlit app allows users to upload an audio file, transcribe the audio using OpenAI's Whisper ASR, and then summarize the transcription using OpenAI's GPT-3.5-turbo language model.
- Upload audio files in supported formats (m4a, mp3, webm, mp4, mpga, wav, and mpeg)
- Transcribe audio using OpenAI's Whisper ASR
- Summarize transcribed audio using OpenAI's GPT-3.5-turbo
- Provide custom API key and select the desired model
- Python 3.6 or higher
- Streamlit
- OpenAI Python library (v0.27.0 or higher)
- Clone the repository:
git clone https://github.com/StanGirard/speechdigest
- Change into the repository's directory:
cd speechdigest
- Install the required packages:
pip install -r requirements.txt
- Set your OpenAI API key as an environment variable:
export OPENAI_API_KEY="your-api-key"
- Run the Streamlit app:
streamlit run app.py
The app will now be accessible at http://localhost:8501
in your browser.
- Open the app in your browser.
- Enter your OpenAI API key and select the desired model (GPT-4 or GPT-3.5-turbo) using the input fields.
- Upload an audio file in a supported format using the file uploader.
- The app will transcribe the audio using Whisper ASR and display the transcription.
- The app will then summarize the transcription using the selected language model and display the summary.
Feel free to submit issues, feature requests, or pull requests. We appreciate any contribution to improve the app.