Skip to content

This Python script provides a simple interface to transcribe audio files using the OpenAI API's speech-to-text functionality, powered by the Whisper model. The result is returned to the console as text or VTT (WebVTT) format.

License

Notifications You must be signed in to change notification settings

heyfoz/python-openai-whisper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 

Repository files navigation

Python OpenAI Whisper Speech to Text Transcription

This Python script provides a simple interface to transcribe audio files using the OpenAI API's speech-to-text functionality, powered by the Whisper model. The result can be returned to the console as text or VTT (WebVTT) format.

Installation

Before using the script, make sure to install the OpenAI Python client library. You can install it using pip:

pip install openai

Usage

Clone this repository:

git clone https://github.com/heyfoz/python_openai_whisper.git

Navigate to the project directory:

cd your_repository

Run the script:

python whisper_local.py

Follow the prompts to enter the file path of the audio file and choose the desired response format (text or vtt).

Note: To access the OpenAI API, you will need an API key. File size is limited to 25 MB. Please refer to the OpenAI API documentation for instructions on how to obtain and use the API key.

Documentation

License

This project is licensed under the MIT License - see the LICENSE file for details.

About

This Python script provides a simple interface to transcribe audio files using the OpenAI API's speech-to-text functionality, powered by the Whisper model. The result is returned to the console as text or VTT (WebVTT) format.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages