Audio and Video Processing with Google Cloud Speech-to-Text and ffmpeg

This project preprocesses audio, uploads it to Google Cloud Storage, transcribes it using Google Cloud Speech-to-Text, generates SRT subtitles, and adds these subtitles to a video file.

Prerequisites

Node.js installed on your machine.
Google Cloud account and project set up.
ffmpeg installed on your machine.
Environment variables configured in a .env file.

Environment Variables

Create a .env file in the root directory of your project and add the following variables:

GOOGLE_CLOUD_PROJECT_ID=your-project-id
GOOGLE_CLOUD_KEY_FILE=path-to-your-service-account-json-file
GOOGLE_CLOUD_STORAGE_BUCKET_NAME=your-bucket-name

Installation

Clone the repository:

git clone https://github.com/mmiddletonn/VideoTranscription.git
cd VideoTranscription

Install the necessary dependencies:
```
npm install
```

Usage

Run the script with the following command:

node index.js

The script performs the following steps:

Preprocess Audio: Converts and preprocesses the audio file (audio.mp4) to remove noise and normalize levels, saving it as audio.mp3.
Upload Audio: Uploads the preprocessed audio file (audio.mp3) to Google Cloud Storage.
Transcribe Audio: Transcribes the audio file using Google Cloud Speech-to-Text and generates an SRT file (subtitles.srt) with word-level timestamps.
Add Subtitles to Video: Adds the generated subtitles to the original video file (audio.mp4), producing an output video file (output_video_with_audio.mp4).

Functions

preprocessAudio(inputFilePath, outputFilePath)

Preprocesses the input audio file and converts it to MP3 format.

inputFilePath: Path to the input audio file (e.g., audio.mp4).
outputFilePath: Path to the output MP3 file (e.g., audio.mp3).

uploadFile()

Uploads the preprocessed audio file (audio.mp3) to Google Cloud Storage.

transcribeAudio()

Transcribes the audio file stored in Google Cloud Storage and generates an SRT file (subtitles.srt) with subtitles.

addSubtitlesToVideo(inputVideo, subtitlesFile, outputVideo)

Adds subtitles to the video file.

inputVideo: Path to the input video file (e.g., audio.mp4).
subtitlesFile: Path to the SRT subtitles file (e.g., subtitles.srt).
outputVideo: Path to the output video file (e.g., output_video_with_audio.mp4).

Error Handling

Errors encountered during each step are logged to the console.

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
index.js		index.js
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Audio and Video Processing with Google Cloud Speech-to-Text and ffmpeg

Prerequisites

Environment Variables

Installation

Usage

Functions

preprocessAudio(inputFilePath, outputFilePath)

uploadFile()

transcribeAudio()

addSubtitlesToVideo(inputVideo, subtitlesFile, outputVideo)

Error Handling

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

mmiddletonn/VideoTranscription

Folders and files

Latest commit

History

Repository files navigation

Audio and Video Processing with Google Cloud Speech-to-Text and ffmpeg

Prerequisites

Environment Variables

Installation

Usage

Functions

preprocessAudio(inputFilePath, outputFilePath)

uploadFile()

transcribeAudio()

addSubtitlesToVideo(inputVideo, subtitlesFile, outputVideo)

Error Handling

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages