Live Voice Translation Application

A real-time voice translation web application built with Laravel, React, and Inertia.js. This application allows users to speak in one language and instantly hear the translation in another language, with support for English, Spanish, and French.

Video Tutorial

This project was created while recording the following video:

Features

🎤 Real-time Voice Recording: Record audio directly in the browser using the MediaRecorder API
🌍 Multi-language Support: Translate between English, Spanish, and French
🔄 Automatic Translation: Automatically translates and generates audio when recording stops
📊 Performance Benchmarking: Track individual API call timings (Whisper, GPT, TTS)
🎵 Audio Playback: Listen to translated audio directly in the browser
📜 Translation History: View and replay previous translations
🔀 Ultra-Low Latency TTS: Murf.ai Falcon engine with ~130ms time-to-first-audio
🎨 Modern UI: Built with React, Tailwind CSS, and Radix UI components

Technology Stack

Backend

Laravel 12: PHP framework
OpenAI Whisper API: Speech-to-text transcription
OpenAI GPT API: Text translation (gpt-4o-mini)
Murf.ai Falcon API: Ultra-low latency text-to-speech (~130ms TTFA)

Frontend

React 19: UI library
Inertia.js v2: Server-driven single-page applications
Tailwind CSS v4: Utility-first CSS framework
TypeScript: Type-safe JavaScript
Radix UI: Accessible component primitives

Requirements

PHP 8.2 or higher
Composer
Node.js 18+ and npm
MySQL, PostgreSQL, or SQLite database
OpenAI API key (for Whisper and GPT)
Murf.ai API key (for Falcon TTS)

Installation

1. Clone the Repository

git clone <repository-url>
cd laravel-murfai-falcon

2. Install PHP Dependencies

composer install

3. Install Node Dependencies

npm install

4. Environment Configuration

Copy the .env.example file to .env:

cp .env.example .env

Generate the application key:

php artisan key:generate

5. Configure Environment Variables

Edit the .env file and add your API keys:

# Application
APP_NAME="Voice Translation"
APP_URL=http://localhost:8000

# Database
DB_CONNECTION=mysql
DB_HOST=127.0.0.1
DB_PORT=3306
DB_DATABASE=voice_translation
DB_USERNAME=your_username
DB_PASSWORD=your_password

# OpenAI API
OPENAI_API_KEY=your_openai_api_key_here
OPENAI_MODEL_WHISPER=whisper-1
OPENAI_MODEL_TRANSLATION=gpt-4o-mini
OPENAI_MODEL_TTS=tts-1

# Murf.ai Falcon API (for TTS)
MURF_API_KEY=your_murf_api_key_here
MURF_API_URL=https://global.api.murf.ai/v1

6. Database Setup

Run the migrations:

php artisan migrate

7. Create Storage Link

Create a symbolic link for public storage:

php artisan storage:link

8. Build Frontend Assets

For development:

npm run dev

For production:

npm run build

9. Start the Development Server

You can use the convenient development script that runs everything:

composer run dev

Or manually:

# Terminal 1: Laravel server
php artisan serve

# Terminal 2: Vite dev server (if not using composer run dev)
npm run dev

Configuration

Text-to-Speech (Murf Falcon)

This application uses Murf.ai Falcon for ultra-low latency text-to-speech:

Model: FALCON
Format: MP3
Sample Rate: 24000 Hz
Channel: MONO
Style: Conversation
Time-to-first-audio: ~130ms

Murf Falcon Available Languages

Murf Falcon API supports 13 languages with 18 dialects and 150+ voices:

Language	Dialects/Variants
English	US/Canada, UK, Australia, India, Scottish
Spanish	Mexico, Spain
French	France
German	-
Italian	-
Hindi	-
Portuguese	Brazil
Dutch	-
Korean	-
Chinese	Mandarin
Bengali	-
Tamil	-
Polish	-

Murf Falcon Documentation

Falcon Model Documentation - Ultra-low latency TTS model details
Voice Library & Explorer - Preview and select from 150+ voices
Full API Documentation - Complete API reference

Supported Languages (This App)

Currently configured for:

English (en) - Voice: en-US-matthew
Spanish (es) - Voice: es-ES-carla
French (fr) - Voice: fr-FR-axel

Usage

1. Access the Application

Navigate to http://localhost:8000 in your browser.

2. Authentication

You need to be authenticated to use the translation feature. Register or log in to your account.

3. Start Translating

Go to the Translations page (/translations)
Select your source language (or use "Auto-detect")
Select your target language
Click "Start Recording"
Speak into your microphone
Click "Stop Recording" when finished
The application will automatically:
- Transcribe your speech using Whisper
- Translate the text using GPT
- Generate audio using the selected TTS provider
- Display the results and play the translated audio

4. View Translation History

All translations are saved and displayed in the history section below the main interface. You can:

View original and translated text
See processing times and API benchmarks
Replay audio from previous translations

API Endpoints

Translation Routes

All routes require authentication (auth middleware).

GET /translations - Display translation interface
POST /translations - Create a new translation
GET /translations/{translation} - Get a specific translation
GET /translations/{translation}/audio - Serve the translated audio file

Translation Request Format

{
  "audio": "<File>",
  "source_language": "auto|en|es|fr",
  "target_language": "en|es|fr"
}

Translation Response Format

{
  "success": true,
  "translation": {
    "id": 1,
    "original_text": "Hello, how are you?",
    "translated_text": "Hola, ¿cómo estás?",
    "source_language": "en",
    "target_language": "es",
    "audio_url": "http://localhost:8000/storage/translations/generated/...",
    "processing_time": 2345,
    "api_timings": {
      "transcribe": 850,
      "translate": 1200,
      "synthesize": 295
    },
    "created_at": "2025-01-15T10:30:00Z"
  }
}

Performance Benchmarking

The application tracks and displays individual API call timings:

Whisper Time: Time taken for speech-to-text transcription
GPT Time: Time taken for text translation
TTS Time: Time taken for text-to-speech synthesis
Total Processing Time: Sum of all operations

These metrics are displayed in the UI and stored in the database for each translation.

Project Structure

laravel-murfai-falcon/
├── app/
│   ├── Contracts/
│   │   └── TextToSpeechService.php
│   ├── Http/
│   │   ├── Controllers/
│   │   │   └── TranslationController.php
│   │   └── Requests/
│   │       └── StoreTranslationRequest.php
│   ├── Models/
│   │   └── Translation.php
│   ├── Providers/
│   │   └── AppServiceProvider.php
│   └── Services/
│       ├── MurfFalconService.php
│       ├── OpenAITranslationService.php
│       ├── OpenAITTSService.php
│       └── OpenAIWhisperService.php
├── config/
│   └── services.php
├── database/
│   └── migrations/
│       └── *_create_translations_table.php
├── resources/
│   ├── js/
│   │   ├── components/
│   │   │   └── translation-history.tsx
│   │   ├── hooks/
│   │   │   └── useAudioRecorder.ts
│   │   ├── pages/
│   │   │   └── translations/
│   │   │       └── index.tsx
│   │   ├── types/
│   │   │   └── translation.ts
│   │   └── lib/
│   │       └── api.ts
│   └── views/
│       └── app.blade.php
└── storage/
    └── app/
        └── public/
            └── translations/
                ├── original/
                └── generated/

Troubleshooting

Audio Recording Issues

Microphone not working: Ensure you've granted microphone permissions in your browser
No audio file created: Check browser console for errors and ensure MediaRecorder API is supported

API Errors

OpenAI API errors: Verify your OPENAI_API_KEY is correct and has sufficient credits
Murf.ai API errors: Verify your MURF_API_KEY is correct and the Falcon model is available on your plan
Transcription fails: Ensure the audio file is clear and contains speech

Storage Issues

Audio files not accessible: Run php artisan storage:link to create the symbolic link
Permission errors: Ensure the storage/app/public directory is writable

Frontend Build Issues

Changes not reflected: Run npm run build or npm run dev to rebuild assets
TypeScript errors: Run npm run types to check for type errors

Development

Running Tests

php artisan test

Code Formatting

# PHP
vendor/bin/pint

# JavaScript/TypeScript
npm run format

Linting

# JavaScript/TypeScript
npm run lint

License

This project is open-sourced software licensed under the MIT license.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Support

For issues and questions, please open an issue on the repository.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.cursor		.cursor
.github/workflows		.github/workflows
.junie		.junie
app		app
bootstrap		bootstrap
config		config
database		database
public		public
resources		resources
routes		routes
storage		storage
tests		tests
.editorconfig		.editorconfig
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
.prettierignore		.prettierignore
.prettierrc		.prettierrc
README.md		README.md
artisan		artisan
boost.json		boost.json
components.json		components.json
composer.json		composer.json
composer.lock		composer.lock
eslint.config.js		eslint.config.js
package-lock.json		package-lock.json
package.json		package.json
phpunit.xml		phpunit.xml
tsconfig.json		tsconfig.json
vite.config.ts		vite.config.ts

thecodeholic/voice-translation-app

Folders and files

Latest commit

History

Repository files navigation