Skip to content

Conversation

@stratus-ss
Copy link
Contributor

This PR adds the ability to save and reuse speech recognition transcripts, dramatically improving workflow efficiency when processing the same audio multiple times.

Features Added

Core Functionality:

  • --save-transcript: Automatically saves generated transcripts as JSON
  • --input-transcript: Explicitly loads a saved transcript
  • --force-retranscribe: Forces new transcription even if cached transcript exists
  • Automatic detection: When --save-transcript is used, existing transcripts are automatically detected and reused

As per the readme you can test with both the included test or

# First run: generates transcript (slow)
monkeyplug -i input/Witch_mother1.m4b -o output/clean.m4a --save-transcript

# Second run: automatically reuses output/clean_transcript.json (fast!)
monkeyplug -i input/Witch_mother1.m4b -o output/clean.m4a --save-transcript

# Use different swear list with same transcript
monkeyplug -i input/Witch_mother1.m4b -o output/strict.m4a \
  --input-transcript output/clean_transcript.json -w swear_list.json

- Add --save-transcript flag to save transcripts as JSON
- Add --input-transcript flag to reuse existing transcripts
- Add --force-retranscribe flag to ignore cached transcripts
- Implement automatic transcript detection and reuse
- Include test audio file for real-world validation
@stratus-ss
Copy link
Contributor Author

This PR needs to be merged before I can proceed with other PRs so I will hold here in case there are adjustments that need to be made

Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'd prefer to not have binary files in the repo. If you'd remove this file from the PR (I know your test uses it; so if you'd like to keep it in your fork or something that's of course your choice) I'll go ahead and pull it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants