Releases · wolfgitpr/HubertFA

30 Dec 10:27

wolfgitpr

v0.0.7

521281f

v0.0.7: Onnx Model for Mandarin&Japanese&English Singing Voice and automatically identify AP and EP. Latest

Latest

Code version: v0.0.7

Language used: Chinese Mandarin, Japanese Romanization (Mandarin&Japanese)
Using the dictionary: ds-zh-pinyin-lite/japanese_dict_full/ds_cmudict-07b
Scope of application: Singing voice
Non acoustic automatic recognition: AP, EP (not recommend)
Release date: 2025-12-30

Migrate CSV Transcription Files

The completed diffsinger dataset has been created. Use this script to migrate the CSV format annotations of the old dictionary to the new dictionary.

python scripts/migrate_dict.py csv [TRANSCRIPTION_CSV] \
    --source-dict [SOURCE_DICT] \
    --target-dict [TARGET_DICT] \
    [--save-path [OUTPUT_CSV] ] \
    [--overwrite]

- `TRANSCRIPTION_CSV`: Path to CSV file with transcriptions.

- `--source-dict`: Path to source dictionary file.

- `--target-dict`: Path to target dictionary file.

- `--save-path`: Path to save migrated file (defaults to original file).

- `--overwrite`: Overwrite existing file (optional flag).

Data contributors

白烁
风羽翼Tsubasa
烛曦遥Haruka
夜燐Yarin
芸青岩
绮萱

Full Changelog: v0.0.6...v0.0.7

Assets 3

17 Dec 13:04

wolfgitpr

v0.0.6

0d77172

v0.0.6: Onnx Model for Mandarin&Japanese&English Singing Voice and automatically identify AP and EP.

Code version: v0.0.6

Language used: Chinese Mandarin, Japanese Romanization (Mandarin&Japanese)
Using the dictionary: opencpop-expression/japanese_dict_full/ds_cmudict-07b
Scope of application: Singing voice
Non acoustic automatic recognition: AP, EP (not recommend)
Release date: 2025-12-17

Onnx infer

pip install -r requirements_onnx.txt

python onnx_infer.py --onnx_path xxx --wav_folder xxx_wav --language zh ...

args:
- --onnx_path / -m: Path to Onnx models.
- --wav_folder / -wf: Input folder path. (default: segments)
- --out_path / -o: Path to the output label.
- --language / -l: Designated language, zh ja en yue. (default: zh)
- --non_speech_phonemes / -np: Non speech phonemes. (default: AP, optional AP,EP)
- --pad_times / -pt: The number of times to pad blank audio before reasoning. (default: 1)
- --pad_length / -pl: The max length of blank audio on the pad before inference. (default: 5)
- --dictionary / -d: Custom dictionary path.

Data contributors

白烁
风羽翼Tsubasa
烛曦遥Haruka
夜燐Yarin
芸青岩
绮萱

Full Changelog: v0.0.5...v0.0.6

Assets 3

08 Dec 14:16

wolfgitpr

v0.0.5

6e8c864

v0.0.5 fixed: Onnx Model for Mandarin&Japanese&English Singing Voice and automatically identify AP and EP.

Code version: v0.0.5 fixed

Language used: Chinese Mandarin, Japanese Romanization (Mandarin&Japanese)
Using the dictionary: opencpop-expression/apanese_dict_full/ds_cmudict-07b. txt
Scope of application: Singing voice
Non acoustic automatic recognition: AP, EP (not recommend)
Release date: 2025-12-8

onnx infer

pip install -r requirements_onnx.txt

python onnx_infer.py --onnx_path xxx --wav_folder xxx_wav --language zh ...

Data contributors

白烁
风羽翼Tsubasa
烛曦遥Haruka
夜燐Yarin
芸青岩
绮萱

Assets 3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Code version: v0.0.7

Migrate CSV Transcription Files

Data contributors

Uh oh!

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Code version: v0.0.6

Onnx infer

Data contributors

Uh oh!

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Code version: v0.0.5 fixed

onnx infer

Data contributors

Uh oh!

Releases: wolfgitpr/HubertFA

v0.0.7: Onnx Model for Mandarin&Japanese&English Singing Voice and automatically identify AP and EP.

Code version: v0.0.7

Migrate CSV Transcription Files

Data contributors

Uh oh!

v0.0.6: Onnx Model for Mandarin&Japanese&English Singing Voice and automatically identify AP and EP.

Code version: v0.0.6

Onnx infer

Data contributors

Uh oh!

v0.0.5 fixed: Onnx Model for Mandarin&Japanese&English Singing Voice and automatically identify AP and EP.

Code version: v0.0.5 fixed

onnx infer

Data contributors

Uh oh!