Compare commits

..

No commits in common. "b54ced2147b8f08e5fcee06312c63737fccf87d5" and "5f4ee2cd669cd869182d9e5bd57b191e22154af4" have entirely different histories.

1 changed files with 12 additions and 8 deletions

View File

@ -7,14 +7,16 @@ metrics:
- wer - wer
- cer - cer
tags: tags:
- en
- audio - audio
- automatic-speech-recognition - automatic-speech-recognition
- en
- hf-asr-leaderboard
- mozilla-foundation/common_voice_6_0
- robust-speech-event
- speech - speech
- xlsr-fine-tuning-week - xlsr-fine-tuning-week
- robust-speech-event
- mozilla-foundation/common_voice_6_0
- hf-asr-leaderboard
- hf-asr-leaderboard
- hf-asr-leaderboard
license: apache-2.0 license: apache-2.0
model-index: model-index:
- name: XLSR Wav2Vec2 English by Jonatas Grosman - name: XLSR Wav2Vec2 English by Jonatas Grosman
@ -61,9 +63,9 @@ model-index:
value: 11.01 value: 11.01
--- ---
# Fine-tuned XLSR-53 large model for speech recognition in English # Wav2Vec2-Large-XLSR-53-English
Fine-tuned [facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) on English using the train and validation splits of [Common Voice 6.1](https://huggingface.co/datasets/common_voice). Fine-tuned [facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) on English using the [Common Voice](https://huggingface.co/datasets/common_voice).
When using this model, make sure that your speech input is sampled at 16kHz. When using this model, make sure that your speech input is sampled at 16kHz.
This model has been fine-tuned thanks to the GPU credits generously given by the [OVHcloud](https://www.ovhcloud.com/en/public-cloud/ai-training/) :) This model has been fine-tuned thanks to the GPU credits generously given by the [OVHcloud](https://www.ovhcloud.com/en/public-cloud/ai-training/) :)
@ -156,9 +158,11 @@ python eval.py --model_id jonatasgrosman/wav2vec2-large-xlsr-53-english --datase
If you want to cite this model you can use this: If you want to cite this model you can use this:
```bibtex ```bibtex
@misc{grosman2021xlsr53-large-english, @misc{grosman2021wav2vec2-large-xlsr-53-english,
title={Fine-tuned {XLSR}-53 large model for speech recognition in {E}nglish}, title={XLSR Wav2Vec2 English by Jonatas Grosman},
author={Grosman, Jonatas}, author={Grosman, Jonatas},
publisher={Hugging Face},
journal={Hugging Face Hub},
howpublished={\url{https://huggingface.co/jonatasgrosman/wav2vec2-large-xlsr-53-english}}, howpublished={\url{https://huggingface.co/jonatasgrosman/wav2vec2-large-xlsr-53-english}},
year={2021} year={2021}
} }