Compare commits

..

No commits in common. "b54ced2147b8f08e5fcee06312c63737fccf87d5" and "5f4ee2cd669cd869182d9e5bd57b191e22154af4" have entirely different histories.

1 changed files with 12 additions and 8 deletions

View File

@ -7,14 +7,16 @@ metrics:
- wer
- cer
tags:
- en
- audio
- automatic-speech-recognition
- en
- hf-asr-leaderboard
- mozilla-foundation/common_voice_6_0
- robust-speech-event
- speech
- xlsr-fine-tuning-week
- robust-speech-event
- mozilla-foundation/common_voice_6_0
- hf-asr-leaderboard
- hf-asr-leaderboard
- hf-asr-leaderboard
license: apache-2.0
model-index:
- name: XLSR Wav2Vec2 English by Jonatas Grosman
@ -61,9 +63,9 @@ model-index:
value: 11.01
---
# Fine-tuned XLSR-53 large model for speech recognition in English
# Wav2Vec2-Large-XLSR-53-English
Fine-tuned [facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) on English using the train and validation splits of [Common Voice 6.1](https://huggingface.co/datasets/common_voice).
Fine-tuned [facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) on English using the [Common Voice](https://huggingface.co/datasets/common_voice).
When using this model, make sure that your speech input is sampled at 16kHz.
This model has been fine-tuned thanks to the GPU credits generously given by the [OVHcloud](https://www.ovhcloud.com/en/public-cloud/ai-training/) :)
@ -156,9 +158,11 @@ python eval.py --model_id jonatasgrosman/wav2vec2-large-xlsr-53-english --datase
If you want to cite this model you can use this:
```bibtex
@misc{grosman2021xlsr53-large-english,
title={Fine-tuned {XLSR}-53 large model for speech recognition in {E}nglish},
@misc{grosman2021wav2vec2-large-xlsr-53-english,
title={XLSR Wav2Vec2 English by Jonatas Grosman},
author={Grosman, Jonatas},
publisher={Hugging Face},
journal={Hugging Face Hub},
howpublished={\url{https://huggingface.co/jonatasgrosman/wav2vec2-large-xlsr-53-english}},
year={2021}
}