Compare commits

..

10 Commits

Author SHA1 Message Date
Jonatas Grosman b54ced2147 Update README.md 2022-12-14 02:02:32 +00:00
Jonatas Grosman 2337bf53f5 Update README.md 2022-08-18 19:05:58 +00:00
Jonatas Grosman 95977151cc Update README.md 2022-07-27 23:37:25 +00:00
Jonatas Grosman 2f923b5935 Update README.md 2022-07-14 19:32:29 +00:00
Jonatas Grosman 5d103dcbb7 Update README.md 2022-06-22 17:04:24 +00:00
Jonatas Grosman c3a4b82f0d Update README.md 2022-06-22 16:16:09 +00:00
Jonatas Grosman 04ecb5902a Update README.md 2022-06-22 16:08:05 +00:00
Anton Lozhkov b2d3244635 Upload README.md 2022-03-23 18:24:41 +00:00
Anton Lozhkov 3d87c53491 Upload README.md 2022-03-23 18:23:57 +00:00
Anton Lozhkov bb14734247 Update README.md 2022-03-23 18:23:19 +00:00
1 changed files with 8 additions and 12 deletions

View File

@ -7,16 +7,14 @@ metrics:
- wer - wer
- cer - cer
tags: tags:
- en
- audio - audio
- automatic-speech-recognition - automatic-speech-recognition
- en
- hf-asr-leaderboard
- mozilla-foundation/common_voice_6_0
- robust-speech-event
- speech - speech
- xlsr-fine-tuning-week - xlsr-fine-tuning-week
- robust-speech-event
- mozilla-foundation/common_voice_6_0
- hf-asr-leaderboard
- hf-asr-leaderboard
- hf-asr-leaderboard
license: apache-2.0 license: apache-2.0
model-index: model-index:
- name: XLSR Wav2Vec2 English by Jonatas Grosman - name: XLSR Wav2Vec2 English by Jonatas Grosman
@ -63,9 +61,9 @@ model-index:
value: 11.01 value: 11.01
--- ---
# Wav2Vec2-Large-XLSR-53-English # Fine-tuned XLSR-53 large model for speech recognition in English
Fine-tuned [facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) on English using the [Common Voice](https://huggingface.co/datasets/common_voice). Fine-tuned [facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) on English using the train and validation splits of [Common Voice 6.1](https://huggingface.co/datasets/common_voice).
When using this model, make sure that your speech input is sampled at 16kHz. When using this model, make sure that your speech input is sampled at 16kHz.
This model has been fine-tuned thanks to the GPU credits generously given by the [OVHcloud](https://www.ovhcloud.com/en/public-cloud/ai-training/) :) This model has been fine-tuned thanks to the GPU credits generously given by the [OVHcloud](https://www.ovhcloud.com/en/public-cloud/ai-training/) :)
@ -158,11 +156,9 @@ python eval.py --model_id jonatasgrosman/wav2vec2-large-xlsr-53-english --datase
If you want to cite this model you can use this: If you want to cite this model you can use this:
```bibtex ```bibtex
@misc{grosman2021wav2vec2-large-xlsr-53-english, @misc{grosman2021xlsr53-large-english,
title={XLSR Wav2Vec2 English by Jonatas Grosman}, title={Fine-tuned {XLSR}-53 large model for speech recognition in {E}nglish},
author={Grosman, Jonatas}, author={Grosman, Jonatas},
publisher={Hugging Face},
journal={Hugging Face Hub},
howpublished={\url{https://huggingface.co/jonatasgrosman/wav2vec2-large-xlsr-53-english}}, howpublished={\url{https://huggingface.co/jonatasgrosman/wav2vec2-large-xlsr-53-english}},
year={2021} year={2021}
} }