Compare commits
10 Commits
5f4ee2cd66
...
b54ced2147
Author | SHA1 | Date |
---|---|---|
|
b54ced2147 | |
|
2337bf53f5 | |
|
95977151cc | |
|
2f923b5935 | |
|
5d103dcbb7 | |
|
c3a4b82f0d | |
|
04ecb5902a | |
|
b2d3244635 | |
|
3d87c53491 | |
|
bb14734247 |
20
README.md
20
README.md
|
@ -7,16 +7,14 @@ metrics:
|
||||||
- wer
|
- wer
|
||||||
- cer
|
- cer
|
||||||
tags:
|
tags:
|
||||||
- en
|
|
||||||
- audio
|
- audio
|
||||||
- automatic-speech-recognition
|
- automatic-speech-recognition
|
||||||
|
- en
|
||||||
|
- hf-asr-leaderboard
|
||||||
|
- mozilla-foundation/common_voice_6_0
|
||||||
|
- robust-speech-event
|
||||||
- speech
|
- speech
|
||||||
- xlsr-fine-tuning-week
|
- xlsr-fine-tuning-week
|
||||||
- robust-speech-event
|
|
||||||
- mozilla-foundation/common_voice_6_0
|
|
||||||
- hf-asr-leaderboard
|
|
||||||
- hf-asr-leaderboard
|
|
||||||
- hf-asr-leaderboard
|
|
||||||
license: apache-2.0
|
license: apache-2.0
|
||||||
model-index:
|
model-index:
|
||||||
- name: XLSR Wav2Vec2 English by Jonatas Grosman
|
- name: XLSR Wav2Vec2 English by Jonatas Grosman
|
||||||
|
@ -63,9 +61,9 @@ model-index:
|
||||||
value: 11.01
|
value: 11.01
|
||||||
---
|
---
|
||||||
|
|
||||||
# Wav2Vec2-Large-XLSR-53-English
|
# Fine-tuned XLSR-53 large model for speech recognition in English
|
||||||
|
|
||||||
Fine-tuned [facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) on English using the [Common Voice](https://huggingface.co/datasets/common_voice).
|
Fine-tuned [facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) on English using the train and validation splits of [Common Voice 6.1](https://huggingface.co/datasets/common_voice).
|
||||||
When using this model, make sure that your speech input is sampled at 16kHz.
|
When using this model, make sure that your speech input is sampled at 16kHz.
|
||||||
|
|
||||||
This model has been fine-tuned thanks to the GPU credits generously given by the [OVHcloud](https://www.ovhcloud.com/en/public-cloud/ai-training/) :)
|
This model has been fine-tuned thanks to the GPU credits generously given by the [OVHcloud](https://www.ovhcloud.com/en/public-cloud/ai-training/) :)
|
||||||
|
@ -158,11 +156,9 @@ python eval.py --model_id jonatasgrosman/wav2vec2-large-xlsr-53-english --datase
|
||||||
If you want to cite this model you can use this:
|
If you want to cite this model you can use this:
|
||||||
|
|
||||||
```bibtex
|
```bibtex
|
||||||
@misc{grosman2021wav2vec2-large-xlsr-53-english,
|
@misc{grosman2021xlsr53-large-english,
|
||||||
title={XLSR Wav2Vec2 English by Jonatas Grosman},
|
title={Fine-tuned {XLSR}-53 large model for speech recognition in {E}nglish},
|
||||||
author={Grosman, Jonatas},
|
author={Grosman, Jonatas},
|
||||||
publisher={Hugging Face},
|
|
||||||
journal={Hugging Face Hub},
|
|
||||||
howpublished={\url{https://huggingface.co/jonatasgrosman/wav2vec2-large-xlsr-53-english}},
|
howpublished={\url{https://huggingface.co/jonatasgrosman/wav2vec2-large-xlsr-53-english}},
|
||||||
year={2021}
|
year={2021}
|
||||||
}
|
}
|
||||||
|
|
Loading…
Reference in New Issue