Update README.md
This commit is contained in:
parent
8858eda923
commit
17cf17c4ec
|
@ -8,15 +8,8 @@ model_index:
|
||||||
name: wav2vec2-lg-xlsr-en-speech-emotion-recognition
|
name: wav2vec2-lg-xlsr-en-speech-emotion-recognition
|
||||||
---
|
---
|
||||||
|
|
||||||
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
|
||||||
should probably proofread and complete it, then remove this comment. -->
|
|
||||||
|
|
||||||
# Speech Emotion Recognition By Fine-Tuning Wav2Vec 2.0
|
# Speech Emotion Recognition By Fine-Tuning Wav2Vec 2.0
|
||||||
|
|
||||||
|
|
||||||
__Important:__ This model it's not yet implementable due to missing built-in functions in HuggingFace for speech classification tasks. I'm working on the instructions of how to use it and a repository where the code will be available soon. Thank you anyway!
|
|
||||||
|
|
||||||
|
|
||||||
The model is a fine-tuned version of [jonatasgrosman/wav2vec2-large-xlsr-53-english](https://huggingface.co/jonatasgrosman/wav2vec2-large-xlsr-53-english) for a Speech Emotion Recognition (SER) task.
|
The model is a fine-tuned version of [jonatasgrosman/wav2vec2-large-xlsr-53-english](https://huggingface.co/jonatasgrosman/wav2vec2-large-xlsr-53-english) for a Speech Emotion Recognition (SER) task.
|
||||||
|
|
||||||
The dataset used to fine-tune the original pre-trained model is the [RAVDESS dataset](https://zenodo.org/record/1188976#.YO6yI-gzaUk). This dataset provides 1440 samples of recordings from actors performing on 8 different emotions in English, which are:
|
The dataset used to fine-tune the original pre-trained model is the [RAVDESS dataset](https://zenodo.org/record/1188976#.YO6yI-gzaUk). This dataset provides 1440 samples of recordings from actors performing on 8 different emotions in English, which are:
|
||||||
|
|
Loading…
Reference in New Issue