Update README.md
This commit is contained in:
parent
be3f459aeb
commit
ebe7999a05
|
@ -13,7 +13,14 @@ should probably proofread and complete it, then remove this comment. -->
|
||||||
|
|
||||||
# wav2vec2-lg-xlsr-en-speech-emotion-recognition
|
# wav2vec2-lg-xlsr-en-speech-emotion-recognition
|
||||||
|
|
||||||
This model is a fine-tuned version of [jonatasgrosman/wav2vec2-large-xlsr-53-english](https://huggingface.co/jonatasgrosman/wav2vec2-large-xlsr-53-english) on an unkown dataset.
|
This model is a fine-tuned version of [jonatasgrosman/wav2vec2-large-xlsr-53-english](https://huggingface.co/jonatasgrosman/wav2vec2-large-xlsr-53-english) for a Speech Emotion Recognition (SER) task.
|
||||||
|
|
||||||
|
The dataset used to fine-tune the original pre-trained model is the [RAVDESS dataset](https://zenodo.org/record/1188976#.YO6yI-gzaUk). This dataset provides 1440 samples of recordings from actors performing on 8 different emotions in English, which are:
|
||||||
|
|
||||||
|
```python
|
||||||
|
emotions = ['angry', 'calm', 'disgust', 'fearful', 'happy', 'neutral', 'sad', 'surprised']
|
||||||
|
```
|
||||||
|
|
||||||
It achieves the following results on the evaluation set:
|
It achieves the following results on the evaluation set:
|
||||||
- Loss: 0.5023
|
- Loss: 0.5023
|
||||||
- Accuracy: 0.8223
|
- Accuracy: 0.8223
|
||||||
|
|
Loading…
Reference in New Issue