From 9eb0395187a5d52ffd0ca04cbd66ab7996f1b781 Mon Sep 17 00:00:00 2001 From: Johannes Wagner Date: Wed, 6 Apr 2022 13:28:08 +0000 Subject: [PATCH] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 3ab0835..369ec6d 100644 --- a/README.md +++ b/README.md @@ -16,7 +16,7 @@ license: cc-by-nc-sa-4.0 The model expects a raw audio signal as input and outputs predictions for arousal, dominance and valence in a range of approximately 0...1. In addition, it also provides the pooled states of the last transformer layer. The model was created by fine-tuning [ Wav2Vec2-Large-Robust](https://huggingface.co/facebook/wav2vec2-large-robust) on [MSP-Podcast](https://ecs.utdallas.edu/research/researchlabs/msp-lab/MSP-Podcast.html) (v1.7). The model was pruned from 24 to 12 transformer layers before fine-tuning. An [ONNX](https://onnx.ai/") export of the model is available from [doi:10.5281/zenodo.6221127](https://zenodo.org/record/6221127). Further details are given in the associated [paper](https://arxiv.org/abs/2203.07378). -# How to +# Usage ```python import numpy as np