emotion-english-distilrober.../README.md

---
language: "en"
tags:
- sentiment
- emotion
- twitter

widget:
- text: "Oh wow. I didn't know that."
- text: "This movie always makes me cry.."
- text: "Oh Happy Day"

---

## Description

With this model, you can classify emotions in English text data. The model was trained on 6 diverse datasets and predicts 7 emotions:

1) anger
2) disgust
3) fear
4) joy
5) neutral
6) sadness
7) surprise

The model is a fine-tuned checkpoint of DistilRoBERTa-base. The emotions reflect Ekman's 6 universal emotions, plus a neutral class.

## Application

a) Run emotion model with 3 lines of code on single text example using Hugging Face's pipeline command on Google Colab:

[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/j-hartmann/emotion-english-distilroberta-base/blob/main/simple_emotion_pipeline.ipynb)

b) Run emotion model on multiple examples and full datasets (e.g., .csv files) on Google Colab:

[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/j-hartmann/emotion-english-distilroberta-base/blob/main/emotion_prediction_example.ipynb)

## Contact

Please reach out to jochen.hartmann@uni-hamburg.de if you have any questions or feedback.

Thanks to Samuel Domdey and chrsiebert for their support in making this model available.

## Appendix

Please find an overview of the datasets used for training below. All datasets contain English text. The table summarizes which emotions are available in each of the datasets.

|Name|anger|disgust|fear|joy|neutral|sadness|surprise|
|---|---|---|---|---|---|---|---|
|Crowdflower (2016)|Yes|-|-|Yes|Yes|Yes|Yes|
|Emotion Dataset, Elvis et al. (2018)|Yes|Yes|Yes|Yes|-|Yes|Yes|
|GoEmotions, Demszky et al. (2020)|Yes|Yes|Yes|Yes|Yes|Yes|Yes|
|ISEAR, Vikash (2018)|Yes|Yes|Yes|Yes|-|Yes|-|
|MELD, Poria et al. (2019)|Yes|Yes|Yes|Yes|Yes|Yes|Yes|
|SemEval-2018, EI-reg (Mohammad et al. 2018) |Yes|-|Yes|Yes|-|Yes|-|

The datasets represent a diverse set of text types. Specifically, they contain emotion labels for texts from Twitter, Reddit, student self-reports, and utterances from TV dialogues. As MELD (Multimodal EmotionLines Dataset) extends the EmotionLines dataset, EmotionLines itself is not included here.
updated example texts 2021-06-16 10:00:25 +00:00			`---`
Update README.md 2021-06-16 14:06:46 +00:00			`language: "en"`
			`tags:`
			`- sentiment`
			`- emotion`
			`- twitter`
updated example texts 2021-06-16 10:00:25 +00:00
			`widget:`
Update README.md 2021-06-16 10:03:54 +00:00			`- text: "Oh wow. I didn't know that."`
updated example texts 2021-06-16 10:00:25 +00:00			`- text: "This movie always makes me cry.."`
Update README.md 2021-06-17 09:40:51 +00:00			`- text: "Oh Happy Day"`
updated example texts 2021-06-16 10:00:25 +00:00
			`---`

Update README.md 2021-06-16 09:32:20 +00:00			`## Description`

Update README.md 2021-06-17 09:23:49 +00:00			`With this model, you can classify emotions in English text data. The model was trained on 6 diverse datasets and predicts 7 emotions:`
Update README.md 2021-06-16 09:32:20 +00:00
			`1) anger`
			`2) disgust`
			`3) fear`
			`4) joy`
			`5) neutral`
			`6) sadness`
			`7) surprise`

Update README.md 2021-06-21 06:15:21 +00:00			`The model is a fine-tuned checkpoint of DistilRoBERTa-base. The emotions reflect Ekman's 6 universal emotions, plus a neutral class.`
Update README.md 2021-06-16 09:35:11 +00:00
Update README.md 2021-06-16 09:32:20 +00:00			`## Application`

Update README.md 2021-06-16 09:48:55 +00:00			`a) Run emotion model with 3 lines of code on single text example using Hugging Face's pipeline command on Google Colab:`
Update README.md 2021-06-16 09:29:04 +00:00
			`[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/j-hartmann/emotion-english-distilroberta-base/blob/main/simple_emotion_pipeline.ipynb)`

Update README.md 2021-06-16 09:48:55 +00:00			`b) Run emotion model on multiple examples and full datasets (e.g., .csv files) on Google Colab:`
Update README.md 2021-06-16 09:29:04 +00:00
Update README.md 2021-06-16 09:34:10 +00:00			`[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/j-hartmann/emotion-english-distilroberta-base/blob/main/emotion_prediction_example.ipynb)`

			`## Contact`

Update README.md 2021-06-16 09:40:03 +00:00			`Please reach out to jochen.hartmann@uni-hamburg.de if you have any questions or feedback.`

Update README.md 2021-06-17 09:05:48 +00:00			`Thanks to Samuel Domdey and chrsiebert for their support in making this model available.`

			`## Appendix`

Update README.md 2021-06-21 06:15:21 +00:00			`Please find an overview of the datasets used for training below. All datasets contain English text. The table summarizes which emotions are available in each of the datasets.`
Update README.md 2021-06-17 09:05:48 +00:00
Update README.md 2021-06-17 09:10:15 +00:00			`\|Name\|anger\|disgust\|fear\|joy\|neutral\|sadness\|surprise\|`
			`\|---\|---\|---\|---\|---\|---\|---\|---\|`
Update README.md 2021-06-17 09:23:49 +00:00			`\|Crowdflower (2016)\|Yes\|-\|-\|Yes\|Yes\|Yes\|Yes\|`
			`\|Emotion Dataset, Elvis et al. (2018)\|Yes\|Yes\|Yes\|Yes\|-\|Yes\|Yes\|`
Update README.md 2021-06-17 09:22:35 +00:00			`\|GoEmotions, Demszky et al. (2020)\|Yes\|Yes\|Yes\|Yes\|Yes\|Yes\|Yes\|`
Update README.md 2021-06-17 09:23:49 +00:00			`\|ISEAR, Vikash (2018)\|Yes\|Yes\|Yes\|Yes\|-\|Yes\|-\|`
Update README.md 2021-06-17 09:17:20 +00:00			`\|MELD, Poria et al. (2019)\|Yes\|Yes\|Yes\|Yes\|Yes\|Yes\|Yes\|`
Update README.md 2021-06-21 06:19:15 +00:00			`\|SemEval-2018, EI-reg (Mohammad et al. 2018) \|Yes\|-\|Yes\|Yes\|-\|Yes\|-\|`

			`The datasets represent a diverse set of text types. Specifically, they contain emotion labels for texts from Twitter, Reddit, student self-reports, and utterances from TV dialogues. As MELD (Multimodal EmotionLines Dataset) extends the EmotionLines dataset, EmotionLines itself is not included here.`