Adding generation config file(s)

Update TF weights (#3 )
- Update TF weights (c95076622ddb2378f0bbbcd48d80cb65af803099)
2023-01-24 15:02:01 +00:00 · 2022-09-05 09:45:24 +00:00 · 2022-07-14 08:57:07 +00:00 · 2022-07-14 08:56:40 +00:00 · 2021-09-09 23:34:54 +02:00 · 2021-07-13 16:26:10 +00:00
5 changed files with 101 additions and 9 deletions
--- a/README.md
+++ b/README.md
@ -1,22 +1,68 @@
 ---
 tags:
 - translation
 license: cc-by-4.0
 ---
 ### opus-mt-en-de
 * source languages: en
 * target languages: de
 *  OPUS readme: [en-de](https://github.com/Helsinki-NLP/OPUS-MT-train/blob/master/models/en-de/README.md)
-*  dataset: opus
+## Table of Contents
-* model: transformer-align
+- [Model Details](#model-details)
 - [Uses](#uses)
 - [Risks, Limitations and Biases](#risks-limitations-and-biases)
 - [Training](#training)
 - [Evaluation](#evaluation)
 - [Citation Information](#citation-information)
 - [How to Get Started With the Model](#how-to-get-started-with-the-model)
 ## Model Details
 **Model Description:**
 - **Developed by:** Language Technology Research Group at the University of Helsinki
 - **Model Type:** Translation
 - **Language(s):**  
  - Source Language: English
  - Target Language: German 
 - **License:** CC-BY-4.0
 - **Resources for more information:**
  - [GitHub Repo](https://github.com/Helsinki-NLP/OPUS-MT-train)
 ## Uses
 #### Direct Use
 This model can be used for translation and text-to-text generation.
 ## Risks, Limitations and Biases
 **CONTENT WARNING: Readers should be aware this section contains content that is disturbing, offensive, and can propagate historical and current stereotypes.**
 Significant research has explored bias and fairness issues with language models (see, e.g., [Sheng et al. (2021)](https://aclanthology.org/2021.acl-long.330.pdf) and [Bender et al. (2021)](https://dl.acm.org/doi/pdf/10.1145/3442188.3445922)).
 Further details about the dataset for this model can be found in the OPUS readme: [en-de](https://github.com/Helsinki-NLP/OPUS-MT-train/blob/master/models/en-de/README.md)
 #### Training Data
 ##### Preprocessing
 * pre-processing: normalization + SentencePiece
 * dataset: [opus](https://github.com/Helsinki-NLP/Opus-MT)
 * download original weights: [opus-2020-02-26.zip](https://object.pouta.csc.fi/OPUS-MT-models/en-de/opus-2020-02-26.zip)
 * test set translations: [opus-2020-02-26.test.txt](https://object.pouta.csc.fi/OPUS-MT-models/en-de/opus-2020-02-26.test.txt)
 ## Evaluation
 #### Results
 * test set scores: [opus-2020-02-26.eval.txt](https://object.pouta.csc.fi/OPUS-MT-models/en-de/opus-2020-02-26.eval.txt)
-## Benchmarks
+
 #### Benchmarks
 | testset               | BLEU  | chr-F |
 |-----------------------|-------|-------|
@ -34,3 +80,30 @@ tags:
 | newstest2019-ende.en.de 	| 40.9 	| 0.654 |
 | Tatoeba.en.de 	| 47.3 	| 0.664 |
 ## Citation Information
 ```bibtex
@InProceedings{TiedemannThottingal:EAMT2020,
  author = {J{\"o}rg Tiedemann and Santhosh Thottingal},
  title = {{OPUS-MT} — {B}uilding open translation services for the {W}orld},
  booktitle = {Proceedings of the 22nd Annual Conferenec of the European Association for Machine Translation (EAMT)},
  year = {2020},
  address = {Lisbon, Portugal}
 }
 ```
 ## How to Get Started With the Model
 ```python
 from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
 tokenizer = AutoTokenizer.from_pretrained("Helsinki-NLP/opus-mt-en-de")
 model = AutoModelForSeq2SeqLM.from_pretrained("Helsinki-NLP/opus-mt-en-de")
 ```
--- a/config.json
+++ b/config.json
@ -53,7 +53,7 @@
  "pad_token_id": 58100,
  "scale_embedding": true,
  "static_position_embeddings": true,
-  "transformers_version": "4.4.0.dev0",
+  "transformers_version": "4.9.0.dev0",
  "use_cache": true,
  "vocab_size": 58101
 }
--- a/flax_model.msgpack
+++ b/flax_model.msgpack
--- a/generation_config.json
+++ b/generation_config.json
@ -0,0 +1,16 @@
 {
  "_from_model_config": true,
  "bad_words_ids": [
    [
      58100
    ]
  ],
  "bos_token_id": 0,
  "decoder_start_token_id": 58100,
  "eos_token_id": 0,
  "forced_eos_token_id": 0,
  "max_length": 512,
  "num_beams": 4,
  "pad_token_id": 58100,
  "transformers_version": "4.27.0.dev0"
 }
--- a/tf_model.h5
+++ b/tf_model.h5
Author	SHA1	Message	Date
Joao Gante	338a95cccb	Adding generation config file(s)	2023-01-24 15:02:01 +00:00
joaogante	bb3e01d60e	Update TF weights (#3 ) - Update TF weights (c95076622ddb2378f0bbbcd48d80cb65af803099)	2022-09-05 09:45:24 +00:00
Jörg Tiedemann	61a2efe1dd	Update README.md	2022-07-14 08:57:07 +00:00
tiedeman	86b3ded812	Model Card (#1 ) - Model Card (0161e478c48dbb4dd22fb4a03a04816533fb56f2) Co-authored-by: Ezi Ozoani <Ezi@users.noreply.huggingface.co>	2022-07-14 08:56:40 +00:00
Julien Chaumond	6c00b328d3	metadata: add license	2021-09-09 23:34:54 +02:00
Patrick	8050bfaae7	add model	2021-07-13 16:26:10 +00:00
Patrick	16d135d56a	allow flax	2021-07-13 16:19:22 +00:00
Patrick	86d3eec33d	remove wrong flax	2021-07-13 16:19:12 +00:00
Patrick	a1a0ba3a50	allow flax	2021-07-13 16:06:40 +00:00
patrickvonplaten	dae267e3c5	add flax model	2021-07-09 17:22:57 +01:00