PygmalionAI/pygmalion-350m is a forked repo from huggingface. License: None
Go to file
0x000011b@waifu.club d65832d913 revert KAI chatmode config until related fixes land on stable version 2023-01-11 23:44:13 +00:00
.gitattributes initial commit 2022-12-20 22:04:33 +00:00
README.md Update README.md 2022-12-21 16:33:04 +00:00
config.json revert KAI chatmode config until related fixes land on stable version 2023-01-11 23:44:13 +00:00
merges.txt Upload main model (test) 2022-12-20 22:15:03 +00:00
pytorch_model.bin Upload pytorch_model.bin 2022-12-20 22:41:49 +00:00
special_tokens_map.json Upload main model (test) 2022-12-20 22:15:03 +00:00
tokenizer_config.json Upload main model (test) 2022-12-20 22:15:03 +00:00
vocab.json Upload main model (test) 2022-12-20 22:15:03 +00:00

README.md

language thumbnail tags inference
en
convAI
conversational
false

pygmalion-350m

Model description

This is a proof-of-concept fine-tune of Facebook's OPT-350M model optimized for dialogue, to be used as a stepping stone to higher parameter models.

Disclaimer: NSFW data was included in the fine-tuning of this model. Although SFW inputs will usually result in SFW outputs, you are advised to chat at your own risk. This model is not suitable for use by minors.

Fine-tuning process

This model was much easier than expected to create.

We used the ColossalAI library to fine-tune the OPT-350M model originally trained by Facebook on The Pile. Though our initial dataset was sets of dialogue gathered from various sources totaling about 50 MB in size, early training runs revealed that the model converged after only 7% of the dataset was passed through. To alleviate this, we massively reduced the size of the dataset to only 273 KB.

ColossalAI's magic allowed for something incredible: this entire model was fine-tuned on a singular GPU with only 6 GB (!) of VRAM. Fine-tuning took less than an hour to complete.