Commit Graph

42 Commits

Author SHA1 Message Date
duzx16 2200e2bc52 Add pad_token_id in config.json
Fix position_ids in ChatGLMModel
Add batch position_ids
2023-03-29 21:52:46 +08:00
duzx16 db2249979c Change padding side 2023-03-29 21:25:40 +08:00
duzx16 5c64357295 Set ignore_index for CrossEntropyLoss 2023-03-29 21:19:38 +08:00
duzx16 8127ab6abf Support batch training 2023-03-29 21:15:30 +08:00
duzx16 fbda1206cb Merge branch 'main' into dev_pt
# Conflicts:
#	modeling_chatglm.py
2023-03-29 20:37:39 +08:00
duzx16 812f43f9ff Add p-tuning v2 2023-03-29 20:22:57 +08:00
duzx16 096f3de6b4 Fix context length in get_position_ids 2023-03-28 17:37:46 +08:00
duzx16 4a9b711e61 Close CPU fusion on Mac 2023-03-23 22:43:06 +08:00
duzx16 d2bbc82a2c Fix Chinese punctuation 2023-03-22 14:37:21 +08:00
duzx16 2449bdc9d8 Add English 2023-03-21 23:27:46 +08:00
songxxzp 1b54948bb2 Fix typo in tokenization_chatglm.py 2023-03-19 22:52:12 +08:00
duzx16 2460dc2430 Remove hardcode bos_token_id 2023-03-19 14:56:15 +08:00
duzx16 42095d42ff Add support for streaming output 2023-03-19 14:31:26 +08:00
duzx16 220f772e9a Fix overflow in FP16 2023-03-16 09:26:05 +08:00
duzx16 f9f74fda55 Set is_parallelizable to False 2023-03-16 00:30:43 +08:00
duzx16 8492687842 Remove image tokens when decoding 2023-03-16 00:24:42 +08:00
duzx16 0d8b08d1bb Revert "Update quantization"
This reverts commit 0101afdfa7.
2023-03-15 21:31:05 +08:00
duzx16 0101afdfa7 Update quantization 2023-03-15 21:18:24 +08:00
duzx16 c3dece3f01 Add logit processor for NaN or Inf scores 2023-03-15 18:14:34 +08:00
duzx16 9d1509a1ad Fix default history argument 2023-03-14 18:38:49 +08:00
duzx16 bcb053bda6 Merge branch 'main' of https://huggingface.co/THUDM/chatglm-6b 2023-03-14 14:49:47 +08:00
duzx16 d4832e8142 Add support for float32 2023-03-14 14:49:14 +08:00
duzx16 a034f2a1ed Update requirements 2023-03-14 12:19:54 +08:00
duzx16 3d0afeeed7 Update requirements 2023-03-14 12:15:19 +08:00
Zeng Aohan 534e8b7045 Update README.md 2023-03-14 03:18:04 +00:00
Zeng Aohan 6331ebe869 Update README.md 2023-03-14 03:17:32 +00:00
Zeng Aohan 4cfd1ea181 Update README.md 2023-03-13 18:44:47 +00:00
duzx16 ac23f2a8f2 Update requirements 2023-03-14 02:21:04 +08:00
duzx16 27d477b51f Update README 2023-03-14 02:15:35 +08:00
duzx16 0f573d111a Update README 2023-03-14 02:11:53 +08:00
duzx16 cd8041ea53 Fix past_key_values 2023-03-14 02:08:43 +08:00
duzx16 65bb3f00a7 add ice_text.model 2023-03-14 01:50:11 +08:00
duzx16 e8b5e82fbe delete ice_text.model 2023-03-14 01:49:46 +08:00
duzx16 9671e13e0e Merge branch 'main' of https://huggingface.co/THUDM/chatglm-6b 2023-03-14 01:48:56 +08:00
duzx16 c4575e73d0 Update tokenizer 2023-03-14 01:32:34 +08:00
Zeng Aohan 0772a2f0c8 Update README.md 2023-03-13 17:19:13 +00:00
Zeng Aohan 7424d1fead Update README.md 2023-03-13 17:18:06 +00:00
Zeng Aohan 452e0aa476 Update README.md 2023-03-13 17:17:31 +00:00
Zeng Aohan eb1daff875 Update README.md 2023-03-13 17:16:28 +00:00
Zeng Aohan a77bce8299 Update README.md 2023-03-13 17:15:46 +00:00
Sengxian fc937144cc Add model file 2023-03-14 00:29:56 +08:00
Sengxian d11c6aaed8 Add chatglm-6b 2023-03-14 00:21:01 +08:00