1 changed files with 3 additions and 9 deletions
--- a/README.md
+++ b/README.md
@ -2,22 +2,16 @@
 tags:
 - vision
 widget:
- src: https://huggingface.co/OFA-Sys/chinese-clip-vit-base-patch16/resolve/main/festival.jpg
+- src: feastival.jpg
-  candidate_labels: 灯笼, 鞭炮, 对联
+  candidate_labels: 灯笼, 对联, 年糕, 鞭炮
  example_title: festival
 - src: https://huggingface.co/datasets/mishig/sample_images/resolve/main/cat-dog-music.png
  candidate_labels: 音乐表演, 体育运动
  example_title: cat & dog
 - src: https://huggingface.co/datasets/mishig/sample_images/resolve/main/football-match.jpg
  candidate_labels: 梅西, C罗, 马奎尔
  example_title: football
 ---
 # Chinese-CLIP-ViT-Base-Patch16
 ## Introduction
-This is the base-version of the Chinese CLIP, with ViT-B/16 as the image encoder and RoBERTa-wwm-base as the text encoder. Chinese CLIP is a simple implementation of CLIP on a large-scale dataset of around 200 million Chinese image-text pairs. For more details, please refer to our technical report https://arxiv.org/abs/2211.01335 and our official github repo https://github.com/OFA-Sys/Chinese-CLIP (Welcome to star! 🔥🔥)
+This is the base-version of the Chinese CLIP, with ViT-B/16 as the image encoder and RoBERTa-wwm-base as the text encoder. Chinese CLIP is a simple implementation of CLIP on a large-scale dataset of around 200 million Chinese image-text pairs. For more details, please refer to our technical report https://arxiv.org/abs/2211.01335 and our official github repo https://github.com/OFA-Sys/Chinese-CLIP
 ## Use with the official API
 We provide a simple code snippet to show how to use the API of Chinese-CLIP to compute the image & text embeddings and similarities.