Compare commits
No commits in common. "36e679e65c2a2fead755ae21162091293ad37834" and "113606e42be3fad3c3856ac41f0af888f25dc07c" have entirely different histories.
36e679e65c
...
113606e42b
12
README.md
12
README.md
|
@ -2,22 +2,16 @@
|
||||||
tags:
|
tags:
|
||||||
- vision
|
- vision
|
||||||
widget:
|
widget:
|
||||||
- src: https://huggingface.co/OFA-Sys/chinese-clip-vit-base-patch16/resolve/main/festival.jpg
|
- src: feastival.jpg
|
||||||
candidate_labels: 灯笼, 鞭炮, 对联
|
candidate_labels: 灯笼, 对联, 年糕, 鞭炮
|
||||||
example_title: festival
|
example_title: festival
|
||||||
- src: https://huggingface.co/datasets/mishig/sample_images/resolve/main/cat-dog-music.png
|
|
||||||
candidate_labels: 音乐表演, 体育运动
|
|
||||||
example_title: cat & dog
|
|
||||||
- src: https://huggingface.co/datasets/mishig/sample_images/resolve/main/football-match.jpg
|
|
||||||
candidate_labels: 梅西, C罗, 马奎尔
|
|
||||||
example_title: football
|
|
||||||
---
|
---
|
||||||
|
|
||||||
|
|
||||||
# Chinese-CLIP-ViT-Base-Patch16
|
# Chinese-CLIP-ViT-Base-Patch16
|
||||||
|
|
||||||
## Introduction
|
## Introduction
|
||||||
This is the base-version of the Chinese CLIP, with ViT-B/16 as the image encoder and RoBERTa-wwm-base as the text encoder. Chinese CLIP is a simple implementation of CLIP on a large-scale dataset of around 200 million Chinese image-text pairs. For more details, please refer to our technical report https://arxiv.org/abs/2211.01335 and our official github repo https://github.com/OFA-Sys/Chinese-CLIP (Welcome to star! 🔥🔥)
|
This is the base-version of the Chinese CLIP, with ViT-B/16 as the image encoder and RoBERTa-wwm-base as the text encoder. Chinese CLIP is a simple implementation of CLIP on a large-scale dataset of around 200 million Chinese image-text pairs. For more details, please refer to our technical report https://arxiv.org/abs/2211.01335 and our official github repo https://github.com/OFA-Sys/Chinese-CLIP
|
||||||
|
|
||||||
## Use with the official API
|
## Use with the official API
|
||||||
We provide a simple code snippet to show how to use the API of Chinese-CLIP to compute the image & text embeddings and similarities.
|
We provide a simple code snippet to show how to use the API of Chinese-CLIP to compute the image & text embeddings and similarities.
|
||||||
|
|
Loading…
Reference in New Issue