diff --git a/README.md b/README.md
index 70733a6..6acb695 100644
--- a/README.md
+++ b/README.md
@@ -51,7 +51,7 @@ However, if you are not satisfied with only using the API, feel free to check ou
## Results
-### MUGE Text-to-Image Retrieval
+**MUGE Text-to-Image Retrieval**:
Setup | Zero-shot | Finetune |
@@ -59,64 +59,87 @@ However, if you are not satisfied with only using the API, feel free to check ou
Metric | R@1 | R@5 | R@10 | MR | R@1 | R@5 | R@10 | MR |
-
- WukongViT-B | 33.4 | 59.3 | 69.7 | 54.1 | 39.2 | 66.9 | 77.4 | 61.2 |
-
- R2D2ViT-B | - | - | - | - | 47.4 | 75.1 | 83.5 | 68.7 |
-
-
- CN-CLIPViT-B | 52.1 | 76.7 | 84.4 | 71.1 | 58.4 | 83.6 | 90.0 | 77.4 |
-
-
-
-
-### Flickr30K-CN Retrieval
-
-
- Task | Text-to-Image | Image-to-Text |
+ Wukong | 42.7 | 69.0 | 78.0 | 63.2 | 52.7 | 77.9 | 85.6 | 72.1 |
-
- Setup | Zero-shot | Finetune | Zero-shot | Finetune |
-
-
- Metric | R@1 | R@5 | R@10 | R@1 | R@5 | R@10 | R@1 | R@5 | R@10 | R@1 | R@5 | R@10 |
-
-
- WukongViT-B | 45.7 | 73.8 | 82.2 | 67.6 | 89.6 | 94.2 | 66.2 | 88.7 | 94.3 | 83.9 | 97.6 | 99.0 |
-
- R2D2ViT-B | - | - | - | 78.3 | 94.6 | 97.0 | - | - | - | 92.6 | 99.1 | 99.8 |
-
-
- CN-CLIPViT-B | 62.7 | 86.9 | 92.8 | 79.1 | 94.8 | 97.4 | 74.6 | 93.5 | 97.1 | 93.5 | 99.0 | 99.5 |
-
-
-
-
-### COCO-CN Retrieval
-
-
- Task | Text-to-Image | Image-to-Text |
+ R2D2 | 49.5 | 75.7 | 83.2 | 69.5 | 60.1 | 82.9 | 89.4 | 77.5 |
-
- Setup | Zero-shot | Finetune | Zero-shot | Finetune |
-
-
- Metric | R@1 | R@5 | R@10 | R@1 | R@5 | R@10 | R@1 | R@5 | R@10 | R@1 | R@5 | R@10 |
-
-
- WukongViT-B | 49.2 | 79.4 | 87.9 | 67.0 | 91.4 | 96.7 | 48.3 | 77.8 | 88.8 | 65.8 | 90.3 | 96.6 |
-
- R2D2ViT-B | - | - | - | 75.1 | 94.2 | 98.1 | - | - | - | 76.1 | 95.3 | 98.5 |
-
-
- CN-CLIPViT-B | 62.2 | 86.6 | 94.9 | 77.0 | 97.1 | 99.0 | 57.0 | 84.1 | 93.6 | 77.4 | 96.2 | 98.9 |
-
+ CN-CLIP | 63.0 | 84.1 | 89.2 | 78.8 | 68.9 | 88.7 | 93.1 | 83.6 |
+
+**Flickr30K-CN Retrieval**:
+
+
+ Task | Text-to-Image | Image-to-Text |
+
+
+ Setup | Zero-shot | Finetune | Zero-shot | Finetune |
+
+
+ Metric | R@1 | R@5 | R@10 | R@1 | R@5 | R@10 | R@1 | R@5 | R@10 | R@1 | R@5 | R@10 |
+
+
+ Wukong | 51.7 | 78.9 | 86.3 | 77.4 | 94.5 | 97.0 | 76.1 | 94.8 | 97.5 | 92.7 | 99.1 | 99.6 |
+
+
+ R2D2 | 60.9 | 86.8 | 92.7 | 84.4 | 96.7 | 98.4 | 77.6 | 96.7 | 98.9 | 95.6 | 99.8 | 100.0 |
+
+
+ CN-CLIP | 71.2 | 91.4 | 95.5 | 83.8 | 96.9 | 98.6 | 81.6 | 97.5 | 98.8 | 95.3 | 99.7 | 100.0 |
+
+
+
+
+**COCO-CN Retrieval**:
+
+
+ Task | Text-to-Image | Image-to-Text |
+
+
+ Setup | Zero-shot | Finetune | Zero-shot | Finetune |
+
+
+ Metric | R@1 | R@5 | R@10 | R@1 | R@5 | R@10 | R@1 | R@5 | R@10 | R@1 | R@5 | R@10 |
+
+
+ Wukong | 53.4 | 80.2 | 90.1 | 74.0 | 94.4 | 98.1 | 55.2 | 81.0 | 90.6 | 73.3 | 94.0 | 98.0 |
+
+
+ R2D2 | 56.4 | 85.0 | 93.1 | 79.1 | 96.5 | 98.9 | 63.3 | 89.3 | 95.7 | 79.3 | 97.1 | 98.7 |
+
+
+ CN-CLIP | 69.2 | 89.9 | 96.1 | 81.5 | 96.9 | 99.1 | 63.0 | 86.6 | 92.9 | 83.5 | 97.3 | 99.2 |
+
+
+
+
+**Zero-shot Image Classification**:
+
+
+ Task | CIFAR10 | CIFAR100 | DTD | EuroSAT | FER | FGVC | KITTI | MNIST | PC | VOC |
+
+
+ GIT | 88.5 | 61.1 | 42.9 | 43.4 | 41.4 | 6.7 | 22.1 | 68.9 | 50.0 | 80.2 |
+
+
+ ALIGN | 94.9 | 76.8 | 66.1 | 52.1 | 50.8 | 25.0 | 41.2 | 74.0 | 55.2 | 83.0 |
+
+
+ CLIP | 94.9 | 77.0 | 56.0 | 63.0 | 48.3 | 33.3 | 11.5 | 79.0 | 62.3 | 84.0 |
+
+
+ Wukong | 95.4 | 77.1 | 40.9 | 50.3 | - | - | - | - | - | - |
+
+
+ CN-CLIP | 96.0 | 79.7 | 51.2 | 52.0 | 55.1 | 26.2 | 49.9 | 79.4 | 63.5 | 84.9 |
+
+
+
+
## Citation
If you find Chinese CLIP helpful, feel free to cite our paper. Thanks for your support!