diff --git a/README.md b/README.md index bfaedc9..b9ae64c 100644 --- a/README.md +++ b/README.md @@ -29,6 +29,8 @@ This is an image captioning model trained by @ydshieh in [flax ](https://github. ```python from transformers import VisionEncoderDecoderModel, ViTFeatureExtractor, AutoTokenizer +import torch +from PIL import Image model = VisionEncoderDecoderModel.from_pretrained("nlpconnect/vit-gpt2-image-captioning") feature_extractor = ViTFeatureExtractor.from_pretrained("nlpconnect/vit-gpt2-image-captioning")