Pillow
jax[cpu]
flax
git+https://github.com/ydshieh/transformers.git@flax_vision_encoder_decoder