Vision Transformer models in vit-keras

I have used vit_b32 and vit_b16 models in vit-keras. What are the other available models and their input image sizes in vit-keras? Are there any combined models (ResNet + Vit) available? Thanks