"Understanding Why ViT Trains Badly on Small Datasets: An Intuitive Perspective. (arXiv:2302.03751v1 [cs.CV])" — A visual intuition to help understand why ViT has a significantly lower evaluation accuracy when trained on small datasets when compared to ResNet-18 with a similar number of parameters.