It is shown that the simple pre-schooling activity of predicting which caption goes with which image is definitely an economical and scalable way to find out SOTA image representations from scratch over a dataset of 400 https://k2spiceshop.com/product/liquid-k2-on-paper-online/