Sort:  

If it can generate coherent stories based on image, that is already impressive.