Flickr30k Entities Examples

With more than 30K images and 5 sentences per image, this dataset now provides bounding box annotations for entities mentioned in the text.