Detect and locate objects in images and video
Open-vocabulary — type any text to detect custom objects