Le principe de cette IA est de localiser certains éléments sur des images.
Ci-dessous une implémentation de démonstration.
"person"
"bicycle"
"car"
"motorcycle"
"airplane"
"bus"
"train"
"truck"
"boat"
"traffic light"
"fire hydrant"
"stop sign"
"parking meter"
"bench"
"bird"
"cat"
"dog"
"horse"
"sheep"
"cow"
"elephant"
"bear"
"zebra"
"giraffe"
"backpack"
"umbrella"
"handbag"
"tie"
"suitcase"
"frisbee"
"skis"
"snowboard"
"sports ball"
"kite"
"baseball bat"
"baseball glove"
"skateboard"
"surfboard"
"tennis racket"
"bottle"
"wine glass"
"cup"
"fork"
"knife"
"spoon"
"bowl"
"banana"
"apple"
"sandwich"
"orange"
"broccoli"
"carrot"
"hot dog"
"pizza"
"donut"
"cake"
"chair"
"couch"
"potted plant"
"bed"
"dining table"
"toilet"
"tv"
"laptop"
"mouse"
"remote"
"keyboard"
"cell phone"
"microwave"
"oven"
"toaster"
"sink"
"refrigerator"
"book"
"clock"
"vase"
"scissors"
"teddy bear"
"hair drier"
"toothbrush"