Over the last decade, use of Deep Learning in many applications produced results that are comparable to and some cases surpassing human expert performance. The application domains include diagnosing diseases, finance, agriculture, search engines, robot vision, others. In this paper, we proposing an architecture utilizes image/video captioning methods Natural Language Processing systems generate...