نتایج جستجو برای: level feature

تعداد نتایج: 1283638  

2004
Lizuo Jin Shin'ichi Satoh Fuminori Yamagishi Duy Dinh Le Masao Sakauchi

This paper describes our participation in the NIST TERCVID 2004 retrieval evaluation. In the first-year effort for the TERCVID project, we only tackle Person X detection of the high-level feature extraction task. We design an automatic Person X detector using frontal faces in videos solely. We illustrate the architecture of Person X detector and the evaluation results in this paper.

2006
Mohamed Faouzi BenZeghiba Christian Wellekens

This report proposes and compares a number of tandem-like feature extraction schemes. The proposed schemes use relative phone posteriors as confidence measures estimated from the MLP outputs directly or using Gamma function. The analysis of variances shows that the proposed tandem-like features discriminate better between phone classes than the conventional tandem features. But these capabiliti...

2011
Guylaine Le-Jan Yannick Benezeth Guillaume Gravier Frédéric Bimbot

We present in this paper a study on auditory feature spaces for speech-driven face animation. The goal is to provide solid analytic ground to underscore the description capability of some well-known features with relation to lipsync. A set of various audio features describing the temporal and spectral shape of speech signal has been computed on annotated audio extracts. The dimension of the inp...

2007
Sercan Aksoy Pinar Duygulu Sahin Cem Aksoy E. Aydin D. Günaydin K. Hadimh L. Koç Y. Olgun C. Orhan G. Yakin

We describe our fourth participation, that includes two high-level feature extraction runs, and one manual search run, to the TRECVID video retrieval evaluation. All of these runs have used a system trained on the common development collection. Only visual information, consisting of color, texture and edge-based low-level features, was used.

Journal: :Journal of telecommunications and information technology 2022

Automatic creation of image descriptions, i.e. captioning images, is an important topic in artificial intelligence (AI) that bridges the gap between computer vision (CV) and natural language processing (NLP). Currently, neural networks are becoming increasingly popular images researchers looking for more efficient models CV sequence-sequence systems. This study focuses on a new caption generati...

Journal: :International Journal of Informatics and Communication Technology (IJ-ICT) 2020

Journal: :The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences 2018

Journal: :IEEE Geoscience and Remote Sensing Letters 2022

We present a mcthod to lcarn diverse group of object categories from an unordcrcd point set. propose our Pyramid Point network, which uses dense pyramid structure instead thc traditional ’U’ shape, typically seen in semantic segmentation networks. This gives second look, allowing network revisit different layers creating various leveis on the for feature propagation. introduce Foc...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید