Layered Interpretation of Street View Images
نویسندگان
چکیده
We propose a layered street view model to encode both depth and semantic information on street view images for autonomous driving. Recently, stixels, stix-mantics, and tiered scene labeling methods have been proposed to model street view images. We propose a 4-layer street view model, a compact representation over the recently proposed stix-mantics model. Our layers encode semantic classes like ground, pedestrians, vehicles, buildings, and sky in addition to the depths. The only input to our algorithm is a pair of stereo images. We use a deep neural network to extract the appearance features for semantic classes. We use a simple and an efficient inference algorithm to jointly estimate both semantic classes and layered depth values. Our method outperforms other competing approaches in Daimler urban scene segmentation dataset. Our algorithm is massively parallelizable, allowing a GPU implementation with a processing speed about 9 fps.
منابع مشابه
End-to-End Interpretation of the French Street Name Signs Dataset
We introduce the French Street Name Signs (FSNS) Dataset consisting of more than a million images of street name signs cropped from Google Street View images of France. Each image contains several views of the same street name sign. Every image has normalized, title case folded ground-truth text as it would appear on a map. We believe that the FSNS dataset is large and complex enough to train a...
متن کاملDIFFUSE CONTRAST ENHANCEMENT ON MR IMAGES IN BRAIN INFARCTION: \"PSEUDOTUMOR SIGN\"
The purpose of this study was to describe the pattern of diffuse enhancement seen on contrast-enhanced MR images in patients with subacute infarction. A retrospective study of 104 patients with the diagnosis of stroke who had undergone contrast-enhanced MR scanning within2 weeks of the inciting neurological event revealed 66 patients who demonstrated different patterns of contrast-enhanceme...
متن کاملLOX Framework: Designing Human Computation Games to Update Street Views
Although the Web has abundant information, it does not necessarily contain the latest, most recently updated information. In particular, interactive map websites and the accompanying street view applications often have outdated information because street views change constantly and are very costly to update. In this work, we propose the LOX (Labeling and O/X) framework – a scalable human comput...
متن کاملCataloging Public Objects Using Aerial and Street-Level Images - Urban Trees
In this section we provide the form of the projection functions Pv(`, c) that convert from geographic locations to pixel locations in aerial view and street view images. We give the form of the inverse function P−1 v (`′, c) that converts from pixel locations to geographic coordinates. Aerial images: Aerial view imagery in Google maps is represented using a Web Mercator projection, a type of cy...
متن کاملI-45: Important Points in Interpretation of Sonographic Images of Female Pelvis (Imaging Case Review)
Ultrasonography represents the method of choice in the investigation of the female pelvis. An accurate interpretation of the images must take into consideration the specific features of the uterus, ovaries and fallopian tubes. The present case review aims to demonstrate important points in interpretation and management of the female pelvis images.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1506.04723 شماره
صفحات -
تاریخ انتشار 2015