FFNet: Frequency Fusion Network for Semantic Scene Completion

نویسندگان

چکیده

Semantic scene completion (SSC) requires the estimation of 3D geometric occupancies objects in scene, along with object categories. Currently, many methods employ RGB-D images to capture and semantic information objects. These use simple but popular spatial- channel-wise operations, which fuse RGB depth data. Yet, they ignore large discrepancy data uncertainty measurements To solve this problem, we propose Frequency Fusion Network (FFNet), a novel method for boosting by better utilizing FFNet explicitly correlates frequency domain, different from features directly extracted convolution operation. Then, network uses correlated guide feature learning RG- B images, respectively. Moreover, accounts properties components RGB- D features. It has learnable elliptical mask decompose learned attending various frequencies facilitate correlation process We evaluate intensively on public SSC benchmarks, where surpasses state-of- the-art methods. The code package is available at https://github.com/alanWXZ/FFNet.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Semantic Scene Completion Combining Colour and Depth: preliminary experiments

Semantic scene completion is the task of producing a complete 3D voxel representation of volumetric occupancy with semantic labels for a scene from a single-view observation. We built upon the recent work of Song et al. [13], who proposed SSCnet, a method that performs scene completion and semantic labelling in a single end-to-end 3D convolutional network. SSCnet uses only depth maps as input, ...

متن کامل

Recursive Context Propagation Network for Semantic Scene Labeling

We propose a deep feed-forward neural network architecture for pixel-wise semantic scene labeling. It uses a novel recursive neural network architecture for context propagation, referred to as rCPN. It first maps the local visual features into a semantic space followed by a bottom-up aggregation of local information into a global representation of the entire image. Then a top-down propagation o...

متن کامل

ScanComplete: Large-Scale Scene Completion and Semantic Segmentation for 3D Scans

We introduce ScanComplete, a novel data-driven approach for taking an incomplete 3D scan of a scene as input and predicting a complete 3D model along with per-voxel semantic labels. The key contribution of our method is its ability to handle large scenes with varying spatial extent, managing the cubic growth in data size as scene size increases. To this end, we devise a fully-convolutional gene...

متن کامل

Semantic Reasoning for Scene Interpretation

In this paper, we propose a hierarchical architecture for representing scenes, covering 2D and 3D aspects of visual scenes as well as the semantic relations between the different aspects. We argue that labeled graphs are a suitable representational framework for this representation and demonstrate its potential by two applications. As a first application, we localize lane structures by the sema...

متن کامل

Semantic Context for Nonparametric Scene Parsing and Scene Classification

Our work focuses on different aspects of image representations as related to a variety of scene understanding tasks. We are interested in simple patch based representations as basic primitives and the role of semantic context as provided by different datasets. In our work, we have pursued a nonparametric approach for semantic parsing [5] which uses small patches and simple gradient, color and l...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2022

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v36i3.20156