ED2IF2-Net: Learning Disentangled Deformed Implicit Fields and Enhanced Displacement Fields from Single Images Using Pyramid Vision Transformer
نویسندگان
چکیده
There has emerged substantial research in addressing single-view 3D reconstruction and the majority of state-of-the-art implicit methods employ CNNs as backbone network. On other hand, transformers have shown remarkable performance many vision tasks. However, it is still unknown whether are suitable for reconstruction. In this paper, we propose first end-to-end network based on Pyramid Vision Transformer (PVT), called ED2IF2-Net, which disentangles an field into topological structures recovery surface details to achieve high-fidelity shape ED2IF2-Net uses a encoder extract multi-scale hierarchical local features global vector input single image, fed three separate decoders. A coarse decoder reconstructs vector, deformation iteratively refines using pixel-aligned obtain deformed through multiple blocks (IFDBs), detail predicts enhanced displacement with hybrid attention modules (HAMs). The final output fusion field, four loss terms applied reconstruct structure novel loss, overall after fusion, via Laplacian loss. quantitative results obtained from ShapeNet dataset validate exceptional ED2IF2-Net. Notably, ED2IF2-Net-L stands out top-performing variant, exhibiting highest mean IoU, CD, EMD, ECD-3D, ECD-2D scores, reaching impressive values 61.1, 7.26, 2.51, 6.08, 1.84, respectively. extensive experimental evaluations consistently demonstrate capabilities reconstructing recovering details, all while maintaining competitive inference time.
منابع مشابه
Implicit Extrusion Fields
This paper presents an implicit function fields based approach for the extrusion of 2-dimensional profiles. According to the fields used, the extrusion follows an implicit object shape or performs the transition between combined primitives. The 1-dimensional polynomial spline properties allow the point-by-point control of the profile. Used on the transition of combined primitives, free form fun...
متن کاملDNA-GAN: Learning Disentangled Representations from Multi-Attribute Images
Disentangling factors of variation has always been a challenging problem in representation learning. Existing algorithms suffer from many limitations, such as unpredictable disentangling factors, bad quality of generated images from encodings, lack of identity information, etc. In this paper, we propose a supervised algorithm called DNA-GAN trying to disentangle different attributes of images. ...
متن کاملRegularizationof Displacement Fields for Glaciermonitoring
Synthetic aperture radar (SAR) images provide scattering information which can be used under any weather conditions for glacier monitoring. The purpose is to estimate a displacement field characterizing at each position the local speeds and orientations of the glacier movement. The recent method proposed in [1] builds a vector field from the similarities existing between two SAR images co-regis...
متن کاملReceptive Fields within the Combinatorial Pyramid Framework
A hierarchical structure is a stack of successively reduced image representations. Each basic element of a hierarchical structure is the father of a set of elements in the level below. The transitive closure of this father–child relationship associates to each element of the hierarchy a set of basic elements in the base level image representation. Such a set, called a receptive field, defines t...
متن کاملMyocardial contour detection in MRI tagged images using short-range correlations of displacement fields
Magnetic Resonance Imaging (MRI) is fast becoming an indispensable tool for the diagnosis of heart disease. Myocardial tagging is a cardiac MRI technique that allows one to perform a detailed kinematic analysis of the heart muscle. This allows clinicians to non-invasively identify areas of damaged cardiac muscle tissue. Myocardial tagging involves magnetically marking the heart muscle with a se...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied sciences
سال: 2023
ISSN: ['2076-3417']
DOI: https://doi.org/10.3390/app13137577