Simultaneous Localization and Mapping (SLAM) is a well-studied area during the past 20 years, yet there is no efficient method for large-scale and long-term indoor/outdoor application.
The robust and efficient feature is very necessary for visual place recognition (VPR). We apply an unsupervised feature learning method, where the raw image is converted into a lower dimension code for place encoding and fast retrieval. The Major challenge for Visual Place Recognition:
- In the real word, appearance are variant and some of them are very similar;
- The same place may have variant appearance under different conditions;
- Dynamic Objects add additional noise for place recognition;
- No label for Visual Place Recognition.
The module framework is an Autoencoder-GAN like framework. The CapsuleNet based Encoder is applied to extract the visual features. The Generative neural networks (GAN) is applied to ensure visual features can capture enough geometry detail to generate realistic images.