# research_semantic_localization **Repository Path**: CSS365_admin/research_semantic_localization ## Basic Information - **Project Name**: research_semantic_localization - **Description**: 语义定位的研究资料 - **Primary Language**: Unknown - **License**: Not specified - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 7 - **Created**: 2020-10-08 - **Last Updated**: 2020-12-19 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # 语义定位 (图像相似性判断,重定位) ## 1. 研究目标 人类的感知没有精确计算自己所在场景的位置信息,仅仅通过场景的相似性来进行定位,既通过目前看到的场景和之前走过的那个场景相似从而完成定位。因此通过深度学习来对场景进行相似性判断,并和地图库中的场景进行比对,可以模拟人的认知过程,从而实现基于视觉(非几何计算)的定位。 主要的研究目标有: 1. 单张图像的相似性判断已经比较多,而序列化的图像的场景相似性判断还没有。研究如何将多张图像所拍摄的场景进行建模,可以构建图,然后在地图(图网络)上进行子图匹配,找到最优的匹配点。 2. 如何更好的生成场景中物体的特征,可以NetVLAD或参考《Image Matching Based on Deep Feature and Spatial Correlation Graph》中的FE-Net。 3. 如何构建图卷积神经网络(Graph Convolutional Networks)来计算相似性。 4. 如何将所经过的场景提取特征并保存到一个图中,从而完成语义地图的构建。 ## 2. 主要思路 研究思路: 1. 先通读一下主要的参考文献,建立对所研究问题的基本认识,了解基本的方法等。 2. 找一些代码运行一下,建立直觉的认识,并熟悉数据集。 3. 可以从基本的深度学习提取特征开始,然后再深入到图卷积神经网络。 具体的研究方法(需要尝试): 1. 使用EdgeBox或者其他方法找到感兴趣的区域,然后提取对象的深度学习特征 - EdgeBox 仅仅提取感兴趣的区域 [论文](https://www.microsoft.com/en-us/research/wp-content/uploads/2014/09/ZitnickDollarECCV14edgeBoxes.pdf),[中文解释](https://blog.csdn.net/wsj998689aa/article/details/39476551) 2. 研究Siamese网络,提高特征的区分能力 (可以参考《Image Matching Based on Deep Feature and Spatial Correlation Graph》中的FE-Net) 3. 研究Graph Convolutional Networks,如何提取网络节点的特征,并做图匹配等。 ## 3. 参考代码 * Improved version of DBow2 (https://github.com/rmsalinas/DBow3) * FBOW (Fast Bag of Words) is an extremmely optimized version of the DBow2/DBow3 libraries (https://github.com/rmsalinas/fbow) * Robust Visual Robot Localization Across Seasons using Network Flows (https://github.com/MHassanNadeem/localization-network-flows) * NetVLAD: CNN architecture for weakly supervised place recognition (https://www.di.ens.fr/willow/research/netvlad/) * Graph Convolutional Networks in PyTorch (https://github.com/tkipf/pygcn) * Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering (https://github.com/mdeff/cnn_graph) * Graph Convolutional Networks (GCNs) (https://github.com/sungyongs/graph-based-nn) ## 4. 参考文献 更多的参考文档可以参考[references](references)目录 ### 4.0 综述 * [2015 Visual Place Recognition: A Survey](references/survey/2015 Visual Place Recognition: A Survey.pdf) * [基于视觉地图的视觉定位](https://www.zhihu.com/column/c_1287353014030585856) ### 4.1 特征提取 * NetVLAD CNN architecture for weakly supervised place recognitiondf * Place Recognition with ConvNet Landmarks: Viewpoint-Robust, Condition-Robust, Training-Free * 2020 Visual search over billions of aerial and satellite images ### 4.2 网络方法 * Image Matching Based on Deep Feature and Spatial Correlation Graph * Location Graphs for Visual Place Recognition * Learning Convolutional Neural Networks for Graphs * Robust Visual Semi-Semantic Loop Closure Detection by a Covisibility Graph and CNN Features * Siamese Network - https://github.com/delijati/pytorch-siamese ### 4.3 图神经网络 * Learning Convolutional Neural Networks for Graphs * Image Matching Based on Deep Feature and Spatial Correlation Graph ### 4.4 多视角的检索 * Lending Orientation to Neural Networks for Cross-view Geo-localization https://github.com/Liumouliu/OriCNN * Optimal Feature Transport for Cross-View Image Geo-Localization https://github.com/shiyujiao/cross_view_localization_CVFT ### 4.5 索引方法 * Tree-based indexing for real-time ConvNet landmark-based visual place recognition * 2020 Visual search over billions of aerial and satellite images ### 4.6 Codes * Keras implementation of the Netvlad for visual place recognition https://github.com/crlz182/Netvlad-Keras * LoST - Visual Place Recognition using Visual Semantics for Opposite Viewpoints across Day and Night https://github.com/oravus/lostX * visual place recognition in changing enviroments https://github.com/PRBonn/vpr_relocalization * PointNetVLAD: Deep Point Cloud Based Retrieval for Large-Scale Place Recognition, CVPR 2018 https://github.com/mikacuy/pointnetvlad * NetVLAD: CNN architecture for weakly supervised place recognition https://github.com/Relja/netvlad * Visual place recognition from opposing viewpoints under extreme appearance variations https://github.com/oravus/seq2single * Optimal Feature Transport for Cross-View Image Geo-Localization https://github.com/shiyujiao/cross_view_localization_CVFT ### 4.7 Dataset * University1652-Baseline https://github.com/layumi/University1652-Baseline * Places: A 10 million image database for scene recognition * 24/7 Place Recognition by View Synthesis http://www.ok.ctrl.titech.ac.jp/~torii/project/247/