From 9fd0b64928e40709cd31690564d06ab3bfd527df Mon Sep 17 00:00:00 2001 From: Nicola Bernini Date: Mon, 29 Apr 2019 16:11:06 +0200 Subject: [PATCH 1/2] Create d2_net.ipynb --- CNN/ComputerVision/d2_net.ipynb | 2 ++ 1 file changed, 2 insertions(+) create mode 100644 CNN/ComputerVision/d2_net.ipynb diff --git a/CNN/ComputerVision/d2_net.ipynb b/CNN/ComputerVision/d2_net.ipynb new file mode 100644 index 0000000..4cc69da --- /dev/null +++ b/CNN/ComputerVision/d2_net.ipynb @@ -0,0 +1,2 @@ +{"cells":[{"metadata":{},"cell_type":"markdown","source":"\n# Analysis of D2-Net: A Trainable CNN for Joint Description and Detection of Local Features\n\n[D2-Net: A Trainable CNN for Joint Description and Detection of Local Features](https://dsmn.ml/files/d2-net/d2-net.pdf)\n\n\n"},{"metadata":{},"cell_type":"markdown","source":"\n# Abstract and Intro \n\n> In this work we address the problem of finding reliable pixel-level correspondences under difficult imaging conditions.\n\n- Pixel-level correspondences is the fundamental task to solve many relevant problems in geometric computer vision like Visual Odometry, Structure from Motion, Localization, Mapping, Optical Flow, ... \n\n\n## Traditional Approaches \n\n- Rely on manually engineered features detection and description algorithms \n- Consist of the following pipeline \n\nThe 3 steps pipeline \n\n1. Feature Detection \n2. Feature Description \n3. Features Matching \n\n\n## Proposed Approach \n\n- Rely on automatically learned features (no manual engineering)\n- Consists of a 2 steps pipeline \n\n1. Joint Features Detection and Description \n2. Features Matching \n\n\n"},{"metadata":{},"cell_type":"markdown","source":"\n## Features Description \n\nThe Features Description is a mapping like \n\n$$ f(u,v,I_{N}) \\rightarrow s \\in S $$\n\nwith \n- $u,v$ : Feature Image Coordinates \n- $I_{N}$ : Feature Center Neighborhood, representing the appearance (e.g. a $W \\times H$ neighborhood)\n- $S$ : Mixed Spatial and Semantic Space resulting from cartesian product of $W \\times H$ Spatial Domain and a $C$ Semantic Domain depending on the feature description \n\n**NOTE**: \n- The $S$ is essentially a $W \\times H \\times C$ Tensor Space, so the bread and butter of Deep Neural Networks hence of CNN as well \n- However at the same time it also represents the output space of manually engineered feature detection and description pipeline \n"},{"metadata":{},"cell_type":"markdown","source":"\n## Feature Matching \n\nThe Features Matching is typically performed with an Approximate Nearest Neighbor Search in a Spatial + Semantic Space so something like the $S$ Space above described \n\n"},{"metadata":{"_cell_guid":"79c7e3d0-c299-4dcb-8224-4455121ee9b0","collapsed":true,"_uuid":"d629ff2d2480ee46fbb7e2d37f6b5fab8052498a","trusted":false},"cell_type":"code","source":"","execution_count":null,"outputs":[]}],"metadata":{"kernelspec":{"display_name":"Python 3","language":"python","name":"python3"},"language_info":{"name":"python","version":"3.6.4","mimetype":"text/x-python","codemirror_mode":{"name":"ipython","version":3},"pygments_lexer":"ipython3","nbconvert_exporter":"python","file_extension":".py"}},"nbformat":4,"nbformat_minor":1} + From 4e35142ed80e2cc979fbe2bbb8bebe0d713b2b66 Mon Sep 17 00:00:00 2001 From: Nicola Bernini Date: Mon, 29 Apr 2019 16:14:05 +0200 Subject: [PATCH 2/2] Added D2 Net Summary --- CNN/ComputerVision/readme.md | 16 +++++++++++++++- 1 file changed, 15 insertions(+), 1 deletion(-) diff --git a/CNN/ComputerVision/readme.md b/CNN/ComputerVision/readme.md index 1bcbc6c..1c00b24 100644 --- a/CNN/ComputerVision/readme.md +++ b/CNN/ComputerVision/readme.md @@ -5,6 +5,7 @@ The CNN are at the core of NN which can achieve SOTA on the following types of C - Recognition - Detection +- Features - Segmentation - Localization - Mapping @@ -17,7 +18,6 @@ The CNN are at the core of NN which can achieve SOTA on the following types of C - Y2019 - [Summary](Optimal_Approach_for_Image_Recognition_using_Deep_Convolutional_Architecture.ipynb) -Work in progress @@ -27,3 +27,17 @@ Work in progress +# Features + +[D2-Net: A Trainable CNN for Joint Description and Detection of Local Features](https://dsmn.ml/files/d2-net/d2-net.pdf) +- Y2019 (CVPR 2019) +- [Summary](d2_net.ipynb) + + + + + + + +Work in progress +