Esri
diff --git a/‎.github/ISSUE_TEMPLATE/feature_request.md
Lines changed: 1 addition & 1 deletion b/‎.github/ISSUE_TEMPLATE/feature_request.md
Lines changed: 1 addition & 1 deletion
diff --git a/‎guide/14-deep-learning/feature_categorization.ipynb
Lines changed: 893 additions & 0 deletions b/‎guide/14-deep-learning/feature_categorization.ipynb
Lines changed: 893 additions & 0 deletions
diff --git a/‎guide/14-deep-learning/how-ssd-works.ipynb
Lines changed: 3 additions & 2 deletions b/‎guide/14-deep-learning/how-ssd-works.ipynb
Lines changed: 3 additions & 2 deletions
diff --git a/‎guide/14-deep-learning/how-unet-works.ipynb
Lines changed: 4 additions & 3 deletions b/‎guide/14-deep-learning/how-unet-works.ipynb
Lines changed: 4 additions & 3 deletions
diff --git a/‎guide/14-deep-learning/how_feature_categorization_works.ipynb
Lines changed: 126 additions & 0 deletions b/‎guide/14-deep-learning/how_feature_categorization_works.ipynb
Lines changed: 126 additions & 0 deletions
diff --git a/‎guide/14-deep-learning/pixel_based_classification.ipynb
Lines changed: 89 additions & 101 deletions b/‎guide/14-deep-learning/pixel_based_classification.ipynb
Lines changed: 89 additions & 101 deletions
diff --git a/‎samples/04_gis_analysts_data_scientists/Wildfire_analysis_using_Sentinel-2_imagery.ipynb
Lines changed: 1 addition & 0 deletions b/‎samples/04_gis_analysts_data_scientists/Wildfire_analysis_using_Sentinel-2_imagery.ipynb
Lines changed: 1 addition & 0 deletions
diff --git a/‎samples/04_gis_analysts_data_scientists/building_damage_assessment_using_feature_classifier.ipynb
Lines changed: 904 additions & 0 deletions b/‎samples/04_gis_analysts_data_scientists/building_damage_assessment_using_feature_classifier.ipynb
Lines changed: 904 additions & 0 deletions
diff --git a/‎samples/04_gis_analysts_data_scientists/land_cover_classification_using_unet.ipynb
Lines changed: 102 additions & 171 deletions b/‎samples/04_gis_analysts_data_scientists/land_cover_classification_using_unet.ipynb
Lines changed: 102 additions & 171 deletions
diff --git a/‎static/img/feature_classify_example.png
2.35 MB b/‎static/img/feature_classify_example.png
2.35 MB
@@ -2,7 +2,7 @@
 name: Feature request
 about: Suggest an idea for this project
 title: ''
-labels: bug
+labels: enhancement
 assignees: ''
 
 ---
 
@@ -166,7 +166,7 @@
     "\n",
     "The grids parameter specifies the size of the grid cell, in this case 4x4. Additionally, we are specifying a zoom level of 1.0 and aspect ratio of 1.0:1.0. What this essentially means is that the network will create an anchor box for each grid cell, which is the same size as the grid cell (zoom level of 1.0) and is square in shape with an aspect ratio of 1.0:1.0. The output activations along the depth of the final feature map are used to shift and scale (within a reasonable limit) this anchor box so it can approach the actual bounding box of the object even if it doesn’t exactly match with the anchor box. \n",
     "\n",
-    "For more information about the API, please go to the [API reference](https://esri.github.io/arcgis-python-api/apidoc/html/arcgis.learn.html#singleshotdetector)."
+    "For more information about the API, please go to the [API reference](https://esri.github.io/arcgis-python-api/apidoc/html/arcgis.learn.html#singleshotdetector). As `arcgis.learn` is built upon fast.ai, more explanation about SSD can be found at fast.ai's Multi-object detection lesson [5]. "
    ]
   },
   {
@@ -177,7 +177,8 @@
     "- [1] Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi: “You Only Look Once: Unified, Real-Time Object Detection”, 2015; <a href='https://arxiv.org/abs/1506.02640'>arXiv:1506.02640</a>.\n",
     "- [2] Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu: “SSD: Single Shot MultiBox Detector”, 2016; <a href='http://arxiv.org/abs/1512.02325'>arXiv:1512.02325</a>.\n",
     "- [3] Zeiler, Matthew D., and Rob Fergus. \"Visualizing and understanding convolutional networks.\" In European conference on computer vision, pp. 818-833. springer, Cham, 2014.\n",
-    "- [4] Dang Ha The Hien. A guide to receptive field arithmetic for Convolutional Neural Networks. https://medium.com/mlreview/a-guide-to-receptive-field-arithmetic-for-convolutional-neural-networks-e0f514068807"
+    "- [4] Dang Ha The Hien. A guide to receptive field arithmetic for Convolutional Neural Networks. https://medium.com/mlreview/a-guide-to-receptive-field-arithmetic-for-convolutional-neural-networks-e0f514068807\n",
+    "- [5] Howard Jeremy. Lesson 9: Deep Learning Part 2 2018 - Multi-object detection. https://docs.fast.ai/vision.models.unet.html#Dynamic-U-Net. Accessed 2 September 2019."
    ]
   }
  ],
 
@@ -66,7 +66,7 @@
     "- The decoder is the second half of the architecture. The goal is to semantically project the discriminative features (lower resolution) learnt by the encoder onto the pixel space (higher resolution) to get a dense classification. The decoder consists of **upsampling** and **concatenation** followed by regular convolution operations. \n",
     "\n",
     "<center><img src=\"../../static/img/unet.png\" height=\"600\" width=\"600\"></center>\n",
-    "<center>Figure 2. U-net architecture. Blue boxes represent multi-channel feature maps, while while boxes represent copied feature maps. The arrows of different colors represent different operations</center>"
+    "<center>Figure 2. U-net architecture. Blue boxes represent multi-channel feature maps, while while boxes represent copied feature maps. The arrows of different colors represent different operations [1]</center>"
    ]
   },
   {
@@ -92,7 +92,7 @@
     "\n",
     "`data` is the returned data object from prepare_data function. `backbone` is used for creating the base of the UnetClassifier, which is resnet34 by default, while `pretrained_path` points to where pre-trained model is saved.\n",
     "\n",
-    "The `UnetClassifier` builds a dynamic U-Net from any backbone pretrained on ImageNet, automatically inferring the intermediate sizes. As you might have noticed, U-net has a lot fewer parameters than SSD, this is because all the parameters such as dropout are specified in the encoder and UnetClassifier creates the decoder part using the given encoder. You can tweak everything in the encoder and our U-net module creates decoder equivalent to that.\n",
+    "The `UnetClassifier` builds a dynamic U-Net from any backbone pretrained on ImageNet, automatically inferring the intermediate sizes. As you might have noticed, U-net has a lot fewer parameters than SSD, this is because all the parameters such as dropout are specified in the encoder and UnetClassifier creates the decoder part using the given encoder. You can tweak everything in the encoder and our U-net module creates decoder equivalent to that [2]. With that, the creation of Unetclassifier requires fewer parameters.\n",
     "\n",
     "For more information about the API, please go to the [API reference](https://esri.github.io/arcgis-python-api/apidoc/html/arcgis.learn.html#unetclassifier)."
    ]
@@ -102,7 +102,8 @@
    "metadata": {},
    "source": [
     "## References\n",
-    "- [1] Olaf Ronneberger, Philipp Fischer, Thomas Brox: U-Net: Convolutional Networks for Biomedical Image Segmentation, 2015; <a href='https://arxiv.org/abs/1505.04597'>arXiv:1505.04597</a>."
+    "- [1] Olaf Ronneberger, Philipp Fischer, Thomas Brox: U-Net: Convolutional Networks for Biomedical Image Segmentation, 2015; <a href='https://arxiv.org/abs/1505.04597'>arXiv:1505.04597</a>.\n",
+    "- [2] Howard Jeremy. Fastai - Dynamic U-Net. https://www.youtube.com/watch?v=0frKXR-2PBY. Accessed 2 September 2019."
    ]
   }
  ],
 
@@ -0,0 +1,126 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# How feature classifier works?"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Introduction"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "The goal of feature classification is to determine the class of each feature (e.g. building). For instance, it could be used to determine when a building is damaged or not after a natural disaster. Feature classification requires two input data: \n",
+    "- A input raster that contains the spectral bands,\n",
+    "- A feature class that defines the location (e.g. outline or bounding box) of each feature."
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "There are two major steps in feature classification. We first export training samples based on the geographical extent of each feature. Once the training samples are exported, it can be used as the training input for the deep learning based classification algorithm to train a feature classifier."
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Export training samples\n",
+    "\n",
+    "The process of exporting training samples is slightly different from pixel-based classification and object detection. In feature classification, we extract training samples based on the extent of each individual feature defined by the feature class. For each training sample, there is an associated class label that comes from the feature class attribute table. Optionally, we can also define a buffer size to extract a larger neighbourhood around the feature so that more spatial context is available to the classification model, which makes distinguishing different classes easier."
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "<center><img src=\"../../static/img/feature_classify_example.png\" height=\"700\" width=\"700\"></center>\n",
+    "<center>Figure 1. An example of export training data for feature classifier</center>"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "In the example above, there are three buildings in the original data. The one on the top is damaged and the other two is undamanged. Therefore, the export results would be three training samples with the corresponding labels. Here we used a buffer size of 50 meters so we can have more surrounding context to feed to the model next."
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Deep learning based classification algorithm"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Once the training samples are ready, it becomes a standard multi-class image classification problem in computer vision, which is a process of taking an input image and outputting a class. Image classification can be solved through convolutional neural networks (CNN) and there are many CNN based image classification algorithms. Most of them have a backbone CNN architecture (e.g. Resnet, LeNet-5, AlexNet, VGG 16) followed by a [softmax layer](https://en.wikipedia.org/wiki/Softmax_function). Again, You can refresh your CNN knowledge by going through this short paper “[A guide to convolution arithmetic for deep learning](https://arxiv.org/pdf/1603.07285.pdf)”."
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Implementation in `arcgis.learn`"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "`arcgis.learn` allows us to define a feature classifier architecture just through a single line of code. For example:\n",
+    "\n",
+    "    feature_classifier = arcgis.learn.FeatureClassifier(data, backbone=None, pretrained_path=None)\n",
+    "\n",
+    "`data` is the returned data object from prepare_data function. `backbone` is used for creating the base of the classifier, which is resnet34 by default, while `pretrained_path` points to where pre-trained model is saved.\n",
+    "\n",
+    "For more information about the API, please go to the [API reference](https://esri.github.io/arcgis-python-api/apidoc/html/arcgis.learn.html#featureclassifier)."
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.7.2"
+  },
+  "toc": {
+   "base_numbering": 1,
+   "nav_menu": {},
+   "number_sections": true,
+   "sideBar": true,
+   "skip_h1_title": false,
+   "title_cell": "Table of Contents",
+   "title_sidebar": "Contents",
+   "toc_cell": false,
+   "toc_position": {},
+   "toc_section_display": true,
+   "toc_window_display": false
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 2
+}
Original file line number	Diff line number	Diff line change
`@@ -166,7 +166,7 @@`
`166`	`166`	`"\n",`
`167`	`167`	"The grids parameter specifies the size of the grid cell, in this case 4x4. Additionally, we are specifying a zoom level of 1.0 and aspect ratio of 1.0:1.0. What this essentially means is that the network will create an anchor box for each grid cell, which is the same size as the grid cell (zoom level of 1.0) and is square in shape with an aspect ratio of 1.0:1.0. The output activations along the depth of the final feature map are used to shift and scale (within a reasonable limit) this anchor box so it can approach the actual bounding box of the object even if it doesn’t exactly match with the anchor box. \n",
`168`	`168`	`"\n",`
`169`		`- "For more information about the API, please go to the [API reference](https://esri.github.io/arcgis-python-api/apidoc/html/arcgis.learn.html#singleshotdetector)."`
	`169`	+ "For more information about the API, please go to the [API reference](https://esri.github.io/arcgis-python-api/apidoc/html/arcgis.learn.html#singleshotdetector). As `arcgis.learn` is built upon fast.ai, more explanation about SSD can be found at fast.ai's Multi-object detection lesson [5]. "
`170`	`170`	`]`
`171`	`171`	`},`
`172`	`172`	`{`
`@@ -177,7 +177,8 @@`
`177`	`177`	`"- [1] Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi: “You Only Look Once: Unified, Real-Time Object Detection”, 2015; <a href='https://arxiv.org/abs/1506.02640'>arXiv:1506.02640</a>.\n",`
`178`	`178`	`"- [2] Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu: “SSD: Single Shot MultiBox Detector”, 2016; <a href='http://arxiv.org/abs/1512.02325'>arXiv:1512.02325</a>.\n",`
`179`	`179`	`"- [3] Zeiler, Matthew D., and Rob Fergus. \"Visualizing and understanding convolutional networks.\" In European conference on computer vision, pp. 818-833. springer, Cham, 2014.\n",`
`180`		`- "- [4] Dang Ha The Hien. A guide to receptive field arithmetic for Convolutional Neural Networks. https://medium.com/mlreview/a-guide-to-receptive-field-arithmetic-for-convolutional-neural-networks-e0f514068807"`
	`180`	`+ "- [4] Dang Ha The Hien. A guide to receptive field arithmetic for Convolutional Neural Networks. https://medium.com/mlreview/a-guide-to-receptive-field-arithmetic-for-convolutional-neural-networks-e0f514068807\n",`
	`181`	`+ "- [5] Howard Jeremy. Lesson 9: Deep Learning Part 2 2018 - Multi-object detection. https://docs.fast.ai/vision.models.unet.html#Dynamic-U-Net. Accessed 2 September 2019."`
`181`	`182`	`]`
`182`	`183`	`}`
`183`	`184`	`],`
Original file line number	Diff line number	Diff line change
`@@ -66,7 +66,7 @@`
`66`	`66`	`"- The decoder is the second half of the architecture. The goal is to semantically project the discriminative features (lower resolution) learnt by the encoder onto the pixel space (higher resolution) to get a dense classification. The decoder consists of upsampling and concatenation followed by regular convolution operations. \n",`
`67`	`67`	`"\n",`
`68`	`68`	`"<center><img src=\"../../static/img/unet.png\" height=\"600\" width=\"600\"></center>\n",`
`69`		`- "<center>Figure 2. U-net architecture. Blue boxes represent multi-channel feature maps, while while boxes represent copied feature maps. The arrows of different colors represent different operations</center>"`
	`69`	`+ "<center>Figure 2. U-net architecture. Blue boxes represent multi-channel feature maps, while while boxes represent copied feature maps. The arrows of different colors represent different operations [1]</center>"`
`70`	`70`	`]`
`71`	`71`	`},`
`72`	`72`	`{`
`@@ -92,7 +92,7 @@`
`92`	`92`	`"\n",`
`93`	`93`	"`data` is the returned data object from prepare_data function. `backbone` is used for creating the base of the UnetClassifier, which is resnet34 by default, while `pretrained_path` points to where pre-trained model is saved.\n",
`94`	`94`	`"\n",`
`95`		- "The `UnetClassifier` builds a dynamic U-Net from any backbone pretrained on ImageNet, automatically inferring the intermediate sizes. As you might have noticed, U-net has a lot fewer parameters than SSD, this is because all the parameters such as dropout are specified in the encoder and UnetClassifier creates the decoder part using the given encoder. You can tweak everything in the encoder and our U-net module creates decoder equivalent to that.\n",
	`95`	+ "The `UnetClassifier` builds a dynamic U-Net from any backbone pretrained on ImageNet, automatically inferring the intermediate sizes. As you might have noticed, U-net has a lot fewer parameters than SSD, this is because all the parameters such as dropout are specified in the encoder and UnetClassifier creates the decoder part using the given encoder. You can tweak everything in the encoder and our U-net module creates decoder equivalent to that [2]. With that, the creation of Unetclassifier requires fewer parameters.\n",
`96`	`96`	`"\n",`
`97`	`97`	`"For more information about the API, please go to the [API reference](https://esri.github.io/arcgis-python-api/apidoc/html/arcgis.learn.html#unetclassifier)."`
`98`	`98`	`]`
`@@ -102,7 +102,8 @@`
`102`	`102`	`"metadata": {},`
`103`	`103`	`"source": [`
`104`	`104`	`"## References\n",`
`105`		`- "- [1] Olaf Ronneberger, Philipp Fischer, Thomas Brox: U-Net: Convolutional Networks for Biomedical Image Segmentation, 2015; <a href='https://arxiv.org/abs/1505.04597'>arXiv:1505.04597</a>."`
	`105`	`+ "- [1] Olaf Ronneberger, Philipp Fischer, Thomas Brox: U-Net: Convolutional Networks for Biomedical Image Segmentation, 2015; <a href='https://arxiv.org/abs/1505.04597'>arXiv:1505.04597</a>.\n",`
	`106`	`+ "- [2] Howard Jeremy. Fastai - Dynamic U-Net. https://www.youtube.com/watch?v=0frKXR-2PBY. Accessed 2 September 2019."`
`106`	`107`	`]`
`107`	`108`	`}`
`108`	`109`	`],`