AvaMins
diff --git a/‎slides/2_26/11-Convolution.key
42.4 MB b/‎slides/2_26/11-Convolution.key
42.4 MB
diff --git a/‎slides/2_26/11-Convolution.pdf
25.7 MB b/‎slides/2_26/11-Convolution.pdf
25.7 MB
diff --git a/‎slides/2_26/channels.ipynb
+295 b/‎slides/2_26/channels.ipynb
+295
diff --git a/‎slides/2_26/channels.pdf
101 KB b/‎slides/2_26/channels.pdf
101 KB
@@ -0,0 +1,295 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "metadata": {
+    "slideshow": {
+     "slide_type": "slide"
+    }
+   },
+   "source": [
+    "# Multiple Input and Output Channels\n",
+    "\n",
+    "**Multiple Input Channels**\n",
+    "\n",
+    "![Cross-correlation with 2 input channels](http://www.d2l.ai/_images/conv_multi_in.svg)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "metadata": {
+    "attributes": {
+     "classes": [],
+     "id": "",
+     "n": "1"
+    }
+   },
+   "outputs": [],
+   "source": [
+    "import d2l\n",
+    "from mxnet import nd\n",
+    "\n",
+    "def corr2d_multi_in(X, K):\n",
+    "    # First, traverse along the 0th dimension (channel dimension) of X and K. \n",
+    "    # Then, add them together by using * \n",
+    "    return nd.add_n(*[d2l.corr2d(x, k) for x, k in zip(X, K)])"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {
+    "slideshow": {
+     "slide_type": "slide"
+    }
+   },
+   "source": [
+    "We can construct the input array `X` and the kernel array `K` of the above diagram to validate the output of the cross-correlation operation."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "metadata": {
+    "attributes": {
+     "classes": [],
+     "id": "",
+     "n": "2"
+    }
+   },
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "\n",
+       "[[ 56.  72.]\n",
+       " [104. 120.]]\n",
+       "<NDArray 2x2 @cpu(0)>"
+      ]
+     },
+     "execution_count": 2,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "X = nd.array([[[0, 1, 2], [3, 4, 5], [6, 7, 8]],\n",
+    "              [[1, 2, 3], [4, 5, 6], [7, 8, 9]]])\n",
+    "K = nd.array([[[0, 1], [2, 3]], [[1, 2], [3, 4]]])\n",
+    "\n",
+    "corr2d_multi_in(X, K)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {
+    "slideshow": {
+     "slide_type": "slide"
+    }
+   },
+   "source": [
+    "**Multiple Output Channels**\n",
+    "\n",
+    "For multiple output channels we simply generate multiple outputs and then stack them together. "
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "metadata": {
+    "attributes": {
+     "classes": [],
+     "id": "",
+     "n": "3"
+    }
+   },
+   "outputs": [],
+   "source": [
+    "def corr2d_multi_in_out(X, K):\n",
+    "    # Traverse along the 0th dimension of K, and each time, perform cross-correlation \n",
+    "    # operations with input X. All of the results are merged together using the stack function.\n",
+    "    return nd.stack(*[corr2d_multi_in(X, k) for k in K])"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "We construct a convolution kernel with 3 output channels by concatenating the kernel array `K` with `K+1` (plus one for each element in `K`) and `K+2`."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "metadata": {
+    "attributes": {
+     "classes": [],
+     "id": "",
+     "n": "4"
+    }
+   },
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "(3, 2, 2, 2)"
+      ]
+     },
+     "execution_count": 4,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "K = nd.stack(K, K + 1, K + 2)\n",
+    "K.shape"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {
+    "slideshow": {
+     "slide_type": "slide"
+    }
+   },
+   "source": [
+    "We can have multiple input and output channels."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "metadata": {
+    "attributes": {
+     "classes": [],
+     "id": "",
+     "n": "5"
+    }
+   },
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "(2, 3, 3)\n",
+      "(3, 2, 2, 2)\n",
+      "\n",
+      "[[[ 56.  72.]\n",
+      "  [104. 120.]]\n",
+      "\n",
+      " [[ 76. 100.]\n",
+      "  [148. 172.]]\n",
+      "\n",
+      " [[ 96. 128.]\n",
+      "  [192. 224.]]]\n",
+      "<NDArray 3x2x2 @cpu(0)>\n"
+     ]
+    }
+   ],
+   "source": [
+    "print(X.shape)\n",
+    "print(K.shape)\n",
+    "print(corr2d_multi_in_out(X, K))"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {
+    "slideshow": {
+     "slide_type": "slide"
+    }
+   },
+   "source": [
+    "## $1\\times 1$ Convolutions\n",
+    "\n",
+    "![1x1 convolutions](http://www.d2l.ai/_images/conv_1x1.svg)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "metadata": {
+    "attributes": {
+     "classes": [],
+     "id": "",
+     "n": "6"
+    }
+   },
+   "outputs": [],
+   "source": [
+    "def corr2d_multi_in_out_1x1(X, K):\n",
+    "    c_i, h, w = X.shape\n",
+    "    c_o = K.shape[0]\n",
+    "    X = X.reshape((c_i, h * w))\n",
+    "    K = K.reshape((c_o, c_i))\n",
+    "    Y = nd.dot(K, X)  # Matrix multiplication in the fully connected layer.\n",
+    "    return Y.reshape((c_o, h, w))"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {
+    "slideshow": {
+     "slide_type": "slide"
+    }
+   },
+   "source": [
+    "This is equivalent to cross-correlation with an appropriately narrow $1\\times 1$ kernel."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 7,
+   "metadata": {
+    "attributes": {
+     "classes": [],
+     "id": "",
+     "n": "7"
+    }
+   },
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "True"
+      ]
+     },
+     "execution_count": 7,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "X = nd.random.uniform(shape=(3, 3, 3))\n",
+    "K = nd.random.uniform(shape=(2, 3, 1, 1))\n",
+    "\n",
+    "Y1 = corr2d_multi_in_out_1x1(X, K)\n",
+    "Y2 = corr2d_multi_in_out(X, K)\n",
+    "\n",
+    "(Y1 - Y2).norm().asscalar() < 1e-6"
+   ]
+  }
+ ],
+ "metadata": {
+  "celltoolbar": "Slideshow",
+  "kernelspec": {
+   "display_name": "Python 3",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.7.2"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 2
+}