Added TFLite Raspberry Pi (Python) image segmentation sample app.

khanhlvg · copybara-github · commit 91c5436c2886 · 2021-12-08T00:59:27.000-08:00
PiperOrigin-RevId: 414929525
diff --git a/lite/examples/image_segmentation/raspberry_pi/README.md b/lite/examples/image_segmentation/raspberry_pi/README.md
@@ -0,0 +1,115 @@
+# TensorFlow Lite Python image segmentation example with Raspberry Pi.
+
+This example uses [TensorFlow Lite](https://tensorflow.org/lite) with Python on
+a Raspberry Pi to perform real-time image segmentation using images streamed
+from the camera.
+
+At the end of this page, there are extra steps to accelerate the example using
+the Coral USB Accelerator, which increases the inference speed by ~10x.
+
+## Set up your hardware
+
+Before you begin, you need to
+[set up your Raspberry Pi](https://projects.raspberrypi.org/en/projects/raspberry-pi-setting-up)
+with Raspberry Pi OS (preferably updated to Buster).
+
+You also need to
+[connect and configure the Pi Camera](https://www.raspberrypi.org/documentation/configuration/camera.md)
+if you use the Pi Camera. This code also works with USB camera connect to the
+Raspberry Pi.
+
+And to see the results from the camera, you need a monitor connected to the
+Raspberry Pi. It's okay if you're using SSH to access the Pi shell (you don't
+need to use a keyboard connected to the Pi)—you only need a monitor attached to
+the Pi to see the camera stream.
+
+## Install the TensorFlow Lite runtime
+
+In this project, all you need from the TensorFlow Lite API is the `Interpreter`
+class. So instead of installing the large `tensorflow` package, we're using the
+much smaller `tflite_runtime` package.
+
+To install this on your Raspberry Pi, follow the instructions in the
+[Python quickstart](https://www.tensorflow.org/lite/guide/python#install_tensorflow_lite_for_python).
+
+You can install the TFLite runtime using this script.
+
+```
+sh setup.sh
+```
+
+## Download the example files
+
+First, clone this Git repo onto your Raspberry Pi like this:
+
+```
+git clone https://github.com/tensorflow/examples --depth 1
+```
+
+Then use our script to install a couple Python packages, and download the
+`Deeplabv3` model:
+
+```
+cd examples/lite/examples/image_segmentation/raspberry_pi
+
+# The script install the required dependencies and download the TFLite models.
+sh setup.sh
+```
+
+## Run the example
+
+```
+python3 segment.py
+```
+
+*   You can optionally specify the `model` parameter to set the TensorFlow Lite
+    model to be used:
+    *   The default value is `deeplabv3.tflite`
+    *   Image segmentation models from TensorFlow Hub **with metadata** are
+        supported.
+*   You can optionally specify the `displayMode` parameter to change how the
+    segmentation result is displayed:
+    *   Use values: `overlay`, `side-by-side`.
+    *   The default value is `overlay`.
+*   Example usage:
+
+```
+python3 main.py
+  --model somemodel.tflite
+  --displayMode side-by-side
+```
+
+**Overlay mode** ![Overlay Image](overlay_mode.png)
+
+**Side-by-side mode** ![Side-by-side Image](sidebyside_mode.png)
+
+For more information about executing inferences with TensorFlow Lite, read
+[TensorFlow Lite inference](https://www.tensorflow.org/lite/guide/inference).
+
+## Speed up model inference (optional)
+
+If you want to significantly speed up the inference time, you can attach an
+[Coral USB Accelerator](https://coral.withgoogle.com/products/accelerator)—a USB
+accessory that adds the
+[Edge TPU ML accelerator](https://coral.withgoogle.com/docs/edgetpu/faq/) to any
+Linux-based system.
+
+If you have a Coral USB Accelerator, you can run the sample with it enabled:
+
+1.  First, be sure you have completed the
+    [USB Accelerator setup instructions](https://coral.withgoogle.com/docs/accelerator/get-started/).
+
+2.  Run the image segmentation script using the EdgeTPU TFLite model and enable
+    the EdgeTPU option.
+
+```
+python3 main.py \
+  --enableEdgeTPU
+  --model deeplabv3_edgetpu.tflite
+```
+
+You should see significantly faster inference speeds.
+
+For more information about creating and running TensorFlow Lite models with
+Coral devices, read
+[TensorFlow models on the Edge TPU](https://coral.withgoogle.com/docs/edgetpu/models-intro/).
diff --git a/lite/examples/image_segmentation/raspberry_pi/overlay_mode.png b/lite/examples/image_segmentation/raspberry_pi/overlay_mode.png
diff --git a/lite/examples/image_segmentation/raspberry_pi/requirements_pypi.txt b/lite/examples/image_segmentation/raspberry_pi/requirements_pypi.txt
@@ -0,0 +1,3 @@
+argparse
+numpy>=1.20.0
+opencv-python~=4.5.3.56
diff --git a/lite/examples/image_segmentation/raspberry_pi/requirements_tflite.txt b/lite/examples/image_segmentation/raspberry_pi/requirements_tflite.txt
@@ -0,0 +1,2 @@
+--extra-index-url https://google-coral.github.io/py-repo/
+tflite-runtime==2.5.0.post1
diff --git a/lite/examples/image_segmentation/raspberry_pi/segment.py b/lite/examples/image_segmentation/raspberry_pi/segment.py
@@ -0,0 +1,216 @@
+# Copyright 2021 The TensorFlow Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+"""Main script to run image segmentation."""
+
+import argparse
+import sys
+import time
+from typing import List
+
+import cv2
+from image_segmenter import ColoredLabel
+from image_segmenter import ImageSegmenter
+from image_segmenter import ImageSegmenterOptions
+import numpy as np
+import utils
+
+# Visualization parameters
+_FPS_AVERAGE_FRAME_COUNT = 10
+_FPS_LEFT_MARGIN = 24  # pixels
+_LEGEND_TEXT_COLOR = (0, 0, 255)  # red
+_LEGEND_BACKGROUND_COLOR = (255, 255, 255)  # white
+_LEGEND_FONT_SIZE = 1
+_LEGEND_FONT_THICKNESS = 1
+_LEGEND_ROW_SIZE = 20  # pixels
+_LEGEND_RECT_SIZE = 16  # pixels
+_LABEL_MARGIN = 10
+_OVERLAY_ALPHA = 0.5
+_PADDING_WIDTH_FOR_LEGEND = 150  # pixels
+
+
+def run(model: str, display_mode: str, num_threads: int, enable_edgetpu: bool,
+        camera_id: int, width: int, height: int) -> None:
+  """Continuously run inference on images acquired from the camera.
+
+  Args:
+      model: Name of the TFLite image segmentation model.
+      display_mode: Name of mode to display image segmentation.
+      num_threads: Number of CPU threads to run the model.
+      enable_edgetpu: Whether to run the model on EdgeTPU.
+      camera_id: The camera id to be passed to OpenCV.
+      width: The width of the frame captured from the camera.
+      height: The height of the frame captured from the camera.
+  """
+
+  # Initialize the image segmentation model.
+  options = ImageSegmenterOptions(
+      num_threads=num_threads, enable_edgetpu=enable_edgetpu)
+  segmenter = ImageSegmenter(model_path=model, options=options)
+
+  # Variables to calculate FPS
+  counter, fps = 0, 0
+  start_time = time.time()
+
+  # Start capturing video input from the camera
+  cap = cv2.VideoCapture(camera_id)
+  cap.set(cv2.CAP_PROP_FRAME_WIDTH, width)
+  cap.set(cv2.CAP_PROP_FRAME_HEIGHT, height)
+
+  # Continuously capture images from the camera and run inference.
+  while cap.isOpened():
+    success, image = cap.read()
+    if not success:
+      sys.exit(
+          'ERROR: Unable to read from webcam. Please verify your webcam settings.'
+      )
+
+    counter += 1
+    image = cv2.flip(image, 1)
+
+    # Segment with each frame from camera.
+    segmentation_result = segmenter.segment(image)
+
+    # Convert the segmentation result into an image.
+    seg_map_img, found_colored_labels = utils.segmentation_map_to_image(
+        segmentation_result)
+
+    # Resize the segmentation mask to be the same shape as input image.
+    seg_map_img = cv2.resize(
+        seg_map_img,
+        dsize=(image.shape[1], image.shape[0]),
+        interpolation=cv2.INTER_NEAREST)
+
+    # Visualize segmentation result on image.
+    overlay = visualize(image, seg_map_img, display_mode, fps,
+                        found_colored_labels)
+
+    # Calculate the FPS
+    if counter % _FPS_AVERAGE_FRAME_COUNT == 0:
+      end_time = time.time()
+      fps = _FPS_AVERAGE_FRAME_COUNT / (end_time - start_time)
+      start_time = time.time()
+
+    # Stop the program if the ESC key is pressed.
+    if cv2.waitKey(1) == 27:
+      break
+    cv2.imshow('image_segmentation', overlay)
+
+  cap.release()
+  cv2.destroyAllWindows()
+
+
+def visualize(input_image: np.ndarray, segmentation_map_image: np.ndarray,
+              display_mode: str, fps: float,
+              colored_labels: List[ColoredLabel]) -> np.ndarray:
+  """Visualize segmentation result on image.
+
+  Args:
+      input_image: The [height, width, 3] RGB input image.
+      segmentation_map_image: The [height, width, 3] RGB segmentation map image.
+      display_mode: How the segmentation map should be shown. 'overlay' or
+        'side-by-side'.
+      fps: Value of fps.
+      colored_labels: List of colored labels found in the segmentation result.
+
+  Returns:
+      Input image overlaid with segmentation result.
+  """
+  # Show the input image and the segmentation map image.
+  if display_mode == 'overlay':
+    # Overlay mode.
+    overlay = cv2.addWeighted(input_image, _OVERLAY_ALPHA,
+                              segmentation_map_image, _OVERLAY_ALPHA, 0)
+  elif display_mode == 'side-by-side':
+    # Side by side mode.
+    overlay = cv2.hconcat([input_image, segmentation_map_image])
+  else:
+    sys.exit(f'ERROR: Unsupported display mode: {display_mode}.')
+
+  # Show the FPS
+  fps_text = 'FPS = ' + str(int(fps))
+  text_location = (_FPS_LEFT_MARGIN, _LEGEND_ROW_SIZE)
+  cv2.putText(overlay, fps_text, text_location, cv2.FONT_HERSHEY_PLAIN,
+              _LEGEND_FONT_SIZE, _LEGEND_TEXT_COLOR, _LEGEND_FONT_THICKNESS)
+
+  # Initialize the origin coordinates of the label.
+  legend_x = overlay.shape[1] + _LABEL_MARGIN
+  legend_y = overlay.shape[0] // _LEGEND_ROW_SIZE + _LABEL_MARGIN
+
+  # Expand the frame to show the label.
+  overlay = cv2.copyMakeBorder(overlay, 0, 0, 0, _PADDING_WIDTH_FOR_LEGEND,
+                               cv2.BORDER_CONSTANT, None,
+                               _LEGEND_BACKGROUND_COLOR)
+
+  # Show the label on right-side frame.
+  for colored_label in colored_labels:
+    rect_color = colored_label.color
+    start_point = (legend_x, legend_y)
+    end_point = (legend_x + _LEGEND_RECT_SIZE, legend_y + _LEGEND_RECT_SIZE)
+    cv2.rectangle(overlay, start_point, end_point, rect_color,
+                  -_LEGEND_FONT_THICKNESS)
+
+    label_location = legend_x + _LEGEND_RECT_SIZE + _LABEL_MARGIN, legend_y + _LABEL_MARGIN
+    cv2.putText(overlay, colored_label.label, label_location,
+                cv2.FONT_HERSHEY_PLAIN, _LEGEND_FONT_SIZE, _LEGEND_TEXT_COLOR,
+                _LEGEND_FONT_THICKNESS)
+    legend_y += (_LEGEND_RECT_SIZE + _LABEL_MARGIN)
+
+  return overlay
+
+
+def main():
+  parser = argparse.ArgumentParser(
+      formatter_class=argparse.ArgumentDefaultsHelpFormatter)
+  parser.add_argument(
+      '--model',
+      help='Name of image segmentation model.',
+      required=False,
+      default='deeplabv3.tflite')
+  parser.add_argument(
+      '--displayMode',
+      help='Mode to display image segmentation.',
+      required=False,
+      default='overlay')
+  parser.add_argument(
+      '--numThreads',
+      help='Number of CPU threads to run the model.',
+      required=False,
+      default=4)
+  parser.add_argument(
+      '--enableEdgeTPU',
+      help='Whether to run the model on EdgeTPU.',
+      action='store_true',
+      required=False,
+      default=False)
+  parser.add_argument(
+      '--cameraId', help='Id of camera.', required=False, default=0)
+  parser.add_argument(
+      '--frameWidth',
+      help='Width of frame to capture from camera.',
+      required=False,
+      default=640)
+  parser.add_argument(
+      '--frameHeight',
+      help='Height of frame to capture from camera.',
+      required=False,
+      default=480)
+  args = parser.parse_args()
+
+  run(args.model, args.displayMode, int(args.numThreads),
+      bool(args.enableEdgeTPU), int(args.cameraId), args.frameWidth,
+      args.frameHeight)
+
+
+if __name__ == '__main__':
+  main()
diff --git a/lite/examples/image_segmentation/raspberry_pi/setup.sh b/lite/examples/image_segmentation/raspberry_pi/setup.sh
@@ -0,0 +1,28 @@
+#!/bin/bash
+
+if [ $# -eq 0 ]; then
+  DATA_DIR="./"
+else
+  DATA_DIR="$1"
+fi
+
+# Install Python dependencies
+python3 -m pip install -r requirements_pypi.txt
+python3 -m pip install -r requirements_tflite.txt
+
+# Download TF Lite models with metadata.
+FILE=${DATA_DIR}/deeplabv3.tflite
+if [ ! -f "$FILE" ]; then
+  curl \
+    -L 'https://tfhub.dev/tensorflow/lite-model/deeplabv3/1/metadata/2?lite-format=tflite' \
+    -o ${FILE}
+fi
+
+FILE=${DATA_DIR}/deeplabv3_edgetpu.tflite
+if [ ! -f "$FILE" ]; then
+  curl \
+    -L 'https://storage.googleapis.com/download.tensorflow.org/models/tflite/edgetpu/deeplabv3_mnv2_dm05_pascal_quant_edgetpu.tflite' \
+    -o ${FILE}
+fi
+
+echo -e "Downloaded files are in ${DATA_DIR}"
diff --git a/lite/examples/image_segmentation/raspberry_pi/sidebyside_mode.png b/lite/examples/image_segmentation/raspberry_pi/sidebyside_mode.png

Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,3 @@`
	`1`	`+argparse`
	`2`	`+numpy>=1.20.0`
	`3`	`+opencv-python~=4.5.3.56`
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,2 @@`
	`1`	`+--extra-index-url https://google-coral.github.io/py-repo/`
	`2`	`+tflite-runtime==2.5.0.post1`