Categories

Versions

You are viewing the RapidMiner Applications documentation for version 2024.0 - Check here for latest version

Image Inferencing using ML Models on Altair® AI Edge™ Devices

One of the most common use cases with Altair® AI Edge™ devices is scoring (inferencing) an image with a machine learning model. The two most common types of inferencing are:

  • Image Classification: categorizing the whole image based on a pre-trained model
  • Object Detection (also known as Bounding Box Detection): locating a rectangular region of an image which contains an object.

For example, suppose your Altair® AI Edge™ device takes a picture like this:

wikimedia-seagull

If you provided this image as input to an image classification model where the possible categories were "day" or "night", it would most likely return the result "day":

{"class": "day"}

Note that it would make no reference to "sky" or "bird" because, in this example, the model was only trained to classify images as either "day" or "night".

If you provided this image as input to an object detection model that was trained on detecting birds, it would likely return a label "bird" along with bounding box coordinates (in pixels) of a rectangle enclosing the bird in the picture:

{"class": "bird", "x": 988, "y": 464, "width": 1436, "height": 1936}

Latency

One of the most important metrics to keep in mind when you start scoring images (or video) on Altair® AI Edge™ devices is latency, the time between when you make a request and when you receive a response. Altair® AI Edge™ devices currently are designed only for high-latency use cases – typically 30 seconds or greater. Later versions of Altair® AI Edge™ will be able to reduce this latency to less than 1 second.

Tutorial: Object Detection on an Altair® AI Edge™ Device using YOLOv10

The easiest way to get started image inferencing is to use a pre-trained model. We will use the standard open-source YOLOv10 single-shot object detection model. It finds one or more objects from the standard COCO Dataset.

In order to follow this tutorial, please first ensure that your Altair® AI Edge™ is online and connected to IoT Studio, and you have the AI Edge Toolbox extension successfully installed on AI Studio.

Note: this is a long tutorial. Get yourself a bag of Haribo Goldbears and reward yourself for each step you accomplish. You will be more successful and have an amazing sugar high when you're done!

  1. Create a new AI Hub project called ai-edge-tutorial.

image-inference-0.png

  1. Open AI Studio and connect to the new ai-edge-tutorial project.

image-inference-1.png

image-inference-2.png

  1. Create two new folders in the project repo ai-edge-tutorial:

  2. ai-edge-processes - where you will save processes that will eventually reside on your Altair® AI Edge™ device

  3. ai-studio-processes - where you will save processes that will be used locally in AI Studio to communicate with your Altair® AI Edge™ device

image-inference-3.png

  1. Open a new AI Studio process, copy the process XML below to your clipboard, and paste it into your new process in AI Studio.

Sample Process XML for Altair® AI Edge™ Image Inferencing

<?xml version="1.0" encoding="UTF-8"?><process version="10.4.003">
  <context>
    <input/>
    <output/>
    <macros>
      <macro>
        <key>threshold</key>
        <value>0.2</value>
      </macro>
      <macro>
        <key>model_name</key>
        <value>hustvl/yolos-tiny</value>
      </macro>
    </macros>
  </context>
  <operator activated="true" class="process" compatibility="10.4.003" expanded="true" name="Process">
    <parameter key="logverbosity" value="init"/>
    <parameter key="random_seed" value="2001"/>
    <parameter key="send_mail" value="never"/>
    <parameter key="notification_email" value=""/>
    <parameter key="process_duration_for_mail" value="30"/>
    <parameter key="encoding" value="SYSTEM"/>
    <process expanded="true">
      <operator activated="true" class="handle_exception" compatibility="10.4.003" expanded="true" height="82" name="Get model" width="90" x="45" y="136">
        <parameter key="add_details_to_log" value="false"/>
        <process expanded="true">
          <operator activated="true" class="open_file" compatibility="10.4.003" expanded="true" height="68" name="Open File" width="90" x="179" y="34">
            <parameter key="resource_type" value="file"/>
            <parameter key="filename" value="%{tempdir}/yolov10b.onnx"/>
          </operator>
          <connect from_op="Open File" from_port="file" to_port="out 1"/>
          <portSpacing port="source_in 1" spacing="0"/>
          <portSpacing port="sink_out 1" spacing="0"/>
          <portSpacing port="sink_out 2" spacing="0"/>
        </process>
        <process expanded="true">
          <operator activated="true" class="open_file" compatibility="10.4.003" expanded="true" height="68" name="Open File (3)" width="90" x="112" y="34">
            <parameter key="resource_type" value="URL"/>
            <parameter key="url" value="https://altair-ai-edge.s3.amazonaws.com/rapidminer/models/yolov10b.onnx"/>
          </operator>
          <operator activated="true" class="write_file" compatibility="10.4.003" expanded="true" height="68" name="Write File" width="90" x="246" y="34">
            <parameter key="resource_type" value="file"/>
            <parameter key="filename" value="%{tempdir}/yolov10b.onnx"/>
            <parameter key="mime_type" value="application/octet-stream"/>
          </operator>
          <connect from_op="Open File (3)" from_port="file" to_op="Write File" to_port="file"/>
          <connect from_op="Write File" from_port="file" to_port="out 1"/>
          <portSpacing port="source_in 1" spacing="0"/>
          <portSpacing port="sink_out 1" spacing="0"/>
          <portSpacing port="sink_out 2" spacing="0"/>
        </process>
      </operator>
      <operator activated="true" class="ai_edge_toolbox:get_image_edge" compatibility="1.3.006" expanded="true" height="82" name="Get Image (Edge)" width="90" x="45" y="238">
        <parameter key="file_name" value="img.png"/>
        <parameter key="file_handling" value="delete from temp directory"/>
        <parameter key="autofocus" value="true"/>
        <parameter key="lens_position" value="1.0"/>
      </operator>
      <operator activated="true" class="image_processing:write_image" compatibility="0.5.000" expanded="true" height="82" name="Write Image" width="90" x="179" y="238">
        <parameter key="file_type" value="jpg"/>
      </operator>
      <operator activated="true" class="python_scripting:python_transformer" compatibility="10.1.002" expanded="true" height="82" name="Scoring Image" width="90" x="313" y="136">
        <parameter key="editable" value="true"/>
        <parameter key="operator" value="{&#10;  &quot;name&quot;: &quot;Custom Python Transformer&quot;,&#10;  &quot;dropSpecial&quot;: false,&#10;  &quot;parameters&quot;: [&#10;    {&#10;      &quot;name&quot;: &quot;threshold&quot;,&#10;      &quot;type&quot;: &quot;real&quot;&#10;    }&#10;  ],&#10;  &quot;inputs&quot;: [&#10;    {&#10;      &quot;name&quot;: &quot;onnx&quot;,&#10;      &quot;type&quot;: &quot;file&quot;&#10;    },&#10;    {&#10;      &quot;name&quot;: &quot;image&quot;,&#10;      &quot;type&quot;: &quot;file&quot;&#10;    }&#10;  ],&#10;  &quot;outputs&quot;: [&#10;    {&#10;      &quot;name&quot;: &quot;scores&quot;,&#10;      &quot;type&quot;: &quot;table&quot;&#10;    }&#10;  ]&#10;}.import pandas as pd&#10;import numpy as np&#10;import cv2&#10;import onnxruntime as ort&#10;from typing import Tuple, Optional&#10;&#10;&#10;def pre_process(image_object: str, max_dimension: int) -&gt; Tuple[np\.ndarray, np\.ndarray, float, Tuple[int, int]]:&#10;&#10;    # image_object is the image as an input file IOObject&#10;    # max_dimension is the width/height of the image required by the model&#10;&#10;    # Read the image&#10;    image = cv2\.imread(image_object)&#10;&#10;    # Get the original dimensions&#10;    original_height, original_width = image\.shape[:2]&#10;&#10;    # Determine the scaling factor to use&#10;    # e\.g\. if max_dimension == 640, original_width == 1663, original_height == 1119&#10;    # then max(original_height, original_width) == 1663&#10;    # and then scale_factor = 640 / 1663 = 0\.3848&#10;    scale_factor = max_dimension / max(original_height, original_width)&#10;&#10;    # Calculate the new dimensions&#10;    # e\.g\. in this example new_width == 640&#10;    # and new_height = 1119 * 0\.3848 = 430\.643 -&gt; 430 to the nearest integer&#10;    new_width = int(original_width * scale_factor)&#10;    new_height = int(original_height * scale_factor)&#10;&#10;    # Resize the image to (new_width, new_height)&#10;    # e\.g\. in this example to 640 x 430&#10;    resized_image = cv2\.resize(image, (new_width, new_height), interpolation=cv2\.INTER_AREA)&#10;&#10;    # Create a black canvas with the max dimensions&#10;    # e\.g\. in this example 640 x 640&#10;    black_canvas = np\.zeros((max_dimension, max_dimension, 3), dtype=np\.uint8)&#10;&#10;    # Calculate offsets - how much to the right/down the original image moved to be superimposed on the black canvas&#10;    # e\.g\. in this example it only moves down because the width &gt; height&#10;    # so x_offset == 0&#10;    # and y_offset is (640 - 430) / 2 = 105&#10;    x_offset = (max_dimension - new_width) // 2&#10;    y_offset = (max_dimension - new_height) // 2&#10;&#10;    # Center the resized image on the black canvas&#10;    black_canvas[y_offset:y_offset + new_height, x_offset:x_offset + new_width] = resized_image&#10;&#10;    # Process the image to a numpy array&#10;    np_image = black_canvas\.astype(np\.float32) / 255\.0&#10;    np_image = np\.transpose(np_image, (2, 0, 1))  # Convert HWC to CHW format&#10;    np_image = np\.expand_dims(np_image, axis=0)  # Add batch dimension&#10;&#10;    return np_image, black_canvas, scale_factor, (x_offset, y_offset)&#10;&#10;def post_process(outputs: np\.ndarray, scaled_image: np\.ndarray, scale_factor: Tuple[float, float], offsets: Tuple[int, int], confidence_score_threshold: Optional[float] = 0\.8) -&gt; Tuple[pd\.DataFrame, np\.ndarray]:&#10;    # outputs (np\.ndarray): A NumPy array of shape (batch_size, num_samples, num_features), where num_features should be 6\.&#10;    # The array contains bounding box coordinates (BB1, BB2, BB3, BB4), confidence scores, and class IDs\.&#10;&#10;    # resized_image (np\.ndarray): The resized image on which to draw the bounding boxes\.&#10;&#10;    # scale_factor (float): A float representing the scaling factor between the original image and resized image for the YOLO model\.&#10;&#10;    # offsets (Tuple[int, int]): A tuple containing the offsets (x_offset, y_offset)\.&#10;&#10;    # confidence_score_threshold (Optional[float]): An optional confidence score threshold\. If provided, only rows with &#10;    # a confidence score greater than this threshold will be included in the resulting DataFrame\.&#10;    &#10;    &quot;&quot;&quot;&#10;    Tuple[pd\.DataFrame, np\.ndarray]: A tuple where:&#10;        - The first element is a Pandas DataFrame with columns ['batch_id', 'class_id', 'confidence_score', 'TopLeftPointX', 'TopLeftPointY', 'width', 'height']\.&#10;          The DataFrame includes the batch ID for each sample, bounding box coordinates adjusted based on the provided&#10;          scaling factors and offsets, confidence scores, and class IDs\. If a threshold is provided, only rows with a &#10;          confidence score above the threshold are included\.&#10;        - The second element is a NumPy array representing the resized image with bounding boxes drawn\.&#10;    &quot;&quot;&quot;&#10;    &#10;    # Extract the scaling factors and offsets&#10;    x_offset, y_offset = offsets&#10;&#10;    # Reshape outputs so that the batch dimension is added as a column to the 2D matrix&#10;    batch_size, num_samples, num_features = outputs\.shape&#10;    reshaped_data = outputs\.reshape(batch_size * num_samples, num_features)&#10;    batch_ids = np\.repeat(np\.arange(batch_size), num_samples)&#10;&#10;    # Convert to (topleftx, toplefty, width, height)&#10;    # Order here matters (e\.g\. don't fix top left until getting width &amp; height)&#10;    reshaped_data[:, 2] = (reshaped_data[:, 2] / scale_factor - reshaped_data[:, 0] / scale_factor) # width&#10;    reshaped_data[:, 3] = (reshaped_data[:, 3] / scale_factor - reshaped_data[:, 1] / scale_factor) # height&#10;    reshaped_data[:, 0] = (reshaped_data[:, 0] - x_offset) / scale_factor  # top_left_x&#10;    reshaped_data[:, 1] = (reshaped_data[:, 1] - y_offset) / scale_factor   # top_left_y&#10;&#10;    # Round coordinates and dimensions to nearest integers&#10;    reshaped_data[:, 0:4] = np\.round(reshaped_data[:, 0:4])&#10;    &#10;    # Create a dataframe from the numpy array&#10;    df = pd\.DataFrame(reshaped_data, columns=['TopLeftPointX', 'TopLeftPointY', 'width', 'height', 'score', 'class_id'])&#10;    &#10;    # Add the batch_id column in case we add batching later&#10;    df['batch_id'] = batch_ids&#10;&#10;    # Convert specific columns to integers&#10;    df['TopLeftPointX'] = df['TopLeftPointX']\.astype(int)&#10;    df['TopLeftPointY'] = df['TopLeftPointY']\.astype(int)&#10;    df['width'] = df['width']\.astype(int)&#10;    df['height'] = df['height']\.astype(int)&#10;&#10;    # Reorder columns&#10;    df = df[['batch_id', 'class_id', 'score', 'TopLeftPointX', 'TopLeftPointY', 'width', 'height']]&#10;&#10;    # Filter based on the threshold only if threshold is not None&#10;    print(&quot;confidence_score_threshold = &quot;,confidence_score_threshold)&#10;    if confidence_score_threshold is not None:&#10;        df = df[df['score'] &gt; confidence_score_threshold]&#10;    return df, scaled_image&#10;&#10;def rm_main(onnx_model_object, image_object, parameters):&#10;&#9;&#10;&#9;# Create a new data frame containing parameters\.&#10;&#9;image_object = image_object\.name&#10;&#9;onnx_model_object = onnx_model_object\.name&#10;&#9;&#10;&#9;############################################# Load Inference Engine #############################################&#10;&#9;&#10;&#9;# Set up the inference session&#10;&#9;session_options = ort\.SessionOptions()&#10;&#9;&#10;&#9;# Create an inference session&#10;&#9;session = ort\.InferenceSession(onnx_model_object, providers=['CPUExecutionProvider'], sess_options=session_options)&#10;&#9;&#10;&#9;# Get the input and output names&#10;&#9;input_name = session\.get_inputs()[0]\.name&#10;&#9;output_name = session\.get_outputs()[0]\.name&#10;&#10;&#9;################################################## Pre-Process ##################################################&#10;&#9;&#10;&#9;# NOTE: The model requires an image that is 640 x 640\. Pre-processing rescales / resizes the image to meet this requirement\.&#10;&#9;&#10;&#9;# Get the image dimensions required by the model&#10;&#9;batch, channels, width, height = session\.get_inputs()[0]\.shape&#10;&#9;max_dimension = max(width, height)&#10;&#9;&#10;&#9;# Rescale the image with padding&#10;&#9;np_image, scaled_image, scale_factor, (x_offset, y_offset) = pre_process(image_object=image_object, max_dimension=max_dimension)&#10;&#9;&#10;&#9;################################################### Inference ###################################################&#10;&#9;&#10;&#9;# Run the inference&#10;&#9;outputs = session\.run([output_name], {input_name: np_image})[0]&#10;&#9;&#10;&#9;################################################## Post-Process #################################################&#10;&#9;&#10;&#9;# Convert the scored onnx output numpy array to a pandas dataframe&#10;&#9;df, scaled_image = post_process(outputs=outputs, scaled_image=scaled_image, scale_factor=scale_factor, offsets=(x_offset, y_offset), confidence_score_threshold=0\.8)&#10;&#9;&#10;&#9;# Return the new data frame and pass through the input data\.&#10;&#9;return df"/>
        <parameter key="use_default_python" value="false"/>
        <parameter key="package_manager" value="specific python binaries"/>
        <parameter key="conda_environment" value="ai-edge-torch"/>
        <parameter key="python_binary" value="/altair/python/onnxruntime_v01/onnxruntime_v01/bin/python3"/>
        <parameter key="threshold" value="0.8"/>
      </operator>
      <connect from_op="Get model" from_port="out 1" to_op="Scoring Image" to_port="onnx"/>
      <connect from_op="Get Image (Edge)" from_port="image" to_op="Write Image" to_port="img"/>
      <connect from_op="Write Image" from_port="file" to_op="Scoring Image" to_port="image"/>
      <connect from_op="Scoring Image" from_port="scores" to_port="result 1"/>
      <portSpacing port="source_input 1" spacing="0"/>
      <portSpacing port="sink_result 1" spacing="0"/>
      <portSpacing port="sink_result 2" spacing="0"/>
      <description align="center" color="yellow" colored="false" height="59" resized="true" width="305" x="87" y="45">Altair&amp;#174; AI Edge&amp;#8482; Tutorial Process - Image Inferencing with YOLOv10</description>
    </process>
  </operator>
</process>

image-inference-4.png

  1. Save this process as score-image-yolo-v10 in the folder ai-edge-processes.

image-inference-5.png

  1. Create a new IoT Connector connection IOObject called iot and save it in your project's Connections folder.

image-inference-6.png

  1. Create a new AI Hub connection IOObject called aihub and save it in your project's Connections folder.

image-inference-7.png

  1. Create a new process called deploy-image-inference in AI Studio and save it in the folder ai-studio-processes.

image-inference-8.png

  1. Drag-and-drop your aihub, your iot connection, and a Push Deployment (IoT) operator from the AI Edge Toolbox extension onto your process canvas and wire it as shown. Save your process.

image-inference-9.png

  1. Snapshot your project to synchronize with AI Hub.

image-inference-10.png

  1. Go to your space in IoT Studio, find your device uid, copy it to your clipboard, and paste it into the parameter called device uid. It will be different than the one shown in the screenshot below.

image-inference-11.png

image-inference-12.png

  1. Fill in the rest of the parameters for the Push Deployment (IoT) as shown below.

Note: the entries for the aihub project name and deployment name parameters are completely arbitrary.

Note: pay attention to the syntax of the aihub folder path name parameter - don't forget the / at the end!

image-inference-13.png

  1. Ensure that your result says "ENDPOINT DEPLOYED". This means that the process score-image-yolo-v10 has been successfully pushed down to your Altair® AI Edge™ device.

image-inference-14.png

Note: it may take you several tries to successfully complete this step the first time you do it. It requires the correct syntax, two successfully-created connection objects, a live AI Hub, and a live IoT Studio space to reach the end. If you get an error message, follow its instructions. Examining the Log panel in AI Studio can also be helpful.

  1. In your browser, go to IoT Studio → Edge Ops → Fleet Management → Asset Management → <your-device> and click on the API Client icon in the upper right.

image-inference-15.png

  1. Download and push the onnxruntime_v01 Python environment down to your Altair® AI Edge™ device:

  2. Method: POST

  3. Href: /support/rapidminer/scoring-agents/01/python-environments
  4. Body:
{
  "file": "https://altair-ai-edge.s3.amazonaws.com/rapidminer/python-environments/onnxruntime_v01.tar.gz",
  "name": "onnxruntime_v01.tar",
  "auth": {
    "type": "omit"
  }
}

image-inference-16.png

Note: this can take several minutes to complete. You may see one or more error messages appear at the bottom of the screen. Ignore them. Wait until you see a response at the bottom of the screen before proceeding.

  1. Create a new process in AI Studio called score-image-on-ai-edge-device.

image-inference-17.png

  1. Build and run the process shown below with your own device uid. The first time it runs, it will take longer than normal because your Altair® AI Edge™ device first has to download the YOLOv10 model. Afterwards it will run faster.

image-inference-18.png

  1. There are two ExampleSets shown in the Results panel: one with information about your query, and the other with the result of the inferencing. In this case it found a car (class_id = 2 in the COCO dataset), with a bounding box whose upper left corner is at coordinates (1095, 1543), a height of 1046px, and a width of 2228px. The model's confidence of this score is 0.926.

image-inference-19.png

image-inference-20.png

If you do not have an object contained in the COCO dataset in your device's field of view, it will return an empty ExampleSet.

image-inference-21.png