Artificial Intelligence is one of the breakthrough tech in computer science milestones among all their achievements. Image Recognition was a challenging part for the machine to detect objects. In this century, many computing resources and intelligent algorithms make it easy. But the feature will be only for those who have specifically configured machines to detect objects. After the release of Tensorflow Lite on Nov 14th, 2017 which made it easy to develop and deploy Tensorflow models in mobile and embedded devices - in this blog we provide steps to a develop android applications which can detect custom objects using Tensorflow Object Detection API.
Requirements
- Android Studio Installing Android Studio in your System (SDK Version >=27 and NDK Version >=16)
- Tensorflow Installing Tensorflow
- CPU (Intel i7,8GB RAM)or GPU (if you cannot prefer this configuration, try Google Cloud Platform of free $300 credits) to train the model.
- Labelimg (To annotate the image by boundary box)
Before You Get Started
Since the project is full of work with Python codes, libraries and API it’s a good methodology to work in a Python virtual environment and we use pip to install Python package make sure you installed it.
PIP installation:
- To install pip, securely download get-pip.py
- Then run the following:
python get-pip.py
- Check you have correctly installed pip by checking its version:
pip --version
Python Virtual Environment installation:
-
Install virtualenv via pip:
$ pip install virtualenv
-
Test your installation:
$ virtualenv --version
-
Create a virtual environment for a project:
$ virtualenv tensor_android
-
Activating the Environment:
$ cd tensor_android
$ source bin/activate
-
To deactivate:
$ deactivate
If you completed all the steps mentioned above you have to see the ouput like this in your terminal or command prompt:
To know more about python virtual environment feel free to visit this link .
Step 1 - Collect Data:
In this project, we are going to work with custom images so I’m collecting images of Steve jobs and Elon Musk for it. After collecting all the images, annotate or box the object which you have to detect in the image using Labelimg and save both the .jpeg and .xml file of it in the image folder.
After creating a boundary box of an object, we get .xml file of it, it’s called as annotation file which will be used to specify the region where the classifier should focus on and it will be looking like this. The boundary box notation will be xmin,ymin,xmax,ymax.
We successfully collected the required data and annotated it. Now we have to convert the images from xml file to a single csv file, because the file conversion is in this manner .jpeg >.xml > .csv > .record. So we have to convert it, I provided the code to convert the xml’s below.
To save the data file create another data directory in your project file, so its normally easy to organize otherwise save as you wish. But when you create the data directory, create an empty train.csv and test.csv into it. Next step is to convert the csv file to tfrecord file because Tensorflow have many functions when we use our data file in a tfrecord format. I’ve given the code below to convert the .csv to .record or .tfrecord.
That’s great we completed the data processing! Now you need to have two files in your data folder train.record and test.record as your final output of this step.
Second Step - Creating and Training your model:
It’s a tedious process to create a convolutional net, feed your data and train it and also we cannot achieve a good accuracy when we develop the net on our own. So we are going to use Google pre-trained model called ssd_mobilenet_v1_coco. We use the model and config file of it in our project.
Link to download the files:
Create a training folder in your project, move the config file into it. Then create another file in training folder called object_label.pbtxt which defines the labels of the class during testing the model. The object_label should contain:
When you extract the ssd_mobilenet file you get all the pre-trained models. Now we have to make some changes of the config file. But in this config file I already changed it. If you want make changes for your images follow this step.
- This part in the config file describes how many classes we are going to use, since we used Steve and Elon so it’s 2; rest all remains the same.
-
Change some values in this part of config file to reduce the complexity because ssd_mobilenet is trained for 90 classes and it has high configuration values so we need not use that much value so change it.
batch_size:15
num_steps:300
But remember the more you train more you get accuracy - 300 is enough for us to train .
- Make some path changes in this part if you require, otherwise leave it as is for this project.
Great! Now we made all our configuration for the project. We have to download the Tensorflow object detection API ( TensorFlow Object Detection API ) as we need only their object models, I have downloaded and it will be available at this link . Now extract the models zip file and store it in your project folder.
- Installing the model in your system: Navigate to the models directory
$ cd image_android/models
Run, $ python setup.py install
Now you get all the required properties installed to run the API in your system.
- Copy the folder which I selected in the project folder and move it to the object_detection folder inside models folder.
- Then do some quiet steps for configuring protobuf and installing all necessary Library:
Note: You should be in your python virtual environment while executing these command
$ cd “PATH TO THE MODELS FOLDER”
$ sudo apt-get install protobuf-compiler python-pil python-lxml
$ sudo pip install pillow
$ sudo pip install lxml
$ sudo pip install jupyter
$ sudo pip install matplotlib
$ protoc object_detection/protos/*.proto —python_out=.
$ export PYTHONPATH=$PYTHONPATH:pwd
:pwd
/slim
- All set, ready to Train your model.
$ python train.py —logtostderr —train_dir=training/ —pipeline_config_path=training/ssd_mobilenet_v1_pets.config
where, logtostderr - it defines that to store the log data
- After successful training, you can view your model reports in Tensorboard.
From
models/object_detection
, via terminal, you start TensorBoard with:$ tensorboard --logdir='training'
Step 3 - Testing and Exporting the model:
Now we created a model which detects Steve or Elon in the image, but we didn’t see our output Here comes the testing. Before testing we should create an inference graph.
-
Go to
models/object_detection
directory, there is a script that does this for us:export_inference_graph.py
-
Run the code in your terminal:
where,
—trained_checkpoint_prefix is the latest ckpt in the training folder
—output__directory defines the directory where the inference graph should be saved
-
If you get an error about
no module named 'nets'
, go tomodels/
,then you need to re run:$ protoc object_detection/protos/*.proto —python_out=. $ export PYTHONPATH=$PYTHONPATH:
pwd
:pwd
/slim -
Now collect some images for testing, in my case I gathered about 4 images of Elon and Steve and saved in
models/object_detection/test_images
folder and renamed them to image1, image2 etc. iteratively. -
Run the jupyter Notebook,
$ jupyter notebook
-
Navigate to project
models/object_detection
open object_detection_tutorial.ipynb -
Make some changes in it ,the edited one is available in this link
-
Run all the cell ,the final output look like this:
Cool ,at last we created a successful model now our job is to deploy in a android app.
Last Step, Deploying in Android:
Explanations:
Few important pointers that we should know:
- The core of the TensorFlow is written in c++.
- In order to build for Android, we have to use JNI(Java Native Interface) to call the c++ functions like loadModel, getPredictions, etc.
- We will have a .so(shared object) file which is a c++ compiled file and a jar file which will consist of JAVA API that will be calling the native c++. And then, we will be calling the JAVA API to get things done easily.
- So, we need the jar(Java API) and a .so(c++ compiled) file.
- We must have the pre-trained model file and a label file for the classification
Procedures:
-
First clone the tensorflow android repo from this link and store in your project folder:
git clone —recurse-submodules https://github.com/tensorflow/tensorflow.git
-
Get installed Android Studio
-
Download the latest version of the NDK
-
Install Bazel from here . Bazel is the primary build system for TensorFlow.
-
Change the version of SDK and NDK in tensorflow workspace file. The workspace file will be available in the tensorflow directory.
Example: For SDK,
For NDK,
- Create a temp_folder and create a object_label.txt and type :
Unknown Steve Jobs Elon Musk
- Copy the frozen_inference_graph.pb in steve_elon folder and move it to the temp_folder and rename it as steve_elon.pb
- Build the .so file using bazel through this command:
$ bazel build -c opt //tensorflow/contrib/android:libtensorflow_inference.so —crosstool_top=//external:android/crosstool —host_crosstool_top=@bazel_tools//tools/cpp:toolchain —cpu=armeabi-v7a
- The library will be located at:
bazel-bin/tensorflow/contrib/android/libtensorflow_inference.so Move the libtensorflow_inference.so file to the temp_folder
- Build the Java counterpart:
$ bazel build //tensorflow/contrib/android:android_tensorflow_inference_java
- You can find the JAR File at:
bazel-bin/tensorflow/contrib/android/libandroid_tensorflow_inference_java.jar Move the libandroid_tensorflow_inference_java.jar file to the temp_folder
- Now you should these file in your temp_folder:
-
You have collected all your necessary resource files for your android implementation.Now open your Android Studio and Click open the existing project and navigate to :
tensorflow/tensorflow/example/android
and open it.You have the pre-built Tensorflow demo modues applied in the android application . -
Create an assets folder under your app project and move your steve_elon.pb(model file) and object_label.txt (label file) to it.
-
Next step move your libtensorflow_inference.so and libandroid_tensorflow_inference_java.jar into your app project folder
-
Click the libandroid_tensorflow_inference_java.jar and choose “Add As Library”
- Click the libtensorflow_inference.so and choose Link C++ Project with Gradle . Then a CMake dialog box opens give the path of the CMake.txt .Refer the below image
Now the .so file is built with your project. Let’s change some configurations
- Select your build.gradle file and change def nativeBuildSystem:‘none’
- Give your model path and label path to your project.Go to src>DetectorActivity.java in your project and change the path as if in the image below
- Great, thats it! Let’s check if our android app detect Elon or Steve from the image. So Click the Run option .
FINAL OUTPUT:
Finally you must get the result like this. If not check your code or your model.
As soon as this feature hits production, start developing this cool stuff in your mobile. If you’re stuck at any point and need help, comment in the section below and we’ll get back to you. Happy Coding! 🙂
Subscribe to our newsletter
Get the latest updates from our team delivered directly to your inbox.
Related Posts
Using AI to detect Facial Landmarks for improved accuracy
Using AI to detect Facial Landmarks for improved human face recognition accuracy. A complete tutorial on how we used AI to detect facial features of humans.
AI in 2024: What Actually Worked and What’s Coming Next
AI is everywhere, and it’s not just for the big players. Here are some of the most interesting AI projects that worked in 2024.
Skcript's Enterprise AI Manifesto
Our strict guidelines on how we build Enterprise AI products at Skcript.