ear-detection

Finding the distance between the top of the ear and the opening of the ear canal given a reference object.

Initialization

After cloning the repo, make sure to install the following dependencies:

OpenCV for Python:

pip install opencv-python

Keras Tensorflow:

pip install tensorflow

Numpy and Matplotlib:

pip install numpy

pip install matplotlib

Input Image Requirements

The image used as the input should meet the following requirements for best results.

It should be cropped to contian just the ear and reference object. The ear should be at least one-fifth of the width of the image, and at least one half of the height. See the following examples for reference.
The reference object should be a distinctive green circle with a diameter of 1 inch. (To change this requirement, read below.)
The resolution of the image should be at least 500px x 500px.

Running with Streamlit

To run the project in your browser with Streamlit, follow these steps:

Install Streamlit using the instructions in Streamlit's Installation Guide.

pip install streamlit

Make sure you're in the src/ directory and run

streamlit run st-app.py

Open http://localhost:8501/ in your browser.
Upload a well-cropped image (see examples in src/images/input/ directory) and that's all!

Running without Streamlit

Copy your input image into the src/images/input/directory.
Make sure you're in the src/ directory and run

python3 main.py [your-filename]

Model Details

The model finds 55 landmark points around the entire ear. The top of the ear is landmark #0 and the entrance of the ear canal is around landmarks #35 and #36. Model accuracy is calculated by finding the distance between the predicted locations for landmarks #0 (top of ear) and #35 (ear canal) and the expected values.

`cnn-model-2.h5`

Model Accuracy: 0.8657

Model: "sequential"
_________________________________________________________________
 Layer (type)                Output Shape              Param #   
=================================================================
 conv2d (Conv2D)             (None, 222, 222, 16)      448       
                                                                 
 conv2d_1 (Conv2D)           (None, 220, 220, 32)      4640      
                                                                 
 max_pooling2d (MaxPooling2D  (None, 110, 110, 32)     0         
 )                                                               
                                                                 
 conv2d_2 (Conv2D)           (None, 108, 108, 64)      18496     
                                                                 
 max_pooling2d_1 (MaxPooling  (None, 54, 54, 64)       0         
 2D)                                                             
                                                                 
 conv2d_3 (Conv2D)           (None, 52, 52, 128)       73856     
                                                                 
 batch_normalization (BatchN  (None, 52, 52, 128)      512       
 ormalization)                                                   
                                                                 
 max_pooling2d_2 (MaxPooling  (None, 26, 26, 128)      0         
 2D)                                                             
                                                                 
 dropout (Dropout)           (None, 26, 26, 128)       0         
                                                                 
 conv2d_4 (Conv2D)           (None, 22, 22, 256)       819456    
                                                                 
 max_pooling2d_3 (MaxPooling  (None, 11, 11, 256)      0         
 2D)                                                             
                                                                 
 conv2d_5 (Conv2D)           (None, 7, 7, 512)         3277312   
                                                                 
 batch_normalization_1 (Batc  (None, 7, 7, 512)        2048      
 hNormalization)                                                 
                                                                 
 max_pooling2d_4 (MaxPooling  (None, 3, 3, 512)        0         
 2D)                                                             
                                                                 
 dropout_1 (Dropout)         (None, 3, 3, 512)         0         
                                                                 
 flatten (Flatten)           (None, 4608)              0         
                                                                 
 dense (Dense)               (None, 1024)              4719616   
                                                                 
 batch_normalization_2 (Batc  (None, 1024)             4096      
 hNormalization)                                                 
                                                                 
 dropout_2 (Dropout)         (None, 1024)              0         
                                                                 
 dense_1 (Dense)             (None, 110)               112750    
                                                                 
=================================================================
Total params: 9,033,230
Trainable params: 9,029,902
Non-trainable params: 3,328

Training a new model

Download the data and labels from here.
If you want to make changes to the model, edit src/model/train-model.py. Remember to update the name of the saved model to make sure pre-trained models are not overwritten.
Change to the src/model directory and run

python3 train-model.py

Update the name of the model used to detect landmarks in the find-landmarks() function in src/utils.py.

Changing the reference object

The reference object is found using the code in the calculate-size-ratio() function in src/utils.py.

To change the color of the reference object, update the HSV color range, represented by the lower and upper variables, using this answer as a reference.

To change the dimensions, update the metric variable.

Example

Input

Reference Object

Ear Detected and Cropped

Landmarks

References

Datasets

See sheet here.

i-Bug Dataset Cropped to Ears -- used for training final CNN models, as they were already annotated with 55 landmarks - download here.
AMI Dataset -- manually annotated subsets 1, 2, and 3 - landmarks csv file contains x and y coordinates the top of ear and entrance to ear canal - download here.

Haar Cascades for Ear Detection

CNN Model for Landmark Detection

Cintas, C., Quinto-Sánchez, M., Acuña, V., Paschetta, C., de Azevedo, S., Cesar Silva de Cerqueira, C., Ramallo, V., Gallo, C., Poletti, G., Bortolini, M.C., Canizales-Quinteros, S., Rothhammer, F., Bedoya, G., Ruiz-Linares, A., Gonzalez-José, R. and Delrieux, C. (2017), Automatic ear detection and feature extraction using Geometric Morphometrics and convolutional neural networks. IET Biom., 6: 211-223. https://doi.org/10.1049/iet-bmt.2016.0002
https://github.com/kbulutozler/ear-landmark-detection-with-CNN
https://ibug.doc.ic.ac.uk/resources/ibug-ears/

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
src		src
.gitattributes		.gitattributes
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ear-detection

Initialization

Input Image Requirements

Running with Streamlit

Running without Streamlit

Model Details

`cnn-model-2.h5`

Training a new model

Changing the reference object

Example

Input

Reference Object

Ear Detected and Cropped

Landmarks

References

Datasets

Haar Cascades for Ear Detection

CNN Model for Landmark Detection

About

Uh oh!

Releases

Packages

Uh oh!

Languages

sanions/ear-detection

Folders and files

Latest commit

History

Repository files navigation

ear-detection

Initialization

Input Image Requirements

Running with Streamlit

Running without Streamlit

Model Details

cnn-model-2.h5

Training a new model

Changing the reference object

Example

Input

Reference Object

Ear Detected and Cropped

Landmarks

References

Datasets

Haar Cascades for Ear Detection

CNN Model for Landmark Detection

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

`cnn-model-2.h5`

Packages