Intelligent bell facial paralysis assessment: a facial recognition model using improved SSD network

Shi, Hai**; Fan, Yinqiu; Zhang, Yu; Li, **aowei; Shu, Yuling; Deng, **nyuan; Zhang, Yating; Zheng, Yunzi; Yang, Jun

doi:10.1038/s41598-024-63478-x

Intelligent bell facial paralysis assessment: a facial recognition model using improved SSD network

Article
Open access
Published: 04 June 2024

Volume 14, article number 12763, (2024)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Intelligent bell facial paralysis assessment: a facial recognition model using improved SSD network

Download PDF

Hai** Shi^1,2,
Yinqiu Fan^1,2,
Yu Zhang^1,2,
**aowei Li²,
Yuling Shu²,
**nyuan Deng²,
Yating Zhang²,
Yunzi Zheng² &
…
Jun Yang^1,2

405 Accesses
Explore all metrics

Abstract

With the continuous progress of technology, the subject of life science plays an increasingly important role, among which the application of artificial intelligence in the medical field has attracted more and more attention. Bell facial palsy, a neurological ailment characterized by facial muscle weakness or paralysis, exerts a profound impact on patients’ facial expressions and masticatory abilities, thereby inflicting considerable distress upon their overall quality of life and mental well-being. In this study, we designed a facial attribute recognition model specifically for individuals with Bell’s facial palsy. The model utilizes an enhanced SSD network and scientific computing to perform a graded assessment of the patients’ condition. By replacing the VGG network with a more efficient backbone, we improved the model’s accuracy and significantly reduced its computational burden. The results show that the improved SSD network has an average precision of 87.9% in the classification of light, middle and severe facial palsy, and effectively performs the classification of patients with facial palsy, where scientific calculations also increase the precision of the classification. This is also one of the most significant contributions of this article, which provides intelligent means and objective data for future research on intelligent diagnosis and treatment as well as progressive rehabilitation.

paraFaceTest: an ensemble of regression tree-based facial features extraction for efficient facial paralysis classification

Article Open access 25 April 2019

The Mask: A Face Network System for Bell’s Palsy Recovery Surveillance

A Tour Toward the Development of Various Techniques for Paralysis Detection Using Image Processing

Introduction

Facial palsy, an affliction characterized by neuromuscular dysfunction of the facial region, imposes significant physical and psychological perils upon its sufferers. Individuals afflicted with this condition experience asymmetrical facial expressions, fostering feelings of embarrassment and distress in their social and interpersonal interactions. The compromised functionality of the ocular muscles can give rise to widened eye fissures and ocular dryness, severely impairing visual acuity and ocular well-being¹. Furthermore, the engagement of lip muscles results in droo** mouth corners and limited lip mobility, thereby impeding eating, speech, and facial expressiveness, leading to challenges in daily life. The detriments caused by facial paralysis extend beyond the physical realm and exert detrimental effects on the self-esteem, self-assurance, and mental well-being of patients, thereby underscoring the pressing need for efficacious treatment options².

With the progression of technology and advancements in the medical field, the repertoire of treatments available for facial palsy has expanded significantly. Clinical practice has witnessed the application of various modalities, including medication, physical therapy, and rehabilitation training, aimed at augmenting patients’ facial functionality. Among these therapeutic approaches, acupuncture, an age-old treatment modality, possesses distinctive merits. Through the stimulation of specific acupuncture points, this traditional practice harmonizes the circulation of vital energy (qi) and blood throughout the body, fostering the recovery and functional amelioration of facial muscles³. Acupuncture is renowned for its safety and absence of adverse effects, while also affording individualized treatment options that can be tailored to the specific circumstances of each patient. As research and implementation of acupuncture persist, its significance in the management of facial paralysis has been progressively underscored⁴. Functioning as an integrative modality within the realm of medical treatment, acupuncture engenders favorable therapeutic outcomes for individuals afflicted with facial palsy by modulating facial muscle tone, facilitating blood circulation, and enhancing nerve functionality. Clinical investigations and practical application have substantiated acupuncture’s capacity to alleviate symptoms and discomfort, enhance facial muscle functionality, and restore facial expression symmetry, thereby elevating patients’ overall quality of life⁵.

With the swift progression of artificial intelligence, its ubiquity in the domain of medicine is witnessing an exponential surge. The robust data processing capabilities and pattern recognition prowess inherent to artificial intelligence technology equip it to furnish enhanced precision and efficacy in the realm of disease diagnosis and treatment. In the realm of facial palsy research, artificial intelligence assumes a pivotal role⁶. By virtue of its capacity to analyze and process facial images, artificial intelligence technology facilitates precise identification and examination of facial attributes in patients afflicted with facial palsy. Leveraging deep learning algorithms and computer vision techniques, automated detection and analysis of facial expressions, ocular muscles, and lip muscles become feasible, thereby enabling quantitative evaluation of the patient’s facial functionality and tracking of condition alterations. This provision of crucial reference information empowers physicians and acupuncturists to craft and fine-tune treatment plans with precision⁷. Furthermore, artificial intelligence can be harnessed for the development of therapeutic assistance systems tailored to facial palsy. By erecting models and algorithms for facial attribute recognition, in conjunction with real-time facial image acquisition technology, an intelligent system can be devised to continually monitor real-time alterations in patients’ facial attributes and proffer acupuncture treatment recommendations based on these changes. Such an advancement would not only heighten the personalization and precision of treatment but also alleviate the workload of acupuncturists, thereby augmenting treatment outcomes and bolstering patient contentment.

Therefore, for the demand of intelligent treatment of acupuncture therapy combined with the current stage of artificial intelligence and deep learning technology, this paper intends to propose a facial feature study for the acupuncture treatment of Bell’s facial palsy, with the following contributions:

1.
The facial palsy patients were classified into three categories: light, middle and severe according to the facial feature;
2.
facial feature detection and classification of their facial palsy condition levels were achieved using improved SSD networks, with an average precision of 87.9% at three levels;
3.
The refinement of the treatment time for different level patients according to the model classification results illustrates the effectiveness of acupuncture treatment for facial palsy diseases.

The rest of the investigation is organized as follows: “Related works” section introduces related works for facial feature extraction using traditional methods and deep learning methods. The related works for the facial paralysis is also given in this section. In “Establishing facial feature recognition model based on improved SSD algorithm for facial palsy patients” section, the framework for the facial paralysis classification is established; “Experiment result and analysis” section gives the experiment result and analysis; “Discussion” section is the Discussion and Conclusion is presented at the end.

Related works

Study on facial features of faces based on traditional features

Traditional methodologies primarily employ geometric and appearance features for feature extraction. The general procedure encompasses face detection, face feature point localization (face alignment), extraction of expression features, and subsequent classification. In terms of feature design and extraction, two main categories of features are employed: geometric features and appearance features. Geometric features encompass the shape of eyebrows, eyes, nostrils, mouth, as well as the relative positioning of feature points such as eyes and mouth. Appearance features, on the other hand, include facial furrows, wrinkles, bulges, and similar attributes. Regarding face key feature point detection and localization methods, face key point models can be categorized into 2D and 3D based on the number of feature points. 3D face feature points typically number in the thousands and find common usage in industrial settings. On the other hand, 2D face key point models typically consist of fewer than a thousand feature points. Statistical modeling methods for 2D key feature points include the active shape model method (ASM) coupled with the active appearance model approach (AAM)⁸. Furthermore, techniques based on direct regression or regression-based methods exhibit robust performance

Establishing facial feature recognition model based on improved SSD algorithm for facial palsy patients

Facial palsy model classification and data creation

Bell's facial palsy is an acute ipsilateral facial nerve paralysis with an unknown cause. It is often attributed to facial nerve edema or demyelination resulting from viral infection or exposure to cold temperatures. Clinical manifestations primarily include incomplete eyelid closure (difficulty in raising and closing the eyebrows, inability to frown) and distorted corners of the mouth (bone and cheek leakage). Bell Facial Paralysis is usually unilateral, causing sudden weakening of facial muscles on one side. In most cases, this weakness is temporary and significantly improves within a few weeks. People with Bell's palsy may experience sagging on one side of the face, a unilateral smile, and difficulty in closing the affected eye. The appropriate treatment is crucial for reducing long-term complications, while improper treatment can lead to facial muscle spasms and exacerbate complications such as crocodile tears 32. To grade the facial features of individuals undergoing acupuncture treatment for Bell's facial palsy, different levels are assigned based on the severity of the condition and the observed features. In this study, the condition of facial palsy patients was classified into three categories: light, middle, and severe, following recommendations and instructions provided by medical experts. Table 1 presents the main features associated with each level.

Table1 The clinical feature level of the facial paralysis.

Full size table

Through the division of facial palsy disease features under different levels, we can see that the changes of expressions and facial features are more obvious, so the extraction of relevant features can be well done by deep learning methods, which completes the classification of relevant levels, thus realizing the later classification analysis and personalized treatment design.

A grading evaluation model for patients with facial palsy based on improved SSD patients

SSD (Single Shot MultiBox Detector) is an algorithm for target detection with a network structure and features that make it advantageous in terms of real-time and accuracy. The SSD algorithm uses a convolutional neural network (CNN) based multi-scale feature extraction strategy that combines different levels of feature maps to detect targets³³ The network structure of SSD consists of two parts: the base network and the multiscale feature layer. The base network usually uses some popular convolutional neural networks, such as VGG16 or ResNet, for extracting high-level semantic features of images. The multi-scale feature layer, on the other hand, is used to detect targets of different sizes. The structure of the network is shown in Fig. 1.

The features of SSD have three main aspects, firstly, it has multi-scale feature fusion, SSD introduces multiple feature layers in the network, which have different scales and semantic information. By fusing these feature layers, SSD can detect targets at different scales, thus improving the accuracy and robustness of detection. Secondly, multi-scale anchor frames can be implemented. SSD performs target detection by placing anchor frames of different scales and aspect ratios on each feature layer. These anchor frames can cover targets of different sizes and shapes, enabling SSD to effectively detect targets of multiple scales. Finally the network is able to decode the detection results and the SSD performs classification and bounding box regression on each anchor box by convolutional and prediction layers³⁴. The classification layer is used to determine the presence of targets in the anchor frame and classify them into different categories, and the bounding box regression layer is used to adjust the position and size of the anchor frame to fit the targets more accurately. For the above three features, the SSD network can be divided into three parts, i.e., multi-scale anchor frame generation, bounding box regression, and loss function training, where multi-scale anchor frame generation means that given a feature layer, and its corresponding input image size and stride (stride), the position and size of the anchor frame can be calculated by the following equation:

$$box_{width} = base_{width} *sqrt(aspect_{ratio} )$$

(1)

$$box_{height} = base_{height} /sqrt(aspect_{ratio} )$$

(2)

$$center_{x} = (i + 0.5)*stride$$

(3)

$$center_{y} = (j + 0.5)*stride$$

(4)

$$anchorbox = [center_{x} ,center_{y} ,box_{width} ,box_{height} ]$$

(5)

where $i$ and $j$ represent the position indexes on the feature map, respectively,${\text{base}}_{{{\text{width}}}}$ and ${\text{base}}_{{{\text{height}}}}$ are the width and height of the reference anchor frame, and ${\text{aspect}}_{{{\text{ratio}}}}$ represents the aspect ratio. Then the bounding box regression is performed

$$predictedbox_{x} = offset_{x} *anchorbox_{width} + anchorbox_{center\;x}$$

(6)

$$predictedbox_{y} = offset_{y} *anchorbox_{height} + anchorbox_{center\;y}$$

(7)

$$predictedbx_{width} = exp(offset_{width} )*anchorbox_{width}$$

(8)

$$predictedbox_{height} = exp(offset_{height} )*anchorbox_{height}$$

(9)

where,${\text{offset}}_{{\text{x}}}$, ${\text{offset}}_{{\text{y}}}$, ${\text{offset}}_{{{\text{width}}}}$ and ${\text{offset}}_{{{\text{height}}}}$ are the bounding box offsets predicted by the network, and ${\text{anchorbox}}_{{{\text{center}}\;{\text{x}}}}$, ${\text{anchorbox}}_{{{\text{center}}\;{\text{y}}}}$, ${\text{anchorbox}}_{{{\text{width}}}}$ and ${\text{anchorbox}}_{{{\text{height}}}}$ are the center coordinates and width and height of the anchor box.

SSD uses a multi-task loss function that combines classification loss and bounding box regression loss to train the network. The classification loss uses the cross-entropy loss as shown in Eq. (10), and the bounding box regression loss (11) uses the smoothed L1 loss (12) to complete the total loss calculation.

$$L_{c} ls = - \, \sum {(y_{t} rue_{c} ls*log(y_{p} red_{c} ls) + (1 - y_{t} rue_{c} ls)*log(1 - y_{p} red_{c} ls))}$$

(10)

$$L_{r} eg = \, \sum {(y_{t} rue_{r} eg*SmoothL1(y_{p} red_{r} eg - y_{t} rue_{r} eg))}$$

(11)

$$L = \, \alpha L_{c} ls + \, \beta L_{r} eg$$

(12)

The VGG network is enhanced by modifying the SSD (Single Shot MultiBox Detector) structure, wherein multiple layers, including the final pooling layer, are extracted as the feature layer. Default detection boxes of various scales are utilized to detect targets of different sizes. Each feature layer has a predefined number and size of target frames. The target class is determined by propagating the data of each default detection frame through the fully connected layer, and the best detection candidates are selected based on target accuracy and the overlap rate of rectangular frames. In this study, the VGG network is further improved by incorporating the MobileNetV3 network. The computational workload of the neural network primarily lies in the feature extraction phase, particularly in the VGG network. However, MobileNetV3 reduces the computational requirements during feature extraction, leading to a reduction in model size and increased speed. As a result, MobileNetV3 is well-suited for mobile devices, offering advantages in terms of time overhead and model size. It has been demonstrated that reducing the number of systematic feature extractions significantly reduces both the time and computational overhead³⁵. Leveraging the excellent performance of MobileNetV3 on mobile devices, this paper replaces the VGG backbone network of SSD with MobileNetV3. A pooling layer operation follows, reducing the size of feature maps generated by the preceding layers of the network in a layered manner. This replacement of the VGG-16 component, as depicted in Fig. 1, effectively reduces model complexity, enhancing its applicability. The algorithm framework obtained by improving the SSD network is illustrated in Fig. 2.

Experiment result and analysis

In this study, the Facial Paralysis Image Database (FPID)³⁶, a publicly available facial paralysis image dataset, was used. The database was released to serve as a resource for facial palsy education and research. Initially, to demonstrate the utility of the database the relationship between the level of facial function and the perceived emotion expression was successfully characterized using a machine learning-based algorithm. The database is composed of 480 high-resolution images from 60 participants, 10 healthy subjects, and 50 patients with different paralysis level. Based on the classification level of facial paralysis disease as described previously and the improved SSD facial feature recognition algorithm, the experimental data of facial paralysis patient images in this dataset are collected and analyzed.

The model training and classification result

The model training is performed using the facial paralysis classification framework shown in Fig. 2, and the variation of loss function and model precision throughout the training process is shown in Fig. 3.

Figure 3 illustrates the gradual change of the loss function and model precision as the number of iterations increases. It can be observed that the model's precision fluctuates after a certain number of training iterations. However, the final precision remains relatively stable, reaching an average precision of 87.9%. The detailed results of the classification of different facial palsy classes were compared and analyzed using metrics such as Precision, Recall, and F1-score. These results are presented in Table2 and Fig. 4.

Table 2 The classification result for the facial paralysis level.

Full size table

According to the recognition effect of the three categories of facial palsy patients' classification shown in Fig. 4, it can be seen that the recognition precision for the middle category and the highest recognition precision for the sever is due to the fact that the middle category also has certain deviations in the actual classification process, which is often carried out by the exclusion method, so its recognition effect may be poor.

Methods comparison for the facial feature extraction

After the training of the model and the analysis of the results were completed, method comparison experiments were conducted to better illustrate the usefulness of the improved SSD network. In the comparison, the unimproved SSD and the traditional CNN method were selected, whose CNN building model and structure are similar to the SSD network structure, and the specific results obtained are shown in Fig. 5.

According to the data in Fig. 5, it can be seen that the proposed method has the highest recognition rate for patients with all three degrees of facial palsy, and according to the curve changes, it is also easy to see that the three methods have the highest recognition rate for severe for Light, middle and severe conditions, and further enhancement of the balance and generalization line of the model is needed in future research.

The classification analysis among patients with different levels

After completing the training of the model and the construction of the framework, we further analyzed the rows according to the classification results, and for the data used, the overall confusion matrix is shown in Fig. 6.

In Fig. 6, it can be seen that there are more misclassification results for the MIDDLE class data, which requires further research on how to level out the model performance in future studies. After completing the analysis related to the model data, we conducted statistics on the treatment years of the data used, and the results are shown in Fig. 7.

We conducted statistical analysis based on the treatment data of relevant patients in the database used. They were all treated using the standard treatment plan of the hospital where the dataset creator was located, and the degree of facial paralysis was alleviated. In Fig. 7, we can see that as the treatment time increases, the severity level of patients gradually decreases, and for patients who only receive one quarter of treatment, the proportion of light is found to be the lowest. The results in Fig. 7 can prove to some extent that the acupuncture and moxibustion method changes the facial features of patients and its positive clinical treatment effect.

Discussion

Facial palsy is a neurological disorder that profoundly impacts the facial expression muscles and significantly affects patients' quality of life and social interactions. While acupuncture has been traditionally used as a treatment for facial palsy, its effectiveness is challenged by the complexity of the condition. Fortunately, the rapid progress of artificial intelligence technology offers new possibilities for facial palsy research and treatment. In this study, we focused on leveraging an improved SSD network for facial feature recognition in patients with Bell's facial palsy. By enhancing the conventional CNN and SSD models and incorporating the efficient MobileNetV3 architecture, our improved SSD network demonstrated notable advantages in facial feature recognition. Firstly, the improved SSD exhibited superior performance in terms of recognition rate. The integration of MobileNetV3 into the network structure enhanced the accuracy of the model, enabling more precise identification of facial features associated with facial palsy. Secondly, the improved SSD is superior in speed. It achieved a balance between recognition speed and accuracy, which is crucial for practical clinical applications. Acupuncturists can benefit from fast and accurate recognition of facial features, as it aids in assessing patients' conditions and monitoring treatment progress effectively. After improving the model, its running time has been reduced by more than 20% compared to before. By integrating MobileNetV3, the improved SSD network structure is lighter, reducing computational burden and improving processing speed. The design of MobileNetV3 focuses on improving performance on mobile and embedded devices, further accelerating recognition speed by optimizing convolution operations. The improved SSD adopts an efficient network architecture, effectively balancing accuracy and computing speed, making it perform better in real-time applications. This is crucial for facial feature recognition, especially in clinical applications. Rapid and accurate recognition of facial features can help acupuncture and moxibustion better assess patients' conditions and treatment progress. Overall, the improved SSD network presented in this study offers promising advancements in facial feature recognition for patients with facial palsy. Its enhanced accuracy and speed make it a valuable tool in clinical settings, facilitating improved patient assessment and treatment monitoring.

The application of artificial intelligence in the future treatment of facial palsy has a broad prospect and great potential. First, AI can provide more accurate and efficient support for the diagnosis and treatment of facial palsy. By using big data and machine learning algorithms, more accurate facial feature recognition models can be established to assist doctors and acupuncturists to make more accurate diagnosis and develop personalized treatment plans. Secondly, AI can provide real-time monitoring and feedback during the treatment of facial palsy. By using intelligent facial image acquisition devices and AI algorithms, it can track the changes of patients' facial features in real time, evaluate the treatment effect and adjust the treatment plan in time. This will greatly improve the accuracy and personalization of treatment, while reducing the workload of the acupuncturist. In addition, AI can also promote the development of intelligent and personalized facial palsy treatment. By combining AI with technologies such as virtual reality and augmented reality, intelligent systems for facial palsy rehabilitation training can be developed. These systems can provide personalized rehabilitation training plans based on the patient's specific situation and guide the patient through training with real-time monitoring and feedback. This can not only increase patient participation and motivation, but also improve the rehabilitation effect and accelerate the rehabilitation process. Through continuous exploration and innovation, artificial intelligence will bring more accurate, personalized and effective solutions for facial palsy treatment.

Conclusion

This study is based on the improved SSD algorithm for facial feature recognition of Bell's facial palsy acupuncture treatment patients, so as to realize the intelligent grading evaluation of facial palsy patients' conditions. In this paper, the grading of light, middle and severe patients is completed based on the existing facial palsy data, and the model recognition accuracy and computational efficiency are improved by improving the existing SSD network. The experimental results show that by introducing MobileNetV3 to replace VGG, the prediction accuracy of the model can be greatly improved, and the average recognition rate of the three categories of patients reaches 87.9%, which is higher than 80.3% of the unimproved method. Meanwhile, this paper analyzes the application prospects of AI methods within this field, illustrates the usability of deep learning methods, and provides new ideas for the future development of smart medicine.

However, there are also some limitations of the study. The research was conducted with a small dataset and focused on only three levels of classification. To further enhance the reliability and generalizability of the model, future research should expand the dataset, ensure data privacy and security, and improve the robustness of the algorithm.

Data availability

If anyone needs a dataset used in the article, they can contact the corresponding author on reasonable request.

References

Jandali, D. & Revenaugh, P. C. Facial reanimation: An update on nerve transfers in facial paralysis. Curr. Opin. Otolaryngol. Head Neck Surg. 27(4), 231–236 (2019).
Article PubMed Google Scholar
Luu, N. N. et al. Clinical practice guidelines in idiopathic facial paralysis. A systematic review using the appraisal of guidelines for research and evaluation (AGREE II) instrument. J. Neurol. 268, 1847–1856 (2021).
Article PubMed Google Scholar
Liu, B. et al. Analysis on the theory and clinical ideas of acupuncture and moxibustion for the prevention and treatment of coronavirus disease 2019. Zhongguo zhen jiu Chin. Acupunct. Moxibust. 40(6), 571–575 (2020).
Google Scholar
Wamkpah, N. S. et al. Physical therapy for iatrogenic facial paralysis: A systematic review. JAMA Otolaryngol. Head Neck Surg. 146(11), 1065–1072 (2020).
Article PubMed PubMed Central Google Scholar
Gaber, A. et al. Comprehensive assessment of facial paralysis based on facial animation units. PLoS ONE 17(12), e0277297 (2022).
Article CAS PubMed PubMed Central Google Scholar
Abdel-Basset, M. et al. A novel intelligent medical decision support model based on soft computing and IoT. IEEE Internet Things J. 7(5), 4160–4170 (2019).
Article Google Scholar
Gaber, A. et al. Classification of facial paralysis based on machine learning techniques. Biomed. Eng. 21(1), 65 (2022).
Google Scholar
Cootes, T. F., Edwards, G. J. & Taylor, C. J. Active appearance models. IEEE Trans. Pattern Anal. Mach. Intell. 23(6), 681–685 (2001).
Article Google Scholar
**ong, X. & De la Torre, F. Supervised descent method and its applications to face alignment. In Proceedings of the IEE Conference on Computer Vision and Patten Recognition 532–539 (2013).
Kazcmi, V. & Sullivan, J. One millisecond face alignment with an ensemble of regression trees. In Proceedings of the IEEE Conference on Computer Vision and Pattem Recognition 1867–1874 (2014).
Matsugu, M. et al. Subject independent facial expression recognition with robust face detection using a convolutional neural nerwork. Neural Netw. 16(5), 555–559 (2003).
Article PubMed Google Scholar
Bartlett, M. S., Littlewort, G., Frank, M., et al. Fully automatic facial action recognition in spontaneous behavior. In 7th International Conference on Automatic Face and Gesture Recognition 223–230 (2006).
Zhang, Y. & Ji, Q. Active and dynamic information fusion for facial expression understanding from image sequences. IEEE Trans. Pattern Anal. Mach. Intell. 27(5), 699–714 (2005).
Article PubMed Google Scholar
Su, Z., Chen, J. & Chen, H. Dynamic facial expression recognition using autoregressive models. In 2014 7th International Congress on Image and Signal Processing 475–479 (IEEE, 2014).
Szegedy, C., Liu, W., Jia, Y., et al. Going deeper with convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 1–9 (2015).
Ioffe, S. & Szegedy, C. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In International Conference on Machine Learning 448–456 (2015).
Mollahosseini, A., Chan, D. & Mahoor, M. H. Going deeper in facial expression recognition using deep neural networks. In 2016 IEEE Winter Conference on Applications of Computer Vision (WACV) 1–10 (IEEE, 2016).
Isaza, C. et al. Dynamic set point model for driver alert state using digital image processing. Multimed. Tools Appl. 78(14), 19543–19563 (2019).
Article Google Scholar
Huynh, X.P., Park, S.M. & Kim, Y. G. Detection of driver drowsiness using 3D deep neural network and semi-supervised gradient boosting machine. In Asian Conference on Computer Vision 134–145 (Springer, 2016).
Zhu, Y. et al. Automated depression diagnosis based on deep networks to encode facial appearance and dynamics. IEEE Trans. Affect. Comput. 9(4), 578–584 (2017).
Article Google Scholar
Jan, A. et al. Artificial intelligent system for automatic depression level analysis through visual and vocal expressions. IEEE Trans. Cogn. Dev. Syst. 10(3), 668–680 (2017).
Article Google Scholar
Zhou, X. et al. Visually interpretable representation learning for depression recognition from facial images. IEEE Trans. Affect. Comput. 11(3), 542–552 (2018).
Article Google Scholar
Chowdary, M. K., Nguyen, T. N. & Hemanth, D. J. Deep learning-based facial emotion recognition for human–computer interaction applications. Neural Comput. Appl. 35(32), 23311–23328 (2023).
Article Google Scholar
Mukhiddinov, M., Djuraev, O., Akhmedov, F., Mukhamadiyev, A. & Cho, J. Masked face emotion recognition based on facial landmarks and deep learning approaches for visually impaired people. Sensors 23, 1080 (2023).
Article ADS PubMed PubMed Central Google Scholar
Pereira, L. M. et al. Facial exercise therapy for facial palsy: Systematic review and meta-analysis. Clin. Rehabil. 25(7), 649–658 (2011).
Article CAS PubMed Google Scholar
Gonzalez-Cardero, E. et al. Facial disability index (FDI): Adaptation to Spanish, reliability and validity. Med. Oral. Patol. Oral. Cir. Bucal. 17(6), e1006–e1012 (2012).
Article PubMed PubMed Central Google Scholar
Barbosa, J. et al. Efficient quantitative assessment of facial paralysis using iris segmentation and active contour-based key points detection with hybrid classifier. BMC Med. Imaging 16(23), 1–18 (2016).
Google Scholar
Wang, T. et al. Automatic evaluation of the degree of facial nerve paralysis. Multimed. Tools Appl. 75(19), 11893–11908 (2016).
Article Google Scholar
Yoshihara, H., Seo, M., Ngo, T. H., et al. Automatic feature point detection using deep convolutional networks for quantitative evaluation of facial paralysis. In 9th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI) 811–814 (2016).
Dittmar, C., Denzler, J. & Gross, H. M. A feedback estimation approach for therapeutic facial training. In 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017) 141–148 (2017).
Jiang, C. Q. et al. Automatic facial paralysis assessment via computational image analysis. J. Healthc. Eng. 2020, 1–10 (2020).
Article Google Scholar
Parra-Dominguez, G. S., Sanchez-Yanez, R. E. & Garcia-Capulin, C. H. Facial paralysis detection on images using key point analysis. Appl. Sci. 11(5), 2435 (2021).
Article CAS Google Scholar
Zheng, W., Tang, W., Jiang, L., et al. SE-SSD: Self-ensembling single-stage object detector from point cloud. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 14494–14503 (2021).
Biswas, D. et al. An automatic traffic density estimation using single shot detection (SSD) and MobileNet-SSD. Phys. Chem. Earth Parts A/B/C 110, 176–184 (2019).
Article ADS Google Scholar
Howard, A., Sandler, M., Chu, G., et al. Searching for mobilenetv3[C]. In Proceedings of the IEEE/CVF International Conference on Computer Vision 1314–1324 (2019).
Greene, J. J. et al. The spectrum of facial palsy: The MEEI facial palsy photo and video standard set. Laryngoscope 130(1), 32–37 (2020).
Article PubMed Google Scholar

Download references

Funding

Project sponsor: Anhui Province University Collaborative Innovation Project. Project Name: Objective evaluation of the therapeutic effect of acupuncture and moxibustion on acute Bell's facial paralysis Project. Approval number: GXXT-2021-083.

Author information

Authors and Affiliations

The First Affiliated Hospital of Anhui University of Chinese Medicine, Hefei, Anhui, China
Hai** Shi, Yinqiu Fan, Yu Zhang & Jun Yang
Anhui University of Chinese Medicine, Hefei, Anhui, China
Hai** Shi, Yinqiu Fan, Yu Zhang, **-Shi-Aff1-Aff2">Hai** Shi
View author publications
You can also search for this author in PubMed Google Scholar
Yinqiu Fan
View author publications
You can also search for this author in PubMed Google Scholar
Yu Zhang
View author publications
You can also search for this author in PubMed Google Scholar
**aowei Li
View author publications
You can also search for this author in PubMed Google Scholar
Yuling Shu
View author publications
You can also search for this author in PubMed Google Scholar
**nyuan Deng
View author publications
You can also search for this author in PubMed Google Scholar
Yating Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yunzi Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Jun Yang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: H.S., J.Y., Y.F.; Data curation: Y.F., Y.Z., X.L.; Investigation: Y.S., X.D.; Resources: Y.Z., Y.Z.; Writing: H.S., J.Y.

Corresponding author

Correspondence to Jun Yang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Shi, H., Fan, Y., Zhang, Y. et al. Intelligent bell facial paralysis assessment: a facial recognition model using improved SSD network. Sci Rep 14, 12763 (2024). https://doi.org/10.1038/s41598-024-63478-x

Download citation

Received: 26 January 2024
Accepted: 29 May 2024
Published: 04 June 2024
DOI: https://doi.org/10.1038/s41598-024-63478-x
Springer Nature Limited

Intelligent bell facial paralysis assessment: a facial recognition model using improved SSD network

Abstract

Similar content being viewed by others

paraFaceTest: an ensemble of regression tree-based facial features extraction for efficient facial paralysis classification

The Mask: A Face Network System for Bell’s Palsy Recovery Surveillance

A Tour Toward the Development of Various Techniques for Paralysis Detection Using Image Processing

Introduction

Related works

Study on facial features of faces based on traditional features

Establishing facial feature recognition model based on improved SSD algorithm for facial palsy patients

Facial palsy model classification and data creation

A grading evaluation model for patients with facial palsy based on improved SSD patients

Experiment result and analysis

The model training and classification result

Methods comparison for the facial feature extraction

The classification analysis among patients with different levels

Discussion

Conclusion

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Intelligent bell facial paralysis assessment: a facial recognition model using improved SSD network

Abstract

Similar content being viewed by others

paraFaceTest: an ensemble of regression tree-based facial features extraction for efficient facial paralysis classification

The Mask: A Face Network System for Bell’s Palsy Recovery Surveillance

A Tour Toward the Development of Various Techniques for Paralysis Detection Using Image Processing

Introduction

Related works

Study on facial features of faces based on traditional features

Establishing facial feature recognition model based on improved SSD algorithm for facial palsy patients

Facial palsy model classification and data creation

A grading evaluation model for patients with facial palsy based on improved SSD patients

Experiment result and analysis

The model training and classification result

Methods comparison for the facial feature extraction

The classification analysis among patients with different levels

Discussion

Conclusion

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation