Research on Micro Expression Emotion Recognition Algorithm Based on Improved YOLOV8 and ResNet
DOI:
https://doi.org/10.54691/fycvf690Keywords:
Micro-expression Recognition; YOLOv8; ResNet; Attention Mechanism; Two-stage Model.Abstract
With the rapid advancement of artificial intelligence, micro-expression recognition has demonstrated significant application value in fields such as criminal investigation and lie detection. However, the extremely short duration of micro-expressions and the scarcity of annotated data pose substantial challenges to accurate recognition. This study proposes a two-stage micro-expression recognition method based on an improved YOLOv8 and ResNet framework. In the first stage, a YOLOv8 model enhanced with StarNet multi-scale feature fusion and the CBAM attention mechanism is employed to achieve high-precision face detection. In the second stage, the detected static facial frames are fed into a ResNet50 network embedded with an SE module to strengthen channel-wise feature representation, thereby enabling micro-expression classification. For experimental validation, the improved models were trained and tested on the WiderFace and RAF-DB datasets. Results indicate that the improved YOLOv8 achieved an mAP of 0.693 on WiderFace, representing an increase of 0.109 over the baseline, while the SE-ResNet50 attained an F1-score of 0.7548 on RAF-DB, with an improvement of 0.2. These findings confirm the effectiveness of the proposed approach in both face detection and micro-expression classification. Moreover, this study provides a feasible solution to address the challenges of capturing micro-expressions and data insufficiency, while also laying the groundwork for future research on end-to-end joint modeling and lightweight network optimization.
Downloads
References
[1] Saeed U. Facial micro-expressions as a soft biometric for person recognition[J]. Pattern Recognition Letters, 2021, 143: 95-103.
[2] Huang XH, Zhao GY, Hong XP, et al. Spontaneous facial micro-expression analysis using spatiotemporal completed local quantized patterns[J]. Neurocomputing, 2016, 175: 564-578.
[3] T. Pfister, X. Li, G. Zhao, et al. Recognising Spontaneous Facial Micro-Expressions[C]. 2011 International Conference on Computer Vision. IEEE, 2011, 1449-1456.
[4] Woo S, Park J, Lee J Y, et al. Cbam: Convolutional block attention module[C]//Proceedings of the European conference on computer vision (ECCV). 2.
[5] Hu J, Shen L, Sun G. Squeeze-and-excitation networks[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2018: 7132-7141.
[6] Liu W, Chen C, Wong K Y K, et al. Star-net: a spatial attention residue network for scene text recognition[C]//BMVC. 2016, 2: 7.
[7] Yang S, Luo P, Loy C C, et al. Wider face: A face detection benchmark[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2016: 5525-5533.
[8] Li S, Deng W, Du J P. Reliable crowdsourcing and deep locality-preserving learning for expression recognition in the wild[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2017: 2852-2861.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Scientific Journal of Technology

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.






