Facial expression recognition has become an increasingly important area of research in recent years. Neural network- based methods have made amazing progress in performing recognition-based tasks, winning competitions set up by various data science communities, and achieving high performance on many datasets. Miscellaneous regularization methods have been utilized by various researchers to help combat over-fitting, to reduce training time, and to generalize their models. In this paper, by applying the Haar Cascade classifier to crop faces and focus on the region of interest, we hypothesize that we would attain a fast convergence without using the whole image to analyze facial expressions. We also apply label smoothing and analyze its effect on the databases of CK+, KDEF, and RAF. The ResNet model has been employed as an example of a neural network model. Label smoothing has demonstrated an improvement of the recognition accuracy up to 0.5% considering CK+ and the KDEF databases. While the application of Haar Cascade has shown to decrease the achieved accuracy on KDEF and RAF databases with a small margin, fast convergence of the model has been observed.

1.
H.
Feng
and
J.
Shao
, "
Facial Expression Recognition Based on Local Features of Transfer Learning
,"
2020 IEEE 4th Information Technology, Networking, Electronic and Automation Control Conference (ITNEC
),
Chongqing, China
,
2020
, pp.
71
76
, doi: .
2.
Krizhevsky
,
A.
,
Sutskever
,
I.
, &
Hinton
,
G. E.
(
2012
).
Imagenet classification with deep convolutional neural networks
.
In Advances in neural information processing systems
(pp.
1097
1105
).
3.
Rumelhart
,
D. E.
,
Hinton
,
G. E.
,
Ronald J
Williams
R., J. (
1986
).
Learning representations by back- propagating errors
.
Nature
,
323
:
19
.
4.
Szegedy
,
C.
,
Vanhoucke
,
V.
,
Ioffe
,
S.
,
Shlens
,
J.
, &
Wojna
,
Z.
(
2016
).
Rethinking the Inception Architec- ture for Computer Vision
.
Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
, 2016-December,
2818
2826
.
5.
Müller
,
R.
,
Kornblith
,
S.
, &
Hinton
,
G.
(
2019
).
When Does Label Smoothing Help? (NeurIPS)
. Retrieved from http://arxiv.org/abs/1906.02629
6.
Viola
,
P.
, &
Jones
,
M.
(
2001
).
Rapid object detection using a boosted cascade of simple features
.
Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
,
1
.
7.
Lucey
,
P.
,
Cohn
,
J. F.
,
Kanade
,
T.
,
Saragih
,
J.
,
Ambadar
,
Z.
, &
Matthews
,
I.
(
2010
).
The extended Cohn- Kanade dataset (CK+): A complete dataset for action unit and emotion-specified expression
.
2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops, CVPRW
2010
, (May 2014),
94
101
.
8.
P.
Ekman
and
W. V.
Friesen
, “
Constants across cultures in the face andemotion
.”
Journal of personality and social psychology
, vol.
17
, no.
2
, p.
124
,
1971
.
9.
Lundqvist
,
D.
,
Flykt
,
A.
, &
Ohman
,
A.
(
1998
).
The Karolinska Directed Emotional Faces - KDEF, CD ROM from Department of Clinical Neuroscience
,
Psychology section, Karolinska Institutet
, ISBN 91-630-7164-9.
10.
Li
,
S.
,
Deng
,
W.
, &
Du
,
J.
(
2019
).
Reliable crowdsourcing and deep locality-preserving learning for unconstrained facial expression recognition
.
IEEE Transactions on Image Processing
,
28
(
1
),
356
370
.
11.
Tran
,
E.
,
Mayhew
,
M. B.
,
Kim
,
H.
,
Karande
,
P.
&
Kaplan
A. D.
,
Facial Expression Recognition Using a Large Out-of-Context Dataset
.
2018 IEEE Winter Applications of Computer Vision Workshops (WACVW), Lake Tahoe, NV
,
2018
, pp.
52
59
.
12.
Sharif
,
M.H.
,
An eigenvalue approach to detect flows and events in crowd videos
.
Journal of Circuits, Systems and Computers
, vol.
26
, no.
7
, p.
1750110
,
2017
.
This content is only available via PDF.
You do not currently have access to this content.