Face recognition has emerged as one of the most prominent applications of image analysis and under-standing,gaining considerable attention in recent years.This growing interest is driven by two key factors:its extensi...Face recognition has emerged as one of the most prominent applications of image analysis and under-standing,gaining considerable attention in recent years.This growing interest is driven by two key factors:its extensive applications in law enforcement and the commercial domain,and the rapid advancement of practical technologies.Despite the significant advancements,modern recognition algorithms still struggle in real-world conditions such as varying lighting conditions,occlusion,and diverse facial postures.In such scenarios,human perception is still well above the capabilities of present technology.Using the systematic mapping study,this paper presents an in-depth review of face detection algorithms and face recognition algorithms,presenting a detailed survey of advancements made between 2015 and 2024.We analyze key methodologies,highlighting their strengths and restrictions in the application context.Additionally,we examine various datasets used for face detection/recognition datasets focusing on the task-specific applications,size,diversity,and complexity.By analyzing these algorithms and datasets,this survey works as a valuable resource for researchers,identifying the research gap in the field of face detection and recognition and outlining potential directions for future research.展开更多
为了解决传统人脸识别数据集构建中手工标注繁琐、低分辨率图像影响标注准确率等问题的问题,提出一种基于监控视频数据的特定人群人脸数据集自动化构建方法(constructing a facial dataset for a targeted group,CFD-TG)。该方法利用相...为了解决传统人脸识别数据集构建中手工标注繁琐、低分辨率图像影响标注准确率等问题的问题,提出一种基于监控视频数据的特定人群人脸数据集自动化构建方法(constructing a facial dataset for a targeted group,CFD-TG)。该方法利用相邻帧的人脸偏移量和相似度进行分组,并融合标准库进行分组标注和数据增强。实验结果表明,该方法所构数据集的调整兰德系数(ARI)与标准化互信息(NMI)比使用人脸聚类方法分别高出0.189、0.08;并将其在人脸识别模型FaceNet、ArcFace与AdaFace上进行了验证,基于特定人群人脸数据集的微调模型相较与原预训练模型识别准确率分别提升了0.4431、0.5912、0.1288。展开更多
An exhaustive study has been conducted on face videos from YouTube video dataset for real time face recognition using the features from deep learning architectures and also the information set features. Our objective ...An exhaustive study has been conducted on face videos from YouTube video dataset for real time face recognition using the features from deep learning architectures and also the information set features. Our objective is to cash in on a plethora of deep learning architectures and information set features. The deep learning architectures dig in features from several layers of convolution and max-pooling layers though a placement of these layers is architecture dependent. On the other hand, the information set features depend on the entropy function for the generation of features. A comparative study of deep learning and information set features is made using the well-known classifiers in addition to developing Constrained Hanman Transform (CHT) and Weighted Hanman Transform (WHT) classifiers. It is demonstrated that information set features and deep learning features have comparable performance. However, sigmoid-based information set features using the new classifiers are found to outperform MobileNet features.展开更多
针对目前人工识别羊个体疼痛过程中存在的经验要求高、识别准确率低、消耗成本高、延误疾病治疗等问题,引入当前主流图像分类网络VGGNet(Visual geometry group network)对有疼痛和无疼痛的羊脸表情进行识别,提出一种基于改进VGGNet的...针对目前人工识别羊个体疼痛过程中存在的经验要求高、识别准确率低、消耗成本高、延误疾病治疗等问题,引入当前主流图像分类网络VGGNet(Visual geometry group network)对有疼痛和无疼痛的羊脸表情进行识别,提出一种基于改进VGGNet的羊脸痛苦表情识别算法,改进后的网络为STVGGNet(Spatial transformer visual geometry group network)。该算法将空间变换网络引入VGGNet,通过空间变换网络增强对羊脸痛苦表情特征区域的关注程度,提高对羊脸痛苦表情的识别准确率。本文对原有的羊脸表情数据集进行了扩充,新增887幅羊脸表情图像。但是新的数据集图像数量仍然较少,所以本文利用ImageNet数据集进行迁移学习,微调后用来自动分类有痛苦和无痛苦的羊脸表情。对羊面部表情数据集的实验结果表明,使用STVGGNet实现的最佳训练准确率为99.95%,最佳验证准确率为96.06%,分别比VGGNet高0.15、0.99个百分点。因此,本文采用的模型在羊脸痛苦表情识别中有非常好的识别效果并且具有较强的鲁棒性,为畜牧业中羊的疾病检测智能化发展提供了技术支撑。展开更多
文摘Face recognition has emerged as one of the most prominent applications of image analysis and under-standing,gaining considerable attention in recent years.This growing interest is driven by two key factors:its extensive applications in law enforcement and the commercial domain,and the rapid advancement of practical technologies.Despite the significant advancements,modern recognition algorithms still struggle in real-world conditions such as varying lighting conditions,occlusion,and diverse facial postures.In such scenarios,human perception is still well above the capabilities of present technology.Using the systematic mapping study,this paper presents an in-depth review of face detection algorithms and face recognition algorithms,presenting a detailed survey of advancements made between 2015 and 2024.We analyze key methodologies,highlighting their strengths and restrictions in the application context.Additionally,we examine various datasets used for face detection/recognition datasets focusing on the task-specific applications,size,diversity,and complexity.By analyzing these algorithms and datasets,this survey works as a valuable resource for researchers,identifying the research gap in the field of face detection and recognition and outlining potential directions for future research.
文摘为了解决传统人脸识别数据集构建中手工标注繁琐、低分辨率图像影响标注准确率等问题的问题,提出一种基于监控视频数据的特定人群人脸数据集自动化构建方法(constructing a facial dataset for a targeted group,CFD-TG)。该方法利用相邻帧的人脸偏移量和相似度进行分组,并融合标准库进行分组标注和数据增强。实验结果表明,该方法所构数据集的调整兰德系数(ARI)与标准化互信息(NMI)比使用人脸聚类方法分别高出0.189、0.08;并将其在人脸识别模型FaceNet、ArcFace与AdaFace上进行了验证,基于特定人群人脸数据集的微调模型相较与原预训练模型识别准确率分别提升了0.4431、0.5912、0.1288。
文摘An exhaustive study has been conducted on face videos from YouTube video dataset for real time face recognition using the features from deep learning architectures and also the information set features. Our objective is to cash in on a plethora of deep learning architectures and information set features. The deep learning architectures dig in features from several layers of convolution and max-pooling layers though a placement of these layers is architecture dependent. On the other hand, the information set features depend on the entropy function for the generation of features. A comparative study of deep learning and information set features is made using the well-known classifiers in addition to developing Constrained Hanman Transform (CHT) and Weighted Hanman Transform (WHT) classifiers. It is demonstrated that information set features and deep learning features have comparable performance. However, sigmoid-based information set features using the new classifiers are found to outperform MobileNet features.
文摘针对目前人工识别羊个体疼痛过程中存在的经验要求高、识别准确率低、消耗成本高、延误疾病治疗等问题,引入当前主流图像分类网络VGGNet(Visual geometry group network)对有疼痛和无疼痛的羊脸表情进行识别,提出一种基于改进VGGNet的羊脸痛苦表情识别算法,改进后的网络为STVGGNet(Spatial transformer visual geometry group network)。该算法将空间变换网络引入VGGNet,通过空间变换网络增强对羊脸痛苦表情特征区域的关注程度,提高对羊脸痛苦表情的识别准确率。本文对原有的羊脸表情数据集进行了扩充,新增887幅羊脸表情图像。但是新的数据集图像数量仍然较少,所以本文利用ImageNet数据集进行迁移学习,微调后用来自动分类有痛苦和无痛苦的羊脸表情。对羊面部表情数据集的实验结果表明,使用STVGGNet实现的最佳训练准确率为99.95%,最佳验证准确率为96.06%,分别比VGGNet高0.15、0.99个百分点。因此,本文采用的模型在羊脸痛苦表情识别中有非常好的识别效果并且具有较强的鲁棒性,为畜牧业中羊的疾病检测智能化发展提供了技术支撑。