Background:Whole-slide image(WSI)is foundational for artificial intelligence in tumor diagnosis,treatment planning,and prognosis prediction.Efficient management of WSI labels is crucial for clinical digitalization;how...Background:Whole-slide image(WSI)is foundational for artificial intelligence in tumor diagnosis,treatment planning,and prognosis prediction.Efficient management of WSI labels is crucial for clinical digitalization;however,manual or semiautomatic methods limit scalability.Enhancing automatic pathological label recognition is critical to advancing digital pathology,improving efficiency,and drive precision oncology.Methods:We developed Auto LDP,a method for automatic labeling of digital pathology,which identifies textual information used for labeling slides.The method includes four steps:identifying text position using the CRAFT model,recognizing text content using the ParSeq model,identifying slice type using the ConvNext classifier,and combining relevant information to generate a new name.The naming format is divided into four parts:pathology ID,wax block ID,staining type,and slice type.We used the accuracy and processing time to validate our method using two validation sets.Results:The AutoLDP system was 20 times faster than manual labeling.The files per minute in the solid-state drives of CRAFT t ParSeq were the highest among all methods at 136.95 in validation set 1 and 170.95 in validation set 2.We compared the proposed model with several commonly used text detection and recognition models including ABinet,CRNN,TRBA,and Vitstr.The results show that we achieved an accuracy of 97.60%in just 87.62 s in validation set 1 with 200 cases,which was significantly better than that of the other models.In addition,the accuracy reached 96.98% in validation set 2 with 13,667 cases,confirming the generalization ability of the model.Conclusion:In this study,we proposed a new model,AutoLDP,automates the extraction and recognition of key information from WSI,enabling standardized naming,and significantly improving labeling efficiency.This innovation supports the digital transformation of pathology and advances precision medicine.展开更多
基金supported by the National Natural Science Foundation of China(No.82202267,82202095,82372042,82272084)Regional Innovation and Development Joint Fund of National Natural Science Foundation of China(No.U22A20345,U23A20478)+5 种基金National Science Fund for Distinguished Young Scholars of China(No.81925023)Guangdong Provincial Key Laboratory of Artificial Intelligence in Medical Image Analysis and Application(No.2022B1212010011)Guangxi Natural Science Foundation(No.2024GXNSFFA010014)Science and Technology Projects in Guangzhou(No.2024A04J4977)Guangdong Basic and Applied Basic Research Foundation(No.2023A1515011339)High-level Hospital Construction Project(No.DFJHBF202105).
文摘Background:Whole-slide image(WSI)is foundational for artificial intelligence in tumor diagnosis,treatment planning,and prognosis prediction.Efficient management of WSI labels is crucial for clinical digitalization;however,manual or semiautomatic methods limit scalability.Enhancing automatic pathological label recognition is critical to advancing digital pathology,improving efficiency,and drive precision oncology.Methods:We developed Auto LDP,a method for automatic labeling of digital pathology,which identifies textual information used for labeling slides.The method includes four steps:identifying text position using the CRAFT model,recognizing text content using the ParSeq model,identifying slice type using the ConvNext classifier,and combining relevant information to generate a new name.The naming format is divided into four parts:pathology ID,wax block ID,staining type,and slice type.We used the accuracy and processing time to validate our method using two validation sets.Results:The AutoLDP system was 20 times faster than manual labeling.The files per minute in the solid-state drives of CRAFT t ParSeq were the highest among all methods at 136.95 in validation set 1 and 170.95 in validation set 2.We compared the proposed model with several commonly used text detection and recognition models including ABinet,CRNN,TRBA,and Vitstr.The results show that we achieved an accuracy of 97.60%in just 87.62 s in validation set 1 with 200 cases,which was significantly better than that of the other models.In addition,the accuracy reached 96.98% in validation set 2 with 13,667 cases,confirming the generalization ability of the model.Conclusion:In this study,we proposed a new model,AutoLDP,automates the extraction and recognition of key information from WSI,enabling standardized naming,and significantly improving labeling efficiency.This innovation supports the digital transformation of pathology and advances precision medicine.