Microstructural classification is typically done manually by human experts,which gives rise to uncertainties due to subjectivity and reduces the overall efficiency.A high-throughput characterization is proposed based ...Microstructural classification is typically done manually by human experts,which gives rise to uncertainties due to subjectivity and reduces the overall efficiency.A high-throughput characterization is proposed based on deep learning,rapid acquisition technology,and mathematical statistics for the recognition,segmentation,and quantification of microstructure in weathering steel.The segmentation results showed that this method was accurate and efficient,and the segmentation of inclusions and pearlite phase achieved accuracy of 89.95%and 90.86%,respectively.The time required for batch processing by MIPAR software involving thresholding segmentation,morphological processing,and small area deletion was 1.05 s for a single image.By comparison,our system required only 0.102 s,which is ten times faster than the commercial software.The quantification results were extracted from large volumes of sequential image data(150 mm^(2),62,216 images,1024×1024 pixels),which ensure comprehensive statistics.Microstructure information,such as three-dimensional density distribution and the frequency of the minimum spatial distance of inclusions on the sample surface of 150 mm^(2),were quantified by extracting the coordinates and sizes of individual features.A refined characterization method for two-dimensional structures and spatial information that is unattainable when performing manually or with software is provided.That will be useful for understanding properties or behaviors of weathering steel,and reducing the resort to physical testing.展开更多
Based on successive multiple-step isothermal crystallization and self-nucleation annealing methods, a novel semi-quantitative method for the characterization of segment distribution in linear low density polyethylene ...Based on successive multiple-step isothermal crystallization and self-nucleation annealing methods, a novel semi-quantitative method for the characterization of segment distribution in linear low density polyethylene (LLDPE) was established by treating the thermal analysis data using the Gibbs-Thomson equation. The method was used to describe the segment distribution of Ziegler-Natta catalyzed LLDPE (Z-N LLDPE), metallocene catalyzed LLDPE (m-LLDPE) and two commercial LLDPEs with wide molecular weight distribution. The differences of the results obtained from the two thermally treated samples were compared. The results of segment distribution of the polymers were discussed according to their microstructure data and were compared with their characteristics. It can be deduced from the results that this characterization method is effective to characterize the sequence structure of the branched ethylene copolymers.展开更多
Gliomas are the most aggressive brain tumors caused by the abnormal growth of brain tissues.The life expectancy of patients diagnosed with gliomas decreases exponentially.Most gliomas are diagnosed in later stages,res...Gliomas are the most aggressive brain tumors caused by the abnormal growth of brain tissues.The life expectancy of patients diagnosed with gliomas decreases exponentially.Most gliomas are diagnosed in later stages,resulting in imminent death.On average,patients do not survive 14 months after diagnosis.The only way to minimize the impact of this inevitable disease is through early diagnosis.The Magnetic Resonance Imaging(MRI)scans,because of their better tissue contrast,are most frequently used to assess the brain tissues.The manual classification of MRI scans takes a reasonable amount of time to classify brain tumors.Besides this,dealing with MRI scans manually is also cumbersome,thus affects the classification accuracy.To eradicate this problem,researchers have come up with automatic and semiautomatic methods that help in the automation of brain tumor classification task.Although,many techniques have been devised to address this issue,the existing methods still struggle to characterize the enhancing region.This is because of low variance in enhancing region which give poor contrast in MRI scans.In this study,we propose a novel deep learning based method consisting of a series of steps,namely:data pre-processing,patch extraction,patch pre-processing,and a deep learning model with tuned hyper-parameters to classify all types of gliomas with a focus on enhancing region.Our trained model achieved better results for all glioma classes including the enhancing region.The improved performance of our technique can be attributed to several factors.Firstly,the non-local mean filter in the pre-processing step,improved the image detail while removing irrelevant noise.Secondly,the architecture we employ can capture the non-linearity of all classes including the enhancing region.Overall,the segmentation scores achieved on the Dice Similarity Coefficient(DSC)metric for normal,necrosis,edema,enhancing and non-enhancing tumor classes are 0.95,0.97,0.91,0.93,0.95;respectively.展开更多
Vehicle license plate (VLP) character segmentation is an important part of the vehicle license plate recognition system (VLPRS).This paper proposes a least square method (LSM) to treat horizontal tilt and vertical til...Vehicle license plate (VLP) character segmentation is an important part of the vehicle license plate recognition system (VLPRS).This paper proposes a least square method (LSM) to treat horizontal tilt and vertical tilt in VLP images.Auxiliary lines are added into the image (or the tilt-corrected image) to make the separated parts of each Chinese character to be an interconnected region.The noise regions will be eliminated after two fusing images are merged according to the minimum principle of gray values. Then,the characters are segmented by projection method (PM) and the final character images are obtained.The experimental results show that this method features fast processing and good performance in segmentation.展开更多
An image segmentation algorithm of the restrained fuzzy Kohonen clustering network (RFKCN) based on high- dimension fuzzy character is proposed. The algorithm includes two steps. The first step is the fuzzification ...An image segmentation algorithm of the restrained fuzzy Kohonen clustering network (RFKCN) based on high- dimension fuzzy character is proposed. The algorithm includes two steps. The first step is the fuzzification of pixels in which two redundant images are built by fuzzy mean value and fuzzy median value. The second step is to construct a three-dimensional (3-D) feature vector of redundant images and their original images and cluster the feature vector through RFKCN, to realize image seg- mentation. The proposed algorithm fully takes into account not only gray distribution information of pixels, but also relevant information and fuzzy information among neighboring pixels in constructing 3- D character space. Based on the combination of competitiveness, redundancy and complementary of the information, the proposed algorithm improves the accuracy of clustering. Theoretical anal- yses and experimental results demonstrate that the proposed algorithm has a good segmentation performance.展开更多
In this paper, a kind of practical image segmentation algorithm for segment characters from car license plate is presented, based on morphology and labeling. First by morphological operation, noise in the binary image...In this paper, a kind of practical image segmentation algorithm for segment characters from car license plate is presented, based on morphology and labeling. First by morphological operation, noise in the binary image of license plate can be greatly decreased. Then, by labeling, each connected pixel component is given a unique label. Finally, by the known data of license plate, each character is extracted correctly. The advantage of this method is that it can deal with plates with different sizes and connected characters plates, and inclined plates. The experiment results show that it is an effective way to extract characters from the license plate, and can be put into practical use.展开更多
This paper presents a methodology for off-line handwritten Chinese character recognition based on mergence of consecutive segments of adaptive duration. The handwritten Chinese character string is partitioned into a s...This paper presents a methodology for off-line handwritten Chinese character recognition based on mergence of consecutive segments of adaptive duration. The handwritten Chinese character string is partitioned into a sequence of consecutive segments, which are combined to implement dissimilarity evaluation within a sliding window whose durations are determined adaptively by the integration of shapes and context of evaluations. The average stroke width is estimated for the handwritten Chinese character string, and a set of candidate character segmentation boundaries is found by using the integration of pixel and stroke features. The final decisions on segmentation and recognition are made under minimal arithmetical mean dissimilarities. Experiments proved that the proposed approach of adaptive duration outperforms the method of fixed duration, and is very effective for the recognition of overlapped, broken, touched, loosely configured Chinese characters.展开更多
A new approach to extract and segment characters in natural scenes was proposed in this paper. First, a set of intrinsic features were calculated based on connected components (CCs) extracted by a non-linear Nilblack ...A new approach to extract and segment characters in natural scenes was proposed in this paper. First, a set of intrinsic features were calculated based on connected components (CCs) extracted by a non-linear Nilblack algorithm. Then, feature propagation was conducted for feature enhancement, under the constraint of the layout relations. Next, candidate CCs were fed into classifiers with the enhanced feature vector. At last, a model-based hierarchical merging (MHM) procedure was presented to obtain understandable characters. The proposed merging algorithm utilized the constraint of text lines for specific languages and dynamically merges CCs into characters. The whole algorithm was evaluated at both pixel level and character level, experimental results showed that the proposed method is effective in detecting scene characters with significant geometric variations, uneven illumination, extremely low contrast and cluttered background.展开更多
The stroke segments:' are proposed to be used as the basic features for handwritten Chinese character recognition. In this way, it is possible to overcome the difFiculties of unstable stroke information caused by ...The stroke segments:' are proposed to be used as the basic features for handwritten Chinese character recognition. In this way, it is possible to overcome the difFiculties of unstable stroke information caused by stroke Joinings. The techniques of data pre-processing and stroke segment extraction have been described. In extracting stroke segment, not only the characteristics of the stroke itself, but also its absolute positions as well as relative positions with other strokes in the character have been taken into account.The primitive features for recognition were extracted under these comprehensive considerations.展开更多
This study presents a single-class and multi-class instance segmentation approach applied to ancient Palmyrene inscriptions,employing two state-of-the-art deep learning algorithms,namely YOLOv8 and Roboflow 3.0.The go...This study presents a single-class and multi-class instance segmentation approach applied to ancient Palmyrene inscriptions,employing two state-of-the-art deep learning algorithms,namely YOLOv8 and Roboflow 3.0.The goal is to contribute to the preservation and understanding of historical texts,showcasing the potential of modern deep learning methods in archaeological research.Our research culminates in several key findings and scientific contributions.We comprehensively compare the performance of YOLOv8 and Roboflow 3.0 in the context of Palmyrene character segmentation—this comparative analysis mainly focuses on the strengths and weaknesses of each algorithm in this context.We also created and annotated an extensive dataset of Palmyrene inscriptions,a crucial resource for further research in the field.The dataset serves for training and evaluating the segmentation models.We employ comparative evaluation metrics to quantitatively assess the segmentation results,ensuring the reliability and reproducibility of our findings and we present custom visualization tools for predicted segmentation masks.Our study advances the state of the art in semi-automatic reading of Palmyrene inscriptions and establishes a benchmark for future research.The availability of the Palmyrene dataset and the insights into algorithm performance contribute to the broader understanding of historical text analysis.展开更多
The fatigue resistance of casting polyurethane(CPU)is crucial in various sectors,such as construction,healthcare,and the automotive industry.Despite its importance,no studies have reported on the fatigue threshold of ...The fatigue resistance of casting polyurethane(CPU)is crucial in various sectors,such as construction,healthcare,and the automotive industry.Despite its importance,no studies have reported on the fatigue threshold of CPU.This study employed an advanced Intrinsic Strength Analyzer(ISA)to evaluate the fatigue threshold of CPUs,systematically exploring the effects of three types of isocyanates(PPDI,NDI,TDI)that contribute to hard segment structures based on the cutting method.Employing multiple advanced characterization techniques(XRD,TEM,DSC,AFM),the results indicate that PPDI-based polyurethane exhibits the highest fatigue threshold(182.89 J/m^(2))due to a highest phase separation and a densely packed spherulitic structure,although the hydrogen bonding degree is the lowest(48.3%).Conversely,NDI-based polyurethane,despite having the high hydrogen bonding degree(53.6%),exhibits moderate fatigue performance(122.52 J/m^(2)),likely due to a more scattered microstructure.TDI-based polyurethane,with the highest hydrogen bonding degree(59.1%)but absence of spherulitic structure,shows the lowest fatigue threshold(46.43 J/m^(2)).Compared to common rubbers(NR,NBR,EPDM,BR),the superior fatigue performance of CPU is attributed to its well-organized microstructure,polyurethane possesses a higher fatigue threshold due to its high phase separation degree and orderly and dense spherulitic structure which enhances energy dissipation and reduces crack propagation.展开更多
A local and global context representation learning model for Chinese characters is designed and a Chinese word segmentation method based on character representations is proposed in this paper. First, the proposed Chin...A local and global context representation learning model for Chinese characters is designed and a Chinese word segmentation method based on character representations is proposed in this paper. First, the proposed Chinese character learning model uses the semanties of loeal context and global context to learn the representation of Chinese characters. Then, Chinese word segmentation model is built by a neural network, while the segmentation model is trained with the eharaeter representations as its input features. Finally, experimental results show that Chinese charaeter representations can effectively learn the semantic information. Characters with similar semantics cluster together in the visualize space. Moreover, the proposed Chinese word segmentation model also achieves a pretty good improvement on precision, recall and f-measure.展开更多
The purpose of this paper is to propose a new multi stage algorithm for the recognition of isolated characters. It was similar work done before using only the center of gravity (This paper is extended version of “A f...The purpose of this paper is to propose a new multi stage algorithm for the recognition of isolated characters. It was similar work done before using only the center of gravity (This paper is extended version of “A fast recognition system for isolated printed characters using center of gravity”, LAP LAMBERT Academic Publishing 2011, ISBN: 978-38465-0002-6), but here we add using principal axis in order to make the algorithm rotation invariant. In my previous work which is published in LAP LAMBERT, I face a big problem that when the character is rotated I can’t recognize the character. So this adds constrain on the document to be well oriented but here I use the principal axis in order to unify the orientation of the character set and the characters in the scanned document. The algorithm can be applied for any isolated character such as Latin, Chinese, Japanese, and Arabic characters but it has been applied in this paper for Arabic characters. The approach uses normalized and isolated characters of the same size and extracts an image signature based on the center of gravity of the character after making the character principal axis vertical, and then the system compares these values to a set of signatures for typical characters of the set. The system then provides the closeness of match to all other characters in the set.展开更多
ESA is an unsupervised approach to word segmentation previously proposed by Wang, which is an iterative process consisting of three phases: Evaluation, Selection and Adjustment. In this article, we propose Ex ESA, the...ESA is an unsupervised approach to word segmentation previously proposed by Wang, which is an iterative process consisting of three phases: Evaluation, Selection and Adjustment. In this article, we propose Ex ESA, the extension of ESA. In Ex ESA, the original approach is extended to a 2-pass process and the ratio of different word lengths is introduced as the third type of information combined with cohesion and separation. A maximum strategy is adopted to determine the best segmentation of a character sequence in the phrase of Selection. Besides, in Adjustment, Ex ESA re-evaluates separation information and individual information to overcome the overestimation frequencies. Additionally, a smoothing algorithm is applied to alleviate sparseness. The experiment results show that Ex ESA can further improve the performance and is time-saving by properly utilizing more information from un-annotated corpora. Moreover, the parameters of Ex ESA can be predicted by a set of empirical formulae or combined with the minimum description length principle.展开更多
The segmentation of individual words into characters is a vital process in handwritten character recognition systems. In this paper, a novel approach is proposed to segment handwritten Arabic text (words). We consider...The segmentation of individual words into characters is a vital process in handwritten character recognition systems. In this paper, a novel approach is proposed to segment handwritten Arabic text (words). We consider the “Naskh” font style. The segmentation algorithm employs seven agents in order to detect regions where segmentation is illegal. Feature points (end points) are extracted from the remaining regions of the word-image. Initially, the middle of every two successive end points is considered as a candidate segmentation point based on a set of rules. The experimental results are very promising as we achieved a success rate of 86%.展开更多
Segmenting Arabic handwritings had been one of the subjects of research in the field of Arabic character recognition for more than 25 years. The majority of reported segmentation techniques share a critical shortcomin...Segmenting Arabic handwritings had been one of the subjects of research in the field of Arabic character recognition for more than 25 years. The majority of reported segmentation techniques share a critical shortcoming, which is over-segmentation. The aim of segmentation is to produce the letters (segments) of a handwritten word. When a resulting letter (segment) is made of more than one piece (stroke) instead of one, this is called over-segmentation. Our objective is to overcome this problem by using an Artificial Neural Networks (ANN) to verify the resulting segment. We propose a set of heuristic-based rules to assemble strokes in order to report the precise segmented letters. Preprocessing phases that include normalization and feature extraction are required as a prerequisite step for the ANN system for recognition and verification. In our previous work [1], we did achieve a segmentation success rate of 86% but without recognition. In this work, our experimental results confirmed a segmentation success rate of no less than 95%.展开更多
Due to the diversity of climate and environment in China,the frequent occurrence of extreme rainfall events has brought great challenges to flood prevention.Water level measurement is one of the important research top...Due to the diversity of climate and environment in China,the frequent occurrence of extreme rainfall events has brought great challenges to flood prevention.Water level measurement is one of the important research topics of flood prevention.Recently,the image‐based water level recognition method has become an important part of water level measurement research due to its advantages in easy installation,low cost,and zero need of manual reading.However,there are two mainly shortcomings of the existing imagebased water level recognition methods:(1)severely affected by light intensity and(2)low accuracy of water level recognition for stained water gauges.To solve these two problems,this paper proposes a water level recognition method in consideration of complex scenarios.This method first uses a semantic segmentation convolutional neural network to extract the water gauge mask,and then uses the YOLOv5 object detection network to extract the letter“E”on the water gauge.Based on the character sequence inspection strategy,the algorithm dynamically compensates for the missed detection of characters of stained water gauges.Through a large number of experiments,the proposed water level measurement method has good robustness in complex scenarios,meeting the needs of flash flood defense.展开更多
基金supported by the National Key Research and Development Program of China(No.2017YFB0702303).
文摘Microstructural classification is typically done manually by human experts,which gives rise to uncertainties due to subjectivity and reduces the overall efficiency.A high-throughput characterization is proposed based on deep learning,rapid acquisition technology,and mathematical statistics for the recognition,segmentation,and quantification of microstructure in weathering steel.The segmentation results showed that this method was accurate and efficient,and the segmentation of inclusions and pearlite phase achieved accuracy of 89.95%and 90.86%,respectively.The time required for batch processing by MIPAR software involving thresholding segmentation,morphological processing,and small area deletion was 1.05 s for a single image.By comparison,our system required only 0.102 s,which is ten times faster than the commercial software.The quantification results were extracted from large volumes of sequential image data(150 mm^(2),62,216 images,1024×1024 pixels),which ensure comprehensive statistics.Microstructure information,such as three-dimensional density distribution and the frequency of the minimum spatial distance of inclusions on the sample surface of 150 mm^(2),were quantified by extracting the coordinates and sizes of individual features.A refined characterization method for two-dimensional structures and spatial information that is unattainable when performing manually or with software is provided.That will be useful for understanding properties or behaviors of weathering steel,and reducing the resort to physical testing.
基金This work was supported by the Science Foundations of State Key Laboratory of Polymer Physics and Chemisny, Chinese Academy of Sciences (00-B-15) and National Natural Science Foundation of China (No. B040504).
文摘Based on successive multiple-step isothermal crystallization and self-nucleation annealing methods, a novel semi-quantitative method for the characterization of segment distribution in linear low density polyethylene (LLDPE) was established by treating the thermal analysis data using the Gibbs-Thomson equation. The method was used to describe the segment distribution of Ziegler-Natta catalyzed LLDPE (Z-N LLDPE), metallocene catalyzed LLDPE (m-LLDPE) and two commercial LLDPEs with wide molecular weight distribution. The differences of the results obtained from the two thermally treated samples were compared. The results of segment distribution of the polymers were discussed according to their microstructure data and were compared with their characteristics. It can be deduced from the results that this characterization method is effective to characterize the sequence structure of the branched ethylene copolymers.
文摘Gliomas are the most aggressive brain tumors caused by the abnormal growth of brain tissues.The life expectancy of patients diagnosed with gliomas decreases exponentially.Most gliomas are diagnosed in later stages,resulting in imminent death.On average,patients do not survive 14 months after diagnosis.The only way to minimize the impact of this inevitable disease is through early diagnosis.The Magnetic Resonance Imaging(MRI)scans,because of their better tissue contrast,are most frequently used to assess the brain tissues.The manual classification of MRI scans takes a reasonable amount of time to classify brain tumors.Besides this,dealing with MRI scans manually is also cumbersome,thus affects the classification accuracy.To eradicate this problem,researchers have come up with automatic and semiautomatic methods that help in the automation of brain tumor classification task.Although,many techniques have been devised to address this issue,the existing methods still struggle to characterize the enhancing region.This is because of low variance in enhancing region which give poor contrast in MRI scans.In this study,we propose a novel deep learning based method consisting of a series of steps,namely:data pre-processing,patch extraction,patch pre-processing,and a deep learning model with tuned hyper-parameters to classify all types of gliomas with a focus on enhancing region.Our trained model achieved better results for all glioma classes including the enhancing region.The improved performance of our technique can be attributed to several factors.Firstly,the non-local mean filter in the pre-processing step,improved the image detail while removing irrelevant noise.Secondly,the architecture we employ can capture the non-linearity of all classes including the enhancing region.Overall,the segmentation scores achieved on the Dice Similarity Coefficient(DSC)metric for normal,necrosis,edema,enhancing and non-enhancing tumor classes are 0.95,0.97,0.91,0.93,0.95;respectively.
基金Scientific Research Fund of Hunan Province,PRC (No.07JJ6141)Scientific Research Fund of Hunan Provincial Education Department,PRC (No.05C720).
文摘Vehicle license plate (VLP) character segmentation is an important part of the vehicle license plate recognition system (VLPRS).This paper proposes a least square method (LSM) to treat horizontal tilt and vertical tilt in VLP images.Auxiliary lines are added into the image (or the tilt-corrected image) to make the separated parts of each Chinese character to be an interconnected region.The noise regions will be eliminated after two fusing images are merged according to the minimum principle of gray values. Then,the characters are segmented by projection method (PM) and the final character images are obtained.The experimental results show that this method features fast processing and good performance in segmentation.
基金supported by the National Natural Science Foundation of China(61073106)the Aerospace Science and Technology Innovation Fund(CASC201105)
文摘An image segmentation algorithm of the restrained fuzzy Kohonen clustering network (RFKCN) based on high- dimension fuzzy character is proposed. The algorithm includes two steps. The first step is the fuzzification of pixels in which two redundant images are built by fuzzy mean value and fuzzy median value. The second step is to construct a three-dimensional (3-D) feature vector of redundant images and their original images and cluster the feature vector through RFKCN, to realize image seg- mentation. The proposed algorithm fully takes into account not only gray distribution information of pixels, but also relevant information and fuzzy information among neighboring pixels in constructing 3- D character space. Based on the combination of competitiveness, redundancy and complementary of the information, the proposed algorithm improves the accuracy of clustering. Theoretical anal- yses and experimental results demonstrate that the proposed algorithm has a good segmentation performance.
文摘In this paper, a kind of practical image segmentation algorithm for segment characters from car license plate is presented, based on morphology and labeling. First by morphological operation, noise in the binary image of license plate can be greatly decreased. Then, by labeling, each connected pixel component is given a unique label. Finally, by the known data of license plate, each character is extracted correctly. The advantage of this method is that it can deal with plates with different sizes and connected characters plates, and inclined plates. The experiment results show that it is an effective way to extract characters from the license plate, and can be put into practical use.
文摘This paper presents a methodology for off-line handwritten Chinese character recognition based on mergence of consecutive segments of adaptive duration. The handwritten Chinese character string is partitioned into a sequence of consecutive segments, which are combined to implement dissimilarity evaluation within a sliding window whose durations are determined adaptively by the integration of shapes and context of evaluations. The average stroke width is estimated for the handwritten Chinese character string, and a set of candidate character segmentation boundaries is found by using the integration of pixel and stroke features. The final decisions on segmentation and recognition are made under minimal arithmetical mean dissimilarities. Experiments proved that the proposed approach of adaptive duration outperforms the method of fixed duration, and is very effective for the recognition of overlapped, broken, touched, loosely configured Chinese characters.
文摘A new approach to extract and segment characters in natural scenes was proposed in this paper. First, a set of intrinsic features were calculated based on connected components (CCs) extracted by a non-linear Nilblack algorithm. Then, feature propagation was conducted for feature enhancement, under the constraint of the layout relations. Next, candidate CCs were fed into classifiers with the enhanced feature vector. At last, a model-based hierarchical merging (MHM) procedure was presented to obtain understandable characters. The proposed merging algorithm utilized the constraint of text lines for specific languages and dynamically merges CCs into characters. The whole algorithm was evaluated at both pixel level and character level, experimental results showed that the proposed method is effective in detecting scene characters with significant geometric variations, uneven illumination, extremely low contrast and cluttered background.
文摘The stroke segments:' are proposed to be used as the basic features for handwritten Chinese character recognition. In this way, it is possible to overcome the difFiculties of unstable stroke information caused by stroke Joinings. The techniques of data pre-processing and stroke segment extraction have been described. In extracting stroke segment, not only the characteristics of the stroke itself, but also its absolute positions as well as relative positions with other strokes in the character have been taken into account.The primitive features for recognition were extracted under these comprehensive considerations.
基金The results and knowledge included herein have been obtained owing to support from the following institutional grant.Internal grant agency of the Faculty of Economics and Management,Czech University of Life Sciences Prague,Grant No.2023A0004-“Text Segmentation Methods of Historical Alphabets in OCR Development”.https://iga.pef.czu.cz/.Funds were granted to T.Novák,A.Hamplová,O.Svojše,and A.Veselýfrom the author team.
文摘This study presents a single-class and multi-class instance segmentation approach applied to ancient Palmyrene inscriptions,employing two state-of-the-art deep learning algorithms,namely YOLOv8 and Roboflow 3.0.The goal is to contribute to the preservation and understanding of historical texts,showcasing the potential of modern deep learning methods in archaeological research.Our research culminates in several key findings and scientific contributions.We comprehensively compare the performance of YOLOv8 and Roboflow 3.0 in the context of Palmyrene character segmentation—this comparative analysis mainly focuses on the strengths and weaknesses of each algorithm in this context.We also created and annotated an extensive dataset of Palmyrene inscriptions,a crucial resource for further research in the field.The dataset serves for training and evaluating the segmentation models.We employ comparative evaluation metrics to quantitatively assess the segmentation results,ensuring the reliability and reproducibility of our findings and we present custom visualization tools for predicted segmentation masks.Our study advances the state of the art in semi-automatic reading of Palmyrene inscriptions and establishes a benchmark for future research.The availability of the Palmyrene dataset and the insights into algorithm performance contribute to the broader understanding of historical text analysis.
基金financially supported by the National Natural Science Foundation of China(No.52473228).
文摘The fatigue resistance of casting polyurethane(CPU)is crucial in various sectors,such as construction,healthcare,and the automotive industry.Despite its importance,no studies have reported on the fatigue threshold of CPU.This study employed an advanced Intrinsic Strength Analyzer(ISA)to evaluate the fatigue threshold of CPUs,systematically exploring the effects of three types of isocyanates(PPDI,NDI,TDI)that contribute to hard segment structures based on the cutting method.Employing multiple advanced characterization techniques(XRD,TEM,DSC,AFM),the results indicate that PPDI-based polyurethane exhibits the highest fatigue threshold(182.89 J/m^(2))due to a highest phase separation and a densely packed spherulitic structure,although the hydrogen bonding degree is the lowest(48.3%).Conversely,NDI-based polyurethane,despite having the high hydrogen bonding degree(53.6%),exhibits moderate fatigue performance(122.52 J/m^(2)),likely due to a more scattered microstructure.TDI-based polyurethane,with the highest hydrogen bonding degree(59.1%)but absence of spherulitic structure,shows the lowest fatigue threshold(46.43 J/m^(2)).Compared to common rubbers(NR,NBR,EPDM,BR),the superior fatigue performance of CPU is attributed to its well-organized microstructure,polyurethane possesses a higher fatigue threshold due to its high phase separation degree and orderly and dense spherulitic structure which enhances energy dissipation and reduces crack propagation.
基金Supported by the National Natural Science Foundation of China(No.61303179,U1135005,61175020)
文摘A local and global context representation learning model for Chinese characters is designed and a Chinese word segmentation method based on character representations is proposed in this paper. First, the proposed Chinese character learning model uses the semanties of loeal context and global context to learn the representation of Chinese characters. Then, Chinese word segmentation model is built by a neural network, while the segmentation model is trained with the eharaeter representations as its input features. Finally, experimental results show that Chinese charaeter representations can effectively learn the semantic information. Characters with similar semantics cluster together in the visualize space. Moreover, the proposed Chinese word segmentation model also achieves a pretty good improvement on precision, recall and f-measure.
文摘The purpose of this paper is to propose a new multi stage algorithm for the recognition of isolated characters. It was similar work done before using only the center of gravity (This paper is extended version of “A fast recognition system for isolated printed characters using center of gravity”, LAP LAMBERT Academic Publishing 2011, ISBN: 978-38465-0002-6), but here we add using principal axis in order to make the algorithm rotation invariant. In my previous work which is published in LAP LAMBERT, I face a big problem that when the character is rotated I can’t recognize the character. So this adds constrain on the document to be well oriented but here I use the principal axis in order to unify the orientation of the character set and the characters in the scanned document. The algorithm can be applied for any isolated character such as Latin, Chinese, Japanese, and Arabic characters but it has been applied in this paper for Arabic characters. The approach uses normalized and isolated characters of the same size and extracts an image signature based on the center of gravity of the character after making the character principal axis vertical, and then the system compares these values to a set of signatures for typical characters of the set. The system then provides the closeness of match to all other characters in the set.
基金supported in part by National Science Foundation of China under Grants No. 61303105 and 61402304the Humanity & Social Science general project of Ministry of Education under Grants No.14YJAZH046+2 种基金the Beijing Natural Science Foundation under Grants No. 4154065the Beijing Educational Committee Science and Technology Development Planned under Grants No.KM201410028017Beijing Key Disciplines of Computer Application Technology
文摘ESA is an unsupervised approach to word segmentation previously proposed by Wang, which is an iterative process consisting of three phases: Evaluation, Selection and Adjustment. In this article, we propose Ex ESA, the extension of ESA. In Ex ESA, the original approach is extended to a 2-pass process and the ratio of different word lengths is introduced as the third type of information combined with cohesion and separation. A maximum strategy is adopted to determine the best segmentation of a character sequence in the phrase of Selection. Besides, in Adjustment, Ex ESA re-evaluates separation information and individual information to overcome the overestimation frequencies. Additionally, a smoothing algorithm is applied to alleviate sparseness. The experiment results show that Ex ESA can further improve the performance and is time-saving by properly utilizing more information from un-annotated corpora. Moreover, the parameters of Ex ESA can be predicted by a set of empirical formulae or combined with the minimum description length principle.
文摘The segmentation of individual words into characters is a vital process in handwritten character recognition systems. In this paper, a novel approach is proposed to segment handwritten Arabic text (words). We consider the “Naskh” font style. The segmentation algorithm employs seven agents in order to detect regions where segmentation is illegal. Feature points (end points) are extracted from the remaining regions of the word-image. Initially, the middle of every two successive end points is considered as a candidate segmentation point based on a set of rules. The experimental results are very promising as we achieved a success rate of 86%.
文摘Segmenting Arabic handwritings had been one of the subjects of research in the field of Arabic character recognition for more than 25 years. The majority of reported segmentation techniques share a critical shortcoming, which is over-segmentation. The aim of segmentation is to produce the letters (segments) of a handwritten word. When a resulting letter (segment) is made of more than one piece (stroke) instead of one, this is called over-segmentation. Our objective is to overcome this problem by using an Artificial Neural Networks (ANN) to verify the resulting segment. We propose a set of heuristic-based rules to assemble strokes in order to report the precise segmented letters. Preprocessing phases that include normalization and feature extraction are required as a prerequisite step for the ANN system for recognition and verification. In our previous work [1], we did achieve a segmentation success rate of 86% but without recognition. In this work, our experimental results confirmed a segmentation success rate of no less than 95%.
基金National Key Research and Development Program of China,Grant/Award Number:2023YFC3006700。
文摘Due to the diversity of climate and environment in China,the frequent occurrence of extreme rainfall events has brought great challenges to flood prevention.Water level measurement is one of the important research topics of flood prevention.Recently,the image‐based water level recognition method has become an important part of water level measurement research due to its advantages in easy installation,low cost,and zero need of manual reading.However,there are two mainly shortcomings of the existing imagebased water level recognition methods:(1)severely affected by light intensity and(2)low accuracy of water level recognition for stained water gauges.To solve these two problems,this paper proposes a water level recognition method in consideration of complex scenarios.This method first uses a semantic segmentation convolutional neural network to extract the water gauge mask,and then uses the YOLOv5 object detection network to extract the letter“E”on the water gauge.Based on the character sequence inspection strategy,the algorithm dynamically compensates for the missed detection of characters of stained water gauges.Through a large number of experiments,the proposed water level measurement method has good robustness in complex scenarios,meeting the needs of flash flood defense.