National Population Health Data Center(NPHDC)is one of China's 20 national-level science data centers,jointly designated by the Ministry of Science and Technology and the Ministry of Finance.Operated by the Chines...National Population Health Data Center(NPHDC)is one of China's 20 national-level science data centers,jointly designated by the Ministry of Science and Technology and the Ministry of Finance.Operated by the Chinese Academy of Medical Sciences under the oversight of the National Health Commission,NPHDC adheres to national regulations including the Scientific Data Management Measures and the National Science and Technology Infrastructure Service Platform Management Measures,and is committed to collecting,integrating,managing,and sharing biomedical and health data through openaccess platform,fostering open sharing and engaging in international cooperation.展开更多
The recent pandemic crisis has highlighted the importance of the availability and management of health data to respond quickly and effectively to health emergencies, while respecting the fundamental rights of every in...The recent pandemic crisis has highlighted the importance of the availability and management of health data to respond quickly and effectively to health emergencies, while respecting the fundamental rights of every individual. In this context, it is essential to find a balance between the protection of privacy and the safeguarding of public health, using tools that guarantee transparency and consent to the processing of data by the population. This work, starting from a pilot investigation conducted in the Polyclinic of Bari as part of the Horizon Europe Seeds project entitled “Multidisciplinary analysis of technological tracing models of contagion: the protection of rights in the management of health data”, has the objective of promoting greater patient awareness regarding the processing of their health data and the protection of privacy. The methodology used the PHICAT (Personal Health Information Competence Assessment Tool) as a tool and, through the administration of a questionnaire, the aim was to evaluate the patients’ ability to express their consent to the release and processing of health data. The results that emerged were analyzed in relation to the 4 domains in which the process is divided which allows evaluating the patients’ ability to express a conscious choice and, also, in relation to the socio-demographic and clinical characteristics of the patients themselves. This study can contribute to understanding patients’ ability to give their consent and improve information regarding the management of health data by increasing confidence in granting the use of their data for research and clinical management.展开更多
To achieve the high availability of health data in erasure-coded cloud storage systems,the data update performance in erasure coding should be continuously optimized.However,the data update performance is often bottle...To achieve the high availability of health data in erasure-coded cloud storage systems,the data update performance in erasure coding should be continuously optimized.However,the data update performance is often bottlenecked by the constrained cross-rack bandwidth.Various techniques have been proposed in the literature to improve network bandwidth efficiency,including delta transmission,relay,and batch update.These techniques were largely proposed individually previously,and in this work,we seek to use them jointly.To mitigate the cross-rack update traffic,we propose DXR-DU which builds on four valuable techniques:(i)delta transmission,(ii)XOR-based data update,(iii)relay,and(iv)batch update.Meanwhile,we offer two selective update approaches:1)data-deltabased update,and 2)parity-delta-based update.The proposed DXR-DU is evaluated via trace-driven local testbed experiments.Comprehensive experiments show that DXR-DU can significantly improve data update throughput while mitigating the cross-rack update traffic.展开更多
Due to the development of technology in medicine,millions of health-related data such as scanning the images are generated.It is a great challenge to store the data and handle a massive volume of data.Healthcare data ...Due to the development of technology in medicine,millions of health-related data such as scanning the images are generated.It is a great challenge to store the data and handle a massive volume of data.Healthcare data is stored in the cloud-fog storage environments.This cloud-Fog based health model allows the users to get health-related data from different sources,and duplicated informa-tion is also available in the background.Therefore,it requires an additional sto-rage area,increase in data acquisition time,and insecure data replication in the environment.This paper is proposed to eliminate the de-duplication data using a window size chunking algorithm with a biased sampling-based bloomfilter and provide the health data security using the Advanced Signature-Based Encryp-tion(ASE)algorithm in the Fog-Cloud Environment(WCA-BF+ASE).This WCA-BF+ASE eliminates the duplicate copy of the data and minimizes its sto-rage space and maintenance cost.The data is also stored in an efficient and in a highly secured manner.The security level in the cloud storage environment Win-dows Chunking Algorithm(WSCA)has got 86.5%,two thresholds two divisors(TTTD)80%,Ordinal in Python(ORD)84.4%,Boom Filter(BF)82%,and the proposed work has got better security storage of 97%.And also,after applying the de-duplication process,the proposed method WCA-BF+ASE has required only less storage space for variousfile sizes of 10 KB for 200,400 MB has taken only 22 KB,and 600 MB has required 35 KB,800 MB has consumed only 38 KB,1000 MB has taken 40 KB of storage spaces.展开更多
The COVID-19 pandemic has exposed vulnerabilities within our healthcare structures. Healthcare facilities are often faced with staff shortages and work overloads, which can have an impact on the collection of health d...The COVID-19 pandemic has exposed vulnerabilities within our healthcare structures. Healthcare facilities are often faced with staff shortages and work overloads, which can have an impact on the collection of health data and constants essential for early diagnosis. In order to minimize the risk of error and optimize data collection, we have developed a robot incorporating artificial intelligence. This robot has been designed to automate and collect health data and constants in a contactless way, while at the same time verifying the conditions for correct measurements, such as the absence of hats and shoes. Furthermore, this health information needs to be transmitted to services for processing. Thus, this article addresses the aspect of reception and collection of health data and constants through various modules: for taking height, temperature and weight, as well as the module for entering patient identification data. The article also deals with orientation, presenting a module for selecting the patient’s destination department. This data is then routed via a wireless network and an application integrated into the doctors’ tablets. This application will enable efficient queue management by classifying patients according to their order of arrival. The system’s infrastructure is easily deployable, taking advantage of the healthcare facility’s local wireless network, and includes encryption mechanisms to reinforce the security of data circulating over the network. In short, this innovative system will offer an autonomous, contactless method for collecting vital constants such as size, mass, and temperature. What’s more, it will facilitate the flow of data, including identification information, across a network, simplifying the implementation of this solution within healthcare facilities.展开更多
Data space,as an innovative data management and sharing model,is emerging in the medical and health sectors.This study expounds on the conceptual connotation of data space and delineates its key technologies,including...Data space,as an innovative data management and sharing model,is emerging in the medical and health sectors.This study expounds on the conceptual connotation of data space and delineates its key technologies,including distributed data storage,standardization and interoperability of data sharing,data security and privacy protection,data analysis and mining,and data space assessment.By analyzing the real-world cases of data spaces within medicine and health,this study compares the similarities and differences across various dimensions such as purpose,architecture,data interoperability,and privacy protection.Meanwhile,data spaces in these fields are challenged by the limited computing resources,the complexities of data integration,and the need for optimized algorithms.Additionally,legal and ethical issues such as unclear data ownership,undefined usage rights,risks associated with privacy protection need to be addressed.The study notes organizational and management difficulties,calling for enhancements in governance framework,data sharing mechanisms,and value assessment systems.In the future,technological innovation,sound regulations,and optimized management will help the development of the medical and health data space.These developments will enable the secure and efficient utilization of data,propelling the medical industry into an era characterized by precision,intelligence,and personalization.展开更多
Background Data on biosimilar use in pediatric inflammatory bowel diseases(IBD)are scarce compared to the status of studies in adults,resulting in limitations in its treatment.We compared effectiveness and safety of b...Background Data on biosimilar use in pediatric inflammatory bowel diseases(IBD)are scarce compared to the status of studies in adults,resulting in limitations in its treatment.We compared effectiveness and safety of biosimilars versus originators in this population.Methods We used data from the French National Health Data System to identify children(less than 18 years old at treatment initiation)initiating treatment with a biosimilar or the originator infliximab or adalimumab for Crohn’s disease(CD)or ulcerative colitis(UC),from first biosimilar launch(January 2015 and October 2018,respectively)to 31 December 2022.Patients’follow-up went until 30 June 2023.We compared the risks of treatment failure and overnight hospitalization in biosimilar versus originator new users using inverse harzard ratio(HR)of probability of treatment weighted Cox regressions(IPTW).Results We included 5870 patients(infliximab:n=3491;adalimumab:n=2379)in the study.Biosimilars represented,respectively,76.0%(n=2652)and 29.0%(n=691)of infliximab and adalimumab initiations.CD represented 70.9%(n=2476)and 69.0%(n=1642)of infliximab and adalimumab initiations.Biosimilar use was not associated with increased risks of treatment failure[IPTW HR(95%confidence interval,CI):infliximab 0.92(0.78–1.09)in CD,0.98(0.76–1.27)in UC;adalimumab 0.98(0.85–1.14)in CD,1.01(0.82–1.24)in UC].Occurrence of all-cause hospitalization was not different between exposure groups[IPTW HR(95%CI):infliximab 0.96(0.78–1.18);adalimumab 1.03(0.80–1.33)].No difference in occurrence of serious infections,mainly gastro-intestinal or dermatological,was found.Conclusion We provide reassuring results on the use,effectiveness and safety of biosimilars in a large unselected pediatric population suffering from IBD.展开更多
Real-time health data monitoring is pivotal for bolstering road services’safety,intelligence,and efficiency within the Internet of Health Things(IoHT)framework.Yet,delays in data retrieval can markedly hinder the eff...Real-time health data monitoring is pivotal for bolstering road services’safety,intelligence,and efficiency within the Internet of Health Things(IoHT)framework.Yet,delays in data retrieval can markedly hinder the efficacy of big data awareness detection systems.We advocate for a collaborative caching approach involving edge devices and cloud networks to combat this.This strategy is devised to streamline the data retrieval path,subsequently diminishing network strain.Crafting an adept cache processing scheme poses its own set of challenges,especially given the transient nature of monitoring data and the imperative for swift data transmission,intertwined with resource allocation tactics.This paper unveils a novel mobile healthcare solution that harnesses the power of our collaborative caching approach,facilitating nuanced health monitoring via edge devices.The system capitalizes on cloud computing for intricate health data analytics,especially in pinpointing health anomalies.Given the dynamic locational shifts and possible connection disruptions,we have architected a hierarchical detection system,particularly during crises.This system caches data efficiently and incorporates a detection utility to assess data freshness and potential lag in response times.Furthermore,we introduce the Cache-Assisted Real-Time Detection(CARD)model,crafted to optimize utility.Addressing the inherent complexity of the NP-hard CARD model,we have championed a greedy algorithm as a solution.Simulations reveal that our collaborative caching technique markedly elevates the Cache Hit Ratio(CHR)and data freshness,outshining its contemporaneous benchmark algorithms.The empirical results underscore the strength and efficiency of our innovative IoHT-based health monitoring solution.To encapsulate,this paper tackles the nuances of real-time health data monitoring in the IoHT landscape,presenting a joint edge-cloud caching strategy paired with a hierarchical detection system.Our methodology yields enhanced cache efficiency and data freshness.The corroborative numerical data accentuates the feasibility and relevance of our model,casting a beacon for the future trajectory of real-time health data monitoring systems.展开更多
This study presents a comprehensive and secure architectural framework for the Internet of Medical Things(IoMT),integrating the foundational principles of the Confidentiality,Integrity,and Availability(CIA)triad along...This study presents a comprehensive and secure architectural framework for the Internet of Medical Things(IoMT),integrating the foundational principles of the Confidentiality,Integrity,and Availability(CIA)triad along with authentication mechanisms.Leveraging advanced Machine Learning(ML)and Deep Learning(DL)techniques,the proposed system is designed to safeguard Patient-Generated Health Data(PGHD)across interconnected medical devices.Given the increasing complexity and scale of cyber threats in IoMT environments,the integration of Intrusion Detection and Prevention Systems(IDPS)with intelligent analytics is critical.Our methodology employs both standalone and hybrid ML&DL models to automate threat detection and enable real-time analysis,while ensuring rapid and accurate responses to a diverse array of attacks.Emphasis is placed on systematic model evaluation using detection metrics such as accuracy,False Alarm Rate(FAR),and False Discovery Rate(FDR),with performance validation through cross-validation and statistical significance testing.Experimental results based on the Edge-IIoTset dataset demonstrate the superior performance of ensemble-based ML models such as Extreme Gradient Boosting(XGB)and hybrid DL models such as Convolutional Neural Networks with Autoencoders(CNN+AE),which achieved detection accuracies of 96%and 98%,respectively,with notably low FARs.These findings underscore the effectiveness of combining traditional security principles with advanced AI-driven methodologies to ensure secure,resilient,and trustworthy healthcare systems within the IoMT ecosystem.展开更多
调查和分析元数据标准在健康科学数据中的应用现状,有助于为我国健康科学数据描述中元数据标准的选择、健康科学数据平台的建设提供参考。通过网络调研法对科学数据仓储注册系统(registry of research data repositories,re3data)中的...调查和分析元数据标准在健康科学数据中的应用现状,有助于为我国健康科学数据描述中元数据标准的选择、健康科学数据平台的建设提供参考。通过网络调研法对科学数据仓储注册系统(registry of research data repositories,re3data)中的健康科学数据管理平台进行调研,梳理所应用的元数据标准,分析典型元数据标准在平台中的应用情况,并归纳其在健康科学数据描述中的适用性。re3data中各健康科学数据平台共使用14种元数据标准,其中DC、DataCite、DDI、仓储自建元数据标准的使用最为广泛,多数平台组合使用多种元数据标准。各类元数据标准可分为通用型、社会科学型、自建型3类,分别适用于描述健康科学数据通用属性、社会科学研究产生的健康科学数据、特色和专业性强及政府开放的健康科学数据。展开更多
Blockchain is commonly considered a potentialdisruptive technology. Moreover, the healthcareindustry has experienced rapid growth in the adoption ofhealth information technology, such as electronic healthrecords and e...Blockchain is commonly considered a potentialdisruptive technology. Moreover, the healthcareindustry has experienced rapid growth in the adoption ofhealth information technology, such as electronic healthrecords and electronic medical records. To guarantee dataprivacy and data security as well as to harness the value ofhealth data, the concept of Health Data Bank (HDB) isproposed. In this study, HDB is defined as an integratedhealth data service institution, which bears no “ownership”of health data and operates health data under the principalagentmodel. This study first comprehensively reviews themain characters of blockchain and identifies the blockchain-based healthcare industry projects and startups in theareas of health insurance, pharmacy, and medical treatment.Then, we analyze the fundamental principles ofHDB and point out four challenges faced by HDB’ssustainable development: (1) privacy protection andinteroperability of health data;(2) data rights;(3) healthdata supervision;(4) and willingness to share health data.We also analyze the important benefits of blockchainadoption in HDB. Furthermore, three application scenariosincluding distributed storage of health data, smart-contractbasedhealthcare service mode, and consensus-algorithmbasedincentive policy are proposed to shed light on HDBbasedhealthcare service mode. In the end, this study offersinsights into potential research directions and challenges.展开更多
OBJECTIVE: To apply data mining methods to research on the state of sub-mental health among residents in eight provinces and cities in China and to mine latent knowledge about many conditions through data mining and a...OBJECTIVE: To apply data mining methods to research on the state of sub-mental health among residents in eight provinces and cities in China and to mine latent knowledge about many conditions through data mining and analysis of data on 3970 sub-mentally healthy individuals selected from 13385 relevant question naires.METHODS: The strategic tree algorithm was used to identify the main mani festations of the state of sub-mental health. The backpropogation artificial neural network was used to analyze the main mani festations of sub-healthy mental states of three different degrees. A sub-mental health evaluation model was then established to achieve predictive evaluationresults.RESULTS: Using classifications from the Scale of Chinese Sub-healthy State, the main manifestations of sub-mental health selected using the strate gictree were F1101(Do you lack peace of mind?),F1102(Are you easily nervous when something comes up?), and F1002(Do you often sigh?). The relative intensity of manifestations of sub-mental health was highest for F1101, followed by F1102,and then F1002. Through study of the neural network, better differentiation could be made between moderate and severe and between mild and severe states of sub-mental health. The differentiation between mild and moderate sub-mental health states was less apparent. Additionally, the sub-mental health state evaluation model, which could be used to predict states of sub-mental health of different individuals, was established using F1101, F1102, F1002, and the mental self-assessment totals core.CONCLUSION: The main manifestations of the state of sub-mental health can be discovered using data mining methods to research and analyze the latent laws and knowledge hidden in research evidence on the state of sub-mental health. The state of sub-mental health of different individuals can be rapidly predicted using the model established here.This can provide a basis for assessment and intervention for sub-mental health. It can also replace the relatively outdated approaches to research on sub-health in the technical era of information and digitization by combining the study of states of sub-mental health with information techniques and by further quantifying the relevant information.展开更多
Big data and associated analytics have the potential to revolutionize healthcare through the tools and techniques they offer to manage and exploit the large volumes of heterogeneous data being collected in the healthc...Big data and associated analytics have the potential to revolutionize healthcare through the tools and techniques they offer to manage and exploit the large volumes of heterogeneous data being collected in the healthcare domain. The strict security and privacy constraints on this data, however, pose a major obstacle to the successful use of these tools and techniques. The paper first describes the security challenges associated with big data analytics in healthcare research from a unique perspective based on the big data analytics pipeline. The paper then examines the use of data safe havens as an approach to addressing the security challenges and argues for the approach by providing a detailed introduction to the security mechanisms implemented in a novel data safe haven. The CIMVHR Data Safe Haven (CDSH) was developed to support research into the health and well-being of Canadian military, Veterans, and their families. The CDSH is shown to overcome the security challenges presented in the different stages of the big data analytics pipeline.展开更多
In conjunction with association rules for data mining, the connections between testing indices and strong and weak association rules were determined, and new derivative rules were obtained by further reasoning. Associ...In conjunction with association rules for data mining, the connections between testing indices and strong and weak association rules were determined, and new derivative rules were obtained by further reasoning. Association rules were used to analyze correlation and check consistency between indices. This study shows that the judgment obtained by weak association rules or non-association rules is more accurate and more credible than that obtained by strong association rules. When the testing grades of two indices in the weak association rules are inconsistent, the testing grades of indices are more likely to be erroneous, and the mistakes are often caused by human factors. Clustering data mining technology was used to analyze the reliability of a diagnosis, or to perform health diagnosis directly. Analysis showed that the clustering results are related to the indices selected, and that if the indices selected are more significant, the characteristics of clustering results are also more significant, and the analysis or diagnosis is more credible. The indices and diagnosis analysis function produced by this study provide a necessary theoretical foundation and new ideas for the development of hydraulic metal structure health diagnosis technology.展开更多
Structural health monitoring (SHM) is a multi-discipline field that involves the automatic sensing of structural loads and response by means of a large number of sensors and instruments, followed by a diagnosis of the...Structural health monitoring (SHM) is a multi-discipline field that involves the automatic sensing of structural loads and response by means of a large number of sensors and instruments, followed by a diagnosis of the structural health based on the collected data. Because an SHM system implemented into a structure automatically senses, evaluates, and warns about structural conditions in real time, massive data are a significant feature of SHM. The techniques related to massive data are referred to as data science and engineering, and include acquisition techniques, transition techniques, management techniques, and processing and mining algorithms for massive data. This paper provides a brief review of the state of the art of data science and engineering in SHM as investigated by these authors, and covers the compressive sampling-based data-acquisition algorithm, the anomaly data diagnosis approach using a deep learning algorithm, crack identification approaches using computer vision techniques, and condition assessment approaches for bridges using machine learning algorithms. Future trends are discussed in the conclusion.展开更多
Guangzhou is the capital and largest city(land area:7287 km2)of Guangdong province in South China.The air quality in Guangzhou typically worsens in November due to unfavorable meteorological conditions for pollutan...Guangzhou is the capital and largest city(land area:7287 km2)of Guangdong province in South China.The air quality in Guangzhou typically worsens in November due to unfavorable meteorological conditions for pollutant dispersion.During the Guangzhou Asian Games in November 2010,the Guangzhou government carried out a number of emission control measures that significantly improved the air quality.In this paper,we estimated the acute health outcome changes related to the air quality improvement during the 2010 Guangzhou Asian Games using a next-generation,fully-integrated assessment system for air quality and health benefits.This advanced system generates air quality data by fusing model and monitoring data instead of using monitoring data alone,which provides more reliable results.The air quality estimates retain the spatial distribution of model results while calibrating the value with observations.The results show that the mean PM2.5concentration in November 2010 decreased by 3.5μg/m^3 compared to that in 2009 due to the emission control measures.From the analysis,we estimate that the air quality improvement avoided 106 premature deaths,1869 cases of hospital admission,and 20,026 cases of outpatient visits.The overall cost benefit of the improved air quality is estimated to be 165 million CNY,with the avoided premature death contributing 90%of this figure.The research demonstrates that Ben MAP-CE is capable of assessing the health and cost benefits of air pollution control for sound policy making.展开更多
Objective: Challenges remain in current practices of colorectal cancer(CRC) screening, such as low compliance,low specificities and expensive cost. This study aimed to identify high-risk groups for CRC from the genera...Objective: Challenges remain in current practices of colorectal cancer(CRC) screening, such as low compliance,low specificities and expensive cost. This study aimed to identify high-risk groups for CRC from the general population using regular health examination data.Methods: The study population consist of more than 7,000 CRC cases and more than 140,000 controls. Using regular health examination data, a model detecting CRC cases was derived by the classification and regression trees(CART) algorithm. Receiver operating characteristic(ROC) curve was applied to evaluate the performance of models. The robustness and generalization of the CART model were validated by independent datasets. In addition, the effectiveness of CART-based screening was compared with stool-based screening.Results: After data quality control, 4,647 CRC cases and 133,898 controls free of colorectal neoplasms were used for downstream analysis. The final CART model based on four biomarkers(age, albumin, hematocrit and percent lymphocytes) was constructed. In the test set, the area under ROC curve(AUC) of the CART model was 0.88 [95%confidence interval(95% CI), 0.87-0.90] for detecting CRC. At the cutoff yielding 99.0% specificity, this model’s sensitivity was 62.2%(95% CI, 58.1%-66.2%), thereby achieving a 63-fold enrichment of CRC cases. We validated the robustness of the method across subsets of test set with diverse CRC incidences, aging rates, genders ratio, distributions of tumor stages and locations, and data sources. Importantly, CART-based screening had the higher positive predictive value(1.6%) than fecal immunochemical test(0.3%).Conclusions: As an alternative approach for the early detection of CRC, this study provides a low-cost method using regular health examination data to identify high-risk individuals for CRC for further examinations. The approach can promote early detection of CRC especially in developing countries such as China, where annual health examination is popular but regular CRC-specific screening is rare.展开更多
This study utilized Data Envelopment Analysis (DEA) in assessing the efficiency of health center in tuberculosis (TB) treatment. Assessing the efficiency of health center treating TB is a vital and sensitive topic, be...This study utilized Data Envelopment Analysis (DEA) in assessing the efficiency of health center in tuberculosis (TB) treatment. Assessing the efficiency of health center treating TB is a vital and sensitive topic, because there is a cumulative amount of public funds devoted to healthcare. In this research, a DEA model has been correlated to evaluate and assess the efficiency of 17 health centers. The researchers selected the health budget and the number of health workers as input variables likewise, the number of people served, number of TB patients served, and TB patients treated (%) as output variables. Based on the result of the study, only five (5) health centers out of seventeen (17) have 100% efficiencies throughout the 2 years period. It is recommended that other health centers should learn from their efficient peers recognized by the DEA model so as to increase the overall performance of the healthcare system. Likewise, health centers should integrate Health Information Technology to deliver healthier care for their patients.展开更多
We assess the relative efficiency of health systems of 35 countries in sub-Saharan Africa using Data Envelopment Analysis. This method allows us to evaluate the ability of each country to transform its sanitary “inp...We assess the relative efficiency of health systems of 35 countries in sub-Saharan Africa using Data Envelopment Analysis. This method allows us to evaluate the ability of each country to transform its sanitary “inputs” into health “outputs”. Our results show that, on average, the health systems of these countries have an efficiency score between 72% and 84% of their maximum level. We also note that education and density of population are factors that affect the efficiency of the health system in these countries.展开更多
Background: High data quality provides correct and up-to-date information which is critical to ensure, not only for the maintenance of health care at an optimal level, but also for the provision of high-quality clinic...Background: High data quality provides correct and up-to-date information which is critical to ensure, not only for the maintenance of health care at an optimal level, but also for the provision of high-quality clinical care, continuing health care, clinical and health service research, and planning and management of health systems. For the attainment of achievable improvements in the health sector, good data is core. Aim/Objective: To assess the level of knowledge and practices of Community Health Nurses on data quality in the Ho municipality, Ghana. Methods: A descriptive cross-sectional study was employed for the study, using a standard Likert scale questionnaire. A census was used to collect 77 Community Health Nurses’ information. The statistical software, Epi-Data 3.1 was used to enter the data and exported to STATA 12.0 for the analyses. Chi-square and logistic analyses were performed to establish associations between categorical variables and a p-value of less than 0.05 at 95% significance interval was considered statistically significant. Results: Out of the 77 Community Health Nurses studied, 49 (63.64%) had good knowledge on data accuracy, 51 (66.23%) out of the 77 Community Health Nurses studied had poor knowledge on data completeness, and 64 (83.12%) had poor knowledge on data timeliness out of the 77 studied. Also, 16 (20.78%) and 33 (42.86%) of the 77 Community Health Nurses responded there was no designated staff for data quality review and no feedback from the health directorate respectively. Out of the 16 health facilities studied for data quality practices, half (8, 50.00%) had missing values on copies of their previous months’ report forms. More so, 10 (62.50%) had no reminders (monthly data submission itineraries) at the facility level. Conclusion: Overall, the general level of knowledge of Community Health Nurses on data quality was poor and their practices for improving data quality at the facility level were woefully inadequate. Therefore, Community Health Nurses need to be given on-job training and proper education on data quality and its dimensions. Also, the health directorate should intensify its continuous supportive supervisory visits at all facilities and feedback should be given to the Community Health Nurses on the data submitted.展开更多
文摘National Population Health Data Center(NPHDC)is one of China's 20 national-level science data centers,jointly designated by the Ministry of Science and Technology and the Ministry of Finance.Operated by the Chinese Academy of Medical Sciences under the oversight of the National Health Commission,NPHDC adheres to national regulations including the Scientific Data Management Measures and the National Science and Technology Infrastructure Service Platform Management Measures,and is committed to collecting,integrating,managing,and sharing biomedical and health data through openaccess platform,fostering open sharing and engaging in international cooperation.
文摘The recent pandemic crisis has highlighted the importance of the availability and management of health data to respond quickly and effectively to health emergencies, while respecting the fundamental rights of every individual. In this context, it is essential to find a balance between the protection of privacy and the safeguarding of public health, using tools that guarantee transparency and consent to the processing of data by the population. This work, starting from a pilot investigation conducted in the Polyclinic of Bari as part of the Horizon Europe Seeds project entitled “Multidisciplinary analysis of technological tracing models of contagion: the protection of rights in the management of health data”, has the objective of promoting greater patient awareness regarding the processing of their health data and the protection of privacy. The methodology used the PHICAT (Personal Health Information Competence Assessment Tool) as a tool and, through the administration of a questionnaire, the aim was to evaluate the patients’ ability to express their consent to the release and processing of health data. The results that emerged were analyzed in relation to the 4 domains in which the process is divided which allows evaluating the patients’ ability to express a conscious choice and, also, in relation to the socio-demographic and clinical characteristics of the patients themselves. This study can contribute to understanding patients’ ability to give their consent and improve information regarding the management of health data by increasing confidence in granting the use of their data for research and clinical management.
基金supported by Major Special Project of Sichuan Science and Technology Department(2020YFG0460)Central University Project of China(ZYGX2020ZB020,ZYGX2020ZB019).
文摘To achieve the high availability of health data in erasure-coded cloud storage systems,the data update performance in erasure coding should be continuously optimized.However,the data update performance is often bottlenecked by the constrained cross-rack bandwidth.Various techniques have been proposed in the literature to improve network bandwidth efficiency,including delta transmission,relay,and batch update.These techniques were largely proposed individually previously,and in this work,we seek to use them jointly.To mitigate the cross-rack update traffic,we propose DXR-DU which builds on four valuable techniques:(i)delta transmission,(ii)XOR-based data update,(iii)relay,and(iv)batch update.Meanwhile,we offer two selective update approaches:1)data-deltabased update,and 2)parity-delta-based update.The proposed DXR-DU is evaluated via trace-driven local testbed experiments.Comprehensive experiments show that DXR-DU can significantly improve data update throughput while mitigating the cross-rack update traffic.
文摘Due to the development of technology in medicine,millions of health-related data such as scanning the images are generated.It is a great challenge to store the data and handle a massive volume of data.Healthcare data is stored in the cloud-fog storage environments.This cloud-Fog based health model allows the users to get health-related data from different sources,and duplicated informa-tion is also available in the background.Therefore,it requires an additional sto-rage area,increase in data acquisition time,and insecure data replication in the environment.This paper is proposed to eliminate the de-duplication data using a window size chunking algorithm with a biased sampling-based bloomfilter and provide the health data security using the Advanced Signature-Based Encryp-tion(ASE)algorithm in the Fog-Cloud Environment(WCA-BF+ASE).This WCA-BF+ASE eliminates the duplicate copy of the data and minimizes its sto-rage space and maintenance cost.The data is also stored in an efficient and in a highly secured manner.The security level in the cloud storage environment Win-dows Chunking Algorithm(WSCA)has got 86.5%,two thresholds two divisors(TTTD)80%,Ordinal in Python(ORD)84.4%,Boom Filter(BF)82%,and the proposed work has got better security storage of 97%.And also,after applying the de-duplication process,the proposed method WCA-BF+ASE has required only less storage space for variousfile sizes of 10 KB for 200,400 MB has taken only 22 KB,and 600 MB has required 35 KB,800 MB has consumed only 38 KB,1000 MB has taken 40 KB of storage spaces.
文摘The COVID-19 pandemic has exposed vulnerabilities within our healthcare structures. Healthcare facilities are often faced with staff shortages and work overloads, which can have an impact on the collection of health data and constants essential for early diagnosis. In order to minimize the risk of error and optimize data collection, we have developed a robot incorporating artificial intelligence. This robot has been designed to automate and collect health data and constants in a contactless way, while at the same time verifying the conditions for correct measurements, such as the absence of hats and shoes. Furthermore, this health information needs to be transmitted to services for processing. Thus, this article addresses the aspect of reception and collection of health data and constants through various modules: for taking height, temperature and weight, as well as the module for entering patient identification data. The article also deals with orientation, presenting a module for selecting the patient’s destination department. This data is then routed via a wireless network and an application integrated into the doctors’ tablets. This application will enable efficient queue management by classifying patients according to their order of arrival. The system’s infrastructure is easily deployable, taking advantage of the healthcare facility’s local wireless network, and includes encryption mechanisms to reinforce the security of data circulating over the network. In short, this innovative system will offer an autonomous, contactless method for collecting vital constants such as size, mass, and temperature. What’s more, it will facilitate the flow of data, including identification information, across a network, simplifying the implementation of this solution within healthcare facilities.
文摘Data space,as an innovative data management and sharing model,is emerging in the medical and health sectors.This study expounds on the conceptual connotation of data space and delineates its key technologies,including distributed data storage,standardization and interoperability of data sharing,data security and privacy protection,data analysis and mining,and data space assessment.By analyzing the real-world cases of data spaces within medicine and health,this study compares the similarities and differences across various dimensions such as purpose,architecture,data interoperability,and privacy protection.Meanwhile,data spaces in these fields are challenged by the limited computing resources,the complexities of data integration,and the need for optimized algorithms.Additionally,legal and ethical issues such as unclear data ownership,undefined usage rights,risks associated with privacy protection need to be addressed.The study notes organizational and management difficulties,calling for enhancements in governance framework,data sharing mechanisms,and value assessment systems.In the future,technological innovation,sound regulations,and optimized management will help the development of the medical and health data space.These developments will enable the secure and efficient utilization of data,propelling the medical industry into an era characterized by precision,intelligence,and personalization.
文摘Background Data on biosimilar use in pediatric inflammatory bowel diseases(IBD)are scarce compared to the status of studies in adults,resulting in limitations in its treatment.We compared effectiveness and safety of biosimilars versus originators in this population.Methods We used data from the French National Health Data System to identify children(less than 18 years old at treatment initiation)initiating treatment with a biosimilar or the originator infliximab or adalimumab for Crohn’s disease(CD)or ulcerative colitis(UC),from first biosimilar launch(January 2015 and October 2018,respectively)to 31 December 2022.Patients’follow-up went until 30 June 2023.We compared the risks of treatment failure and overnight hospitalization in biosimilar versus originator new users using inverse harzard ratio(HR)of probability of treatment weighted Cox regressions(IPTW).Results We included 5870 patients(infliximab:n=3491;adalimumab:n=2379)in the study.Biosimilars represented,respectively,76.0%(n=2652)and 29.0%(n=691)of infliximab and adalimumab initiations.CD represented 70.9%(n=2476)and 69.0%(n=1642)of infliximab and adalimumab initiations.Biosimilar use was not associated with increased risks of treatment failure[IPTW HR(95%confidence interval,CI):infliximab 0.92(0.78–1.09)in CD,0.98(0.76–1.27)in UC;adalimumab 0.98(0.85–1.14)in CD,1.01(0.82–1.24)in UC].Occurrence of all-cause hospitalization was not different between exposure groups[IPTW HR(95%CI):infliximab 0.96(0.78–1.18);adalimumab 1.03(0.80–1.33)].No difference in occurrence of serious infections,mainly gastro-intestinal or dermatological,was found.Conclusion We provide reassuring results on the use,effectiveness and safety of biosimilars in a large unselected pediatric population suffering from IBD.
基金supported by National Natural Science Foundation of China(NSFC)under Grant Number T2350710232.
文摘Real-time health data monitoring is pivotal for bolstering road services’safety,intelligence,and efficiency within the Internet of Health Things(IoHT)framework.Yet,delays in data retrieval can markedly hinder the efficacy of big data awareness detection systems.We advocate for a collaborative caching approach involving edge devices and cloud networks to combat this.This strategy is devised to streamline the data retrieval path,subsequently diminishing network strain.Crafting an adept cache processing scheme poses its own set of challenges,especially given the transient nature of monitoring data and the imperative for swift data transmission,intertwined with resource allocation tactics.This paper unveils a novel mobile healthcare solution that harnesses the power of our collaborative caching approach,facilitating nuanced health monitoring via edge devices.The system capitalizes on cloud computing for intricate health data analytics,especially in pinpointing health anomalies.Given the dynamic locational shifts and possible connection disruptions,we have architected a hierarchical detection system,particularly during crises.This system caches data efficiently and incorporates a detection utility to assess data freshness and potential lag in response times.Furthermore,we introduce the Cache-Assisted Real-Time Detection(CARD)model,crafted to optimize utility.Addressing the inherent complexity of the NP-hard CARD model,we have championed a greedy algorithm as a solution.Simulations reveal that our collaborative caching technique markedly elevates the Cache Hit Ratio(CHR)and data freshness,outshining its contemporaneous benchmark algorithms.The empirical results underscore the strength and efficiency of our innovative IoHT-based health monitoring solution.To encapsulate,this paper tackles the nuances of real-time health data monitoring in the IoHT landscape,presenting a joint edge-cloud caching strategy paired with a hierarchical detection system.Our methodology yields enhanced cache efficiency and data freshness.The corroborative numerical data accentuates the feasibility and relevance of our model,casting a beacon for the future trajectory of real-time health data monitoring systems.
基金funded by the Deanship of Graduate Studies and Scientific Research at Jouf University under Grant Number(DGSSR-2023-02-02516).
文摘This study presents a comprehensive and secure architectural framework for the Internet of Medical Things(IoMT),integrating the foundational principles of the Confidentiality,Integrity,and Availability(CIA)triad along with authentication mechanisms.Leveraging advanced Machine Learning(ML)and Deep Learning(DL)techniques,the proposed system is designed to safeguard Patient-Generated Health Data(PGHD)across interconnected medical devices.Given the increasing complexity and scale of cyber threats in IoMT environments,the integration of Intrusion Detection and Prevention Systems(IDPS)with intelligent analytics is critical.Our methodology employs both standalone and hybrid ML&DL models to automate threat detection and enable real-time analysis,while ensuring rapid and accurate responses to a diverse array of attacks.Emphasis is placed on systematic model evaluation using detection metrics such as accuracy,False Alarm Rate(FAR),and False Discovery Rate(FDR),with performance validation through cross-validation and statistical significance testing.Experimental results based on the Edge-IIoTset dataset demonstrate the superior performance of ensemble-based ML models such as Extreme Gradient Boosting(XGB)and hybrid DL models such as Convolutional Neural Networks with Autoencoders(CNN+AE),which achieved detection accuracies of 96%and 98%,respectively,with notably low FARs.These findings underscore the effectiveness of combining traditional security principles with advanced AI-driven methodologies to ensure secure,resilient,and trustworthy healthcare systems within the IoMT ecosystem.
文摘调查和分析元数据标准在健康科学数据中的应用现状,有助于为我国健康科学数据描述中元数据标准的选择、健康科学数据平台的建设提供参考。通过网络调研法对科学数据仓储注册系统(registry of research data repositories,re3data)中的健康科学数据管理平台进行调研,梳理所应用的元数据标准,分析典型元数据标准在平台中的应用情况,并归纳其在健康科学数据描述中的适用性。re3data中各健康科学数据平台共使用14种元数据标准,其中DC、DataCite、DDI、仓储自建元数据标准的使用最为广泛,多数平台组合使用多种元数据标准。各类元数据标准可分为通用型、社会科学型、自建型3类,分别适用于描述健康科学数据通用属性、社会科学研究产生的健康科学数据、特色和专业性强及政府开放的健康科学数据。
基金the National Natural Science Foundation of China(Grant No.71671039).
文摘Blockchain is commonly considered a potentialdisruptive technology. Moreover, the healthcareindustry has experienced rapid growth in the adoption ofhealth information technology, such as electronic healthrecords and electronic medical records. To guarantee dataprivacy and data security as well as to harness the value ofhealth data, the concept of Health Data Bank (HDB) isproposed. In this study, HDB is defined as an integratedhealth data service institution, which bears no “ownership”of health data and operates health data under the principalagentmodel. This study first comprehensively reviews themain characters of blockchain and identifies the blockchain-based healthcare industry projects and startups in theareas of health insurance, pharmacy, and medical treatment.Then, we analyze the fundamental principles ofHDB and point out four challenges faced by HDB’ssustainable development: (1) privacy protection andinteroperability of health data;(2) data rights;(3) healthdata supervision;(4) and willingness to share health data.We also analyze the important benefits of blockchainadoption in HDB. Furthermore, three application scenariosincluding distributed storage of health data, smart-contractbasedhealthcare service mode, and consensus-algorithmbasedincentive policy are proposed to shed light on HDBbasedhealthcare service mode. In the end, this study offersinsights into potential research directions and challenges.
基金Supported by Chinese"Disease"Sub-health Medicine Research and Intervention of the Eleventh Five-Year Science and Technology Support Project of China(No.2006BAI13B01)Financial Support Case Studies of Traditional Chinese Medicine Treatment of Disease and Health Management Ideas of Shanghai Health Bureau(No.2010227)+2 种基金Scientific Innovation Research Funds of Shanghai Municipal Education Commission(No.14YZ061)Teacher Academic Community Fund of Shanghai University of Traditional Chinese Medicine(No.2013JXG03)Chinese Culture and Its Core Value System Modernization Transformation of the National Social Science Funds(No.12AZD094)
文摘OBJECTIVE: To apply data mining methods to research on the state of sub-mental health among residents in eight provinces and cities in China and to mine latent knowledge about many conditions through data mining and analysis of data on 3970 sub-mentally healthy individuals selected from 13385 relevant question naires.METHODS: The strategic tree algorithm was used to identify the main mani festations of the state of sub-mental health. The backpropogation artificial neural network was used to analyze the main mani festations of sub-healthy mental states of three different degrees. A sub-mental health evaluation model was then established to achieve predictive evaluationresults.RESULTS: Using classifications from the Scale of Chinese Sub-healthy State, the main manifestations of sub-mental health selected using the strate gictree were F1101(Do you lack peace of mind?),F1102(Are you easily nervous when something comes up?), and F1002(Do you often sigh?). The relative intensity of manifestations of sub-mental health was highest for F1101, followed by F1102,and then F1002. Through study of the neural network, better differentiation could be made between moderate and severe and between mild and severe states of sub-mental health. The differentiation between mild and moderate sub-mental health states was less apparent. Additionally, the sub-mental health state evaluation model, which could be used to predict states of sub-mental health of different individuals, was established using F1101, F1102, F1002, and the mental self-assessment totals core.CONCLUSION: The main manifestations of the state of sub-mental health can be discovered using data mining methods to research and analyze the latent laws and knowledge hidden in research evidence on the state of sub-mental health. The state of sub-mental health of different individuals can be rapidly predicted using the model established here.This can provide a basis for assessment and intervention for sub-mental health. It can also replace the relatively outdated approaches to research on sub-health in the technical era of information and digitization by combining the study of states of sub-mental health with information techniques and by further quantifying the relevant information.
文摘Big data and associated analytics have the potential to revolutionize healthcare through the tools and techniques they offer to manage and exploit the large volumes of heterogeneous data being collected in the healthcare domain. The strict security and privacy constraints on this data, however, pose a major obstacle to the successful use of these tools and techniques. The paper first describes the security challenges associated with big data analytics in healthcare research from a unique perspective based on the big data analytics pipeline. The paper then examines the use of data safe havens as an approach to addressing the security challenges and argues for the approach by providing a detailed introduction to the security mechanisms implemented in a novel data safe haven. The CIMVHR Data Safe Haven (CDSH) was developed to support research into the health and well-being of Canadian military, Veterans, and their families. The CDSH is shown to overcome the security challenges presented in the different stages of the big data analytics pipeline.
基金supported by the Key Program of the National Natural Science Foundation of China(Grant No.50539010)the Special Fund for Public Welfare Industry of the Ministry of Water Resources of China(Grant No.200801019)
文摘In conjunction with association rules for data mining, the connections between testing indices and strong and weak association rules were determined, and new derivative rules were obtained by further reasoning. Association rules were used to analyze correlation and check consistency between indices. This study shows that the judgment obtained by weak association rules or non-association rules is more accurate and more credible than that obtained by strong association rules. When the testing grades of two indices in the weak association rules are inconsistent, the testing grades of indices are more likely to be erroneous, and the mistakes are often caused by human factors. Clustering data mining technology was used to analyze the reliability of a diagnosis, or to perform health diagnosis directly. Analysis showed that the clustering results are related to the indices selected, and that if the indices selected are more significant, the characteristics of clustering results are also more significant, and the analysis or diagnosis is more credible. The indices and diagnosis analysis function produced by this study provide a necessary theoretical foundation and new ideas for the development of hydraulic metal structure health diagnosis technology.
基金the National Natural Science Foundation of China (51638007, 51478149, 51678203,and 51678204).
文摘Structural health monitoring (SHM) is a multi-discipline field that involves the automatic sensing of structural loads and response by means of a large number of sensors and instruments, followed by a diagnosis of the structural health based on the collected data. Because an SHM system implemented into a structure automatically senses, evaluates, and warns about structural conditions in real time, massive data are a significant feature of SHM. The techniques related to massive data are referred to as data science and engineering, and include acquisition techniques, transition techniques, management techniques, and processing and mining algorithms for massive data. This paper provides a brief review of the state of the art of data science and engineering in SHM as investigated by these authors, and covers the compressive sampling-based data-acquisition algorithm, the anomaly data diagnosis approach using a deep learning algorithm, crack identification approaches using computer vision techniques, and condition assessment approaches for bridges using machine learning algorithms. Future trends are discussed in the conclusion.
基金provided by the US Environmental Protection Agency(No.5-312-0212979-51786L)the Guangzhou EnvironmentalProtection Bureau(No.x2hj B2150020)+3 种基金the project of an integrated modeling and filed observational verification on the deposition of typical industrial point-source mercury emissions in the Pearl River Deltsupported by the funding of the Guangdong Provincial Key Laboratory of Atmospheric Environment and Pollution Control(No.2011A060901011)the project of Atmospheric Haze Collaboration Control Technology Design from the Chinese Academy of Sciences(No.XDB05030400)the National Environmental Protection Public Welfare Industry Targeted Research Foundation of China(No.201409019)
文摘Guangzhou is the capital and largest city(land area:7287 km2)of Guangdong province in South China.The air quality in Guangzhou typically worsens in November due to unfavorable meteorological conditions for pollutant dispersion.During the Guangzhou Asian Games in November 2010,the Guangzhou government carried out a number of emission control measures that significantly improved the air quality.In this paper,we estimated the acute health outcome changes related to the air quality improvement during the 2010 Guangzhou Asian Games using a next-generation,fully-integrated assessment system for air quality and health benefits.This advanced system generates air quality data by fusing model and monitoring data instead of using monitoring data alone,which provides more reliable results.The air quality estimates retain the spatial distribution of model results while calibrating the value with observations.The results show that the mean PM2.5concentration in November 2010 decreased by 3.5μg/m^3 compared to that in 2009 due to the emission control measures.From the analysis,we estimate that the air quality improvement avoided 106 premature deaths,1869 cases of hospital admission,and 20,026 cases of outpatient visits.The overall cost benefit of the improved air quality is estimated to be 165 million CNY,with the avoided premature death contributing 90%of this figure.The research demonstrates that Ben MAP-CE is capable of assessing the health and cost benefits of air pollution control for sound policy making.
基金supported by funding from Beijing Municipal Science & Technology Commission, Clinical Application and Development of Capital Characteristic (No. Z161100000516003)National Natural Science Foundation of China (No. 31871266)
文摘Objective: Challenges remain in current practices of colorectal cancer(CRC) screening, such as low compliance,low specificities and expensive cost. This study aimed to identify high-risk groups for CRC from the general population using regular health examination data.Methods: The study population consist of more than 7,000 CRC cases and more than 140,000 controls. Using regular health examination data, a model detecting CRC cases was derived by the classification and regression trees(CART) algorithm. Receiver operating characteristic(ROC) curve was applied to evaluate the performance of models. The robustness and generalization of the CART model were validated by independent datasets. In addition, the effectiveness of CART-based screening was compared with stool-based screening.Results: After data quality control, 4,647 CRC cases and 133,898 controls free of colorectal neoplasms were used for downstream analysis. The final CART model based on four biomarkers(age, albumin, hematocrit and percent lymphocytes) was constructed. In the test set, the area under ROC curve(AUC) of the CART model was 0.88 [95%confidence interval(95% CI), 0.87-0.90] for detecting CRC. At the cutoff yielding 99.0% specificity, this model’s sensitivity was 62.2%(95% CI, 58.1%-66.2%), thereby achieving a 63-fold enrichment of CRC cases. We validated the robustness of the method across subsets of test set with diverse CRC incidences, aging rates, genders ratio, distributions of tumor stages and locations, and data sources. Importantly, CART-based screening had the higher positive predictive value(1.6%) than fecal immunochemical test(0.3%).Conclusions: As an alternative approach for the early detection of CRC, this study provides a low-cost method using regular health examination data to identify high-risk individuals for CRC for further examinations. The approach can promote early detection of CRC especially in developing countries such as China, where annual health examination is popular but regular CRC-specific screening is rare.
文摘This study utilized Data Envelopment Analysis (DEA) in assessing the efficiency of health center in tuberculosis (TB) treatment. Assessing the efficiency of health center treating TB is a vital and sensitive topic, because there is a cumulative amount of public funds devoted to healthcare. In this research, a DEA model has been correlated to evaluate and assess the efficiency of 17 health centers. The researchers selected the health budget and the number of health workers as input variables likewise, the number of people served, number of TB patients served, and TB patients treated (%) as output variables. Based on the result of the study, only five (5) health centers out of seventeen (17) have 100% efficiencies throughout the 2 years period. It is recommended that other health centers should learn from their efficient peers recognized by the DEA model so as to increase the overall performance of the healthcare system. Likewise, health centers should integrate Health Information Technology to deliver healthier care for their patients.
文摘We assess the relative efficiency of health systems of 35 countries in sub-Saharan Africa using Data Envelopment Analysis. This method allows us to evaluate the ability of each country to transform its sanitary “inputs” into health “outputs”. Our results show that, on average, the health systems of these countries have an efficiency score between 72% and 84% of their maximum level. We also note that education and density of population are factors that affect the efficiency of the health system in these countries.
文摘Background: High data quality provides correct and up-to-date information which is critical to ensure, not only for the maintenance of health care at an optimal level, but also for the provision of high-quality clinical care, continuing health care, clinical and health service research, and planning and management of health systems. For the attainment of achievable improvements in the health sector, good data is core. Aim/Objective: To assess the level of knowledge and practices of Community Health Nurses on data quality in the Ho municipality, Ghana. Methods: A descriptive cross-sectional study was employed for the study, using a standard Likert scale questionnaire. A census was used to collect 77 Community Health Nurses’ information. The statistical software, Epi-Data 3.1 was used to enter the data and exported to STATA 12.0 for the analyses. Chi-square and logistic analyses were performed to establish associations between categorical variables and a p-value of less than 0.05 at 95% significance interval was considered statistically significant. Results: Out of the 77 Community Health Nurses studied, 49 (63.64%) had good knowledge on data accuracy, 51 (66.23%) out of the 77 Community Health Nurses studied had poor knowledge on data completeness, and 64 (83.12%) had poor knowledge on data timeliness out of the 77 studied. Also, 16 (20.78%) and 33 (42.86%) of the 77 Community Health Nurses responded there was no designated staff for data quality review and no feedback from the health directorate respectively. Out of the 16 health facilities studied for data quality practices, half (8, 50.00%) had missing values on copies of their previous months’ report forms. More so, 10 (62.50%) had no reminders (monthly data submission itineraries) at the facility level. Conclusion: Overall, the general level of knowledge of Community Health Nurses on data quality was poor and their practices for improving data quality at the facility level were woefully inadequate. Therefore, Community Health Nurses need to be given on-job training and proper education on data quality and its dimensions. Also, the health directorate should intensify its continuous supportive supervisory visits at all facilities and feedback should be given to the Community Health Nurses on the data submitted.