In the neutral hydrogen(H I)galaxy survey,a significant challenge is to identify and extract the H I galaxy signal from the observational data contaminated by radio frequency interference(RFI).For a drift-scan survey,...In the neutral hydrogen(H I)galaxy survey,a significant challenge is to identify and extract the H I galaxy signal from the observational data contaminated by radio frequency interference(RFI).For a drift-scan survey,or more generally a survey of a spatially continuous region,in the time-ordered spectral data,the H I galaxies and RFI all appear as regions that extend an area in the time-frequency waterfall plot,so the extraction of the H I galaxies and RFI from such data can be regarded as an image segmentation problem,and machine-learning methods can be applied to solve such problems.In this study,we develop a method to effectively detect and extract signals of H I galaxies based on a Mask R-CNN network combined with the PointRend method.By simulating FAST-observed galaxy signals and potential RFI impact,we created a realistic data set for the training and testing of our neural network.We compared five different architectures and selected the best-performing one.This architecture successfully performs instance segmentation of H I galaxy signals in the RFI-contaminated time-ordered data,achieving a precision of 98.64%and a recall of 93.59%.展开更多
基金support by the National SKA Program of ChinaNo.2022SKA0110100+1 种基金the CAS Interdisciplinary Innovation Team(JCTD-2019-05)the science research grants from the China Manned Space Project with No.CMS-CSST-2021-B01。
文摘In the neutral hydrogen(H I)galaxy survey,a significant challenge is to identify and extract the H I galaxy signal from the observational data contaminated by radio frequency interference(RFI).For a drift-scan survey,or more generally a survey of a spatially continuous region,in the time-ordered spectral data,the H I galaxies and RFI all appear as regions that extend an area in the time-frequency waterfall plot,so the extraction of the H I galaxies and RFI from such data can be regarded as an image segmentation problem,and machine-learning methods can be applied to solve such problems.In this study,we develop a method to effectively detect and extract signals of H I galaxies based on a Mask R-CNN network combined with the PointRend method.By simulating FAST-observed galaxy signals and potential RFI impact,we created a realistic data set for the training and testing of our neural network.We compared five different architectures and selected the best-performing one.This architecture successfully performs instance segmentation of H I galaxy signals in the RFI-contaminated time-ordered data,achieving a precision of 98.64%and a recall of 93.59%.