Although the diploid nature has been observed for over 50 years, phasing the diploid is still a laborious task. The speed and throughput of next generation sequencing have largely increased in the past decades. Howeve...Although the diploid nature has been observed for over 50 years, phasing the diploid is still a laborious task. The speed and throughput of next generation sequencing have largely increased in the past decades. However, the short read-length remains one of the biggest challenges of haplotype analysis. For instance, reads as short as 150 bp span no more than one variant in most cases. Numerous experimental technologies have been developed to overcome this challenge. Distance, complexity and accuracy of the linkages obtained are the main factors to evaluate the efficiency of whole genome haplotyping methods. Here, we review these experimental technologies, evaluating their efficiency in linkages obtaining and system complexity. The technologies are organized into four categories based on its strategy: (i) chromosomes separation, (ii) dilution pools, (iii) crosslinking and proximity ligation, (ix) long-read technologies. Within each category, several subsections are listed to classify each technology. Innovative experimental strategies are expected to have high-quality performance, low cost and be labor-saving, which will be largely desired in the future.展开更多
基金ACKNOWLEDGEMENTS This work was supported by the National Basic Research Program of China (No. 2012CB316501), and the National Natural Science Foundation of China (Nos. 61227803 and 61571121).
文摘Although the diploid nature has been observed for over 50 years, phasing the diploid is still a laborious task. The speed and throughput of next generation sequencing have largely increased in the past decades. However, the short read-length remains one of the biggest challenges of haplotype analysis. For instance, reads as short as 150 bp span no more than one variant in most cases. Numerous experimental technologies have been developed to overcome this challenge. Distance, complexity and accuracy of the linkages obtained are the main factors to evaluate the efficiency of whole genome haplotyping methods. Here, we review these experimental technologies, evaluating their efficiency in linkages obtaining and system complexity. The technologies are organized into four categories based on its strategy: (i) chromosomes separation, (ii) dilution pools, (iii) crosslinking and proximity ligation, (ix) long-read technologies. Within each category, several subsections are listed to classify each technology. Innovative experimental strategies are expected to have high-quality performance, low cost and be labor-saving, which will be largely desired in the future.