摘要
对河豚基因组微卫星分布特征进行研究,在约365Mb基因组序列中共找到49647个微卫星序列,平均每隔7351个碱基就有1个。微卫星序列长度主要分布在20~60个碱基的长度范围内。全部微卫星序列重复区长度(2802374bp)占整个基因组的比例为0.77%。两碱基重复类型数目最多,为35339个,占总数的71.30%;其次是四碱基类型,6837个,占13.77%;再次分别是三碱基类型3548个,占7.1496。五碱基类型1856个,占3.73%,六碱基类型1605个,占3.23%,单碱基类型最少,共402个。占0.80%。按降序排列,出现最多的前20个微卫星重复类别为:CA,TG,GT,AC,GA,CT,TC,TA.AG,AT,TCCA,ATCC,TCTG,GATG,ATCT,CATC,TGGA,ATGG,GATA,GAG,占全部微卫星序列的75.21%。在河豚基因组中未发现CC微卫星序列的存在。与其它生物基因组中微卫星分布特征相比,河豚基因组具有更高的简洁性。本研究将为研究河豚微卫星标记、研究其群体遗传多样性,进行不同基因组的比较研究提供基础。
As the smallest among vertebrates, the genome of Takifugu rubripes is of great significance in a wide range of studies. By screening the whole genome sequence of Takifugu rubripes available in GenBank, a total of 49 647 simple sequence repeats (SSRs) are identified and characterized. One SSR is found every 7 351b p and the total length of all SSRs accounts for 0.77 % of the whole genome, 365Mb in length. The most frequent six SSRs are di-, tetra-, tri-, penta , hexa-and mono-nucleotide types, accounting for 71. 30%, 13.77 %, 7.14 %, 3.73 %, 3.23 % and 0.80 % of the total respectively. The most frequently occurring microsatellites are CA, TG, GT, AC, GA, CT, TC, TA, AG, AT, TCCA, ATCC, TCTG, GATG, ATCT, CATC, TGGA, ATGG, GATA and GAG in decreasing order, representing 75.21% of the total. The copy numbers of different repeat sequence types are mainly between 20 and 60. The polymorphism of these SSRs will certainly facilitate the genetic diversity analysis of Takifugu rubripes and the comparative studies of diverse genomes.
出处
《中国海洋大学学报(自然科学版)》
CAS
CSCD
北大核心
2006年第2期249-254,272,共7页
Periodical of Ocean University of China
基金
山东省科技兴海项目(001164504)
国家科技攻关计划项目(2001BA804A14)资助