摘要
分析启动子区域内调控元件是阐明基因转录起始机制的重要前提.利用从PlanPromDB数据库下载的植物Pol-ⅡTATA和TATA-less启动子数据,深入分析了两类启动子GC偏好、位点结构保守性、序列碱基组分、保守模体分布、TATAbox位点分布及关联位点保守性等特点,统计出两类植物启动子许多特有的序列组分和结构规律,这些规律对进一步揭示植物Pol-Ⅱ启动子的转录调控机制有一定的帮助.通过构建能够同时考虑位点保守性和关联性的位点关联性权重矩阵扫描模型(PCWM),利用相应打分函数(Score)对两类启动子进行区分,得到了较好结果,说明PCWM的预测性能要优于单碱基的位点权重矩阵(PWM).
Analysis of regular elements in promoter region is the base for elucidating the mechanism of gene transcription initiation. The TATA and the TATA-less promoters of plant RNA polymerase Ⅱ gene are chosen from the PlanPromDB. The GC bias, position structure conservation, nucleotide content and conservative motifs of sequences, position distribution of TATA box and conservation of correlation position are analyzed. Many specific regulars for the two types of promoters are found. These features can offer some help for revealing the transcription regulation of plant gene. A new prediction algorithm based on position-correlation weight matrix (PCWM) is proposed. The better discrimination results for two sort plant promoters are obtained by using score function. It is confirmed that the performance of position-correlation weight matrix (PCWM) is superior to single-base position weight matrix (PWM).
出处
《生物化学与生物物理进展》
SCIE
CAS
CSCD
北大核心
2009年第7期863-871,共9页
Progress In Biochemistry and Biophysics
基金
国家自然科学基金资助项目(30560039)~~