In eukaryotic cells, initiation of protein translation is to recruit the ribosome to a specific mRNA, which is generally dependent on the 5' cap structure. However, protein translation can also be initiated in a cap-...In eukaryotic cells, initiation of protein translation is to recruit the ribosome to a specific mRNA, which is generally dependent on the 5' cap structure. However, protein translation can also be initiated in a cap-independent manner by using a cis-regulatory element termed the internal ribosome entry site (IRES). The first experimentally validated IRES was reported in the poliovirus (Pelletier and Sonenberg, 1988). Then eukaryotic cellular mRNAs were also validated to contain IRES elements.展开更多
K-mer can be used for the description of biological sequences and k-mer distribution is a tool for solving sequences analysis problems in bioinformatics.We can use k-mer vector as a representation method of the k-mer ...K-mer can be used for the description of biological sequences and k-mer distribution is a tool for solving sequences analysis problems in bioinformatics.We can use k-mer vector as a representation method of the k-mer distribution of the biological sequence.Problems,such as similarity calculations or sequence assembly,can be described in the k-mer vector space.It helps us to identify new features of an old sequence-based problem in bioinformatics and develop new algorithms using the concepts and methods from linear space theory.In this study,we defined the k-mer vector space for the generalized biological sequences.The meaning of corresponding vector operations is explained in the biological context.We presented the vector/matrix form of several widely seen sequence-based problems,including read quantification,sequence assembly,and pattern detection problem.Its advantages and disadvantages are discussed.Also,we implement a tool for the sequence assembly problem based on the concepts of k-mer vector methods.It shows the practicability and convenience of this algorithm design strategy.展开更多
基金supported by the grants from National Natural Science Foundation of China (Nos. 61571223 and 61171191)
文摘In eukaryotic cells, initiation of protein translation is to recruit the ribosome to a specific mRNA, which is generally dependent on the 5' cap structure. However, protein translation can also be initiated in a cap-independent manner by using a cis-regulatory element termed the internal ribosome entry site (IRES). The first experimentally validated IRES was reported in the poliovirus (Pelletier and Sonenberg, 1988). Then eukaryotic cellular mRNAs were also validated to contain IRES elements.
基金the National Natural Science Foundation of China(11771393,11632015)the Natural Sci-ence Foundation of Zhejiang Province,China(LZ14A010002).
文摘K-mer can be used for the description of biological sequences and k-mer distribution is a tool for solving sequences analysis problems in bioinformatics.We can use k-mer vector as a representation method of the k-mer distribution of the biological sequence.Problems,such as similarity calculations or sequence assembly,can be described in the k-mer vector space.It helps us to identify new features of an old sequence-based problem in bioinformatics and develop new algorithms using the concepts and methods from linear space theory.In this study,we defined the k-mer vector space for the generalized biological sequences.The meaning of corresponding vector operations is explained in the biological context.We presented the vector/matrix form of several widely seen sequence-based problems,including read quantification,sequence assembly,and pattern detection problem.Its advantages and disadvantages are discussed.Also,we implement a tool for the sequence assembly problem based on the concepts of k-mer vector methods.It shows the practicability and convenience of this algorithm design strategy.