In a recent decade, many DNA sequencing projects are developed on cells, plants and animals over the world into huge DNA databases. Researchers notice that mammalian genomes encoding thousands of large noncoding RNAs ...In a recent decade, many DNA sequencing projects are developed on cells, plants and animals over the world into huge DNA databases. Researchers notice that mammalian genomes encoding thousands of large noncoding RNAs (lncRNAs), interact with chromatin regulatory complexes, and are thought to play a role in localizing these complexes to target loci across the genome. It is a challenge target using higher dimensional tools to organize various complex interactive properties as visual maps. In this paper, a Pseudo DNA Variant MapPDVM is proposed following Cellular Automata to represent multiple maps that use four Meta symbols as well as DNA or RNA representations. The system architecture of key components and the core mechanism on the PDVM are described. Key modules, equations and their I/O parameters are discussed. Applying the PDVM, two sets of real DNA sequences from both the sample human (noncoding DNA) and corn (coding DNA) genomes are collected in comparison with two sets of pseudo DNA sequences generated by a stream cipher HC-256 under different modes to show their intrinsic properties in higher levels of similar relationships among relevant DNA sequences on 2D maps. Sample 2D maps are listed and their characteristics are illustrated under a controllable environment. Various distributions can be observed on both noncoding and coding conditions from their symmetric properties on 2D maps.展开更多
Recent work revealed that, in the genomes of polyploid wheat, there exists a class of low_copy and chromosome_specific sequences that are labile upon polyploid formation. This class of sequences was proposed to play ...Recent work revealed that, in the genomes of polyploid wheat, there exists a class of low_copy and chromosome_specific sequences that are labile upon polyploid formation. This class of sequences was proposed to play a critical role in the stabilization and establishment of nascent plant polyploids as new species. To further study this issue, five wheat chromosome 7B_specific sequences, isolated from common wheat (Triticum aestivum L.) by chromosome microdissection, were characterized. The sequences were studied by genomic Southern hybridizations on a collection of polyploid wheats and their diploid progenitors. Four sequences hybridized to all polyploid species, but at the diploid level to only species closely related to the B_genome of polyploid wheat. This indicates that these sequences originated with the divergence of the diploid species, and was then vertically transmitted to polyploids. One sequence hybridized to all species at both the diploid and polyploid levels, suggesting its elimination after the polyploid wheat formation. The hybridization of this sequence to two synthetic polyploid wheats indicated that sequence elimination is a rapid event and probably related to methylation status of the sequence. Based on the above results, we suggest that selective changes of low_copy sequences occur rapidly after polyploid formation, which may contribute to the differentiation of chromosomes in newly formed allopolyploid wheats.展开更多
The material T240_6 derived from SC 2 young embryo of the combination CA9211/RW15 (6D/6V alien substitution) was telosomic substitution line of 6VS identified by GISH (genomic in situ hybridization) analysis. The 6V...The material T240_6 derived from SC 2 young embryo of the combination CA9211/RW15 (6D/6V alien substitution) was telosomic substitution line of 6VS identified by GISH (genomic in situ hybridization) analysis. The 6VS was microdissected with a needle and transferred into a 0.5 mL Ep tube. In the 'single tube', all the subsequence steps were conducted. After two round of LA (Linker adaptor)_PCR amplification, the size of PCR bands ranged from 100 to 3 000 bp, with predominate bands 600-1 500 bp. The products were confirmed by Southern blotting analysis using Haynaldia villosa (L.) Schur. genomic DNA labeled with 32 P as probe. The PCR products were purified and ligated into clone vector-pGEM_T easy vector. Then, the plasmids were transformed into competence E. coli JM109 with cool CaCl 2. It was estimated that there were more than 17 000 white clones in the library. The size of insert fragments distributed from 100-1 500 bp, with average of 600 bp. Using H. villosa genomic DNA as probe, dot blotting results showed that 37% clones displayed strong and medium positive signals, and 63% clones had faint or no signals. It is demonstrated that there were about 37% repeat sequence clones and 67% single/unique sequence clones in the library. Eight H. villosa_specific clones were screened from the library, and two clones pHVMK22 and pHVMK134 were used for RFLP analysis and sequencing. Both of them were H. villosa specific clones. The pHVMK22 was a unique sequence clone, and the pHVMK134 was a repeat sequence clone. When the pHVMK22 was used as a probe for Southern hybridization, all the powdery mildew resistance materials showed a special band of 2 kb, while all the susceptible ones not. The pHVMK22 may be applied to detect the existence of Pm21.展开更多
Pse-in-One 2.0 is a package of web-servers evolved from Pse-in-One (Liu, B., Liu, F., Wang, X., Chen, J. Fang, L. & Chou, K.C. Nucleic Acids Research, 2015, 43:W65-W71). In order to make it more flexible and compr...Pse-in-One 2.0 is a package of web-servers evolved from Pse-in-One (Liu, B., Liu, F., Wang, X., Chen, J. Fang, L. & Chou, K.C. Nucleic Acids Research, 2015, 43:W65-W71). In order to make it more flexible and comprehensive as suggested by many users, the updated package has incorporated 23 new pseudo component modes as well as a series of new feature analysis approaches. It is available at http://bioinformatics.hitsz.edu.cn/Pse-in-One2.0/. Moreover, to maximize the convenience of users, provided is also the stand-alone version called “Pse-in-One-Analysis”, by which users can significantly speed up the analysis of massive sequences.展开更多
The microsatellite-enriched library was constructed using magnetic bead hybridization selection method, and the mi-crosatellite DNA sequences were analyzed in Pacific abalone Haliotis discus hannai. Three hundred and ...The microsatellite-enriched library was constructed using magnetic bead hybridization selection method, and the mi-crosatellite DNA sequences were analyzed in Pacific abalone Haliotis discus hannai. Three hundred and fifty white colonies were screened using PCR-based technique, and 84 clones were identified to potentially contain microsatellite repeat motif. The 84 clones were sequenced, and 42 microsatellites and 4 minisatellites with a minimum of five repeats were found (13.1% of white colonies screened). Besides the motif of CA contained in the oligoprobe, we also found other 16 types of microsatellite repeats including a dinucleotide repeat, two tetranucleotide repeats, twelve pentanucleotide repeats and a hexanucleotide repeat. According to Weber (1990), the microsatellite sequences obtained could be categorized structurally into perfect repeats (73.3%), imperfect repeats (13.3%), and compound repeats (13.4%). Among the microsatellite repeats, relatively short arrays (< 20 repeats) were most abundant, accounting for 75.0%. The largest length of microsatellites was 48 repeats, and the average number of repeats was 13.4. The data on the composition and length distribution of microsatellites obtained in the present study can be useful for choosing the repeat motifs for microsatellite isolation in other abalone species.展开更多
Conogethes punctiferalis(Guenée)(Lepidoptera: Crambidae) was originally considered as one species with fruit-feeding type(FFT) and pinaceae-feeding type(PFT), but it has subsequently been divided into tw...Conogethes punctiferalis(Guenée)(Lepidoptera: Crambidae) was originally considered as one species with fruit-feeding type(FFT) and pinaceae-feeding type(PFT), but it has subsequently been divided into two different species of Conogethes punctiferalis and Conogethes pinicolalis. The relationship between the two species was investigated by phylogenetic reconstruction using maximum-likelihood(ML) parameter estimations. The phylogenetic tree and network were constructed based upon sequence data from concatenation of three genes of mitochondrial cytochrome c oxidase subunits I, II and cytochrome b which were derived from 118 samples of C. punctiferalis and 24 samples of C. pinicolalis. The phylogenetic tree and network showed that conspecific sequences were clustering together despite intraspecific variability. Here we report the results of a combined analysis of mitochondrial DNA sequences from three genes and morphological data representing powerful evidence that C. pinicolalisand C. punctiferalis are significantly different.展开更多
Chaos game representation (CGR) is an iterative mapping technique that processes sequences of units, such as nucleotides in a DNA sequence or amino acids in a protein, in order to determine the coordinates of their ...Chaos game representation (CGR) is an iterative mapping technique that processes sequences of units, such as nucleotides in a DNA sequence or amino acids in a protein, in order to determine the coordinates of their positions in a continuous space. This distribution of positions has two features: one is unique, and the other is source sequence that can be recovered from the coordinates so that the distance between positions may serve as a measure of similarity between the corresponding sequences. A CGR-walk model is proposed based on CGR coordinates for the DNA sequences. The CGR coordinates are converted into a time series, and a long-memory ARFIMA (p, d, q) model, where ARFIMA stands for autoregressive fractionally integrated moving average, is introduced into the DNA sequence analysis. This model is applied to simulating real CGR-walk sequence data of ten genomic sequences. Remarkably long-range correlations are uncovered in the data, and the results from these models are reasonably fitted with those from the ARFIMA (p, d, q) model.展开更多
The phylogeny of Ptychostomum was first spacer (ITS) region of the nuclear ribosomal (nr) DNA DNA rps4 sequences. Maximum parsimony, maximum undertaken based on analysis of the internal transcribed and by combinin...The phylogeny of Ptychostomum was first spacer (ITS) region of the nuclear ribosomal (nr) DNA DNA rps4 sequences. Maximum parsimony, maximum undertaken based on analysis of the internal transcribed and by combining data from nrDNA ITS and chloroplast likelihood, and Bayesian analyses all support the conclusion that the reinstated genus Ptychostomum is not monophyletic. Ptychostomum funkii (Schwagr.) J. R. Spence (≡ Bryum funkii Schwaigr.) is placed within a clade containing the type species of Bryum, B. argenteum Hedw. The remaining members of Ptychostomum investigated in the present study constitute another well-supported clade. The results are congruent with previous molecular analyses. On the basis of phylogenetic evidence, we agree with transferring B. amblyodon Mull. Hal. (≡ B. inclinatum (Brid.) Turton≡ Bryum archangelicum Bruch & Schimp.), Bryum lonchocaulon Mull. Hal., Bryum pallescens Schleich. ex Schwaigr., and Bryum pallens Sw. to Ptychostomum.展开更多
Stream cipher, DNA cryptography and DNA analysis are the most important R&D fields in both Cryptography and Bioinformatics. HC-256 is an emerged scheme as the new generation of stream ciphers for advanced network ...Stream cipher, DNA cryptography and DNA analysis are the most important R&D fields in both Cryptography and Bioinformatics. HC-256 is an emerged scheme as the new generation of stream ciphers for advanced network security. From a random sequencing viewpoint, both sequences of HC-256 and real DNA data may have intrinsic pseudo-random properties respectively. In a recent decade, many DNA sequencing projects are developed on cells, plants and animals over the world into huge DNA databases. Researchers notice that mammalian genomes encode thousands of large noncoding RNAs (lncRNAs), interact with chromatin regulatory complexes, and are thought to play a role in localizing these complexes to target loci across the genome. It is a challenge target using higher dimensional visualization tools to organize various complex interactive properties as visual maps. The Variant Map System (VMS) as an emerging scheme is systematically proposed in this paper to apply multiple maps that used four Meta symbols as same as DNA or RNA representations. System architecture of key components and core mechanism on the VMS are described. Key modules, equations and their I/O parameters are discussed. Applying the VM System, two sets of real DNA sequences from both sample human (noncoding DNA) and corn (coding DNA) genomes are collected in comparison with pseudo DNA sequences generated by HC-256 to show their intrinsic properties in higher levels of similar relationships among relevant DNA sequences on 2D maps. Sample 2D maps are listed and their characteristics are illustrated under controllable environment. Visual results are briefly analyzed to explore their intrinsic properties on selected genome sequences.展开更多
Random sequences play an important role in wider security applications, such as mobile communication and network security. Due to DNA sequences owning natural randomness, in order to show DNA’s characteristics intuit...Random sequences play an important role in wider security applications, such as mobile communication and network security. Due to DNA sequences owning natural randomness, in order to show DNA’s characteristics intuitively, this paper proposes a method to present DNA sequences as conjugate maps. The method includes two core models: measuring models to transfer DNA data into measurements, and visual models to test random sequences as distribution maps to show DNA’s characteristics. The spatial relations between sample DNA and CA random sequences are illustrated and compared in the end. The results show that the distribution of DNA sequences and CA random sequences has significant differences and similarities. It can provide a reference for in-depth visualization study of DNA sequences on their measurement maps.展开更多
N^(6)-methyladenine(6mA)is a prevalent DNA modification and is involved in a wide range of human diseases.Previous studies have indicated that 6mA is enriched in mitochondrial DNA(mtDNA)of mammals.By employing an evol...N^(6)-methyladenine(6mA)is a prevalent DNA modification and is involved in a wide range of human diseases.Previous studies have indicated that 6mA is enriched in mitochondrial DNA(mtDNA)of mammals.By employing an evolved adenine deaminase,we developed a deaminase-mediated sequencing(DM-seq)method that could achieve genome-wide mapping of 6mA in mammalian mtDNA at single-base resolution.In this study,we used an engineered adenine deaminase,known as TadA8e protein,to map 6mA in mtDNA of hepatocellular carcinoma(HCC)by DM-seq.Through high-throughput sequencing,we identified sixteen 6mA sites in both HCC and adjacent normal tissue mtDNA.The results revealed an increased overall 6mA level in mtDNA associated with HCC.Furthermore,an elevation in 6mA level was observed alongside a decrease in the m RNA levels of the corresponding genes,indicating that increased6mA level hindered transcription processes related to these genes.These findings demonstrate that 6mA in mtDNA is correlated with HCC and provide evidence supporting the inhibitory effect of elevated 6mA level on subsequent transcriptional activity.This research illuminates the intricate relationship between 6mA modification and transcriptional regulation in the context of HCC,offering valuable insights into the role of 6mA modification in HCC pathogenesis.展开更多
To understand the gene-based biological processes in-depth,the single-molecule real-time sequencing has drawn increasing attention with promoted by the Human Genome Project.Herein,a set of newly designed canonical flu...To understand the gene-based biological processes in-depth,the single-molecule real-time sequencing has drawn increasing attention with promoted by the Human Genome Project.Herein,a set of newly designed canonical fluorescent bases(A_(y),tC,G_(b),T_(p))are proposed for four-color DNA sequencing.These quasi-intrinsic probes are derived from the fluorophore replacement and ring expansion on natural bases,which still keep the pyrimidine or purine underlying skeleton and Watson–Crick hydrogen bonding face to allow minimal perturbation to the native DNA duplex.More importantly,these nucleobase analogues possess red-shifted absorption and efficient photoluminescence due to the enhancedπ-conjugation in character.Meanwhile,the four analogues could generate distinct emission wavelength(Δλ~50 nm)for real-time sequencing.To assess the biological employment of the proposed biosensors,the effects of base pairing and linking deoxyribose are also considered.展开更多
The characterization of long-range correlations and fractal properties of DNA sequences has proved to be adifficult though rewarding task mainly due to the mosaic character of DNA consisting of many patches of various...The characterization of long-range correlations and fractal properties of DNA sequences has proved to be adifficult though rewarding task mainly due to the mosaic character of DNA consisting of many patches of various lengthswith different nucleotide constitutions.In this paper we investigate statistical correlations among different positions in DNAsequences using the two-dimensional DNA walk.The root-mean-square fluctuation F(l)is described by a power law.Theautocorrelation function C(l),which is used to measure the linear dependence and periodicity,exists a power law ofC(l)-l^(-μ).We also calculate the mean-square distance<R^2(l)>along the DNA chain,and it may be expressed as<R^2(l)>-l^(?)with 2>γ>1.Our investigations can provide some insights into long-range correlations in DNA sequences.展开更多
Active Motif Finder (AMF) is a novel algorithmic tool, designed based on mutations in DNA sequences. Tools available at present for finding motifs are based on matching a given motif in the query sequence. AMF descr...Active Motif Finder (AMF) is a novel algorithmic tool, designed based on mutations in DNA sequences. Tools available at present for finding motifs are based on matching a given motif in the query sequence. AMF describes a new algorithm that identifies the occurrences of patterns which possess all kinds of mutations like insertion, deletion and mismatch. The algorithm is mainly based on the Alignment Score Matrix (ASM) computation by com paring input motif with full length sequence. Much of the effort in bioinformatics is directed to identify these motifs in the sequences of newly discovered genes. The proposed bio-tool serves as an open resource for analysis and useful for studying polymorphisms in DNA sequences. AMF can be searched via a user-friendly interface. This tool is intended to serve the scientific community working in the areas of chemical and structural biology, and is freely available to all users, at http://www.sastra.edu/scbt/amf/.展开更多
This paper investigates the existence of low-dimensional deterministic chaos in the AT and GC skew profiles of DNA sequences. It has taken DNA sequences from eight organisms as samples. The skew profiles are analysed ...This paper investigates the existence of low-dimensional deterministic chaos in the AT and GC skew profiles of DNA sequences. It has taken DNA sequences from eight organisms as samples. The skew profiles are analysed using continuous wavelet transform and then nonlinear time series methods. The invariant measures of correlation dimension and the largest Lyapunov exponent are calculated. It is demonstrated that the AT and GC skew profiles of these DNA sequences all exhibit low dimensional chaotic behaviour. It suggests that chaotic properties may be ubiquitous in the DNA sequences of all organisms.展开更多
This paper presents a model to describe alternating current (AC) conductivity of DNA sequences, in which DNA is considered as a one-dimensional (1D) disordered system, and electrons transport via hopping between l...This paper presents a model to describe alternating current (AC) conductivity of DNA sequences, in which DNA is considered as a one-dimensional (1D) disordered system, and electrons transport via hopping between localized states. It finds that AC conductivity in DNA sequences increases as the frequency of the external electric field rises, and it takes the form of σac(ω) - ω2 ln^2(1/ω). Also AC conductivity of DNA sequences increases with the increase of temperature, this phenomenon presents characteristics of weak temperature-dependence. Meanwhile, the AC conductivity in an offdiagonally correlated case is much larger than that in the uncorrelated case of the Anderson limit in low temperatures, which indicates that the off-diagonal correlations in DNA sequences have a great effect on the AC conductivity, while at high temperature the off-diagonal correlations no longer play a vital role in electric transport. In addition, the proportion of nucleotide pairs p also plays an important role in AC electron transport of DNA sequences. For p 〈 0.5, the conductivity of DNA sequence decreases with the increase of p, while for p ≥ 0.5, the conductivity increases with the increase of p.展开更多
The univalent from the meiosis-metaphase spreads of F1 (Z2× wheat variety Wan7107) wasidentified to be Agropyrum intermedium 2Ai-2 chromosome by GISH. The 2Ai-2 chromosomes weremicroisolated and collected. After ...The univalent from the meiosis-metaphase spreads of F1 (Z2× wheat variety Wan7107) wasidentified to be Agropyrum intermedium 2Ai-2 chromosome by GISH. The 2Ai-2 chromosomes weremicroisolated and collected. After two rounds of PCR amplification, the PCR products wereranged from 150-3 000 bp,with predominant fragments at about 200-2 000 bp. Using Ag.intermedium genomic DNA as a probe, Southern blotting analysis confirmed the products originatedfrom Ag. intermedium genome. The products were purified, ligated to pUC18 and then transformedinto competence E.coli DH5αto produce a 2Ai-2 chromosome DNA library. The microcloningexperiments produced approximately 5 ×105 clones, the size range of the cloned inserts was 200-1 500 bp, with an average of 580 bp. Using Ag.intermedium genomic DNA as a probe, dot blottingresults showed that 56% clones are unique/low copy sequences, 44% are repetitive sequences inthe library. Four Ag. intermedium clones were screened from the library by RFLP, and threeclones(Mag065, Mag088, Mag139)belong to low/single sequences, one clone(Mag104)was repetitivesequence, and GISH results indicated that Mag104 was Ag.intermedium species-specific repetitiveDNA sequence.展开更多
Japanese Angelica Root prepared from Angelica acutiloba var. acutiloba and A. acutiloba var. sugiyamae, known in Japan as “Toki” and “Hokkai Toki”, is an important crude drug used in Kampo medicine (traditional Ja...Japanese Angelica Root prepared from Angelica acutiloba var. acutiloba and A. acutiloba var. sugiyamae, known in Japan as “Toki” and “Hokkai Toki”, is an important crude drug used in Kampo medicine (traditional Japanese medicine). However, since these Angelica varieties have recently outcrossed with each other, it is unclear whether Japanese Angelica Root sold for use in Kampo medicine is a pure variety. Here, we describe DNA sequence polymorphisms that can be used to distinguish between A. acutiloba var. acutiloba and A. acutiloba var. sugiyamae. In our analyses, differences in the trnK region of chloroplast DNA distinguished among some A. acutiloba varieties and related species, but not between A. acutiloba var. acutiloba and A. acutiloba var. iwatensis. One geographical strain of A. acutiloba var. acutiloba and A. acutiloba var. sugiyamae showed identical sequences in three regions of chloroplast DNA, but differences in the internal transcribed spacer region of nuclear ribosomal DNA. One strain of A. acutiloba var. iwatensis and A. acutiloba var. sugiyamae had identical sequences in all of the chloroplast and nuclear ribosomal DNA regions examined. These findings show that A. acutiloba var. acutiloba has hybridized with A. acutiloba var. sugiyamae and that the “Hokkai Toki” variety resulted from outcrossing with A. acutiloba var. iwatensis. Molecular authentication based on analyses of chloroplast and nuclear ribosomal DNA sequences of A. acutiloba and related species is an efficient method to authenticate Japanese Angelica Root at the variety level. Therefore, these analyses can determine whether a product is derived from A. acutiloba var. acutiloba or A. acutiloba var. sugiyamae.展开更多
文摘In a recent decade, many DNA sequencing projects are developed on cells, plants and animals over the world into huge DNA databases. Researchers notice that mammalian genomes encoding thousands of large noncoding RNAs (lncRNAs), interact with chromatin regulatory complexes, and are thought to play a role in localizing these complexes to target loci across the genome. It is a challenge target using higher dimensional tools to organize various complex interactive properties as visual maps. In this paper, a Pseudo DNA Variant MapPDVM is proposed following Cellular Automata to represent multiple maps that use four Meta symbols as well as DNA or RNA representations. The system architecture of key components and the core mechanism on the PDVM are described. Key modules, equations and their I/O parameters are discussed. Applying the PDVM, two sets of real DNA sequences from both the sample human (noncoding DNA) and corn (coding DNA) genomes are collected in comparison with two sets of pseudo DNA sequences generated by a stream cipher HC-256 under different modes to show their intrinsic properties in higher levels of similar relationships among relevant DNA sequences on 2D maps. Sample 2D maps are listed and their characteristics are illustrated under a controllable environment. Various distributions can be observed on both noncoding and coding conditions from their symmetric properties on 2D maps.
文摘Recent work revealed that, in the genomes of polyploid wheat, there exists a class of low_copy and chromosome_specific sequences that are labile upon polyploid formation. This class of sequences was proposed to play a critical role in the stabilization and establishment of nascent plant polyploids as new species. To further study this issue, five wheat chromosome 7B_specific sequences, isolated from common wheat (Triticum aestivum L.) by chromosome microdissection, were characterized. The sequences were studied by genomic Southern hybridizations on a collection of polyploid wheats and their diploid progenitors. Four sequences hybridized to all polyploid species, but at the diploid level to only species closely related to the B_genome of polyploid wheat. This indicates that these sequences originated with the divergence of the diploid species, and was then vertically transmitted to polyploids. One sequence hybridized to all species at both the diploid and polyploid levels, suggesting its elimination after the polyploid wheat formation. The hybridization of this sequence to two synthetic polyploid wheats indicated that sequence elimination is a rapid event and probably related to methylation status of the sequence. Based on the above results, we suggest that selective changes of low_copy sequences occur rapidly after polyploid formation, which may contribute to the differentiation of chromosomes in newly formed allopolyploid wheats.
基金国家"8 6 3"计划资助项目 (Z 17 0 4 0 1) 国家转基因植物研究与产业化资助项目 (J0 0 A 0 0 2 )~~
文摘The material T240_6 derived from SC 2 young embryo of the combination CA9211/RW15 (6D/6V alien substitution) was telosomic substitution line of 6VS identified by GISH (genomic in situ hybridization) analysis. The 6VS was microdissected with a needle and transferred into a 0.5 mL Ep tube. In the 'single tube', all the subsequence steps were conducted. After two round of LA (Linker adaptor)_PCR amplification, the size of PCR bands ranged from 100 to 3 000 bp, with predominate bands 600-1 500 bp. The products were confirmed by Southern blotting analysis using Haynaldia villosa (L.) Schur. genomic DNA labeled with 32 P as probe. The PCR products were purified and ligated into clone vector-pGEM_T easy vector. Then, the plasmids were transformed into competence E. coli JM109 with cool CaCl 2. It was estimated that there were more than 17 000 white clones in the library. The size of insert fragments distributed from 100-1 500 bp, with average of 600 bp. Using H. villosa genomic DNA as probe, dot blotting results showed that 37% clones displayed strong and medium positive signals, and 63% clones had faint or no signals. It is demonstrated that there were about 37% repeat sequence clones and 67% single/unique sequence clones in the library. Eight H. villosa_specific clones were screened from the library, and two clones pHVMK22 and pHVMK134 were used for RFLP analysis and sequencing. Both of them were H. villosa specific clones. The pHVMK22 was a unique sequence clone, and the pHVMK134 was a repeat sequence clone. When the pHVMK22 was used as a probe for Southern hybridization, all the powdery mildew resistance materials showed a special band of 2 kb, while all the susceptible ones not. The pHVMK22 may be applied to detect the existence of Pm21.
文摘Pse-in-One 2.0 is a package of web-servers evolved from Pse-in-One (Liu, B., Liu, F., Wang, X., Chen, J. Fang, L. & Chou, K.C. Nucleic Acids Research, 2015, 43:W65-W71). In order to make it more flexible and comprehensive as suggested by many users, the updated package has incorporated 23 new pseudo component modes as well as a series of new feature analysis approaches. It is available at http://bioinformatics.hitsz.edu.cn/Pse-in-One2.0/. Moreover, to maximize the convenience of users, provided is also the stand-alone version called “Pse-in-One-Analysis”, by which users can significantly speed up the analysis of massive sequences.
文摘The microsatellite-enriched library was constructed using magnetic bead hybridization selection method, and the mi-crosatellite DNA sequences were analyzed in Pacific abalone Haliotis discus hannai. Three hundred and fifty white colonies were screened using PCR-based technique, and 84 clones were identified to potentially contain microsatellite repeat motif. The 84 clones were sequenced, and 42 microsatellites and 4 minisatellites with a minimum of five repeats were found (13.1% of white colonies screened). Besides the motif of CA contained in the oligoprobe, we also found other 16 types of microsatellite repeats including a dinucleotide repeat, two tetranucleotide repeats, twelve pentanucleotide repeats and a hexanucleotide repeat. According to Weber (1990), the microsatellite sequences obtained could be categorized structurally into perfect repeats (73.3%), imperfect repeats (13.3%), and compound repeats (13.4%). Among the microsatellite repeats, relatively short arrays (< 20 repeats) were most abundant, accounting for 75.0%. The largest length of microsatellites was 48 repeats, and the average number of repeats was 13.4. The data on the composition and length distribution of microsatellites obtained in the present study can be useful for choosing the repeat motifs for microsatellite isolation in other abalone species.
基金supported by China Agriculture Research System(CARS-02)Beijing Municipal Sci-Tech Program(Z111100056811009)
文摘Conogethes punctiferalis(Guenée)(Lepidoptera: Crambidae) was originally considered as one species with fruit-feeding type(FFT) and pinaceae-feeding type(PFT), but it has subsequently been divided into two different species of Conogethes punctiferalis and Conogethes pinicolalis. The relationship between the two species was investigated by phylogenetic reconstruction using maximum-likelihood(ML) parameter estimations. The phylogenetic tree and network were constructed based upon sequence data from concatenation of three genes of mitochondrial cytochrome c oxidase subunits I, II and cytochrome b which were derived from 118 samples of C. punctiferalis and 24 samples of C. pinicolalis. The phylogenetic tree and network showed that conspecific sequences were clustering together despite intraspecific variability. Here we report the results of a combined analysis of mitochondrial DNA sequences from three genes and morphological data representing powerful evidence that C. pinicolalisand C. punctiferalis are significantly different.
基金Project supported by the National Natural Science Foundation of China (Grant No 60575038)the Natural Science Foundation of Jiangnan University,China (Grant No 20070365)
文摘Chaos game representation (CGR) is an iterative mapping technique that processes sequences of units, such as nucleotides in a DNA sequence or amino acids in a protein, in order to determine the coordinates of their positions in a continuous space. This distribution of positions has two features: one is unique, and the other is source sequence that can be recovered from the coordinates so that the distance between positions may serve as a measure of similarity between the corresponding sequences. A CGR-walk model is proposed based on CGR coordinates for the DNA sequences. The CGR coordinates are converted into a time series, and a long-memory ARFIMA (p, d, q) model, where ARFIMA stands for autoregressive fractionally integrated moving average, is introduced into the DNA sequence analysis. This model is applied to simulating real CGR-walk sequence data of ten genomic sequences. Remarkably long-range correlations are uncovered in the data, and the results from these models are reasonably fitted with those from the ARFIMA (p, d, q) model.
基金supported by the National Natural Science Foundation of China(grantno.30670152)the National Infrastructure of Natural Resources for Science and Technology(grant no.2005DKA21403)the Natural Science Foundation of Hebei Province,China(no.C2008000158)
文摘The phylogeny of Ptychostomum was first spacer (ITS) region of the nuclear ribosomal (nr) DNA DNA rps4 sequences. Maximum parsimony, maximum undertaken based on analysis of the internal transcribed and by combining data from nrDNA ITS and chloroplast likelihood, and Bayesian analyses all support the conclusion that the reinstated genus Ptychostomum is not monophyletic. Ptychostomum funkii (Schwagr.) J. R. Spence (≡ Bryum funkii Schwaigr.) is placed within a clade containing the type species of Bryum, B. argenteum Hedw. The remaining members of Ptychostomum investigated in the present study constitute another well-supported clade. The results are congruent with previous molecular analyses. On the basis of phylogenetic evidence, we agree with transferring B. amblyodon Mull. Hal. (≡ B. inclinatum (Brid.) Turton≡ Bryum archangelicum Bruch & Schimp.), Bryum lonchocaulon Mull. Hal., Bryum pallescens Schleich. ex Schwaigr., and Bryum pallens Sw. to Ptychostomum.
文摘Stream cipher, DNA cryptography and DNA analysis are the most important R&D fields in both Cryptography and Bioinformatics. HC-256 is an emerged scheme as the new generation of stream ciphers for advanced network security. From a random sequencing viewpoint, both sequences of HC-256 and real DNA data may have intrinsic pseudo-random properties respectively. In a recent decade, many DNA sequencing projects are developed on cells, plants and animals over the world into huge DNA databases. Researchers notice that mammalian genomes encode thousands of large noncoding RNAs (lncRNAs), interact with chromatin regulatory complexes, and are thought to play a role in localizing these complexes to target loci across the genome. It is a challenge target using higher dimensional visualization tools to organize various complex interactive properties as visual maps. The Variant Map System (VMS) as an emerging scheme is systematically proposed in this paper to apply multiple maps that used four Meta symbols as same as DNA or RNA representations. System architecture of key components and core mechanism on the VMS are described. Key modules, equations and their I/O parameters are discussed. Applying the VM System, two sets of real DNA sequences from both sample human (noncoding DNA) and corn (coding DNA) genomes are collected in comparison with pseudo DNA sequences generated by HC-256 to show their intrinsic properties in higher levels of similar relationships among relevant DNA sequences on 2D maps. Sample 2D maps are listed and their characteristics are illustrated under controllable environment. Visual results are briefly analyzed to explore their intrinsic properties on selected genome sequences.
文摘Random sequences play an important role in wider security applications, such as mobile communication and network security. Due to DNA sequences owning natural randomness, in order to show DNA’s characteristics intuitively, this paper proposes a method to present DNA sequences as conjugate maps. The method includes two core models: measuring models to transfer DNA data into measurements, and visual models to test random sequences as distribution maps to show DNA’s characteristics. The spatial relations between sample DNA and CA random sequences are illustrated and compared in the end. The results show that the distribution of DNA sequences and CA random sequences has significant differences and similarities. It can provide a reference for in-depth visualization study of DNA sequences on their measurement maps.
基金supported by the National Natural Science Foundation of China(No.22277093)the Key Research and Development Project of Hubei Province(No.2023BCB094)。
文摘N^(6)-methyladenine(6mA)is a prevalent DNA modification and is involved in a wide range of human diseases.Previous studies have indicated that 6mA is enriched in mitochondrial DNA(mtDNA)of mammals.By employing an evolved adenine deaminase,we developed a deaminase-mediated sequencing(DM-seq)method that could achieve genome-wide mapping of 6mA in mammalian mtDNA at single-base resolution.In this study,we used an engineered adenine deaminase,known as TadA8e protein,to map 6mA in mtDNA of hepatocellular carcinoma(HCC)by DM-seq.Through high-throughput sequencing,we identified sixteen 6mA sites in both HCC and adjacent normal tissue mtDNA.The results revealed an increased overall 6mA level in mtDNA associated with HCC.Furthermore,an elevation in 6mA level was observed alongside a decrease in the m RNA levels of the corresponding genes,indicating that increased6mA level hindered transcription processes related to these genes.These findings demonstrate that 6mA in mtDNA is correlated with HCC and provide evidence supporting the inhibitory effect of elevated 6mA level on subsequent transcriptional activity.This research illuminates the intricate relationship between 6mA modification and transcriptional regulation in the context of HCC,offering valuable insights into the role of 6mA modification in HCC pathogenesis.
基金Project supported by the National Natural Science Foundation of China(Grant Nos.11804195,11847224,11674198,and 12274265)the Natural Science Foundation of Shandong Province,China(Grant Nos.ZR2018BA034 and ZR2022MA006)。
文摘To understand the gene-based biological processes in-depth,the single-molecule real-time sequencing has drawn increasing attention with promoted by the Human Genome Project.Herein,a set of newly designed canonical fluorescent bases(A_(y),tC,G_(b),T_(p))are proposed for four-color DNA sequencing.These quasi-intrinsic probes are derived from the fluorophore replacement and ring expansion on natural bases,which still keep the pyrimidine or purine underlying skeleton and Watson–Crick hydrogen bonding face to allow minimal perturbation to the native DNA duplex.More importantly,these nucleobase analogues possess red-shifted absorption and efficient photoluminescence due to the enhancedπ-conjugation in character.Meanwhile,the four analogues could generate distinct emission wavelength(Δλ~50 nm)for real-time sequencing.To assess the biological employment of the proposed biosensors,the effects of base pairing and linking deoxyribose are also considered.
基金This work was financially support by the National Natural Science Foundation of China(Nos.29874012,20174036,20274040)Natural Science Foundation of Zhejiang Province(No.10102).
文摘The characterization of long-range correlations and fractal properties of DNA sequences has proved to be adifficult though rewarding task mainly due to the mosaic character of DNA consisting of many patches of various lengthswith different nucleotide constitutions.In this paper we investigate statistical correlations among different positions in DNAsequences using the two-dimensional DNA walk.The root-mean-square fluctuation F(l)is described by a power law.Theautocorrelation function C(l),which is used to measure the linear dependence and periodicity,exists a power law ofC(l)-l^(-μ).We also calculate the mean-square distance<R^2(l)>along the DNA chain,and it may be expressed as<R^2(l)>-l^(?)with 2>γ>1.Our investigations can provide some insights into long-range correlations in DNA sequences.
文摘Active Motif Finder (AMF) is a novel algorithmic tool, designed based on mutations in DNA sequences. Tools available at present for finding motifs are based on matching a given motif in the query sequence. AMF describes a new algorithm that identifies the occurrences of patterns which possess all kinds of mutations like insertion, deletion and mismatch. The algorithm is mainly based on the Alignment Score Matrix (ASM) computation by com paring input motif with full length sequence. Much of the effort in bioinformatics is directed to identify these motifs in the sequences of newly discovered genes. The proposed bio-tool serves as an open resource for analysis and useful for studying polymorphisms in DNA sequences. AMF can be searched via a user-friendly interface. This tool is intended to serve the scientific community working in the areas of chemical and structural biology, and is freely available to all users, at http://www.sastra.edu/scbt/amf/.
基金supported in part by the National Natural Science Foundation of China (Grant No.60774088)the Specialized Research Fund for the Doctoral Program of Higher Education of China (Grant No.20090031110029)the Foundation of the Application Base and Frontier Technology Research Project of Tianjin (Grant No.08JCZDJC21900)
文摘This paper investigates the existence of low-dimensional deterministic chaos in the AT and GC skew profiles of DNA sequences. It has taken DNA sequences from eight organisms as samples. The skew profiles are analysed using continuous wavelet transform and then nonlinear time series methods. The invariant measures of correlation dimension and the largest Lyapunov exponent are calculated. It is demonstrated that the AT and GC skew profiles of these DNA sequences all exhibit low dimensional chaotic behaviour. It suggests that chaotic properties may be ubiquitous in the DNA sequences of all organisms.
基金supported by the Doctoral Program Foundation of Institutions of Higher Education,China (Grant No 20070533075)
文摘This paper presents a model to describe alternating current (AC) conductivity of DNA sequences, in which DNA is considered as a one-dimensional (1D) disordered system, and electrons transport via hopping between localized states. It finds that AC conductivity in DNA sequences increases as the frequency of the external electric field rises, and it takes the form of σac(ω) - ω2 ln^2(1/ω). Also AC conductivity of DNA sequences increases with the increase of temperature, this phenomenon presents characteristics of weak temperature-dependence. Meanwhile, the AC conductivity in an offdiagonally correlated case is much larger than that in the uncorrelated case of the Anderson limit in low temperatures, which indicates that the off-diagonal correlations in DNA sequences have a great effect on the AC conductivity, while at high temperature the off-diagonal correlations no longer play a vital role in electric transport. In addition, the proportion of nucleotide pairs p also plays an important role in AC electron transport of DNA sequences. For p 〈 0.5, the conductivity of DNA sequence decreases with the increase of p, while for p ≥ 0.5, the conductivity increases with the increase of p.
基金supported by National High-Tech R&D(863)ProgramNational Natural Science Foundation of China(101-04-03-03-97).
文摘The univalent from the meiosis-metaphase spreads of F1 (Z2× wheat variety Wan7107) wasidentified to be Agropyrum intermedium 2Ai-2 chromosome by GISH. The 2Ai-2 chromosomes weremicroisolated and collected. After two rounds of PCR amplification, the PCR products wereranged from 150-3 000 bp,with predominant fragments at about 200-2 000 bp. Using Ag.intermedium genomic DNA as a probe, Southern blotting analysis confirmed the products originatedfrom Ag. intermedium genome. The products were purified, ligated to pUC18 and then transformedinto competence E.coli DH5αto produce a 2Ai-2 chromosome DNA library. The microcloningexperiments produced approximately 5 ×105 clones, the size range of the cloned inserts was 200-1 500 bp, with an average of 580 bp. Using Ag.intermedium genomic DNA as a probe, dot blottingresults showed that 56% clones are unique/low copy sequences, 44% are repetitive sequences inthe library. Four Ag. intermedium clones were screened from the library by RFLP, and threeclones(Mag065, Mag088, Mag139)belong to low/single sequences, one clone(Mag104)was repetitivesequence, and GISH results indicated that Mag104 was Ag.intermedium species-specific repetitiveDNA sequence.
文摘Japanese Angelica Root prepared from Angelica acutiloba var. acutiloba and A. acutiloba var. sugiyamae, known in Japan as “Toki” and “Hokkai Toki”, is an important crude drug used in Kampo medicine (traditional Japanese medicine). However, since these Angelica varieties have recently outcrossed with each other, it is unclear whether Japanese Angelica Root sold for use in Kampo medicine is a pure variety. Here, we describe DNA sequence polymorphisms that can be used to distinguish between A. acutiloba var. acutiloba and A. acutiloba var. sugiyamae. In our analyses, differences in the trnK region of chloroplast DNA distinguished among some A. acutiloba varieties and related species, but not between A. acutiloba var. acutiloba and A. acutiloba var. iwatensis. One geographical strain of A. acutiloba var. acutiloba and A. acutiloba var. sugiyamae showed identical sequences in three regions of chloroplast DNA, but differences in the internal transcribed spacer region of nuclear ribosomal DNA. One strain of A. acutiloba var. iwatensis and A. acutiloba var. sugiyamae had identical sequences in all of the chloroplast and nuclear ribosomal DNA regions examined. These findings show that A. acutiloba var. acutiloba has hybridized with A. acutiloba var. sugiyamae and that the “Hokkai Toki” variety resulted from outcrossing with A. acutiloba var. iwatensis. Molecular authentication based on analyses of chloroplast and nuclear ribosomal DNA sequences of A. acutiloba and related species is an efficient method to authenticate Japanese Angelica Root at the variety level. Therefore, these analyses can determine whether a product is derived from A. acutiloba var. acutiloba or A. acutiloba var. sugiyamae.