A computer based statistical tool to analyze the correlation among DNA sequences

dc.contributor.authorJayarathna, P. G. S. S.
dc.contributor.authorSooriyapathirana, S. D. S. S.
dc.contributor.authorYapa, R. D.
dc.date.accessioned2024-08-07T11:58:51Z
dc.date.available2024-08-07T11:58:51Z
dc.date.issued2013-07-04
dc.description.abstractDNA is the molecule of life. DNA sequence analysis is the key for understanding many biological questions. In bioinformatics, statistical techniques such as frequency distribution techniques, alignment algorithms, hypothesis testing, and clustering techniques are used to analyze the correlation among DNA sequences. Furthermore, comparing lengths, GC-content, AT/GC ratio, repetition of small sub-sequences and the analysis about restriction sites are the most basic analysis on the DNA sequences. Pie charts and the frequency tables can be used to analyze nucleotide distribution among DNA sequences. In DNA sequence analysis, sequence alignment is one of the most important steps to identify the similarity regions between DNA sequences, because it reflects functional, structural, or evolutionary relationships among them. Since the process of alignment algorithms like Smith-Waterman’s are very time consuming, the BLAST algorithm can be used as a time efficient procedure because it addresses the fundamental problems and the algorithm emphasizes speed over sensitivity. Cluster Analysis is also associated widely in DNA sequence analysis. The DNA analysis by using different statistical techniques requires several statistical tools and demands considerable expertise in statistics. Therefore, an attempt was made to design a user friendly computer based statistical tool to analyze one or more DNA sequences in different paths of statistics and make a sequence alignment efficiently. The DNA Sequence Analysis Tool (DSAT) was developed and implemented. by using vb.net programming language in Microsoft Visual Studio 8. MSChart and MSChartVisualStudioAddOn tools were used to display graphic outputs of the tool. An analysis can be conducted under five options named as Nucleotide Distribution Analysis, Basic Analysis (GC content, AT/GC ratio and repetitions), Multiple Analysis, Pair wise Analysis and Cluster Analysis. The DSAT contains a collection of several statistical techniques in one application and quick in aligning DNA sequences. This statistical tool can be used by biologists and students with limited statistical knowledge in quick time to get more detailed information about the correlation among DNA sequences.
dc.identifier.citationPeradeniya University Research Sessions PURSE - 2012, Book of Abstracts, University of Peradeniya, Sri Lanka, Vol. 17, July. 4. 2012 pp. 208
dc.identifier.isbn9789555891646
dc.identifier.issn13914111
dc.identifier.urihttps://ir.lib.pdn.ac.lk/handle/20.500.14444/497
dc.language.isoen_US
dc.publisherThe University of Peradeniya
dc.subjectStatistics and computer science
dc.subjectMolecular biology and biotechnology
dc.subjectDNA
dc.subjectDna sequence analysis
dc.titleA computer based statistical tool to analyze the correlation among DNA sequences
dc.typeArticle
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
P.G.S.S.Jayarathna.pdf
Size:
201.98 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed to upon submission
Description:
Collections