Fast and Accurate Identification of Short Tandem Repeats (STRs) Using Hash Function in DNA Sequences
S. Jawahar1, P. Sumathi2

1S. Jawahar, Research Scholar, PG & Research, Department of Computer Science, Government Arts College, Coimbatore (Tamil Nadu), India.
2Dr. P. Sumathi, Assistant Professor, PG & Research, Department of Computer Science, Government Arts College, Coimbatore (Tamil Nadu), India.
Manuscript received on 14 December 2018 | Revised Manuscript received on 26 December 2018 | Manuscript Published on 24 January 2019 | PP: 262-266 | Volume-7 Issue-4S2 December 2018 | Retrieval Number: Es2065017519/19©BEIESP
Open Access | Editorial and Publishing Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: The main challenge in bioinformatics is the size and complexity of input datasets. Tandem repeats detection is important function in biology and medicine for phylogenic studies and diagnosing various diseases. Short Tandem Repeats (STRs) plays an important role in human genetic disease and for various regulatory mechanism and evolution. The mutation rate is higher in STR which leads to more biological research in this area. In our study at least two adjacent nucleotide patterns are considered as tandem repeats. The Short Tandem Repeats (STRs) is identified and investigated for diseases related mutation in human. The proposed algorithm Short Tandem Repeat using Hashing (STRH) uses hash table for fast storing and easy retrieval of values. The hash function generally hashes a longer string into much shorter string with fixed length. The analysis of STRH is made using five genes, HUMTH01, CSF1 Receptor, FIBRA, TPOX and VWF. Mostly the STRs in the five genes are tetranucleotide and contains perfect tandem repeat. The proposed STRH algorithm identifies more number of tandem repeats than the traditional algorithms.
Keywords: Bioinformatics, Tandem Repeat, Hash Function, Short Tandem Repeat (STRs), Tetra Nucleotide.
Scope of the Article: Software Defined Networking and Network Function Virtualization