abstract 1

9
HUMAN DISEASE DETECTION ARCHITECTURE USING DIGITAL IMAGE PROCESSING AND ARTIFICIAL NEURAL NETWORKS GUIDED BY: M.JENATH STUDENTS: G.R.PRIYADHARSHINI V.SUSITHRA R.NAFSIN M.THARAGESHWARI Abstract: With the increase in probability of diseases due to environment and gene misspellings there is always a preventive notion among people. Disease prediction has been difficult because of change in DNA over time. A novel architecture is proposed to find the probability of diseases which affect the humans due to gene misspellings and change in DNA over time. In the proposed architecture DNA microarray technology is used to extract genes and using digital image processing the image of genes are characterized .The artificial neural network calculates the mutated value and helps to find the probability of disease to be affected . INTRODUCTION : Bio-informatics is a rapidly growing area of computer science, which melds both biological data and computer calculations. It is essential to use the genomic information in understanding human diseases and in the identification of new molecular target for drug discovery. Each of the trillions of cells in the human body contains 46 chromosomes packed tightly into the region called nucleus. Half of the chromosomes in the nucleus come from mother and half from father. Each chromosome is a

Upload: abvaja

Post on 12-Jan-2016

217 views

Category:

Documents


1 download

DESCRIPTION

n

TRANSCRIPT

Page 1: Abstract 1

HUMAN DISEASE DETECTION ARCHITECTURE USING DIGITAL IMAGE PROCESSING AND ARTIFICIAL NEURAL

NETWORKSGUIDED BY: M.JENATH

STUDENTS: G.R.PRIYADHARSHINIV.SUSITHRA

R.NAFSINM.THARAGESHWARI

Abstract: With the increase in probability of diseases due to environment and gene misspellings there is always a preventive notion among people. Disease prediction has been difficult because of change in DNA over time. A novel architecture is proposed to find the probability of diseases which affect the humans due to gene misspellings and change in DNA over time. In the proposed architecture DNA microarray technology is used to extract genes and using digital image processing the image of genes are characterized .The artificial neural network calculates the mutated value and helps to find the probability of disease to be affected .

INTRODUCTION :

Bio-informatics is a rapidly growing area of computer science, which melds both biological data and computer calculations. It is essential to use the genomic information in understanding human diseases and in the

identification of new molecular target for drug discovery. Each of the trillions of cells in the human body contains 46 chromosomes packed tightly into the region called nucleus. Half of the chromosomes in the nucleus come from mother and half from father. Each chromosome is a long, tightly coiled molecule called DNA [1,2,4,5] or deoxyribonucleic acid. DNA is made up of chemical building blocks like A, C, T and G. The entire length of a DNA strand consists of the above four blocks in different combinations. The entire DNA in all the chromosomes makes up the human genome. The DNA in the genome [4] is organized into units called genes. There may be as many as 30,000 genes in the genome; they are the instruction manual for making all the proteins in the body. These proteins are the physical stuff that makes up our hair, skin, heart and blood. They also control chemical reactions that regulate blood sugar and heart rate, and control how food or medicine is metabolized in the body. The way the genes “spelled” makes the difference. Misspelling in a gene can cause disease. From the normal sequence of the human genome, researchers can compare the DNA sequence [2,4] of normal and diseased

Page 2: Abstract 1

people. If there are differences in the spelling of certain genes between the two groups, it’s possible that the disease is caused by misspelling in that gene. Scientist has identified about 6000 diseases, such as Huntington disease and cystic fibrosis that are directly caused by misspellings or physical problems in single genes. But the genetic contribution to common conditions – such as diabetes and heart disease – is a part of a larger puzzle that could include diet, lifestyle, environment, and even other genes. For many of these common conditions, genetic misspellings probably makes a small contribution to disease relative to other factors, or work in concert with them to cause illness. New biological reports that are published recently says that the DNA expression changes over time.

PROBLEM DEFINITION :

A Novel prototype architecture is designed, for the disease prediction for all age groups including infants based on the combination of parent DNA using Artificial Neural Networks (ANN), which is depicted in fig.2. The proposed architecture integrates Microarray Technology, Digital Image Processing, and Disease probing and Artificial Neural Networks.

A. DNA Microarray Data Analysis DNA microarray is used to measure the expression levels of large numbers of genes. The blood or tissue samples from Bio- informatics wet lab are collected and processed through Microarray analysis. The output of this analysis will be a gene expression image,

from the DNA sequence of the given parent pair that are stored in a database in a file format .TIFF or .JPEG.

B. Gene Extraction The gene expression is the most fundamental level in which the genotype gives rise to the phenotype. Genotype contains all the hereditary information of an individual, even if genes are not expressed. DNA, susceptible to the diseases are its examples. Phenotype is an observable trait that contains expressed genes. Hair color and weight are the notable examples of phenotype. So, with the gene expression image obtained from the Microarray Analysis, the highlighted mutated genes are characterized.

C. Image Analysis of Characterized Gene The morphological adaptive image enhancement and alignment methods are applied to the extracted images. These methods are adaptive to the local characteristics of the image such as noise, background signal, or presence of edges. The waveform analysis is used to determine the location of each band that represents one nucleotide in the sequence from which the image array of the mutated part can be extracted.

D. Disease Probing As the relationship between genetic mutations and the predisposition to certain diseases are known; it becomes more important to screen for single nucleotide polymorphisms (SNP) in entire populations, using low cost and fast methods. A draw back in this technology is the enormous time consumption in the design of probes. To overcome this computational solution is implemented to automate the process of probe design for the

Page 3: Abstract 1

combination of the mutated image array in high accuracy.

E. Artificial Neural Network System The array obtained from the disease probing algorithm is analyzed and input its mutated values to previously trained network, depicted in fig.1. With the help of disease prediction algorithm, the probability of disease can be listed to infants and the comparison results will be made with support vector machines. In ANN normally only one hidden layer is used. In order to find the prediction of disease for all age group people another hidden layer is included to provide weighted values for the

environments, food habits and stress levels in the people.

Fig 1:

CONCLUSION:

The novel architecture for predicting diseases for humans of all age groups caused due to misspelling of genes and change in DNA over time. The architecture is implemented in MATLAB. The proposed system integrates Digital Image Processing, DNA microarray and Artificial Neural Networks. The expression of genes is obtained and processed to find the mutations. Further change in DNA and effect of surroundings, stress levels and food habits of the people which leads to many diseases like hypertension, diabetes are also found using this architecture.

Page 4: Abstract 1

REFERENCES

[1] Sathish Kumar.S., and N. Duraipandian., "An Effective Identification of Species from DNA Sequence: A Classification Technique by Integrating DM and ANN.", International Journal of Advanced Computer Science and Applications, Vol. 3, No.8, 2012.

[2] Khan, Omniyah Gul M., et al., "DNA base-calling using artificial neural networks.", Biomedical Engineering (MECBME), 2011 1st Middle East Conference on. IEEE, 2011.

[3] Kim, Kyung-Joong, and Sung-Bae Cho., "Evolving artificial neural networks for DNA microarray analysis.", Evolutionary Computation, 2003. CEC'03. The 2003 Congress on. Vol. 4. IEEE, 2003.

[4] Fitch, J. Patrick, and Bahrad Sokhansanj., "Genomic engineering: moving beyond DNA sequence to function.", Proceedings of the IEEE 2000.

[5] Elhadi, Gamal F., R. M. Farouk, and Abdalhakeem T. Issa., "Protein sequence for clustering DNA based on Artificial Neural Networks.", 2012.

[6] Tong, Dong Ling, and Amanda C. Schierz., "Hybrid genetic algorithm- neural network: Feature extraction for unpreprocessed microarray data.", Artificial intelligence in medicine, 2011.

[7] Molla, Michael, et al., "Using machine learning to design and interpret gene-

expression microarrays.", AI Magazine, 2004.

[8] Möröy, Tarik., "DNA Microarrays in Medicine: Can the Promises Be Kept?.", Journal of Biomedicine and Biotechnology, 2002.

[9] Hewett, Rattikorn, and Phongphun Kijsanayothin. "Tumor classification ranking from microarray data.", BMC genomics, 2008.

[10] Zhao, Xin, and Leo WK Cheung., "Kernel-imbedded Gaussian processes for disease classification using microarray gene expression data.", BMC bioinformatics, 2007.

[11] Beiko, Robert G., and Robert L. Charlebois., "GANN: genetic algorithm neural networks for the detection of conserved combinations of features in DNA.", BMC bioinformatics, 2005

. [12] Lee, Chien-Pang, et al., "Gene selection and sample classification on microarray data based on adaptive genetic algorithm/k-nearest neighbor method.", Expert Systems with Applications, 2011.

[13] Chintanu KumarSarmah and Sandhya Samarasinghe, ” Microarray gene expression: A study of between-platform association of Affymetrix and cDNA arrays”, Computers in Biology and Medicine, Elsevier, 2011.

Page 5: Abstract 1

[14] Fatma El-Zahraa Labib, Islam Fouad, Mai Mabrouk, Amr Sharawy, ”An Efficient Fully Automated Method for Gridding Microarray Images”, American Journal of Biomedical Engineering, 2012.

.[15] Nagaraja, J., et al., "Application Of Mathematical Morphology For The Enhancement Of Microarray Images.", International Journal of Advances in Engineering & Technology, 2011.

[16] Kadam, A. B., R. R. Manza, and K. V. Kale., "A Novel approach for Microarray Spot Segmentation & Detection using four

shaped Mathematical Morphology.", Advances in Computational Research 2012.

[17] Manjunath.S.S, Shreenidhi.B.S, Nagaraja.J, Pradeep.B.S, ”Morphological Spot Detection and Analysis for Microarray Images”, International Journal of Innovative Technology and Exploring Engineering,Vol.2, April 2013.

[18] Kakumani, Arunakumari, Kaustubha A. Mendhurwar, and Rajasekhar Kakumani, "Microarray Image Denoising using Independent Component Analysis.", International Journal of Computer Applications, 2010

Page 6: Abstract 1

Fig 2: Proposed architecture for detecting the probability of diseases

Micro Array

Analysis

GENE EXTRACTION

GENE EXTRACTION

ANALYSIS OF DNA wrt SURROUNDINGS AND FOOD HABITS

IMAGE ANALYSIS

IMAGE ANALYSIS

DISEASE PROBING

ARTIFICIAL NEURAL

NETWORK

Male DNA / DNA /

Female DNA

Gene Expression Image of Y

Study of the Characteristics

of Extracted Gene

Image Array

Image Array of Mutated Value and Change in

DNA Array

Study of the Characteristics

of Extracted Gene

Probability of Diseases

Gene Expression Image of X

Mutated Values Input