Salama, M.,
Data Mining for Medical Informatics,
, Cairo, Cairo Unv, 2012.
AbstractThe work presented in this thesis investigates the nature of real-life data, mainly in the medical field, and the problems in handling such nature by the conventional data mining techniques. Accordingly, a set of alternative techniques are proposed in this thesis to handle the medical data in the three stages of data mining process. In the first stage which is preprocessing, a proposed technique named as interval-based feature evaluation technique that depends on a hypothesis that the decrease of the overlapped interval of values for every class label leads to increase the importance of such attribute. Such technique handles the difficulty of dealing with continuous data attributes without the need of applying discretization of the input and it is proved by comparing the results of the proposed technique to other attribute evaluation and selection techniques. Also in the preprocessing stage, the negative effect of normalization algorithm before applying the conventional PCA has been investigated and how the avoidance of such algorithm enhances the resulted classification accuracy. Finally in the preprocessing stage, an experimental analysis introduces the ability of rough set methodology to successfully classify data without the need of applying feature reduction technique. It shows that the overall classification accuracy offered by the employed rough set approach is high compared with other machine learning techniques including Support Vector Machine, Hidden Naive Bayesian network, Bayesian network and other techniques.
In the machine learning stage, frequent pattern-based classification technique is proposed; it depends on the detection of variation of attributes among objects of the same class. The preprocessing of the data like standardization, normalization, discretization or feature reduction is not required in this technique which enhances the performance in time and keeps the original data without being distorted. Another contribution has been proposed in the machine learning stage including the support vector machine and fuzzy c-mean clustering techniques; this contribution is about the enhancement of the Euclidean space calculations through applying the fuzzy logic in such calculations. This enhancement has used chimerge feature evaluation techniques in applying fuzzification on the level of features. A comparison is applied on these enhanced techniques to the other classical data mining techniques and the results shows that classical models suffers from low classification accuracy due to the dependence of un-existed presumption.
Finally, in the visualization stage, a proposed technique is presented to visualize the continuous data using Formal Concept Analysis that is better than the complications resulted from the scaling algorithms.
Salama, M., A. E. Hassanien, and Adel Alimi,
"Formal concept analysis approach for comparison between mutagenicity and carcinogenicity in Cheminformatics. ",
13th IEEE International Conference on Hybrid Intelligent Systems |(HIS13) Tunisia, 4-6 Dec. pp. 268-273, 2013, Tunisia, , 4-6 Dec, 2013.
Sahlol, A. T., C. Y. Suen, H. M. Zawbaa, A. E. Hassanien, and M. A. Fattah,
"Bio-inspired BAT optimization algorithm for handwritten Arabic characters recognition",
Evolutionary Computation (CEC), 2016 IEEE Congress on: IEEE, pp. 1749–1756, 2016.
Abstractn/a
Sahlol, A. T., A. A. Ewees, A. M. H.;, and A. E. Hassanien,
"Training feedforward neural networks using Sine-Cosine algorithm to improve the prediction of liver enzymes on fish farmed on nano-selenite",
12th International Computer Engineering Conference (ICENCO),, Cairo, 28-29 Dec, 2016.
AbstractAnalytical prediction of oxidative stress biomarkers in ecosystem provides an expressive result for many stressors. These oxidative stress biomarkers including superoxide dismutase, glutathione peroxidase and catalase activity in fish liver tissue were analyzed within feeding different levels of selenium nanoparticles. Se-nanoparticles represent a salient defense mechanism in oxidative stress within certain limits; however, stress can be engendered from toxic levels of these nanoparticles. For instance, prediction of the level of pollution and/or stressors was elucidated to be improved with different levels of selenium nanoparticles using the bio-inspired Sine-Cosine algorithm (SCA). In this paper, we improved the prediction accuracy of liver enzymes of fish fed by nano-selenite by developing a neural network model based on SCA, that can train and update the weights and the biases of the network until reaching the optimum value. The performance of the proposed model is better and achieved more efficient than other models.
and ella S. Udhaya Kumar, H. Hannah Inbarani, A. T. A. A. H.,
"Identification of Heart Valve Disease using Bijective, Soft sets Theory ",
International Journal of Rough Sets and Data Analysis, vol. 1, issue 2, pp. , 1(2), 1-13, 2014.
Abstract Major complication of heart valve diseases is congestive heart valve failure. The heart is of essential significance to human beings. Auscultation with a stethoscope is considered as one of the techniques used in the analysis of heart diseases. Heart auscultation is a difficult task to determine the heart condition and requires some superior training of medical doctors. Therefore, the use of computerized techniques in the diagnosis of heart sounds may help the doctors in a clinical environment. Hence, in this study computer-aided heart sound diagnosis is performed to give support to doctors in decision making. In this study, a novel hybrid Rough-Bijective soft set is developed for the classification of heart valve diseases. A rough set (Quick Reduct) based feature selection technique is applied before classification for increasing the classification accuracy. The experimental results demonstrate that the overall classification accuracy offered by the employed Improved Bijective soft set approach (IBISOCLASS) provides higher accuracy compared with other classification techniques including hybrid Rough-Bijective soft set (RBISOCLASS), Bijective soft set (BISOCLASS), Decision table (DT), Naïve Bayes (NB) and J48.