Heba, E., M. Salama, A. E. Hassanien, and T. - H. Kim,
"Bi-Layer Behavioral-Based Feature Selection Approach for Network Intrusion Classification",
Security Technology - International Conference, SecTech 2011, pp.195-203, Jeju Island, Korea, December 8-10,, 2011.
AbstractTo satisfy the ever growing need for effective screening and diagnostic tests, medical practitioners have turned their attention to high resolution, high throughput methods. One approach is to use mass spectrometry based methods for disease diagnosis. Effective diagnosis is achieved by classifying the mass spectra as belonging to healthy or diseased individuals. Unfortunately, the high resolution mass spectrometry data contains a large degree of noisy, redundant and irrelevant information, making accurate classification difficult. To overcome these obstacles, feature extraction methods are used to select or create small sets of relevant features. This paper compares existing feature selection methods to a novel wrapper-based feature selection and centroid-based classification method. A key contribution is the exposition of different feature extraction techniques, which encompass dimensionality reduction and feature selection methods. The experiments, on two cancer data sets, indicate that feature selection algorithms tend to both reduce data dimensionality and increase classification accuracy, while the dimensionality reduction techniques sacrifice performance as a result of lowering the number of features. In order to evaluate the dimensionality reduction and feature selection techniques, we use a simple classifier, thereby making the approach tractable. In relation to previous research, the proposed algorithm is very competitive in terms of (i) classification accuracy, (ii) size of feature sets, (iii) usage of computational resources during both training and classification phases.
Salama, M. A., and A. E. Hassanien,
"Binarization and Validation in Formal Concept Analysis",
International Journal of Systems Biology and Biomedical Technologies, vol. 1, issue 4, pp. 17-28, 2012.
AbstractRepresentation and visualization of continuous data using the Formal Concept Analysis (FCA) became an
important requirement in real-life fields. Application of formal concept analysis (FCA) model on numerical
data, a scaling or Discretization / binarization procedures should be applied as preprocessing stage. The
Scaling procedure increases the complexity of computation of the FCA, while the binarization process leads to a distortion in the internal structure of the input data set. The proposed approach uses a binarization procedure prior to applying FCA model, and then applies a validation process to the generated lattice to measure or ensure its degree of accuracy. The introduced approach is based on the evaluation of each attribute according to the objects of its extent set. To prove the validity of the introduced approach, the technique is applied on two data sets in the medical field which are the Indian Diabetes and the Breast Cancer data sets. Both data sets show the generation of a valid lattice.
Sahlol, A. T., C. Y. Suen, H. M. Zawbaa, A. E. Hassanien, and M. A. Fattah,
"Bio-inspired BAT optimization algorithm for handwritten Arabic characters recognition",
Evolutionary Computation (CEC), 2016 IEEE Congress on: IEEE, pp. 1749–1756, 2016.
Abstractn/a
Elharir, E., N. El-Bendary, and A. E. Hassanien,
"Bio-inspired optimization for feature set dimensionality reduction",
3rd International Conference on Advances in Computational Tools for Engineering Applications (ACTEA),, Beirut, Lebanon, 13-15 July , 2016.
AbstractIn this paper, two novel bio-inspired optimization algorithms; namely Dragonfly Algorithm (DA) and Grey Wolf Optimizer (GWO), have been applied for fulfilling the goal of feature set dimensional reduction. The proposed classification system has been tested via solving the problem of Electromyography (EMG) signal classification with optimal features subset selection. The obtained experimental results showed that the GWO based Support Vector Machines (SVM) classification algorithm has achieved an accuracy of 93.22% using 31% of the total extracted features. It also outperformed both the typical SVM algorithm, with no feature set optimization, and the DA based optimized feature set SVM classification, for the tested EMG dataset.