COST EFFECTIVE APPROACH ON FEATURE SELECTION USING GENETIC ALGORITHMS AND FUZZY LOGIC FOR DIABETES DIAGNOSIS .

Published on Mar 1, 2011in International Journal of Soft Computing
· DOI :10.5121/IJSC.2011.2101
E. P. Ephzibah2
Estimated H-index: 2
,
E. P. Ephzibah2
Estimated H-index: 2
Sources
Abstract
A way to enhance the performance of a model that combines genetic algorithms and fuzzy logic for feature selection and classification is proposed. Early diagnosis of any disease with less cost is preferable. Diabetes is one such disease. Diabetes has become the fourth leading cause of death in developed countries and there is substantial evidence that it is reaching epidemic proportions in many developing and newly industrialized nations. In medical diagnosis, patterns consist of observable symptoms along with the results of diagnostic tests. These tests have various associated costs and risks. In the automated design of pattern classification, the proposed system solves the feature subset selection problem. It is a task of identifying and selecting a useful subset of pattern-representing features from a larger set of features. Using fuzzy rule-based classification system, the proposed system proves to improve the classification accuracy.
📖 Papers frequently viewed together
88 Citations
219 Citations
105 Citations
References27
Newest
#1Humar Kahramanli (Selçuk University)H-Index: 8
#2Novruz Allahverdi (Selçuk University)H-Index: 14
Data can be classified according to their properties. Classification is implemented by developing a model with existing records by using sample data. One of the aims of classification is to increase the reliability of the results obtained from the data. Fuzzy and crisp values are used together in medical data. Regarding to this, a new method is presented for classification of data of a medical database in this study. Also a hybrid neural network that includes artificial neural network (ANN) and ...
247 CitationsSource
#1Kemal Polat (Selçuk University)H-Index: 33
#2Salih Güneş (Selçuk University)H-Index: 30
This paper presents a hybrid approach based on feature selection, fuzzy weighted pre-processing and artificial immune recognition system (AIRS) to medical decision support systems. We have used the heart disease and hepatitis disease datasets taken from UCI machine learning database as medical dataset. Artificial immune recognition system has shown an effective performance on several problems such as machine learning benchmark problems and medical classification problems like breast cancer, diab...
61 CitationsSource
Jun 20, 2007 in ICML (International Conference on Machine Learning)
#1Bin Cao (PKU: Peking University)H-Index: 14
#2Dou Shen (HKUST: Hong Kong University of Science and Technology)H-Index: 24
Last. Zheng Chen (Microsoft)H-Index: 64
view all 5 authors...
We address the problem of feature selection in a kernel space to select the most discriminative and informative features for classification and data analysis. This is a difficult problem because the dimension of a kernel space may be infinite. In the past, little work has been done on feature selection in a kernel space. To solve this problem, we derive a basis set in the kernel space as a first step for feature selection. Using the basis set, we then extend the margin-based feature selection al...
79 CitationsSource
#1Kemal Polat (Selçuk University)H-Index: 33
#2Salih Güneş (Selçuk University)H-Index: 30
Last. Sülayman Tosun (Selçuk University)H-Index: 2
view all 3 authors...
This paper presents a novel method for diagnosis of heart disease. The proposed method is based on a hybrid method that uses fuzzy weighted pre-processing and artificial immune recognition system (AIRS). Artificial immune recognition system has showed an effective performance on several problems such as machine learning benchmark problems and medical classification problems like breast cancer, diabetes, liver disorders classification. The robustness of the proposed method is examined using class...
83 CitationsSource
The healthcare environment is generally perceived as being ‘information rich’ yet ‘knowledge poor’. There is a wealth of data available within the healthcare systems. However, there is a lack of effective analysis tools to discover hidden relationships and trends in data. Knowledge discovery and data mining have found numerous applications in business and scientific domain. Valuable knowledge can be discovered from application of data mining techniques in healthcare system. In this study, we bri...
167 CitationsSource
#1Zhaohui TangH-Index: 3
#2Jamie MacLennanH-Index: 1
About the Authors. Credits. Foreword. Chapter 1: Introduction to Data Mining. Chapter 2: OLE DB for Data Mining. Chapter 3: Using SQL Server Data Mining. Chapter 4: Microsoft Naive Bayes. Chapter 5: Microsoft Decision Trees. Chapter 6: Microsoft Time Series. Chapter 7: Microsoft Clustering. Chapter 8: Microsoft Sequence Clustering. Chapter 9: Microsoft Association Rules. Chapter 10: Microsoft Neural Network. Chapter 11: Mining OLAP Cubes. Chapter 12: Data Mining with SQL Server Integration Servi...
159 Citations
Variable and feature selection have become the focus of much research in areas of application for which datasets with tens or hundreds of thousands of variables are available. These areas include text processing of internet documents, gene expression array analysis, and combinatorial chemistry. The objective of variable selection is three-fold: improving the prediction performance of the predictors, providing faster and more cost-effective predictors, and providing a better understanding of the ...
11.3k CitationsSource
#1David E. GoldbergH-Index: 106
#1GoldbergH-Index: 1
Last. William ShakespeareH-Index: 1
view all 2 authors...
16.4k Citations
Fuzzy inference systems (FIS) are widely used for process simulation or control. They can be designed either from expert knowledge or from data. For complex systems, FIS based on expert knowledge only may suffer from a loss of accuracy. This is the main incentive for using fuzzy rules inferred from data. Designing a FIS from data can be decomposed into two main phases: automatic rule generation and system optimization. Rule generation leads to a basic system with a given space partitioning and t...
585 CitationsSource
#1Manoranjan Dash (NUS: National University of Singapore)H-Index: 21
#2Huan Liu (NUS: National University of Singapore)H-Index: 117
Last. Hiroshi Motoda (Osaka University)H-Index: 41
view all 3 authors...
Feature selection is an effective technique in dealing with dimensionality reduction for classification task, a main component of data mining. It searches for an "optimal" subset of features. The search strategies under consideration are one of the three: complete, heuristic, and probabilistic. Existing algorithms adopt various measures to evaluate the goodness of feature subsets. This work focuses on one measure called consistency. We study its properties in comparison with other major measures...
167 CitationsSource
Cited By39
Newest
#1Hatice Nizam Ozogur (Istanbul University)H-Index: 1
#2Gokhan OzogurH-Index: 1
Last. Zeynep Orman (Istanbul University)H-Index: 9
view all 3 authors...
1 CitationsSource
#1Hafiz Farooq AhmadH-Index: 13
#2Hamid MukhtarH-Index: 16
view all 5 authors...
Source
#1Kaustabh Ganguly (KGEC: Kalyani Government Engineering College)
#2Amiya Karmakar (KGEC: Kalyani Government Engineering College)
Last. Partha Sarathi Banerjee (KGEC: Kalyani Government Engineering College)H-Index: 3
view all 3 authors...
IoT-based portable medical diagnostic tools like smartwatches, health monitors, etc. are extensively used for real-time data collection and monitoring. There are a plethora of options available for tech stacks to be used and open source frameworks for managing a complete internet of things system. Recognizing the pattern of fluctuation of medical data and making a decision on the probability of disease before the onset of any symptom, puts forth a big challenge for the medical practitioners. We ...
Source
#1Harshil ThakkarH-Index: 1
#2Vaishnavi ShahH-Index: 1
Last. Manan Shah (Pandit Deendayal Petroleum University)H-Index: 19
view all 4 authors...
Abstract Diabetes is an ailment in which glucose level increase in at high rates in blood due to body’s inability to metabolize it. This happens when body does not produce sufficient amount of insulin or it does not respond to it properly. Critical and long-term health issues arise if diabetes is not handled or properly treated which includes: heart problems, disorders of the lungs, skin and liver complications, nerve damage, etc. With increasing number of diabetic patients, its early detection ...
9 CitationsSource
#1Sarika Jain (National Institute of Technology, Kurukshetra)H-Index: 14
Last. Chandan KumarH-Index: 1
view all 4 authors...
Source
#1Dilip Kumar Choubey (VIT University)H-Index: 7
#1Dilip Kumar Choubey (VIT University)H-Index: 2
Last. Santosh Kumar (ITI: Information Technology Institute)H-Index: 1
view all 4 authors...
Diabetes has become one of the major health concerns for the modern day population. This can be attributed to a number of factors such as unhealthy lifestyle, meager diet, genetics, obesity, etc. The rapid growth in the number of diabetic patients urges the requirement for a state-of-the-art healthcare against such diseases. Early prediction of such diseases can be very useful for mitigating the risks associated with such diseases. In this context, this research proposes an indigenous efficient ...
19 CitationsSource
#1Sushruta Mishra (KIIT: KIIT University)H-Index: 7
#2Hrudaya Kumar Tripathy (KIIT: KIIT University)H-Index: 8
Last. Paolo Barsocchi (National Research Council)H-Index: 2
view all 5 authors...
Disease diagnosis is a critical task which needs to be done with extreme precision. In recent times, medical data mining is gaining popularity in complex healthcare problems based disease datasets. Unstructured healthcare data constitutes irrelevant information which can affect the prediction ability of classifiers. Therefore, an effective attribute optimization technique must be used to eliminate the less relevant data and optimize the dataset for enhanced accuracy. Type 2 Diabetes, also called...
17 CitationsSource
#2M. Valan RajkumarH-Index: 4
Last. P. S. Manoharan (TCE: Thiagarajar College of Engineering)H-Index: 14
view all 3 authors...
Diabetes mellitus is one of the major concerned diseases that cause a large number of deaths every year. It is considered as the chronic disease which is caused by an increase in blood sugar. If di...
Source
Source