Performance Evaluation of Support Vector Regression for Intrusion Detection
M. Govindarajan and R. M. Chandrasekaran
Department of Computer Science and Engineering, Annamalai University, Annamalai Nagar, Tamil Nadu, India
Data Mining is the use of algorithms to extract the information and patterns derived by the knowledge discovery in databases process. Classification maps data into predefined groups or classes. It is often referred to as supervised learning because the classes are determined before examining the data. In many data mining applications that address classification problems, feature and model selection are considered as key tasks. That is, appropriate input features of the classifier must be selected from a given set of possible features and structure parameters of the classifier must be adapted with respect to these features and a given data set. This paper describes feature selection and model selection simultaneously for Support Vector Regression (SVR). In order to reduce the optimization effort, various techniques are integrated that accelerate and improve the classifier significantly: hybrid SVR, comparative cross validation. The feasibility and the benefits of the proposed approach are demonstrated by means of data mining problem: intrusion detection in computer networks. It is shown that, compared to earlier SVR technique, the run time is reduced by up to 0.07s and 0.26s while error rates are lowered by up to 0.01% and 1.84% for normal and abnormal behavior respectively. The algorithm is independent of specific applications so that many ideas and solutions can be transferred to other classifier paradigms.
Keywords: Data mining, Ensemble method, Error rate, Run time, Support vector regression.
Back to TOC