Received 12 April 2016; accepted 2 May 2016; published 25 August 2016
Throughout the world, all industries, hospitals, and educational institutions utilize electric power. Typically, electric power is their primary power source. If there is a shortage in this primary power, all of these entities would collapse. This situation must be avoided, and this problem can be solved by electric energy demand forecasting. Power plants are the major source of this primary power generation. Different energy sources, including thermal, nuclear, wind, and hydrological are installed to meet the energy demand required in the future. But the successful installation of power plants is a long-term process, and after installation the capacity of the power plant cannot be increased. This leads to the necessity of forecasting as the initial process in power system engineering. Demand forecasting plays a vital role in electricity generation. In recent years, the state of Tamil nadu in India is facing irregular power supply to all its districts due to a shortage of power. This problem is the consequence of a lack of forecasting methodologies. Therefore, this research may be helpful to avoid this critical situation. Power distribution is a cumbersome task due to the increase in the consumption of electric power by the increase in the usage of electronic equipment and by the modernizations and the new entry of industries. These changes induced the need to find the way of improvising the power distribution through forecasting the power need and looking for the ways to acquire them. For forecasting the power need, it is not eventually distributed throughout the period, but it varies seasonally. Season have a higher impact in the electric power consumption in the domestic market. The domestic sector electricity consumption varies with respect to rural and urban segments and climatic seasonal variations.
In this paper, seasonal electricity demand forecasting is developed to predict the electricity demand by integrating with the WEKA time series forecasting. Demand forecasting is the process of predicting the needed electric energy in advance. Many prediction and forecasting methodologies were developed in the electricity domain. The main intention of this work is to forecast the electricity demand by using with seasonal data approach. At first, the electric energy monthly consumption data is collected from TNEB office. Time series forecasting is done with Seasonal data of the seasons such as Summer, Winter and Rainy with the WEKA Learning Algorithms. The accuracy parameters are used to measure the performance of the learning algorithms. The learning algorithm Support vector machine is shows the better accuracy compared with other learning algorithms. Finally, the future electricity demand is forecasted for the years from 2016 to 2018 with the help of support vector machine learning algorithm.
The rest of the paper is organized as follows: Section 2 illustrates some of the existing works related to electricity demand forecasting. Section 3 gives the detailed description of the overall flow of the proposed electricity demand forecasting model. Section 4 presents the results evaluation and comparison results of the proposed system. Finally, this paper is concluded and the future work to be carried out is stated in Section 5.
2. Related Works
Bresfelean, V.P.  uses WEKA environment to predict the student’s behavior using Decision tree algorithm. Zhifei Chen  developed on new approach in time series analysis to find the pattern sequences. They used k-means clustering techniques to grouping and labeling the samples from a data set the clustering the dataset. Mean Error Relative (MER) and Mean Absolute Error (MAE) is analyzed for several energy time series. The univariate time series analysis   is commonly used to forecasting monthly electric energy consumption in Eastern Saudi Arabia and Lebanon.
Hong  developed on forecasting model which includes support vector regression (SVR) model, chaotic immune algorithm and seasonal adjustment mechanism to forecast monthly electric loads. Ghosh  analyzed on Multivariate Time-Series Analysis to forecasting for traffic flow. Existing methods for short-term load forecasting include general regression analysis and time series methods  -  . Saigal S and Mehrotra D  compare the performance of four Models namely Multiple Regression in Excel, Multiple Linear Regression of Dedicated Time Series Analysis in WEKA, Vector Autoregressive Model in R and Neural Network Model using Neural Works Predict. All the models are compared on the basis of the forecasting errors generated by them. Suchao Jake Parkpoom  suggested regression analysis on Residential and Commercial consumption that predict that how the changes of climate will affect Thailand’s short-term and long-term electricity demand and to capture the influence of temperature on daily and seasonal demand.
3. Overall Flow of the Seasonal Based Electricity Demand Forecasting Model
Season based electricity demand forecasting model is designed to forecast the seasonal based electricity power consumption. Figure 1 shows the overall flow of the proposed electricity demand forecasting model. Initially
Figure 1. Overall flow of the proposed electricity demand forecasting model.
the Monthly Electric Consumption Data of Domestic Category is collected from TNEB. The power consumption by the Domestic category is highly influenced by the seasons. So here prediction is highly recommended for the Seasonal forecasting. This data is split into three dataset based on the seasons. Here the three seasons such as Summer, Winter and Rainy are considered for the forecasting. This model is implemented with WEKA Time series forecasting. In WEKA packet manager, Time Series Forecasting package is available. The WEKA learning algorithms such as Multilayer Perceptron, Support Vector Machine, Linear Regression, and Gaussian Pro- cess are capable of predicting the numeric quantity. They are used for this comparative study for forecasting electricity consumption based on seasonal data.
The monthly electric power consumption of the domestic category of Madurai District Data is used as the sample to deploy the forecasting. This Dataset is collected from Tamil Nadu Electricity Board (TNEB)  for the period from 2008 to 2016. A season is a division of the year marked by changes in weather, ecology and hours of daylight. Since Tamilnadu is a hot region, it has three seasons; the monsoon season, the summer season, and winter season. The collected data was pruned into three datasets based on seasons. The seasons are identified by the months are tabulated in the Table 1. The months such as November, December and January are decidedly Winter season, the months such as February, March, April, May and June are decidedly Summer season and the months such as July, August, September, October are decidedly Rainy season. Figures 2-4 shows the data series of power consumption of the three seasons. In these figures, the X axis represents periods of months and Y axis represents power consumption is measured in Megawatt per Hour. It is shortly denoted as Mwh.
3.2. Data Preprocessing
This research used data set  about electric power consumption in one household that has a sampling rate in one minute over a long period of time from the years 2006 to 2010. We used R and Rstudio for building the model. The first step is to read the text file (as show in Figure 2) with the following R commands:
Table 1. Seasons and their period in months.
Figure 2. Power consumption data series in summer season.
Figure 3. Power consumption data series in winter season.
Figure 4. Power consumption data series in monsoon season.
The missing value may reduce the accuracy of forecast. So the missing value is filled with the methods such as mean, median or previous value. In this dataset, the missing value is filled with the previous value in the time series.
Waikato Environment for Knowledge Analysis-version 3.7.6 (Weka 3.7.6) tool  has used for Time series forecasting. A package called “Time series forecasting environment” is used are performing the predication process. It has the both the command-line and GUI user interfaces. From the menu bar, the Tool menu has the Packet Manager. It has the Time series forecasting package. Select Time series forecasting package and install it.
There are two configurations in WEKA Forecasting. They are named “Basic configuration” and “Advanced configuration”. In Basic configuration, the important parameters such as timestamp, periodicity and number of units to be forecasted are available. A timestamp is a sequence of information which identifies an event, and usually it is represented as date and time of day, sometimes accurate to a small fraction of a second. Since the used datasets are seasonal data, the time stamp is set as “Use Artificial Time Index”. This is used only when we are adjusting for trends via a real or artificial time stamp. That means that it will increment the artificial time value with time stamp. In Advanced configuration, the Base learner is available; it has configured parameters specific to the learning algorithm selected. Here the WEKA learning algorithms such as Multilayer Perceptron  , Support Vector Machine  , Linear Regression  , and Gaussian Process  are used for implementation. These algorithms are capable of predicting the numeric quantity. The input data set is available in Attri- bute-Relation File Format (ARFF).
The following shows the structure of ARFF file.
@relation 'Time Series Data'
@attribute Period date 'MMM-yyyy'
@attribute Electricity_Consumption numeric
4. Result and Discussion
This section explains about the results and discussions, including the comparative analysis of the accuracy parameters such as Mean absolute error, Root mean squared error, and Direction accuracy of the WEKA learning algorithms available. WEKA learning algorithms identify the patterns of the each seasonal data. Once these patterns are identified, then we can carry over forecasting. Finally, the forecasted power consumption is computed with the help of WEKA learning algorithms. The accuracy parameters such as Mean absolute error, Root mean squared error, and Direction accuracy are calculated. These comparison results shows that support vector machine algorithm gives less error with more direction accuracy compared to other WEKA learning algorithm.
4.1. Accuracy Parameters
4.1.1. Mean Absolute Error
Figure 5 shows the comparison graph of the Mean Absolute Error of each WEKA learning algorithm. Mean Absolute Error is a basic accuracy parameter which calculates the average magnitude of the errors of the forecasting results. It gives the number differences between the actual and the forecasted values. In statistics, the mean absolute error (MAE) is a quantity used to measure how close forecasts are to the eventual outcomes. The mean absolute error is given by
where is the prediction and the true value.
4.1.2. Root Mean Squared Error (RMSE)
Figure 6 shows the comparison graph of the Root Mean Squared Error (RMSE) of each WEKA learning algorithm. The RMSE is the difference between forecast and corresponding observed values are each squared and then averaged over the sample. Finally, the square root of the average is taken. The mean absolute error is given by
In the case of a set of n values the RMS
4.1.3. Direction Accuracy (DA)
Figure 7 shows the comparison graph of the Direction Accuracy (DA) of each WEKA learning algorithm. DA is a measure of predictive accuracy of a forecasting method in statistics. It compares the forecast direction (upward or downward) to the actual realized direction. It is defined by the following formula
where At, At is the actual value at time t and Ft is the forecast value at time t. Variable N represents number of forecasting points. The function sign is sign function.
Figure 5. Comparison of mean absolute error between WEKA learning algorithms.
Figure 6. Comparison of root mean squared error between WEKA learning algorithms.
Figure 7. Comparison of direction accuracy between WEKA learning algorithms.
4.2. Future Prediction for Power Consumption Using Seasonal Data
The main aim of forecasting is to predict the future value of some time series. Seasonal based electricity demand forecasting is important for managing activities of consumers. Generally, seasonal based demand forecasting is based on the regularities in time series related to the changes in seasons. In this work, we use a real time monthly electric power consumption of the domestic category of Madurai District Data is used as the sample to deploy the forecasting. This Dataset is collected from Tamil Nadu Electricity Board (TNEB)  for the period from 2008 to 2016 to develop the forecast system by using the WEKA. Typically, the electricity demands are lump-filled and vary greatly with the seasons. Here, the demand model is integrated with the month and their corresponding power consumption.
From the analysis of the seasonal dataset, the result shows that, the support vector machine learning algorithm produces low Mean Absolute Error with increased Direction Accuracy. Thus the upcoming power consumption of three seasons is forecasted are as shown in the Figures 8-10.
The purpose of this research was to find a suitable model to forecast the seasonal based electric power consumption in a domestic category. Power layoff is the important problem faced by the government, industries, business people and the residents during very peak summer, winter and in monsoon season. This shortage is because of the failure in the needed power forecasting. Using the WEKA Time series forecasting package, the Learning Algorithms such as Multilayer Perceptron, Support Vector Machine, Linear Regression, and Gaussian Process are applied to the electric power consumption from December 2006 to November 2010. The time series forecasting is used to identify the patterns hidden in the data set. Once these patterns are identified using WEKA learning algorithms, and then we can carry over forecasting. The accuracy parameters such as Mean Absolute Error and Direction Accuracy of the learning algorithms are calculated. The best forecasting method is chosen
Figure 8. Future predictions of power consumption in summer season.
Figure 9. Future predictions of power consumption in winter season.
Figure 10. Future predictions of power consumption in monsoon season.
by considering the low Mean Absolute Error and high Direction accuracy. Hence, the Support Vector Machine is found to be the best technique for the electricity demand prediction for seasonal based dataset. The upcoming power consumption of three seasons is forecasted for three years using support vector machine learning algorithm. These seasonal forecasts provide information about how the electric energy demand is likely to be for three years into the future. These forecasts will provide alerts to government and will helpful in making decisions for generating power through seasonal based power plants.