The renewable capacity expansion around the World has increased over the past years. In 2019, the additions have taken the renewable share of all global power capacity to 34.7% . In case of Brazil, wind energy accounted for 9% of total system capacity in 2019 . The growth is justified by the countries’ attempt to transform their electricity matrix cleaner, changing from fossil fuel plants to renewable sources, and by the lower technological renewable costs when compared to years before.
Despite the benefits of cleaner and low cost energy, the renewable generation brings important issues to system operation due to its natural intermittency and seasonality characteristics . The studies   show that high renewable sources penetration on the power system requires the implementation of system flexibility mechanisms such as controlled units, ancillary services, market design changes and storage services.
Other issue related to renewable sources is the financial risk of its cash flow that may discourage new investors. It can be explained by periods where the renewable generation curve does not meet the selling volumes contracts, leading to involuntary exposures to the short-term market.
One alternative to mitigate this issue is to explore portfolios composed by power plants with different seasonal generation patterns where the complementary effect between the plants can be used for financial risk management.
Several works demonstrate that the complementary effect resulting from the geographical or technological diversification of renewable generation      can mitigate the generation risk and improve the financial results under the risk-return perspective.
Therefore, for decisions of new investments in renewable generation, that involves uncertain variables as generation and spot price, is essential an appropriated risk analysis model with representation of stochastic behavior, that can be obtained by applying stochastic programming  techniques with risk metrics  and risk-aversion approaches  in the formulation, resulting in a model with risk-return analysis where the decision is taken by the expected return and the risk weighted by a parameter that represents the risk aversion profile of the decision maker.
A methodology to represent the stochasticity of wind generation into the medium-term planning of Brazilian system can be seen in . The quoted study uses the methodology of wind time series reconstruction presented by .
In this context, this work focuses on searching the best financial resources allocation for optimal wind power plants portfolio selection and proposes a long-term wind series reconstruction methodology for generating scenarios of wind energy by improving the methodology present in , and proposes a risk-averse stochastic optimization model to define the optimal wind power plant selection.
This paper is organized as follows. Section 2 details the methodology for long-term wind series reconstruction and applies it to estimate the generation scenarios of five sites located in the Brazilian territory, taking into account the Vortex and NCEP/NCAR mesoscale data set of these locations. Section 3 describes the risk-averse stochastic optimization model, which aims to define the optimal portfolio selection of wind power plants, considering the wind complementarity of the sites and budget constraint. An application case is presented in this section, investigating the effect of risk aversion on the decision, under different CAPEX premises. Finally, Section 4 presents the main conclusions of the paper.
2. Long-Term Wind Series Treatment
For wind energy investment analysis using stochastic programming models, it is essential to work with long-term scenarios of wind generation to guarantee the results quality, reliability and representativeness. For this reason, data processing activities and series characterization are incorporated into the time series reconstruction (wind speed/wind generation) methodologies for long-term scenarios.
It is worthwhile to realize that this work proposes improvements to the methodology presented in , which aims the reconstruction of wind time series for long term analysis. The innovation is associated with the modeling and data analysis processes. The methodology addresses the equations and processes to Pandas scientific data model. The library is coded in Python computer language, providing a better and quite robust time-series data analysis by applying the codes available in the library. For more details about Data Analysis see .
2.1. Methodology for the Reconstruction of Wind Time Series
The methodology proposed in this work aims at the reconstruction of long-term wind generation series. To this end, it also includes the basic activities of processing wind speed data from series originated from mesoscale data.
The main challenge of the reconstruction process is related to the application of the methodology developed by , for the extension of a shorter time series (1 - 30 years) to a longer time series (>60 years), in order to obtain an extended data set to be applied in the process of creating scenarios with associated probability of occurrence, preserving the statistical parameters of the reference series.
The methodology of wind time series reconstruction (speed and generation) can be explained partitioning the whole process in three main steps: 1) selection and validation of time series, 2) reconstruction of the daily series based on the medium term reference series characteristics (e.g.: Vortex), 3) daily generation estimation based on the reconstructed series.
The methodology Steps are presented in detail as follows and can be summarized as shown in the flowcharts of Figure 1, Figure 2 and Figure 3. In the example, two mesoscale long-term historical wind speed time series are used: NCAR (National Center for Atmospheric Research)  and Vortex (Weather Research & Forecasting Model) .
Figure 1. Selection and validation of time series.
Figure 2. Reconstruction of the daily series based on the vortex series characteristics.
Figure 3. Daily generation based on the reconstructed series.
STEP I: Selection and Validation of Time Series
Step I aims to select and validate the time series to be used in the reconstruction process. Figure 1 shows the procedures applied in this step, using NCAR Series (1948-2016) and Vortex Series (1982-2016) as an example.
It is important to evidence that NCAR and Vortex are mesoscale long-term historical wind speed time series with different horizons and time scale. The NCAR series has 68-year horizon and data integrated at every 6 hours, while the Vortex has a 32-year horizon and data integrated at every 1 hour.
The practical difficulties coming from data alignment and combination between these data sets are overcome with the set of tools available in Pandas Library, e.g. resample, merge and group-by methods.
In Step I, these methods are applied for the wind speed time series validation aiding the following procedures:
1) Calculation of the average daily speeds for both series NCAR e Vortex;
2) Transformation of NCAR e Vortex series into the same analysis period (start-end);
3) Validation of all series with cross-correlation greater than 0.8;
4) In case of validation, proceed to Step II. Otherwise, other series (data sources) are evaluated and the procedures are repeated.
STEP II: Reconstruction of the daily series based on the Vortex series characteristics
Step II focuses on the daily series reconstruction process based on the statistical characteristics of the base series (Vortex). Figure 2 illustrates the flowchart with the main procedures.
The procedures adopted in this Step II can be described as follows:
1) Vertical extrapolation of the base series (Vortex) to the hub height of the wind turbine ( );
2) Vortex statistical analysis from the hourly speed, calculating the average speed ( ) and the monthly standard deviations ( ) of the series;
3) Vertical extrapolation of the reference series (NCAR) based on the calculation of the power law exponent (n), considering a statistical adjustment based on speeds with different heights of the base series;
4) NCAR statistical analysis using the extrapolated reference series, calculating the average speed ( ) and daily variability ( ) (distance between daily speed and long-term average speed);
5) Reconstruction of the daily series considering the daily variability of the NCAR (1948-2016) and the average speed of the Vortex (1982-2014), the series for the entire horizon 1948-2016 ( ).
Equation (1) presents the required calculation for vertical extrapolation of the reference series (NCAR) and Equation (2) provides the power law exponent (n) adjusted as proposed in  and adopted to feed the Pandas data model.
STEP III: Daily generation based on the reconstructed series
Step III focuses on estimating the daily reconstructed series generation (Figure 3). The procedures applied in this Step are:
1) Weibull distribution (daily) : for each day, the reconstructed daily wind speed and the monthly standard deviation (shape and scale parameters of Weibull distribution) are applied to define the associated distribution curve;
2) Daily generation: the Weibull distribution curve for the wind speed is applied to the selected wind turbine power curve.
2.2. Characterization of the Reconstituted Wind Data Series
The wind series reconstruction methodology was applied to 5 locations of interest, as shown in Table 1. These locations, selected by state, synthesize the wind characteristics of their region, being the Northeastern coast (CE and RN), Northeastern inland (PI and BA) and South region (RS). The wind generation in South region of Brazil is characterized by lower intensity, lower annual seasonality and higher direction variability while the Northeast is characterized by higher intensity, higher annual seasonality and lower steering variability.
Table 2 presents the characterization of these series, as well as the values of the Exponent (n) of the Power Law used for the vertical extrapolation of each series and the monthly correlations.
2.3. Reconstructed Wind Time Series
The wind time series for each location analyzed in this study are shown in Figure 4, Figure 5, Figure 6, Figure 7 and Figure 8. In these figures, for each considered location there are two plotted graphs, where the first one refers to the raw data pairing, without treatment. The second graph represents the result of the methodology where the wind speed is plotted on a daily average basis. Figure 9
Table 1. Wind power plants locations.
a. Coordinates in World Geodetic System 1984 (WGS84); b. Codes represent WPP in their related Federal State location; c. CF—Capacity Factor.
Table 2. Data sources.
a. wind turbine hub height; b. (n)—exponent of the wind profile power law relationship.
Figure 4. Methodology applied to WPP-BA.
Figure 5. Methodology applied to WPP-PI.
Figure 6. Methodology applied to WPP-RN.
Figure 7. Methodology applied to WPP-RS.
Figure 8. Methodology applied to WPP-CE.
Figure 9. Comparation of wind generation reconstructed.
presents the generation results comparison between the five locations considered in this work.
The results indicate great variability of the wind speed between sites, directly influencing the generation complementarity degree, being important to observe that it is not possible to define a global standard behavior as there are different wind generation patterns. Nevertheless, sites like WPP-CE e WPP-RN show similarity although located in different places. These locations share the same Northeast coastal wind characteristics, however, they present different infrastructure restrictions that reflect on investment costs.
3. Financial Resource Allocation for Wind Power Plants Portfolio Selection
This work presents a new business model formulation and its application for wind power plants portfolio selection. The business model uses the concept of optimal resource allocation, meaning that given a budget cap and investment options in wind power plants, it is possible to define the optimal plant portfolio that maximizes the financial results for trading the energy produced by the whole optimum set of generation plants, considering both, the financial risk and investment return. In this model, the long-term wind time series data provided by the reconstruction methodology are used as scenarios of energy generation.
3.1. Model Overview
The selection of portfolios composed purely by wind power plants (WPP) can be understood as the solution of a problem characterized by to find the optimal allocation of the available financial resources for investment in one or more plants, in such a way to get financial results (risk x return) higher than those that could be obtained by fully allocating resources in a single wind project.
To carry out this kind of analysis, it is was decided to apply a stochastic risk-aversion optimization model, where the decision variables are 1) the optimal composition of the wind portfolio and 2) the volume to be allocated in the portfolio selling contract.
The objective function considers the financial risk and investment return, weighted by a parameter that represents the risk aversion profile of the decision maker. The financial risk is measured by the Conditional Value-at-Risk (CVaR) metric, as defined by .
In Equation(3), the objective function is defined as the maximization of the convex function composed by the Expected Revenue and CVaR (risk metric), weighted by a risk aversion parameter ρ. In this function, the first expression inside the brackets computes the Expected Revenue while the second expression represents the main equation of CVaR. Note that in case of an agent totally risk-averse, ρ is equal to 100% and the decision is only taken by accounting the financial risk (CVaR). In opposite condition, totally risk-neutral, ρ is null and the decision is taken by Expected Revenue. Intermediates values of ρ represent risk-aversion profiles that weight Expected Revenue and CVaR in the decision.
In the presented equation, r is the return rate, is the probability of scenario s belonging to a set of S scenarios, is an auxiliary variable at time t belonging to a set of T months in horizon planning, whose value corresponds to the Value-at-Risk within a confidence interval , assumed 0.05 in this work. The is a positive auxiliary variable used to compute the CVaR at time t and scenario s. For computing CVaR, it is necessary to consider an additional restriction defined as Equation (4):
The Portfolio Revenue at time t and scenario s, , is obtained by the sum of the Variable Revenue ( ) and the Fixed Revenue ( ) minus the expenditure cost ( ) as in Equation (5):
As shown in Equation (6), the Variable Revenue ( ) represents the exposure risk in energy spot market, as in order to meet the selling contract the differences between the contracted energy and real generation are settled by the spot price.
where: is the generation of each k WPP in the portfolio composed by K WPP at time t and scenario s; is the energy committed in the selling contract, expressed in terms of avg MW (Average MW = MWh/number of hours); is the spot price at time t and scenario s; is the number of hours in each period of time t.
The Fixed Revenue ( ), coming from the selling contracts, is computed by multiplying the energy committed in a selling ( ) by its nominal price ( ) at time t, as indicated in Equation (7). Once the model aims to find optimal volume to be allocated in a single selling contract of the portfolio, thus represents a decision variable.
In Equation (8) the total capital expenditure allocated in the portfolio ( ) is defined as the sum of the unitary investments of the WPP in the portfolio ( ) by its correspondent installed capacity ( ). Note that is a decision variable that defines the participation of each WPP in the portfolio.
The unitary monthly investment WPP ( ) is represented in terms of the capital expenditure per unit of installed capacity, converted into uniform monthly payments during the planning horizon. It is obtained by applying the Annual Equivalent Cost (AEC) divided by 12 (number of months in a year), as in Equation (9).
The AEC parameter is a function of the interest rate (r = 9% p.y.), power plant lifetime (n = 25 years) and CAPEX (Capital Expenditure, per-unit of MW installed). With this approach, the financial costs are uniformed distributed along the project lifetime.
Associated with the equations above, Equation (10) is a constraint representing that the total capital invested must be less than or equal to the available budget ( ), defined as model input assumption.
Another important assumption is that there is no leverage in the selling operation, meaning that the energy selling should be covered by the maximum amount of energy the portfolio can sell.
Under the Brazilian regulatory rules, the maximum amount a power plant can sell is defined by its Firm Energy Certificate (FEC). For each type of source, there is a specific rule to calculate it. In the case of wind energy, FEC is calculated based on the 90th percentile criteria of annual generation estimated for the wind power plant, which considers three years of wind measurement, among other technical details, according to the energy certification issued by certification’s companies. Therefore, the non-leverage restriction is written as Equation (11) below:
Note that the FEC of each WPP depends on its own installed capacity, thus, we wrote this variable as a single multiplication of the decision variable by the , which represents a per-unit FEC of a each WPP.
The generation scenarios of each WPP ( ) can be described as a function of the decision variable and the per-unit generation, , as shown in Equation (12):
It is important to highlight that the variables , and are expressed as per-unit of MW, assuming that these parameters behaviors can be described as linear functions for the purpose of modeling simplification.
The spot prices scenarios were obtained from a multi-stage stochastic optimization model named Newave, the official model used for operating the Brazilian Power System, characterized by being a system with centralized dispatch, whose energy price is formed through the application of models that emulate the operation of the system. For more on, see .
3.2. Case Study
The case studies aim to analyze the portfolio selection considering the five wind power plants, supported by the proposed optimization model and using the generation scenarios created by the reconstruction methodology. The five wind power plants are those previously studied in this work: Caetité-BA, Aracati-CE, Parnaíba-PI, Macau-RN, Coxilha Negra-RS.
We simulated two cases under CAPEX hypotheses: 1) a single CAPEX amount for all WPP and 2) different CAPEX for each WPP, based on historical data of Public Energy Auctions .
In each case we consider three risk-aversion levels (0%, 50%, 100%) and run four portfolio configurations. As a research assumption, in each simulation round the highest-performing WPP is excluded to investigate the attractiveness of the others with the lowest performance. Thus, four sequential simulations were carried out with 5, 4, 3 and 2 WPP in the portfolio configuration.
3.2.1. Case (i): Portfolio Selection—Same CAPEX Value for All WPP
In the first case, considering the same single CAPEX value of 4 million R$/MW for all WPP, the goal was to analyze the competitiveness of wind farms under the same investment conditions, to emphasize their performance in relation to the commercialization of the energy produced by the portfolio and the complementarity of generation between the WPP.
The investment budget is assumed to be R$ 600 million1, which allows to compose a portfolio up to 150 MW. The assumed price for the selling contract is 140.00 R$/MWh.
Table 3 presents the results obtained for a risk-aversion of 0%, that is, when decision is taken only the Expected Revenue is considered. In this case, there is no portfolio diversification, and, for all combinations, it is allocated 150.00 MW in the WPP of higher capacity factor, meaning that all available budget is allocated in a unique WPP in all combinations.
In this case, the objective function considers only the expected revenue for the final decision, however, we plot the CVaR values as reference of the risk that is no being accounted in such risk-aversion condition. Another important observation is on the huge difference in the financial results among the return on Caetité (higher capacity factor) in comparison with Parnaiba (lower capacity factor).
The next simulation was performed under a risk-aversion of 50%, where Expected Revenue and CVaR are equally weighted in the objective function.
Table 4 presents the financial results achieved, in which it is observed a diversification by considering only the two WPP of lowest capacity factors, Parnaíba (CF = 44%) and Coxilha Negra (CF = 44%).
As can be seen, although both WPP have the same capacity factor, the allocation was higher in the first (102.59 MW) than in the second (47.41 MW). This can be understood by the fact that the generation risk of WPP-Parnaíba is lower than the other. Therefore, when accounting for the risk (CVaR) in the objective function, it is better to allocate more capital in the WPP-Parnaíba.
Table 5 presents the result under a risk-aversion of 100%, where it is only accounted the CVaR in the objective function. Note that there is more diversification, considering portfolios composed by the combinations of 3 WPP and 2 WPP. Comparing these results with those obtained in the previous risk-aversion
Table 3. Case (i)—Portfolio Composition (MW): Risk-aversion 0%.
Table 4. Case (i)—Portfolio Selection (MW): Risk-aversion 50%.
simulations, there is a decrease in the expected revenue in compensation of an increase in the CVaR value (the higher the CVaR value, the lower the risk, as the CVaR value becomes closer to the expected revenue value).
In all simulations, we found allocations in selling contract between 85% - 100% of the total firm energy credit of the portfolio. This pattern reflects the influence of the P90 criterion in the calculation of the FEC of wind farms, which significantly reduces the amount of energy that wind power plants can commercialize in Brazil. As a matter of organization, we have not aimed to detail this aspect in this study. For more on, see .
3.2.2. Case (ii): Portfolio Selection—Different CAPEX
The second case includes an assumption of different CAPEX unitary value for each WPP. The CAPEX is based on the historical data of Public Energy Auctions in Brazil  and the unitary value is represented by the historical average investments in each Federal State related to the WPP location, as shown in Table 6. Thus, it approximately reflects the cost differences in each location, given its economic particularities for developing this type of power plants.
For simulation purpose, it was assumed an investment budget of R$ 600 million and a selling contract price of 140.00 R$/MWh.
In the neutral risk-aversion (0%) simulation results, it is observed that there is no diversification (see the next Table). The only change observed is that in this case ii, WPP-Aracati becomes more valuable than Caetité, as the first has lower CAPEX than the second. Table 7 presents the results for all combinations, showing the full budget allocation in each WPP.
Considering a risk-aversion of 50% in the simulation (Table 8), there is diversification between Macau e Caetité in the 4 WPP combination, because of differences in the CAPEX of each one. In the remaining combinations, there is no diversification, as the selection includes only the WPP with greater attractiveness
Table 5. Case (i)—Portfolio Composition (MW): Risk-aversion 100%.
Table 6. Case (ii)—WPP CAPEX.
in terms of risk and return.
Under 100% of risk-aversion condition, where only CVaR is accounted, the risk profile of WPP leads to diversifications in all combinations, as it could be seen in Table 9, meaning that the optimal portfolio compositions are those that provide a higher CVaR (lower risk), in absence of considering the expected revenue in the decision.
Based on the achieved results, it should be realized that there is a trade-off in defining the portfolio selection, associated with the risk-aversion profile of the decision maker. This issue brings an important reflection on the analysis together with the currently questions related with generation profiles, trading and investment in each wind power plant.
Wind energy investment analysis using stochastic programming models demands to consider long-term scenarios of wind generation, to guarantee the
Table 7. Case (ii)—Portfolio Selection (MW): Risk-aversion 0%.
Table 8. Case (ii)—Portfolio Selection (MW): Risk-aversion 50%.
Table 9. Case (ii)—Portfolio Composition (MW): Risk-aversion 100%.
results quality, reliability and representativeness. For this reason, the adopted methodology considers a technique addressing time series reconstruction to support the formulation of long-term wind generation scenarios.
The selection of a portfolio composed purely by wind farms can be translated as a business model in which the investor seeks to define the optimal allocation of the financial resources available for investment in one or more plants, in such a way as to get financial results higher than those that could be obtained by fully allocating resources in a single wind project.
The solution of such a problem was carried out by applying a stochastic risk-averse optimization model, so that, given an investment budget cap, it can be possible to determine the optimal portfolio formed by the adequate proportion of each candidate wind farms.
In the case studies, the conditions associated with the generation profile, firm energy credit and the installed capacity of each plant in the portfolio selection, in addition to the effect of the investment cost of each one, were accounted for.
Furthermore, the results show the model performance in terms of capital allocation for wind power plants portfolio selection under distinct boundary conditions, as well as emphasize that the diversification of the portfolio changes due to the assumed profile of the investor’s risk aversion.
Although applied to the Brazilian case, this model can be customized for any location worldwide.
We gratefully acknowledge financial support from Afluente Geração de Energia Elétrica S/A (a Contour Global PLC company) through the Research and Development Project PD-05162-0001/2018 (ANEEL Code).
1Financial values are expressed in Brazilian currency Real (R$), where R$ 1000.00 ≈ ?80.00 ≈ US $190.00, according to April 2020 quotation. R$ 600 million ≈ ?08 million ≈ US$ 114 million; 140.00 R$/MWh ≈ 25.2 ?MWh ≈ 26.6 US$/MWh.
 IRENA (2020) Renewable Capacity Statistics 2020. International Renewable Energy Agency (IRENA), Abu Dhabi.
 EPE (2019) Decade Energy Plan 2029. EPE, Rio de Janeiro.
 Verdejo, H., Escudero, W., Kliemann, W., Awerdin, A., Becker, C. and Vargas, L. (2016) Impact of Wind Power Generation on a Large-Scale Power System Using Stochastic Linear Stability. Applied Mathematical Modelling, 40, 7977-7987.
 Mars, P., O’Sullivan, A. and Keppo, I. (2020) Estimating the Impact of Variable Renewable Energy on Base-Load Cycling in the GB Power System. Energy, 195, Article ID: 117041.
 Milligan, M., Kirby, B., Acker, T., Ahlstrom, M., Frew, B., Goggin, M., et al. (2015) Review and Status of Wind Integration and Transmission in the United States: Key Issues and Lessons Learned. NREL.
 Bagatini, M., Benevit, M.G., Beluco, A. and Risso, A. (2017) Complementarity in Time between Hydro, Wind and Solar Energy Resources in the State of Rio Grande do Sul, in Southern Brazil. Energy and Power Engineering, 9, 515-526.
 Cantao, M.P., Bessa, M.R., Bettega, R., Detzel, D.H.M. and Lima, J.M. (2017) Evaluation of Hydro-Wind Complementarity in the Brazilian Territory by Means of Correlation Maps. Renewable Energy, 101, 1215-1225.
 Castro, R. and Crispim, J. (2018) Variability and Correlation of Renewable Energy Sources in the Portuguese Electrical System. Energy for Sustainable Development, 42, 64-76.
 Vinel, A. and Mortaz, E. (2019) Optimal Pooling of Renewable Energy Sources with a Risk-Averse Approach: Implications for US Energy Portfolio. Energy Policy, 132, 928-939.
 Ramos, D.S., Camargo, L.A.S., Guarnier, E. and Witzler, L.T. (2013) Minimizing Market Risk by Trading Hydro-Wind Portfolio: A Complementarity Approach. 10th International Conference on the European Energy Market (EEM), Stockholm, 27-31 May 2013, 1-8.
 Conejo, A.J., Carrión, M. and Morales, J. (2010) Decision Making under Uncertainty in Electricity Markets. International Series in Operations Research & Management Science 153, Springer Science + Business Media, Berlin, 540.
 Shapiro, A., Tekaya, W., Costa, J.P. and Soares, M.P. (2013) Risk Neutral and Risk Averse Stochastic Dual Dynamic Programming Method. European Journal of Operational Research, 224, 375-391.
 Mummey, J.F.C., Ramos, D.S., Sauer, I.L. and Yeh, W.G. (2017) Important Issues and Results When Considering the Stochastic Representation of Wind Power Plants in a Generation Optimization Model: An Application to the Large Brazilian Interconnected Power System. Energy and Power Engineering, 11, 320-332.
 Witzler, L.T., Ramos, D.S., Camargo, L.A.S. and Guarnier, E. (2016) Reconstruction of Wind Generation Historical Series Aiming at the Analysis of Energy Complementarity: Methodology and Applications. 13th International Conference on the European Energy Market (EEM), Porto, 6-9 June 2016, 1-6.
 Tekay, W. (2013) Risk Neutral and Risk Averse Approaches to Multistage Stochastic Programming with Applications to Hydrothermal Operation Planning Problems. PhD Thesis, School of Industrial and Systems Engineering, Georgia Institute of Technology, Atlanta.
 Camargo, L.A.S. (2015) Commercialization and Investment Strategy, with Emphasis on Renewable Energy, Supported by Specialized Optimization Models for Stochastic Risk X Return Assessment. Ph.D. Thesis, Polytechnique School University of Sao Paulo, Sao Paulo. (In Portuguese)
 CCEE (2020) The Chamber of Commercialization of Electric Energy.