At present, there are a large number of two-phase flow systems in many fields, such as electricity, petroleum, chemical industry, metallurgy and other national economic production. In these systems, due to the complex flow mechanism of two-phase flow, randomly large, the use of traditional detection technology is difficult to be used for observation or measurement methods. And according to flow pattern change criterion or flow chart to determine the flow mode with the flow parameters is also difficult  . So electrical capacitance tomography (ECT) developed gradually since the 1980s.
ECT is an imaging technique, which can measure dielectric constant changes and material distribution in pipes or tubes  . It uses differential medium has different dielectric constant, and by measuring the capacitance values between the electrodes placed on the surface of the tube and the spatial distribution of the dielectric constant in the tube is calculated, combined with the sensitive field, the distribution of the working medium in the pipeline is further deduced. Because ECT can change the position and number of sensor, produce different imaging results, and it does not affect the motion of the fluid, fast, cheap, so it is considered to be the future development prospects of imaging technology   .
In the era of big data, big data can be summarized as four characteristics, data volume, velocity, variety, veracity. Data mining is a technique that looks for regularities and rules from a large number of data by analyzing each data, there are three main steps―data preparation, regular search and regular representation. Data preparation is to select the required data from the relevant data source and integrate it into a data set for data mining. The regular search is to find out the rules contained in the data set in some way. The regular representation indicates that the rules are expressed as much as possible in a user-understandable manner (such as visualization). Capacitance data mining process is: first of all data acquisition and storage, through the simulation software to get the capacitance value of the filter into a data set when the micro element at high dielectric constant; and then the data set for processing and analysis, such as the use of linear and non-linear statistical analysis of data and according to certain rules of the data set classification, and analyzing the relationships between data and data categories; then the classification of data after data mining, explore and find internal connection between data   ; finally, the results will be imaging, and easy to analyze and use.
Statistics is a comprehensive discipline, which by searching, collating, analyzing data and other means to achieve the nature of the measured object, and even predict the future of the object  . The regression equation is a mathematical expression that reflects the regression relationship of a variable (dependent variable) to another or a set of variables (independent variable) from the sample data by regression analysis. Regression linear equations are used more often, we can use the least squares method to find the regression line equation to get the regression linear equation  .
The statistical method used in this study is correlation analysis. Correlation analysis has been widely used in many fields because of its ability to quickly and efficiently discover the interrelationships between various parameters. Correlation analysis refers to the analysis of two or more variables that are correlated to measure the degree of correlation between the two variables  . The scope and areas covered by the correction cover almost every aspect of the world we see, and there are great differences in the definition of correlation in different disciplines  . Conventional ECT method is built on the electromagnetical theory, which encounters difficulties when the problem becomes nonlinear. It is therefore worthwhile to investigate this problem from another angle by exploring other possible relationships between the images and the measured capacitance data. In this study, it is our intension to investigate the correlation of the permittivity distributions with capacitance measurements.
2. The Theoretical Basis
2.1. Principles of Electrical Capacitance Tomography
In this paper, a square ECT sensor with 8 electrodes is used  . The grid is divided into 32 × 32, as shown in Figure 1. Usually, a circle ECT sensor is popular, as a result of this paper uses divided into 32 × 32, and the edge of the circle ECT sensor can not be divided into the same small square, it is not easy to find the correlation. The dielectric constant of the working medium is, and the rest is. As shown in Figure 1(b), the blue micro element and the working medium do not coincide, we can know that the correlation is small, the red micro element and the working medium coincide, so the correlation is large.
By the definition of the sensitive field:
where is the sensitivity of the e-th test micro element between the i electrode and the j electrode; is the capacitance values when the dielectric constant of the e-th element is and the and the dielectric constant in the other micro element is, and respectively are the capacitance values when the pipeline filled with dielectric constant is and; is a correction factor related to the area of the e-th unit in the pipeline.
It can be seen from the Equation (1) that there is a certain linear relationship between the and the normalized capacitance value.
Figure 1. Square ECT sensor with 8 electrodes.
Conventionally, the simulation method is used to obtain the sensitive field. We usually obtain sensitive field by auto Maxwell software. By calculating the internal potential distribution of the pipeline under each excitation voltage, directly using the point multiplication can generally the sensitive field. The specific formula is:
where is the electric field distribution inside of the pipeline when the electrode is excited (where the voltage applied to the i electrode is constant), is the area of the pixel in (x, y).
After obtaining the sensitive field by simulation method, by formula (1) we can obtain the capacitance value when the dielectric constant is in the e-th element and the dielectric constant in the other micro element are   .
2.2. Principles of Correlation Coefficient
There are two variables in a linear regression, in which x is an observable and controllable common variable, which is often called an independent variable or controllable variable, and y is a random variable, which is often called a dependent variable or a response variable. There is a significant linear correlation between x and y by scattering or calculating the correlation coefficient, that is, the relationship between x and y is as follows   :
It is usually considered that and suppose has no relationship with x. Substituting the observed data into Equation (4), and note that the sample is a simple random sample, we obtain:
Because (G is the pixel gray value), and the Equation (1) can be written as Equation (6).
we can simplify Equation (6) as
The values of a and b (that is and AG) are required to be minimized by:
The method of minimizing the sum of squares of deviations is called the least squares method. The sum of squares of the deviations is transformed into:
which reflects the degree of correlation between the two variables. AG is in the range of −1 to 1. Since A is constant, R and G are identically distributed.
The new sensitive field matrix constructed by R as weights is:
2.3. Principles of Improve Landweber Algorithm
The iterative process of the Landweber iteration  :
The iterative process of the Landweber iteration based on the correlation matrix generated in this study is:
where is iteration step, in simulation, P is the correlation matrix, and is the maximum eigenvalue of.
3.1. Image Reconstruction by Simulation
As shown in Figure 2, from left to right (that is from (a) to (c)) for three kinds of work conditons, from top to bottom (that is from (1) to (4)) are the original picture, the image map of Landweber algorithm, the image map of correlation algorithm, the image map of improved Landweber algorithm based on correlation, the number of iteration is 150.
Various image maps are obtained can be seen from the Figure 2. In Figure 2(a) Landweber algorithm can separate two square working medium, although the correlation algorithm can not completely separate two square working medium, it can clearly figure out the shape of square. In Figure 2(b) two algorithms can not be imaged correctly, but can only make approximate positions. In Figure 2(c) the sensitive position of Landweber is not obvious, and the working medium is not imaged at all. In the correlation algorithm, the intensity of the two adjacent working medium is enhanced. The improved Landweber algorithm based on correlation has better imaging effect. In Figure 2(4)-(a), the size of the working medium and the position imaging results are better. In Figure 2(4)-(b) the location and number of working mediums can be clearly represented.
Figure 2. The image of working medium.
3.2. The Sensitivity Analysis by Simulation
We know that due to the soft field characteristics of the sensitive field, the sensitive field will be affected when the working medium is added into the pipeline, which will lead to unsatisfactory imaging results. The results show that the method used in this paper is using the correlation between capacitance and sensitive field, make full use of the soft field characteristic, we can reconstruct the sensitive field matrix, and the reconstructed sensitive field can be regarded as the sensitive field after putting into the working medium.
It can be seen from Figure 3, from top to bottom (that is from (1) to (4), of these, (1) is the original sensitive field, (2) to (4) correspond to the case in Figure 2 from (a) to (b)) are the sensitive field between electrodes, from left to right (that is from (a) to (b)) for the sensitive field between the electrodes 1 and 2, electrodes 1 and 6, electrodes 2 and 6. The sensitive field of the working medium
Figure 3. The image of the sensitive field between electrodes.
part is strengthened, so that the imaging effect is better.
3.3. Image Error and Correlation Coefficient
According to the two standards of the image error and image correlation coefficient, a further comparison is made  :
where is the real permittivity distribution, is the reconstructed permittivity distribution, and are the mean values of and.
Figure 4 is the image error, and Figure 5 is the correlation coefficient. In the figure, 1 represents Landweber algorithm, 2 represents correlation algorithm, and 3 to 7 represent improved Landweber algorithm based on correlation, the number of iteration is 10, 30, 50, 80, 100. As can be seen from the figure, the image error of the correlation algorithm and the improved Landweber significantly
Figure 4. Image error.
Figure 5. Correlation coefficient.
decreased, the correlation coefficient significantly increased. With the increase of the number of iterations, the value of image correlation coefficient tends to be gradual.
This paper presents an algorithm based on data analysis and correlation coefficient. The algorithm firstly uses the inverse method to obtain the capacitance value when a certain element is a high dielectric constant working medium, which is simpler and more effective than using the cyclic acquisition method. Comparing with the original sensitive field, it can be found that the reconstructed sensitive field is strengthened in the presence of the working medium, so the imaging effect becomes better. By comparing the image with the Landweber iterative algorithm, it can be seen that the correlation coefficient method has the advantage of reducing the size of the image shape. Improved Landweber algorithm based on correlation can improve image quality better, the accuracy of the method of this study is high and the error is small. It is more accurate to determine the location of multiple working fluids. It can be seen from the image that the study in the paper is prone to deformation in the diagonal direction, and the future research work should focus on how to modify the image at the diagonal. And the correlation coefficient method can image the working medium with high dielectric constant, how to make the dielectric constant of the higher working medium imaging better is a key research direction.