The driving force of the overall spontaneous progressions in nature is the attempt to minimize the actual energy and maximize the entropy in the actual processes. In this sense, life follows the basic thermodynamic laws: the living process continuously “burns” the incoming “nutrition”. Only the energy-pump of the incoming sun-energy makes the difference: creates original gradients which are later divided into other inhomogeneities by spontaneous processes.
Life process tries to diminish the working energy of the sunlight by increasing the overall entropy of the environment. Living process lowers the electron energy by the oxidation producing outgoing (waste) final “products”. The gradual loss of electron energy of the “nutrition” molecules is the energy to sustain life. Simply speaking, the living process is a dissipative entropy producer. As the Nobel laureate physiologist A. Szentgyorgyi states “Life is nothing but an electron looking for a place to rest” .
Living objects are open systems among various environmental surroundings, adapting themselves to the conditions around, forming self-organized structures , and forcing evolution . The approach of complexity becomes a useful tool for the description of nature  . Self-organization appears in various scientific problems . The self-organization explains multiple structural and dynamical challenges in biology ; it is observed in broad range of research from the gene-regulatory networks , through the cells , to the general evolution of living objects .
The invariance of magnification (scale invariance, when the up or down magnification shows similar structures) is the form of self-similarity, which is a typical consequence of the self-organizing processes,  . It has developed a new discipline, the fractal physiology   ; where stochastic processes are applied instead of the deterministic actions, so the predictions of the distant future always have random, unpredictable elements.
Random stationary, stochastic, self-organizing processes form dynamic behaviors , define a spatiotemporal-fractal structure, which is self-similar both in space and time . The spatiotemporal fractal structure is a fingerprint of the self-organizing , and especially characteristic for the living matter . The basal metabolism as the energy consumption of the living objects has a central role in system biology  and describes the biosystems by definite properties . According to the system biology:
· life is complexly organized in a wide range of magnification and different levels of interactions,
· life is self-regulated with various feedback processes,
· the living systems are open, dissipative objects with multilevel interactions with the environment,
· the activity of life processes has intensive cross-talks of different levels of its organization,
· the specific forms and properties are complexly environment dependent
These points are important for the universality of life, for the dynamic fluctuations and scaling too , and as a character of life, it could be used in its diagnosis as well . Dynamical interactions have a spatiotemporal fluctuation which also has a scaling behavior. Homeostatic time-fluctuation is the so-called pink noise , that characterizes the noise of homeostasis.
The above complex biological processes connect to the biological allometry, scaling, non-equilibrium, and non-linear thermodynamics. Special self-similarity characterizes the mass-allometry by universal scaling, and it appears in a large category of living structures and processes , which rigorously optimizes the metabolic power in a universal frame, . Scaling is a simple power function, (like ), where a and b are constants, therefore the form of remains the same during any magnification of x. This scaling condition characterizes the biomaterials, which is indeed scaled universally on a very wide range of magnifications from the subcellular energy-consumption through mitochondria and respiratory complexes to the largest animals by scaling exponent α = 3/4, . The fingerprint of complexity can be found in various fields of biology, showing unified principles of self-organization . Note, that mitochondria probably has a key-role in this complex behavior of living objects, because the non-mitochondrial respiration scaling factor is lower, (α = 2/3), characterizing the simple surface-volume ratio in these processes, , however the robust category of living systems is scaled by complex manner . Different conditions modify the power function , forming various universality classes by self-similarity.
Self-organized processes are widely investigated in solid-state reactions (precipitations, phase-transitions, aggregations, nucleation, growth, etc.). The theory of phase-transition involving simultaneous random nucleation and growth was pioneered by Kolmogorov , Johnson, Mehl  and Avrami . It is called Johnson-Mehl-Avrami-Kolmogorov (JMAK) model, revised later by others, , . It describes the kinetics of phase transformation when nucleation is spatially random. The JMAK theory and one of its formulation called Avrami-function (AF) were introduced for solids to serve as mathematical models of different biological processes,   and even for DNA replication process,  too. Experimental data    , prove a certain universality of the Avrami-equation to describe the real processes, which could be a useful tool for further research, . It is generally useful for studying different processes with no known special system parameters, similarly to the critical phenomena of the physical-laws near to the phase-transition .
The AF (A(t))  in its most applicable forms:
where t is the elapsed time of the process, depends linearly on the nucleation rate and on the growth-rate by the power of three. The so called “Avrami constant” (n) was introduced in simple model n = 4, and so originally in solids it was considered an integer . It is interesting, that the space-fractal dimension dependens on AF . Here n value is not necessarily an integer and depends well on the processes that are described by it. The fractal dimension, and the power-law of self-similarity are tightly connected . Experimental data show, that the progression of many reactions in biology also follow the A(t) AF with various, non-integer characteristic constants . It was observed universally in different processes from a wide range of structural and dynamical situations of living systems    .
The non-equilibrium thermodynamical formalism could be applied to a self-organized system of malignancy in space and time . Cancer breaks the network of normal cells, while the cooperative tissue harmony changed to non-co-operative competitiveness forms a new complex structure non-linearly far from the thermodynamic equilibrium. Cancer could be described as a dynamical phase transition from healthy to cancerous , described with a clear analogy with phase transitions in a lifeless nature. Starting with an avascular situation and forming a dormant microscopic cluster , it continues to develop new angiogenetic formations by epithelial-mesenchymal cell transition, induced by bio-electromagnetic forces, . Tumor leaves the dormant state by an allometric transformation , and the previously almost undetectable phase becomes traceable. An Avrami-like function in time describes its development . This idea was used to show the validity of Avrami description  and extended to metastases while studying the transition of avascular appearance of tumorous clusters  to vascular phase, which bases the dissemination of malignant cells, . Metastases are developed by a first order phase transition of cells from non-cancerous to metastatic ones . The development of this new phase needs a great amount of energy. The energy dissipates in the system, produces a high rate of entropy development.
The general transport structure (blood-vessel network) of the tissues forms fractals by allometric scaling, including the angiogenetic processes in tumor formation . In oncological applications, the available metabolic transport and the fractal dimensions of the angiogenetic network determine the average survival of a tumor. The average survival of the tumor-cells shortens by the growing fractal dimension of the transport network and modified by some kind of an alimentation of the tumor, . The tumor-growth follows the universal law of scaling , which can be used in cancer-research .
The dynamics of the evolution of cancer produces various phases of the growing structures due to the genetic instability, leading to phase transitions . Tumor development operates near the threshold of phase transition, destabilizing the actual structure, making it highly heterogeneous , producing a large variety of random mutations , finding the most optimal conditions of the further proliferation. Their development is based on competition, a “fight” for the individual survival. The optimal strategy is well known in the game-theory  where the mixed-strategy forms Nash equilibrium in the non-cooperative game by random variation behind . This situation is typical for topological phase transitions , where the cooperation emerges despite the selfish, non-cooperative individual participating cells .
Our objective in this article is to find a parametric description of overall survival, which fits the self-organized processes and able to show the inherent information of survival measurements of cancer patients.
Most of the survival analyses in medical evaluations use the Kaplan-Meier (KM) non-parametric estimator  , used for incomplete observations. KM is useful to examine the probability of lifetime and effectivity of the chosen treatment for such lethal diseases like cancer. The computed probability of an event in a definite point of time:
KM estimator is defined by multiplying the above described successive probabilities by any earlier point of time obtaining the final estimate:
where is the number of deaths at the time ; is a time when at least one death had happened in the examined cohort, and is the number of individuals known to survive (not censored, exists in the study) at time . Some modifications were done in tails (pessimistic approach when short-tailed) , and optimistic approach, a fat-tailed  is in use having a difference in survivals at the end of the trial.
The best method for mining data could be when the non-parametric KM survival plot can be parameterized. The description of survival curves by parametric distribution function is a long-term effort  allowing the optimization of the information from the measured dataset. For the correct parametrization, we have to take an overview on the scientific facts that we can use for the research of the optimal parametrization. The most important result available is the parametric solution that is connected to the spatiotemporal self-organization and the self-similarity of developed structures.
The parametrization of survival measures we use to the universality of life consideres its self-organized self-similarity. The progression of life involves non-linear and non-equilibrium thermodynamical consequences including the fractal description and similar processes of the phase transitions in non-living systems. For calculating the survival-time, let T be the stochastic variable defined on the set of individuals, (lifetime). The lifetime distribution function is the probability of the lifetime being less than or equal to t, namely
Thus, the survival probability distribution (survival function) can be defined by the probability of the T lifetime being higher than t, that can be expressed in the form of
The density function of the lifetime distribution function is the
probable density, therefore, the average lifetime is:
Introducing the h(t)dt death rate is the probability that in case of a t length survival time, death occurs at (t + Δt) and (h(t) is the “hazard function” or “death rate”). Therefore, the probability is that in the case of a t length time survival, death occurs at (t + Δt) is
It’s cumulative form is
Biological systems are strictly self-organized . The inherent property of the living objects is the self-organizing and the consequent self-similarity of the living structures , which could be the basis of the proper parameterization of survival.
Taking the self-similarity into consideration, death-rate (failure rate in (8)) must be a self-similar time function , mirrored by a scaling like:
Its self-similarity is obvious because it gives the same function by magnification m:
The survival probability distribution function from (9) and (10) is:
The self-similar death rate (hazard function) is:
Substituting (14) with survival (13), we get:
which has two parameters for one curve, , is the scale parameter, which is the natural scale of the time-function variation, and n is the shape parameter. Consequently, the lifetime distribution function , by (3) and (4) is the well-known AF (A(t)) or cumulative form of the two-parametric cumulative Weibull distribution (W(t)):
with additional conditions , , when . The inverse function, when the t-time is calculated from a given p probability is:
There are various parameters characterizing the WF from the time of development independently. The shape parameter of WF is usually , following a sigmoid curve, which form is a psychometric function  anyway. In cases when the survival is a simple exponential function with rapid decrease by the decreasing of n.
The cumulative Weibull distribution (Weibull function, WF) is highly universal and represents all the features described in the introduction above. The formal identity of WF with the AF in JMAK inherently involves the phase transition approach, and the mechanics follow the tumor kinetics, .
The AF and WF have been used for a long time for survival/reliability description. Originally Weibull’s statistics was developed to describe the fracture of brittle materials ,  and to calculate the probability of the damage-free survival of the given material. It can be derived from geometric scale invariance (fractal organized structures) by physical principles,  in mechanical mills. It is frequently applied in the study of mechanical fatigue and failure .
The fit of WF to the non-parametric KM is completely rigorous when a strictly homogeneous cohort of patients is investigated, with unified equivalence of the participating individuals followed until the decease or censoring. This grouping selection apparently limits the applicability of WF. The parametrization of the aging and natural death has no such grouping selection, it is related to every human being and their survival. The epidemiological studies in gerontology refer to the Gompertz-distribution, . The Gompertz function (GF) is a function of time. When G(t) represents the number of individuals in the given period of time, t, is the number of subjects at the start of the counting time, then GF is:
The parameters a and b are positive and a is connected to the growth, while b is connected to the displacement in variable t. GF is also a double-parametric function, similarly to the n and in WF.
During the historical development of WF, it has started to characterize the aging of the non-living components and machineries (reliability) while the GF was initially developed for the ageing of living objects . By developing the statistical methods, soon, both the Weibull and Gompertz distribution have started to be applied for description of tumor-development and cancer-death. The comparison of the two distributions shows that the best fit of GF is ( , ) and the best fit of WF is ( , ); ( , ); where SE is the standard error of the regression estimate minimizing the sum of squares of measured and estimated data-pairs. Due to their applicability, the Gompertz and Weibull distributions are both commonly used in biological and engineering reliability investigations , .
The study of Gompertzian distribution for tumors supports a hypothesis that the fractal structure weakens and, in the end, it disappears by the growth of the tumor . In general, the tumor-growth follows a universality,  , which prefers to use the WF. The clear fitting of allometric scaling by the fractal structure of the tumor  shows not only the tumor growth but the validity of the allometry in the growth of the axillary lymph node involvement in breast cancer . In consequence, we choose to use the WF for modelling the KM plot of the overall survival.
The Gompertz distribution could be obtained by the reduction of the generalized exponential Weibull distribution , which formulated in a more general form, proposing to derive both distribution from one single  and it is applied for survival data with pretty good results.
The GF does not satisfies the self-similarity (formulated in (11)), and therefore, it is not in harmony with self-organizing biological dynamics, which is a certain character of the harmonized biological development, . This might be the reason, why the WF describes the intrinsic causes of age-related mortality better (following the homeostasis in the healthy aging process) while the Gompertz distribution reflects the extrinsic factors . Due to the self-similarity of WF, we expect, that the self-organized biological development of tumors intrinsically developing in a healthy environment from where it derives, prefers the WF to describe the KM in malignant diseases accurately. It is a further support for the primary importance of Weibull distribution, that it is derived from the ontological law, and so it is directly connected to the self-organized structure of the living matter . The self-similarity, as the basic fingerprint of self-organizing is not valid in Gompertz distribution. The “mystery” of Gompertz function is probably the equilibrium between the predictable and unpredictable (chaotic) dynamisms, . Contrary to the exponential origin of GF, the self-similarity (power function) of WF’s origin hypothesizes some parallels with the opposite pictures of fractal-like organizations and general scale-free (small-words, ) large networks (exponential function). Despite the structural preference of WF, GF also fits well to allometry, represented by power-function , shown in the development of rats . Although WF fits very well to the growth function of the general ontogenic model, using the data for rat  ( , ); the fit of GF shows the same result ( , ) for the same allometric curve. The difference is negligible in this regime of development. In the case of animals with larger masses, the difference is also not significant. It is subtle, favoring only the WF for the description of the best regression fit to the allometric scaling result, using the available data from . (The best WF and GF fits to allometry for cow are ( , ) and ( , ), respectively.)
WF is successfully applied to the living processes as the psychological function , describing the sensing processes well in connection with Weber-Fechner law , establishing psychometry . Lifetime estimations are frequently approached by WF  and WF is also successfully used for clustering gene expression .
WF describes the non-parametric KM plot with appropriate accuracy in gerontology  . A mathematical link of natural death-rate, aging and complexity is a fundamental tool of lifetime estimation  , using time-dependent shape-factor ( ) to describe the natural death at the end of life. Cancer-death was also described by WF with time-dependent shape-factor, using a similarity between the fracture survival of brittle materials and the specific survival characteristics of a cohort of cancer patients  . In this model the shape factor linearly depends on the time and gives surprisingly accurate fit to the data from the cancer-registers.
Due to its self-similar behavior, fractals could be used for modeling cancer , and the KM survival plot divided significantly by fractal dimension shows the prognostic value of the fractal analysis well . Consequently, it is possible to evaluate the various images in oncology by the fractal structure and these images can be characterized by Weibull distribution as well .
Due to the self-similarity, the parametric distribution generally fits well with the KM plot, and so it is successfully used in oncology  . The application of the parametric WF approximating the survival curve is a standard approach for the evaluation of clinical trial data, and so it is established theoretically and practically,    . Comparing various parametric fits to KM survival plot, the WF was the most accurate . The model was used to analyze the prognostic factors of the survival of cancer patients, and it was proved in a large retrospective analysis with n = 746 gastric cancer cases, .
Summarizing the above, the self-organizing and the self-similarity are universal laws fingerprinted in the fractal description and can be described by cumulative Weibull distribution. This universality of WF is applied to parametrize the KM plot. Due to the universality, the WF parametric regression fits the KM plot with sufficient accuracy and so determines the KM curve by two parameters ( and n). On the regression, a considerable improvement could be made by smoothing the KM with the hazard data (patients at risk), . Other improvements of the bivariant fit are also available , but for simplicity we use the original WF fit to KM insisting on showing the roots that are the universality of WF in survival investigations. Further smoothing and corrections are additional to the clearly established basis, due to the deviations in real cases.
The characterization of WF has four special points, the value at , the mean, the median and the inflection point. The median, the mean and the mode (the maximum point in the distribution function is an inflection point in the cumulative curve) are calculable from the parametric formulas, (see Figure 1):
The corresponding probabilities when and n = 2, are 0.5, 0.607 and 0.456 for the median, mode and mean, respectively. The quantile of this function is ≈0.632 and it independent from n value. Limit through a step-function at t = 0, while is a step function at , (Figure 2). All the noteworthy points are proportional to , so the natural units of the elapsed time are , when the single n-parameter defines the function. The hazard function (9) is constant when n = 1 (or β = 0, which means the parameter has no effect on the hazard), and it is increasing and decreasing when n > 1 (meaning the event is more likely to occur) and n < 1, (meaning the event is less likely to occur), respectively. The limit is a step-function at t = 0, and is a step function at , (Figure 2).
Figure 1. The noteworthy points of the Avrami-Weibull function, when and n = 2. The reference point of the Avrami-Weibull function is the value (1/e ≈ 0.37), where . The inflection point marks the mode of the distribution, which is the most frequent probability. When is chosen, it will be the unit of the elapsed time.
Figure 2. The limits of (a) survival ( ) and (b) hazard (H(t)) functions ( ).
The various parameter-pairs of WF are shown in Figure 3.
The inflection point in the WF (cumulative Weibull distribution) is the mode of the probability distribution function. It is the most likely appearing value in the Weibull probability distribution function. The inflection in the WF of survival divides the speed of developing death, which reaches its maximum at this point and the transfer of inflection is slowed by the elapsed time.
Programming calculates the result or makes it graphical (Figure 4). This makes it possible to generate the Weibull fit for the Kaplan-Meier routinely by knowing its median and mean values.
Figure 3. WF with various parameters (a) changing (scale parameter) at constant n = 1 (shape parameter); (b) n = 3.
Figure 4. Graphical solution of reestablishing the Weibull parametric survival curve from the median and mean values. (a) example: and are the time curves from median and mean expressions, respectively. Their common point (crossing) gives the and n parameters of the WF, which—in this case is when the mean is 130 and median is 100, the and ; therefore, it looks like this: . (b) a few solutions to show the trend of the graphical results.
The data at the particular points vs. n are shown in (Figure 5). The mode changes rapidly in the interval of n (1, 2), so reading accurately is difficult, therefore the median and mean are proposed to reestablish the entire WF. However, in a value of at the values of mode, mean and median are practically identical, so the WF could be characterized with a single parameter. Increasing does not lead to a significant change of the situation, so in virtually every case, we may approach WF only with one parameter over .
In conclusion from the above, the parametric regression KM is universally determined by two parameters (the shape parameter (n) and the scale parameter ( ) of WF), due to the basic behaviors of living processes: their self-organizing
Figure 5. Characteristics by shape-parameters at . (a) function vs. n; (b) derivatives vs. n.
and self-similarity, which is characterized well by their spatio-temporal fractal structure. When a clinician tries to describe the main info of the KM survival curves, takes the median value of survival, as a significant parameter characterizing the actual survival result into account. This is, in fact, an automatic characterization by a single parameter of the non-parametric estimation. However, the median alone cannot characterize the long tail of the KM plot; it does not consider the history of the patients in the remaining second half of the cohort, which could be essential for measuring the “cured”  anyway. Studying the median alone disregards the real measurable success at the end of the study. Correcting this “mistake” the average (mean) of the KM non-parametric distribution is considered. The mean is affected more by the “tail” of the distribution, so it gives a more accurate idea on the cure rate. The median is more responsible for the information about the rapidity of the loss of the patients, while the mean has more part in the information about the length of the effect of the high-success patients, Figure 6.
Sometimes the inflection of KM is studied too, having the highest death-rate in the study at that point. All are important for characterization, but two of them are independent, and the third could be calculated from the chosen two. The distribution curve must be characterized by two parameters at least.
Two of the three noteworthy points (median, mean, inflection) of the KM may parametrize the non-parametric plot. Measuring or guessing these characteristic points (mainly the median and mean) is a standard comparison of the KM-plots and usually accepted as the result of the actual study. These points really characterize the non-parametric distribution and give the possibility to parametrize, so, in fact, this is a “hidden” parameterization of the KM plot by WF.
A simple approach of Weibull fit could be made on the KM plot by its derivative in the reference point, which is proportional to –n. (The derivative there
is exactly .) Therefore, the parametric evaluation
could be checked well at the point, and the complete parametrization could be established approximately by the value of the point and the value of its slope, Figure 7.
The regression could be simplified to linear by double logarithmic approach:
The regression is shown in Figure 8. Note, that this approach is less precise than the function fit, because the double logarithm suppresses the accuracy in real KM fit.
However, the obvious deviation of the regressions from the measured OS is in
Figure 6. The mean and median changes according to the n and parameters. Parabola fits rather well, which connects the two parameters (n and ) at different medians and means.
Figure 7. A quick check of the parameters of the Weibull-fit to KM. real process on a KM (n = 1.5, ).
Figure 8. Logarithmic determination of the Weibull parameters (n = 1.5, ). (a) original WF, (b) vs. .
the tail of KM, which is similarly not followed by both functions. The universal WF idea offers regression fit to the KM for a group of patients who have had an event or have censored until the end of the study. This is, of course, limited in real trials. We consider any chosen cohorts inhomogeneous because of the huge variability of living conditions. A homogenous group of patients, which has identical individuals could never be selected. However, there is a possibility to divide the cohort to subgroups with very similar patients, and fit WF on these independently, while the measured KM is, of course, a sum of the results of all the subgroups. With M subgroups in the complete cohort of N patients, and every group containing patients, the WF for the actual measured non-parametric KM will be:
By taking extra care to have a homogeneous cohort, at least the time-limit of the study forms a group from patients, who had no event (or are not censored). The “remaining” patients in the given treatment study have the highest benefit from the performed treatment or they were in a definitely different condition when they were selected into the cohort. We call this group “remained group” (RG) due to the lack of proof of complete recovery. However, this group is sometimes regarded (incorrectly) as a cured fraction (according to the endpoints of the study). In a rigorous approach the disease-free survival (DFS) has to be compared with the matched healthy control group, and the cure-rate on this comparison must be decided . An alternative way to determine the group of “cured” patients and the connected value of the “cure” time is when the hazard rate of the studied group corresponds to the hazard in the general population . When it fits, we may talk about the real cure rate, which does not mean that an event cannot happen due to independent reasons from the investigated disease.
The KM curve in an RG situation obviously does not fit to the strict WF, which must be decreased to a zero cumulative probability. When the ratio of the remaining individuals is , the KM plot can be approximated with reasonable accuracy by the weighted sum of two WFs. In the RG fraction, the time-parameter is longer than in the fraction of patients having an event or censored.
In this case, the composition of the time-parameter of the long survival WF fit is practically infinite (compared to the time-length of the study):
In this case, the correction by a survived fraction of the patients is constant. Denoting the constant correction c, the plot will be composed by this:
The variation of c shows different fitting functions, Figure 9:
Characterization of the curative effect of the treatment making a WF fit to the non-parametric KM survival could be done with the Shannon-entropy. Entropy measures the information carried by the probable density function (pdf, ) behind the WF ( ). It measures the probability of realization of an event or censoring
Figure 9. The WF with various c concentration of the patients in the RG group.
The quantity of information is which is realized by , so the complete information from the system is the classical Shannon-entropy,  is:
A higher entropy shows less information (more uncertainty). When an event has a lower probability to occur, it carries more information, so its Shannon-entropy is lower than the effects of the frequent occurrence. The expectation of a random variable is characterized by this entropy, so by this meaning it is a direct analog for the entropy definition in physics (statistical thermodynamics). When the informational entropy decreases, (its change becomes negative) it means that the probability distribution differs from the uniformed distribution, concentrating to some data.
The entropy growth in physics usually happens when the system approaches equilibrium, while in pdf the increase of entropy shows a lack of information when the average rate of information produced by the stochastic source of the data decreases.
The Shannon entropy (28) measures the diversity of probability distribution function (pdf) behind WF (in fact the derivative of WF). It is a sum of the n and dependent parts:
and is the Euler-Mascheroni constant: The special points of this entropy function are:
The entropy (diversity) monotonically grows by in a logarithmic way, while it rapidly grows by n reaching the maximum at (when ) and decreases from that point reaching zero at n = 4.223 (when ) and building information from that point (decreasing), so the step-function of WF (definite step) starts to dominate. The division of the entropy of a shape and scale (time) dependent part gives a possibility to define the role of these parameters. While the scale (time) parameter increases the Shannon-entropy monotonically, the shape parameter (n) after a maximum at , decreases the entropy, showing an increasing amount of information about the death (decreasing info about being alive) of the participants in the cohort. The growing shape-factor n definitely worsens the survival over the value , while the growth of the scale (time) factor gives longer survival expectations.
The Shannon entropy could be calculated real-time t ( ) and also could be relative to time, meaning, that the time is measured in units ( ), estimating the self-time. A higher entropy value means a higher uncertainty of death (therefore, a lower certainty of being alive). We expectthe growth of Shannon entropy of the parametric probability distribution function in cases of better results of the treatment.
To demonstrate the parametrization, we use a large number of patients (1180 individuals), with various tumors treated by numerous standard therapies, but having one thing in common: they are treated by complementary modulated electro-hyperthermia (mEHT), when the standard treatment fail to deliver the desirable results,  ; Figure 10.
Using the approximate parametrization by the evaluation of this KM plot with the slope in , we get and . median ≈ 28, Figure 11.
The fit of single parametric WF curve to the KM plot, (Figure 12). The single
Figure 10. The overall survival KM-plot of a large number of patients (Pts.) = 1180 patients (various advanced solid malignancies, treated by complementary modulated electro-hyperthermia (mEHT),  . The KM-plot contains very long (25 y) survival too. (a) with censored cases, (b) without censoring (for clarity).
Figure 11. Rough determination of the parameters of the Weibull-fit to KM. Real process on a KM of Pts. = 1180 patients suffering in various malignant diseases , Figure 10. The obtained parameters are: and , hence . Control: median ≈ 28, which is approximately correct. (The principle of the process is in the insert in the figure).
Figure 12. WF fit (dashed line) to KM solid line to the overall survival KM-plot of pts. = 1180 patients (Figure 10) (a) Regression by deviation minimum SE = 1.6544; r2 = 0.9850, n = 1.043; t0 = 42.28; ; (b) regression by correlation maximum. ; , , ; .
WF fits with an acceptable accuracy; the largest deviation is less than 0.007, (0.7%).
Note, that there is a difference, when we fit by minimalizing the deviation of
the curves, or
the square of Pearson correlation
(where the bracket means the mean of the variable). The obvious difference is due to the different meaning of fit. The parameter SE minimizes the difference between the curves, while the minimizes the shape difference (maximizes the similarities) of the curves. A comparison with Shannon entropy shows more certainty (less uncertainty) by about 6% in the regression by minimizing SE than maximizing . In the following, when we do not note the opposite, we use the minimal SE regression.
The fit is accurate, having no more difference in any compared points of the curves than 1%, but it is not accurate enough at the end of the observed time, due to the RG group of the patients. The deviation could be less with applying the RG principle of (26), Figure 13. The , which is 2.5% higher, mirrors the RG part of the patient distribution.
The parametric decomposition gives better fit by two WFs according to (24), Figure 14, where the r2 has reduced drastically. The result shows the responding group (response rate (RR) 48%) and the non-responding one (52%). Note, that the less-responding group could be regarded as a non-responding control-arm.
The long-survival part of KM-plot has a higher entropy and shows more uncertainty of the death in both approaches. A better fit can be achieved when we
Figure 13. WF fit (dashed line) to KM solid line) from Figure 10. The applied RG is 7%. ; , , ; . The largest square of deviation of the point-pairs (LD) is 0.002 (0.2%).
Figure 14. (a) The double Weibull fit to the overall survival of n = 1180 patients in malignant diseases complementary treated by the mEHT method. The longer survival (solid line) is the group of responding patients for the treatment; while the shorter survival time (dashed line) is a component of the composite fit of WF regarded as a non-responding group, that could be used as a reference cohort of patients. The sum of the two components (dotted line) fits to the measured overall survival. Deviation of the regression is shown in light solid line with values on the secondary axes. (a) Decomposition using both WF without RG ( , , ; ); longer survival component ( , ; ); shorter survival component ( , ; ); (b) Decomposition using RG ( , , ; ); longer survival component ( , ; ); shorter survival component ( , ; ). KM-plot and the sum of decomposed Weibull curves suppressed are remarkable (solid line) compare to the single fit (dotted line).
count RG. The RG is obtained from the remaining survival fraction in most of the actual cases, and it has measurably longer survival than the study follows the patients who had no event or were not censored earlier. RG is a part of the “censored” patients at the end of the study.
For an easier calculation of the WF fractions (components) of the KM-plot, we may use the logarithmic evaluation of the survivals, which modifies the grouping more than the above decomposition. A linearly fit function by of KM is shown in Figure 10. According to (22) it shows rather large deviations at the start and at the end of the curve, Figure 15.
The original WF fit shown in Figure 12(a), and the linear fit from the logarithmic approach of Figure 15. differs from each other, Figure 16. The deviation of the logarithmic fit is more than double in some intervals, so the direct fit of WF to KM is more accurate.
Figure 15. The logarithmic fits of WF to KM of Figure 10. The linear fit to the complete curve gives two parameters: ; , ; ; .
Figure 16. The logarithmic fits of WF to KM of Figure 10. The linear fit to the complete curve gives two parameters: ; ; ; . The deviation from the parameters of the original fit ( ; , ; ; ) shown in Figure 12(a).
Despite the inaccuracy of the logarithmic evaluation, it has a great advantage of guessing the subgroups of the patients by an optimal decomposing of the KM plot. The logarithmic curve on Figure 15. shows three well distinguishable parts, for which the linear is accurate, and divides the original KM into three subgroups, Figure 17.
The logarithmic fit by (22) shows different results than the direct fit. The reason is simply that the logarithmic fit considers only a part of the whole curve, and fits to that, consequently the accurate fit to that part of the KM will not fit to the other parts at all, if the logarithmic curve was approached in different parts. The observed KM is, of course, considers all the patients. The overlapping fits from the logarithmic approach modifies the KM plot. Consequently, only the fit for original KM plot has a relevance.
However, the logarithmic analysis is very useful for detecting the subgroups of the patients. It became clear that the survival contains three subgroups, Figure 17. Consequently, three partitions of the KM curve (Figure 10) would give a
Figure 17. The various logarithmic fits to KM of Figure 10. (a) Linear fit to three fraction of the KM curve, , , , (b) Using the linear fits, the original curve may be fractioned to the three subgroups, (solid, dots and dashed lines).
Figure 18. Fitting the KM by WFs according to the logarithmic fit on Figure 17. (a) curves, , ; ; , ; ; , ; . (b) deviations by groups.
Figure 19. Decomposition of the KM on Figure 10. (Pts. = 1180) to three groups (long 18%: n1 = 1.46, , , ; medium (36%): , , , ; short (46%): , , , . The deviation of the fit remains under 0.0005 (0.05%).
We had shown the applicability of the two-parameter cumulative Weibull distribution for approximating the non-parametric Kaplan-Meier plot with a higher accuracy. We had shown the universality of the Weibull approach based on the general behaviors of the living organisms, including the cancer-tissue development. The self-organizing and self-similarity with their consequences determine the strict connection of the parametric approach well with the experimental non-parametric observations. Informational entropy allows the distinguishing of the subgroups in a general set of patients by their overall survival.
We have demonstrated that applying the two-parameter WF provides a sufficient fit to the non-parametric KM survival curve in a real case of 1180 patients suffering in various malignant diseases. Two of the 3 characteristic parameters of the KM plot (namely the points of median, mean or inflection) are enough to reconstruct the parametric fit.
In summary, Weibull parametric distribution with satisfactory refinement can accurately approximate a KM survival plot with surviving individuals at the end-point of the study.
This work was supported by the Hungarian Competitiveness and Excellence Programme grant (NVKP_16-1-2016-0042).
 Szentgyorgyi, A. (2019) Life Is Nothing But an Electron Looking for a Place to Rest.
 Kurakin, A. (2011) The Self-Organizing Fractal Theory as a Universal Discovery Method: The Phenomenon of Life. Theoretical Biology and Medical Modelling, 8, 4.
 Goldberger, A.L., Amaral, L.A., Hausdorff, J.M., et al. (2002) Fractal Dynamics in Physiology: Alterations with Disease and Aging. PNAS Colloquium, 99, 2466-2472.
 West, G.B., Woodruf, W.H. and Born, J.H. (2002) Allometric Scaling of Metabolic Rate from Molecules and Mitochondria to Cells and Mammals. Proceedings of the National Academy of Sciences of the United States of America, 99, 2473-2478.
 West, G.B. and Brown, J.H. (2005) The Origin of Allometric Scaling Laws in Biology from Genomes to Ecosystems: Towards a Quantitative Unifying Theory of Biological Structure and Organization. Journal of Experimental Biology, 208, 1575-1592.
 Brown, J.H., West, G.B. and Enquis, B.J. (2005) Yes, West, Brown and Enquist’s Model of Allometric Scaling Is Both Mathematically Correct and Biologically Relevant. Functional Ecology, 19, 735-738.
 Levine, L.E., Narayan, K.L. and Kelton, K.F. (1997) Finite Size Corrections for the Johnson-Mehl-Avrami-Kolmogorov Equation. Journal of Materials Research, 12, 124-131.
 Fanfoni, M., Persichetti, L. and Tomellini, M. (2012) Order and Randomness in Kolmogorov-Johnson-Mehl-Avrami-Type Phase Transitions. Journal of Physics: Condensed Matter, 24, Article ID: 355002.
 Cope, F.W. (1977) Detection of Phase Transitions and Cooperative Interactions by Avrami Analysis of Sigmoid Biological Time Curves for Muscle, Nerve, Growth, Firefly, and Infrared Phosphorescence, of Green Leaves, Melanin and Cytochrome C. Physiological Chemistry and Physics, 9, 443-459.
 Suckjoon, J. and Bechhoefer, J. (2005) Nucleation and Growth in One Dimension. II. Application to DNA Replication Kinetics. Physical Review E, 71, Article ID: 011909. https://doi.org/10.1103/PhysRevE.71.011909
 Augis, J.A. and Bennett, J.E. (1978) Calculation of the Avrami Parameters for Heterogeneous Solid State Reactions Using a Modification of the Kissinger Method. Journal of Thermal Analysis, 13, 285-291.
 Aloev, V.Z., Kozlov, G.V. and Zaikov, G.E. (2004) Relationship between the Exponent of the Kolmogorov-Avrami Equation and the Fractal Dimension in the Crystallisation of Uniaxially Stretched Crosslinked Polychloroprene. Kauchuk I Rezina, No. 3, 38-39.
 Cope, F.W. (1977) Detection of Phase Transitions and Cooperative Interactions by Avrami Analysis of Sigmoid Biological Time Curves for Muscle, Nerve, Growth, Firefly, and Infrared Phosphorescence, of Green Leaves, Melanin and Cytochrome C. Physiological Chemistry and Physics, 9, 443-459.
 Izquierdo-Kulich, E. and Nieto-Villar, J.M. (2013) Morphogenesis and Complexity of the Tumor Patterns. In: Rubio, R.G., Ryazantsev, Y.S., Starov, V.M., Huang, G.-X., Chetverikov, A.P., Arena, P., Nepomnyashchy, A.A., Ferrus, A. and Morozov, E.G., Eds., Without Bounds: A Scientific Canvas of Nonlinearity and Complex Dynamics. Understanding Complex Systems, Springer-Verlag, Berlin Heidelberg, 657-691.
 Enderling, H., Hahnfeldt, P., Hlatky, L. and Almog, N. (2012) Systems Biology of Tumor Dormancy: Linking Biology and Mathematics on Multiple Scales to Improve Cancer Therapy. Cancer Research, 72, 2172-2175.
 González, M.M., Joa, J.A.G., Cabrales, L.E.B., Pupo, A.E.B., Schneider, B., Kondakci, S., Ciria, H.M.C., Reyes, J.B., Jarque, M.V., O’Farril Mateus, M.A., Tamara Rubio González, T.R., Brooks, S.C.A., Cáceres, J.L.H. and González, G.V.S. (2017) Is Cancer a Pure Growth Curve or Does It Follow a Kinetics of Dynamical Structural Transformation? BMC Cancer, 17, 174.
 Izquierdo-Kulich, E.E., Alonso-Becerra and Nieto-Villar, J.M. (2011) Entropy Production Rate for Avascular Tumor Growth. Journal of Modern Physics, 2, 615.
 Agus, D.B., Alexander, J.F., Arap, W., Ashili, S., Aslan, J.E., Austin, R.H., Backman, V., Bethel, K.J., et al. (2013) A Physical Sciences Network Characterization of Non-Tumorigenic and Metastatic Cells. Scientific Reports, 3, Article No. 1449.
 Stehlik, M., Mrkvicka, T., Filus, J. and Filus, I. (2012) Recent Developments on Testing in Cancer Risk: A Fractal and Stochastic Geometry. Journal of Reliability and Statistical Studies, 5, 83-95.
 Guiot, C., Degiorgis, P.G., Delsanto, P.P., Gabriele, P. and Deisboeck, T.S. (2003) Does Tumor Growth Follow a “Universal Law”? Journal of Theoretical Biology, 225, 147-151.
 Bose, P., Brockton, N.T., Guggisberg, K., Nakoneshny, S.C., Kornaga, E., Klimowicz, A.C., Tambasco, M. and Dort, J.C. (2015) Fractal Analysis of Nuclear Histology Integrates Tumor and Stromal Features into a Single Prognostic Factor of the Oral Cancer Microenvironment. BMC Cancer, 15, 409.
 Neumann, J. (1928) Zur Theorie der Gesellschaftsspiele. Mathematische Annalen, 100, 295-320. (English Translation: Tucker, A.W. and Luce, R.D. (1959) On the Theory of Games of Strategy. Contributions to the Theory of Games, 4, 13-42.
 Menon, S.N., Sasidevan, V. and Sinha, S. (2008) Emergence of Cooperation as a Non-Equilibrium Transition in Noisy Spatial Games. Frontiers in Physics, 6, 34.
 Kaplan, E.L. and Meier, P. (1958) Nonparametric Estimation from Incomplete Observations. Journal of the American Statistical Association, 53, 457-481.
 Etikan, I., Abubakar, S. and Alkassim, R. (2017) The Kaplan Meier Estimate in Survival Analysis. Biometrics & Biostatistics International Journal, 5, Article ID: 00128.
 Camazine, S., Deneubourg, J.-L., Franks, N.R., Sneyd, J., Theraulaz, G. and Bonabeau, E. (2003) Self-Organization in Biological Systems. Princeton Studies in Complexity, Princeton Univ. Press, Princeton, Oxford.
 Brown, W.K. and Wohletz, K.H. (1995) Derivation of the Weibull Distribution Based on Physical Principles and Its Connection to the Rosin-Rammler and Lognormal Distributions. Journal of Applied Physics, 78, 2758-2764.
 Batdorf, S.B. (1978) Fundamentals of the Statistical Theory of Fracture. In: Bradt, R.C., Hasselman, D.P.H. and Lange, F.F., Eds., Fracture Mechanics of Ceramics, Vol. 3, Plenum Press, New York, 1.
 Gompertz, B. (1825) On the Nature of the Function Expressive of the Law of Human Mortality and on a New Mode of Determining the Value of Life Contingencies. Philosophical Transactions of the Royal Society of London, 115, 513-585.
 Wilson, D.L. (1994) The Analysis of Survival (Mortality) Data: Fitting Gompertz, Weibull, and Logistic Functions. Mechanisms of Ageing and Development, 74, 15-33.
 Pham, H. (2008) Mortality Modeling Perspectives. In: Recent Advances in Reliability and Quality in Design, Springer Series in Reliability Engineering, Springer-Verlag, London, 509-516.
 Bru, A., Albertos, S., Subiza, J.L., García-Asenjo, J.L. and Bru, I. (2003) The Universal Dynamics of Tumor Growth. Biophysical Journal, 85, 2948-2961.
 Demicheli, R., Biganzoli, E., Boracchi, P., Greco, M., Hrushesky, W.J.M. and Retsky, M.W. (2006) Allometric Scaling Law Questions the Traditional Mechanical Model for Axillary Lymph Node Involvement in Breast Cancer. Journal of Clinical Oncology, 24, 4391-4396.
 Oguntunde, P.E., Balogun, O.S., Okagbue, H.I. and Bishop, S.A. (2015) The Weibull-Exponential Distribution: Its Properties and Applications. Journal of Applied Sciences, 15, 1305-1311.
 El-Bassiouny, A.H., El-Damcese, M.A., Mustafa, A. and Eliwa, M.S. (2017) Exponentiated Generalized Weibull-Gompertz Distribution with Application in Survival Analysis. Journal of Statistics Applications & Probability, 6, 7-16.
 Ricklefs, R.E. and Scheuerlein, A. (2002) Biological Implications of the Weibull and Gompertz Models of Aging. Journal of Gerontology: Biological Sciences, 57A, B69-B76.
 Waliszewski, P. and Konarski, J. (2005) A Mystery of the Gompertz Function. In: Losa, G.A., Merlini, D., Nonnenmacher, T.F. and Weibel, E.R., Eds., Fractals in Biology and Medicine, Birkhäuser, Basel, 277-286.
 Nijhout, H.F. and German, R.Z. (2012) Developmental Causes of Allometry: New Models and Implications for Phenotypic Plasticity and Evolution. Integrative and Comparative Biology, 52, 43-52.
 Hajian-Tilaki, K.O., Hanley, J.A., Joseph, L. and Collet, J.-P. (1977) A Comparison of Parametric and Nonparametric Approaches to ROC Analysis of Quantitative Diagnostic Tests. Medical Decision Making, 17, 94-102.
 Bex, P.J., Metha, A.B. and Makous, W. (1998) Psychophysical Evidence for a Functional Hierarchy of Motion Processing Mechanisms. Journal of the Optical Society of America A, 15, 769-777.
 Pelli, D.G. and Farell, B. (1995) Psychophysical Methods. In: Bass, M., Van Stryland, E.W., Williams, D.R. and Wolfe, W.L., Eds., Handbook of Optics, 2nd Edition, McGraw-Hill, New York, I (29.21-29.13).
 Wang, H., Wang, Z., Li, X., et al. (2011) A Robust Approach Based on Weibull Distribution for Clustering Gene Expression Data. Algorithms for Molecular Biology, 6, 14.
 Pugno, N.M. (2007) A Statistical Analogy between Collapse of Solids and Death of Living Organisms: Proposal for a “Law of Life”. Medical Hypotheses, 69, 441-447.
 Liu, S., Wang, Y., Xu, K., Wang, Z., Fan, X., Zhang, C., Li, S., Qiu, X. and Jiang, T. (2017) Relationship between Necrotic Patterns in Glioblastoma and Patient Survival: Fractal Dimension and Lacunarity Analyses Using Magnetic Resonance Imaging. Scientific Reports, 7, Article No. 8302.
 Weston, C.L., Douglas, C., Craft, A.W., Lewis, I.J. and Machin, D. (2004) Establishing Long-Term Survival and Cure in Young Patients with Ewing’s Sarcoma. British Journal of Cancer, 91, 225-232.
 Jones, G. and Rocke, D.M. (2002) Multivariate Survival Analysis with Doubly-Censored Data: Application to the Assessment of Accutane Treatment for Fibrodysplasia Ossificans Progressive. Statistics in Medicine, 21, 2547-2562.
 Pourhoseingholi, M.A., Pourhoseingholi, A., Vahedi, M., Moghimi Dehkordi, B., Safaee, A., Ashtari, S. and Zali, M.R. (2011) Alternative for the Cox Regression Model: Using Parametric Models to Analyze the Survival of Cancer Patients. Iranian Journal of Cancer Prevention, 4, 1-9.
 Pourhoseingholi, M., Hajizadeh, E., Moghimi Dehkordi, B., Safaee, A., Abadi, A. and Zali, M.R. (2007) Comparing Cox Regression and Parametric Models for Survival of Patients with Gastric Carcinoma. Asian Pacific Journal of Cancer Prevention: APJCP, 8, 412-416.
 Hoyle, M.W. and Henley, W. (2011) Improved Curve Fits to Summary Survival Data: Application to Economic Evaluation of Health Technologies. BMC Medical Research Methodology, 11, 139.
 Barriga, G.D.C., Louzada-Neto, F., Ortega, E.M.M. and Cancho, V. (2010) A Bivariate Regression Model for Matched Paired Survival Data: Local Influence and Residual Analysis. Statistical Methods and Applications, 19, 477-495.
 Lambert, P.C., Thompson, J.R., Weston, C.L. and Dickman, P.W. (2007) Estimating and Modeling the Cure Fraction in Population-Based Cancer Survival Analysis. Biostatistics, 8, 576-594.
 Szasz, A., Dani, A., Varkonyi, A. and Magyar, T. (2005) Retrospective Analysis of 1180 Oncological Patients Treated by Electro-Hyperthermia in Hungary. Strahlentherapy Onkologie (Radiation Oncology), 181, 121-122.