In the Late-December of 2019, the novel coronavirus outbreak took place in Wuhan City, Hubei province, China. Coronavirus disease is officially named by the World Health Organization as COVID-19 (Coronavirus Disease 2019) on 12th January 2020. It can result in influenza, or even Middle East Respiratory Syndrome and Severe Acute Respiratory Syndrome. Currently, it is commonly believed that COVID-19 is caused by the sale and consumption of bats. Since December 2019, coronavirus disease has spread through relocation diffusion and contagious diffusion from the center, Wuhan city.
The existing literature on contagious disease outbreak is mainly based on the researching of the pathology of novel coronavirus and drug development  , epidemic prediction , and disease transmission  . Among the analysis of disease transmission, most are concentrated on assessing the potential of human-to-human transmission. As this conclusion is now verified, less study, however, focused on researching influencing factors that affect trends of disease transmissions. Additionally, while the existing papers based their study areas on the center of the outbreak, Wuhan City, or on China as a whole, fewer papers have concentrated on minor provinces or municipalities that experienced an anomalous transmission. Therefore, this research will study four provinces (Heilongjiang province, Guangdong province, Guizhou province, Tianjin municipality) that experience an anomalous disease-spreading (that is, disease-spreading does not influence by geographical locations to the center) from 21st January 2020 to 19th February 2020, and analyzes the four influencing factors (temperature, transportation and passenger traffic volume, urban planning, and policy-making and individual consciousness) with respect to transmission. This paper will provide helpful guidance for disease prevention and control in the capital city of China, Beijing, other cities and countries across the world, and for future copings with similar contagious diseases.
The paper is organized as follows. Section 2 presents past literature on disease transmission and their characteristics. Section 3 introduces the data sources and methodology. Section 4 presents and analyzes the influencing factors for anomalous disease-spreading in four provinces. Section 5 discusses plausible suggestions for disease control and prevention in Beijing, and concludes the paper.
2. Literature Review
Most papers at present have focused on researching disease transmission. In particular, Riou et al. studied transmission patterns and the potential for sustained human-to-human transmission of 2019-nCoV in China . According to Lin et al., public health interventions implemented at both the social and personal levels are effective in preventing outbreaks of COVID-19 in Wuhan and the other 29 provinces in China . Based on 41 cases of 2019-nCoV in Wuhan City, Tang et al. find out that intensive contact tracing followed by quarantine and isolation can effectively reduce the control reproduction number and transmission risk .
The existing literature has mostly pinpointed their study areas in China as a whole. Fewer studies have narrowed down their focus on provinces and regions that experience anomalous disease-spreading (transmitting trends in provinces, municipalities, autonomous regions that are not influenced by geographical locations to the center). In addition, these studies put less focus on analyzing other influencing factors, such as government and residential awareness, temperature, passenger traffic volume, and household size and distribution. Thus, this paper is going to focus on studying four influencing factors in four provinces that manifested anomalous disease-spreading geographically.
3. Data Source and GIS Tools
3.1. Data Source
The data of accumulated confirmed cases of 2019-nCoV comes from Jinritoutiao (Today’s Headlines) APP by 19th February 2020 in all provinces/municipalities/autonomous regions in China mainland. The data of passenger traffic volume, number of large households, household distribution, and temperature derived from 2019 China yearbook, which reflects the statistics in 2018. We assume that the general rankings and trends of development for each influencing factor did not change significantly in 2019. For temperature, we adopted the average value in provincial capitals to represent the overall value in each province for correlation analysis. In addition, all the correlation analysis has eliminated the statistics of an obvious outlier, Hubei province. The research utilizes Python language in calculating the mean and ranking the statistics
3.2. GIS Tools
TIANDITU is a map website of China, which provides official free web mapping services. It features detailed street-level geographic data for China, and it is constructed and maintained by the National Geomatics Center of China (NGCC). This paper adopts the GIS Functions in TIANDITU (https://www.tianditu.gov.cn/) to gauge the distance between different province/municipality/autonomous regions and Wuhan City, China. Thus, this study derives regions that experience geographically anomalous disease spreading.
ArcGIS is a geographic information system (GIS) software for creating and editing maps and compiling and analyzing geographic data. This research visualizes the extent of confirmed coronavirus disease in each province/municipality/autonomous region in China mainland through ArcGIS. Confirmed cases are divided into seven intervals when presenting on maps in different colors, respectively 1 - 15, 15 - 60, 60 - 99, 99 - 499, 499 - 999, 999 - 4999, >4999 (unit: case).
4.1. Determination of Anomalous Cases
Table 1. Statistics of confirmed cases in each province/municipality/autonomous region by 19th Feb. 2020.
confirmed cases of the novel coronavirus disease are extensively concentrated in Hubei province, with 62,457 cases by 19th February. Figure 1 further reveals that geographical locations influences disease-spreading significantly. In
Figure 1. Spatial distribution of confirmed cases in China mainland by 19th February 2020.
particular, the top rankings mostly located within 600 km from Hubei Province: Henan province is approximately 467 km apart; Zhejiang province is about 562 km apart; Hunan province is around 297 km apart. However, in some provinces, geographical location remains one of the least influencing factors on disease-spreading. Guizhou province, in the proximity of Hubei, does not experience a remarkable outbreak of disease; Tianjin municipality, where people in Wuhan can easily arrive to for its developed railroad and highways, does not suffer greatly from coronavirus disease as well; Guangdong province and Heilongjiang province, situated more than 1000 km apart from Hubei province, has a large amount of confirmed diagnosis. Therefore, this paper will take these anomalous cases of disease-spreading into account and study the influencing factors that lead to the anomalies.
4.2. Case 1: Guangdong Province
4.2.1. Transportation and Passenger Traffic Volume
Table 2 and Figure 2 show the statistics and relationship between the number of confirmed cases and the passenger traffic volume in each province. Accordingly, the flow of population and the number of confirmed diagnoses have a moderately strong, approximately linear, positive correlation. Although Guangdong locates far from Hubei, it is one of the most developed provinces in China, with advanced transportation center at the mouth of the Pearl River and the largest seaport in southern China. Guangdong ranked top one in its passenger traffic volume (1,421,440 million people) in 2018. This results in Guangdong becoming a susceptible region to contagious diseases. Especially during the Spring Festival, the massive flow of population led to interaction and exchange between suspected cases and the public, resulting in the high confirmed cases in Guangdong.
Figure 2. Scatterplot of the passenger traffic volume and number of confirmed cases in each province/municipality/autonomous region by 19th Feb. 2020.
Table 2. Statistics of passenger traffic volume and number of confirmed cases in each province/municipality/autonomous region (sorted according to passenger traffic volume).
Figure 3 illustrates the correlation between average temperature of January and February and confirmed cases in each province/municipality/autonomous region. Accordingly, there is no relationship between confirmed cases and the temperature warmer than 0 degrees Celsius. For regions that are colder than 0 degrees Celsius, the low temperature is accompanied with low diagnoses. Besides Heilongjiang (an anomalous case), none of the provinces/municipalities/autonomous regions with negative temperatures have more than 400 confirmed cases. Although the novel coronavirus prefers colder environment, temperature appears to be a secondary influencing factor of 2019-nCoV transmission. According to the data in Table 3, Guangdong’s temperature ranked the top two among China. Based on the relationships illustrated in the scatterplot, Guangdong turns out to have a higher number of confirmed cases.
Table 3. Statistics of temperature and confirmed cases in each province/municipality/autonomous region by 19th February 2020.
Figure 3. Scatterplot of temperature and confirmed cases in each province/municipality/autonomous region by 19th February 2020.
4.2.3. Household Size and Distribution
Figure 4 illustrates the correlation between the number of medium- and large-sized households and the accumulated confirmed cases in each province/municipality/autonomous region in China mainland by 19th February 2020. There is a moderately strong, positive, approximately linear correlation. Since members within each household have close contact with each other every day, the larger families, the higher the possibility for infectious people to transmit 2019-nCoV to a broader group. Table 4 shows the statistics of medium- and large-sized households in each province. There are 17,860 million families that have more than two members in Guangdong. In this case, it is prone for Guangdong to have a greater scope of disease-spreading than other provinces.
Figure 5 indicates the correlation between the number of urbanized population and confirmed cases. As implied in the scatterplot, there is a moderately strong, positive, approximately linear correlation between the two variables. Urbanized population displays a more concentrated residential distribution compared to rural population. A denser distribution of people is linked with increased contacts between individuals, resulting in a higher risk of contagion. Guangdong has 80,220 million households that live in urban areas, ranking the first. This correlates to Guangdong’s 1332 cases of diagnosis, as shown in Table 5.
Figure 6 indicates, at the very beginning stage of coronavirus outbreak, a significant rate of increase in Guangdong compared to the other three provinces examined in this paper. This shows that Guangdong’s government and the public lacked serious awareness for disease control, which causes a large amount of confirmed diagnosis in later stages (in February).
4.3. Case 2: Tianjin Municipality
4.3.1. Transportation and Passenger Traffic Volume
Tianjin is located at the Bohai Bay in the Great Plains of North China. It has the
Figure 4. Scatterplot of medium- and large-sized households and confirmed cases in each provinces from 21st January to 19th February.
Figure 5. Scatterplot of the number of urbanized household and confirmed cases in each province/municipality/autonomous region by 19th February 2020.
Figure 6. Increasing trends of confirmed cases in Tianjin municipality, Guizhou province, Guangdong province, and Heilongjiang province from 21st January to 19th February 2020.
Table 4. Statistics of medium- and large-sized households and confirmed cases in each province from 21st January to 19th February.
largest seaport and water and land transportation facilities in northern China. Nevertheless, transportation at Tianjin is mainly constituted of maritime shipping and transporting activities with less population flow, as indicated above in Table 2. Therefore, Tianjin has successfully controlled the spread of disease.
Table 5. Statistics of urbanized household and confirmed cases in each province/municipality/autonomous region by 19th February 2020.
Tianjin locates in the Beijing-Tianjin-Hebei region, the northern part of China. The average temperature during January and February in Tianjin is −2.4 degrees Celsius. With a lower-than-0-degrees-Celsius temperature, Tianjin has fewer cases of confirmed 2019-nCoV.
During the disease outbreak, Tianjin has carried out “Implementation measures during coronavirus disease outbreak for promoting economic and social development” early in time. As illustrated in Figure 7, at the very beginning stage, Tianjin has a slow rate of increase in confirmed cases per day. Additionally, the Jinyun application has shared detailed trip information of diagnosed patients to inform residents about ongoing situations efficiently. As indicated in Figure 8, Tianjin has the lowest number of diagnoses within the Beijing-Tianjin-Hebei region: the confirmed cases in Beijing and Hebei province are about 3 to 4 times of those in Tianjin. This underscores that the government control through on-time policy-making and the update of detailed patients’ information has been especially critical for disease prevention and control.
4.4. Case 3: Guizhou Province
4.4.1. Transportation and Passenger Traffic Volume
According to Table 2, the population traffic volume of Guizhou is 930,250 million people in 2018, ranking in the top ten. However, during the outbreak, the heavy snow forced Guizhou to close roads and airport runways, which dissuades population movement and reduces the spread of disease.
Figure 7. Increasing trends of confirmed cases in Beijing-Tianjin-Hebei region from 21st January to 19th February 2020.
Figure 8. Number of confirmed cases in Beijing-Tianjin-Hebei region by 19th February 2020.
4.4.2. Household Size and Distribution
Guizhou is mainly constituted of mountainous, located at the Yungui Plateau. This reduces population density and concentration. In light of Table 4 and Table 5, Guizhou Province has only 6820 million urbanized households and 5080 million large households. The geography of Guizhou results in less urbanized residential concentrations and more scattered city arrangement across the provinces. Thus, the coronavirus cannot spread out easily.
Guizhou province has carried out five pieces of implementation measures to strengthen disease control: first, identify suspected cases from data; second, verify their detailed profile; third, acquire their daily routine and mobility information; fourth, strictly ask for following report of their condition; fifth, restricted suspected cases from inflowing into Guizhou . According to a detailed news report in Beijing Evening News, Guizhou has security officers collecting information about individuals’ mobility and body temperature . In addition, Guizhou has subsidized small and medium-sized enterprises to ensure their steady development during work extension. The high government awareness is conducive to disease control.
4.5. Case 4: Heilongjiang Province
4.5.1. Transportation and Passenger Traffic Volume
Though according to Table 2, the passenger traffic volume is 315,680 million people in Heilongjiang, ranking low, its Northern location makes Heilongjiang prone to infection. Trips to Heilongjiang are longer than those to other provinces, which increases the potential exposure to the virus and suspected cases. This further increases the possibility of infection, resulting in more confirmed cases in Heilongjiang.
Heilongjiang is located in the very northern part of China with an average outdoor temperature of −17.9 degree Celsius during January and February. The freezing weather dissuades people from room ventilations. Especially when people congregated, the indoor and enclosed areas make disease-spreading easier, causing a significant increase in diagnosis.
Heilongjiang’s policies and control are not so strict as Tianjin and Guizhou. Heilongjiang did not suffer from the outbreak of SARS in 2003, and neither the government institutions nor residents have taken timely measures and control during the 2019 outbreak of the novel coronavirus. The mobility information of infected people did not expose in time. The public has not taken high regard for self-isolation. According to Sohu news, 94 percent of cases of infection are due to indoor congregation in Heilongjiang . Furthermore, in accordance with Figure 9, Heilongjiang has 4 to 5 times of confirmed cases than Liaoning and
Figure 9. Confirmed cases in provinces in Northeastern China by 19th February 2020.
Jilin (Liaoning and Jilin have a similar geographical location as Heilongjiang, far from Hubei).
5. Discussions and Conclusions
This paper has studied the disease outbreak by 19th February 2020 in provinces and municipalities that experienced anomalous disease-spreading. Guangdong and Heilongjiang province have an unusually high quantity of confirmed diagnoses, whereas Tianjin municipality and Guizhou province are protected from a massive coronavirus transmission. This study focuses on four influencing factors that can account for these anomalies and obtains the following results. Transportation and passenger traffic volume and residential distribution and size are positively related to the extent of disease-spreading; the degree of government and individual consciousness has a negative correlation with disease-spreading. Although the novel coronavirus prefers colder environment, temperature appears to be a secondary influencing factor of 2019-nCoV transmission in this study, as regions with negative temperatures have fewer diagnoses. Disease transmission in Guangdong province is caused by the high volume of passenger traffic, large and urbanized households, and low awareness. Heilongjiang province is mainly a result of high passenger traffic volume, long travelling trips, and low public awareness. Guizhou province is benefited from high awareness, limited passenger volume, and scattered households. Tianjin municipality is protected from the severe disease-spreading owe to its beneficial temperature, low land transportation volume, and high public and government awareness.
Based on the results mentioned above, this paper will provide some suggestions for Beijing’s disease control and prevention. As a well-developed municipality and the center of political and cultural activities, Beijing deserves more attention, or it will affect a series of more regions.
According to the positive cases of disease control in Tianjin municipality and Guizhou province, this paper proposes the followings for the Beijing government and community officers. 1) Beijing government should take high regard for disease control. For example, it should try best to limit the population inflow on-time. It should also carry out policies like verifying the identities of drivers and passengers at traffic stations carefully, to avoid infected people from entering Beijing. 2) For communities, officers should keep detailed track of mobility and information of incoming visitors and residents.
However, as the capital of China, Beijing has a more complex procedure in its policy-making and implementation. So, at the same time, a higher awareness in public will be helpful for disease reduction, or Beijing may undergo a similar negative situation as Heilongjiang province. This paper further advises the public as follows. 1) Individual residents should reduce congregating activities and self-isolate at home. 2) For organizations and institutions, less congregating events should be held during the outbreak to reduce the possibility of wider 2019-nCoV transmission.
The shortcomings of this paper are, first, with increasing data in the future, this paper can conduct more accurate analysis; second, in the future study, this paper can adopt complex approaches and regressing methodologies to test correlations and to study disease transmission.
Y.L.: conceptualization, data processing, formal analysis, writing original draft, writing review and editing; Y.L. and Z.D.: visualization.
 Read, J.M., Bridgen, J.R.E., Cummings, D.A.T., et al. (2020) Novel Coronavirus 2019-nCoV: Early Estimation of Epidemiological Parameters and Epidemic Predictions.
 Nishiura, H., Jung, S.-M., Linton, N.M., Kinoshita, R., Yang, Y., Hayashi, K., Kobayashi, T., Yuan, B. and Akhmetzhanov, A.R. (2020) The Extent of Transmission of Novel Coronavirus in Wuhan, China. Journal of Clinical Medicine, 9, 330.
 Riou, J. and Althaus, C.L. (2020) Pattern of Early Human-to-Human Transmission of Wuhan 2019 Novel Coronavirus (2019-nCoV), December 2019 to January 2020. Eurosurveillance, 25, 2000058.
 Ralph, R., Lew, J., Zeng, T., et al. (2020) 2019-nCoV (Wuhan Virus), a Novel Coronavirus: Human-to-Human Transmission, Travel-Related Cases, and Vaccine Readiness. The Journal of Infection in Developing Countries, 14, 3-17.
 Tang, B., Wang, X., Li, Q., et al. (2020) Estimation of the Transmission Risk of the 2019-nCoV and Its Implication for Public Health Interventions. Journal of Clinical Medicine, 9, 462.