Microblogging has become an important platform for public opinion, media communication, corporate brand and product promotion because of the nature of sharing, interaction and openness. Enterprise microblogging is a kind of accounts that enterprise open in the microblogging platform   . As an independent communication media on social media, because of its important commercial value, enterprise micro-blog has become an important network marketing means. So it is of great practical significance to study the micro-blog. In order to effectively mine the information in enterprise microblogging and give full play to the commercial value of enterprise microblogging, this paper presents a set of enterprise micro-blog analysis method based on knowledge network. In the second chapter, we briefly review the current research situation, construct the enterprise micro-blogging knowledge network model in chapter three, then propose the enterprise micro-blog analysis method based on knowledge network in chapter four, and finally, in chapter 5, the feasibility of the model is verified by taking Club of Huawei as an example.
2. An Overview of Related Studies
Enterprise microblogging, an enterprise account in the microblogging platform, as an independent medium on social media, has very important commercial value. B. J. Jansen and M. Zhang, who analyzed 150,000 microblogging information in Twitter, found that 3.8% of the microblogging messages conveyed the brand’s emotional tendencies, and that the micro-blog site to become the platform of enterprise marketing, customer relationship maintenance and word of mouth  . However, the length of microblogging text is short, the amount of words is small, and the use of words is not standardized, which brings great difficulty for effective analysis of enterprises microblogging.
The concept of knowledge networks was first proposed by the Swedish industry, the relevant research began in the 20th century, the mid-90s. Beckmann described the knowledge network as institutions and activities for production and dissemination of scientific knowledge in the academic point of view  . Andreas believes that the knowledge network is a social network among the participants. Participants at all levels realize the production and transfer of knowledge through this knowledge network, thus creating value  . These characteristics of knowledge network make it fit well with the originality, communication and interaction of micro-blog. Building enterprise microblogging knowledge network can dig out the inherent value of enterprise microblogging better.
3. The Construction of Enterprise Micro-Blog Knowledge Network
3.1. Microblogging Content Acquisition
At present, Sinamicroblogging API (Application Programming Interface) content is not fully open. There are many restrictions in the number of single-query results, access to data resources and call frequency. So simply using the API provided by Sinamicroblogging to obtain comprehensive microblogging business data is more difficult. The data acquisition efficiency is relatively low too. Therefore, this paper choose to use the Web crawler technology to obtain business microblogging data.
Web crawler is a client program, used to obtain information on the web page. How Web Crawlers Work:
1) Establishes a connection with the server, sends an Http request to the server, and requests a web page;
2) The server receives the request, make a response, and return to the web crawler page html source code;
3) Web crawler analysis html page, access to the URL contained in html, joined the URL queue;
4) If the URL is still not crawled in the URL queue, return to step 1 to continue.
3.2. Enterprise Micro-Blog Knowledge Network Node Access
Using the Web crawler to get a collection of microblogging content as follows:
where i is the microblogging number, a is the total number of microblogs to obtain, M is the collection of all microblogging content.
In order to obtain the key words of each micro-blog text, we must first segment the micro-blog text, and then get the enterprise micro-blog node set K, as follows:
The word frequency of is expressed by , and the high frequency word set K1 is established as follows:
is the threshold value, used to distinguish between high frequency and low frequency keywords.
3.3. Network Weights
In the enterprise microblogging, the keywords with high frequency often re- present the focus of microblogging communication. And obtains frequency weight set Q of the high frequency keywords according to the word frequency of the acquired keywords:
In the enterprise micro-blog social network, the two keywords can be linked through microblogging content. The more the number of times two keywords appear in the same microblogging content, the closer the relationship between the two topics is  . According to whether the two high-frequency nodes in the same micro-Bo, we can build enterprise micro-blog high frequency node co- occurrence relationship set. The indicates that the high-frequency node and the high-frequency node co-occur in the same enterprise micro- blog, and indicates that there is no co-occurrence relationship between the nodes and . Assuming a total of n high frequency keywords, then the high frequency node co-occurrence relationship set can be expressed as:
Based on the number of co-occurrence relations between high-frequency nodes, the co-occurrence relationship weight set of the enterprise micro-blog high-frequency nodes can be constructed as follows:
The greater the value of , the greater the link between the two keywords. Through the statistics the number of times that all the keywords appeared in the same micro-blog, we could be able to build enterprise micro-blog high frequency keywords co-occurrence matrix.
3.4. The Construction of Enterprise Micro-Blog Knowledge Network
According to the enterprise micro-blog high-frequency keyword set obtained from (3), high frequency key word frequency set obtained from (4) and high frequency node co-occurrence times obtained from (6), we can get enterprise micro-blog Weighted Knowledge Network Model as follows:
From the formula (7), we can know that the nodes have two kinds of weights: Q(K) and Q(E), where Q(K) denotes the number of times the node appears, Q(E) represents the number of co-occurrence between nodes. We can weigh the importance of the node from these two respects.
4. Analysis Method of Enterprise Micro-Blog Based on Knowledge Network
4.1. Identification of Important Nodes Based on Word Frequency
Using the EMKN model, the nodes with larger weights can be identified and analyzed. The high-frequency keyword set is composed of keywords whose word frequency is greater than a certain high-frequency threshold. This collection represents the point at which the enterprise wants to focus on disseminating knowledge to users in all published microblogs. The threshold of high-frequency keywords can be determined by the enterprise microblogging according to the actual situation.
4.2. Centre-Point Identification Based on Co-Occurrence Matrix
In the social network, the centrality can be used to measure the node’s position in the overall network. The nodes with high centrality are at the core position in the whole network. Other nodes are either directly related to them, or connected to other nodes through them  . We refer to these nodes as “central points” and measure their importance with two points: degree centralization (CD) and betweenness centralization (CB)  .
In the network, if the degree of a node is larger, it means that the higher the centrality of this node, the greater the importance of the node in the network.
Without considering the weight of the edge, the degree centralization of the node ki is:
Considering the weight of edge, the degree centralization of node ki is:
The significance of betweenness centralization can be expressed as follows: if two non-adjacent nodes s and t want to interact with each other and node i is on their path, the node i may control the interaction between them  . Therefore, in the constructed knowledge network, if the node i is located on the path that many nodes are connected with each other, the node i is in the most important position in the network.
Similarly, we can calculate the betweenness centralization of :
In the above formula, denotes the number of shortcuts for nodes to , and , indicates the number of shortcuts to pass through .
Based on the above centrality metric, we can find that the node with high centrality is very important for discovering the microblog topic information.
4.3. Cluster Identification Based on Cohesive Subgroups
Using the EMKN model, we can cluster the enterprise micro-blog by the method of cohesive subgroup analysis. At the same time, we can use the network midpoint and edge weights to carry on the further analysis. This way is not only able to analyze the weight of various groups of enterprise micro-blog and the relationship between groups, but also the composition of various groups. Importantly, the visualization of the analysis results helps to identify the key communication points of the enterprise micro-blog effectively. In addition, the knowledge points of any subgroup can be represented by a knowledge subnet. With this knowledge subnet, we can deeply analyze the internal structure, hot spots and associated patterns of sub-groups.
5. Examples and Application of Models
In this study, we selected the Club of Huawei, official microblogging of Huawei, as the research object in Sina micro-blogging platform. In recent years, Huawei’s brand awareness and reputation have been greatly improved. The National Federation of Industry and Commerce released “2016 top 500 Chinese private enterprises” list, Huawei become the top 500 list with 395.09 billion Yuan in annual revenue. Club of Huawei is an interaction platform for Huawei’s fans. It answers questions of fans, presents the latest product and service information, provides rich online content and offline interaction at the first time. Therefore, this paper selects this microblogging account for data collection.
5.1. Microblogging Access and Processing
First of all, the preparation of reptiles collects all the microblogging of Club of Huawei in January 1, 2015 to December 31, 2015, a total of 2974. Then segment the microblog text: we use the NLPIR/ICTCLAS 2014 Chinese word segmentation system to preprocess the crawled microblogging text, and get the keywords by all the microblogs. The third step is to obtain high-frequency keywords. In this paper, we choose 104 words with frequency more than 50 as high frequency keywords. Table 1 shows some of the high frequency words and their frequency.
5.2. Construction of Knowledge Network
According to the formula (4), we can get the word frequency of the high frequency keywords. Construct the matrix between microblogging content and keywords, form 2974 × 104 word matrix. In the enterprise microblogging text, there must be some kind of association between co-occurrence keywords, the degree of association can be expressed with the frequency of co-occurrence. According to formula (6), the co-occurrence matrix of 104 high-frequency keywords is obtained, which is 104 × 104 co-word matrix.
According to the co-occurrence relationship between high frequency keywords, the knowledge network model of Club of Huawei was constructed with Ucinet software, as shown in Figure 1.
Table 1. High frequency keywords and word frequency table (part).
Figure 1. High frequency keywords co-occurrence network.
5.3. Identification and Analysis of Important Nodes
Enterprise microblogging, as a main positions of enterprise marketing and promotion and an important window of connecting with the user directly, its content often represents the main information enterprise wants to pass to its users. Therefore, in general, the high-frequency words appeared in the enterprise microblogging, often on behalf of the focus of the enterprise microblogging content.
We can see from Table 2 that the words “Pollen”, “Huawei”, “mobile phone” and “honor” are the most frequently mentioned words in the microblog of Club of Huawei. From the above list of high-frequency keywords, we could found that Club of Huawei focused on Huawei’s mobile phone users, known as “Pollen”. In 2015, Huawei launched a number of different models of new mobile phones, including the honor series, mate series, P series, Maimang series and so on. But according to all the high frequency vocabulary released by Club of Huawei in 2015, we can found: In 2015, the Club of Huawei focused on promoting the honor series and “Huawei P8”. Although the Mate series is the focus of the launch in 2015, a high-end model, but its micro-blog marketing share is not significant.
5.4. Identification and Analysis of Centre Point
In order to further study the importance of the key nodes in the Club of Huawei microblogging, refers to the centrality of the social network, this paper uses the degree centralization and the betweenness centralization of node to identify the central point in the microblogging. The analysis results are shown below:
As can be seen from Figure 2, degree centralization of “pollen”, “honor”, “Huawei”, “topic”, “mobile phone” is relatively high, indicating that these keywords and other microblogging keywords directly related to more. However, comparing with the high frequency vocabularies, although the frequency of these keywords, such as “Honor 7i” and “Honor Changwan 5X”, is relatively high, but their degree centralization does not reach the equivalent height. This means that although the frequency of these keywords is relatively large, but their relevance with keywords appeared in other microblogging is not high. While both the frequency and the degree centralization of “Huawei P8” are relatively high, we can easily find that the model is the Club of Huawei focus in 2015 microblogging.
Table 2. High frequency keywords co-occurrence matrix (part).
The betweenness centrality directly reveals the importance of the node location in the whole knowledge network. As we can see from Figure 3, the betweenness centralization of “pollen”, “Huawei”, “mobile phone” and “glory” are relatively high. These nodes are located at the core of the network and play an important mediating role.
It is noteworthy that although the frequency and degree centralization of “Huawei P8” are relatively high, its betweenness centralization is only 17.241,
Figure 2. Node degree centrality (partial).
Figure 3. Node betweenness centrality (partial).
which means that it is not very important in the whole network. This is because, as an independent high-end model in addition to the honor series, the betweenness role of “Huawei P8” is not obvious.
5.5. Identification and Analysis of Cluster
By analyzing the cohesive subgroup of the enterprise microblogging knowledge network, it is possible to divide the microblog topic. Analysis results into the following:
As shown in Figure 4, 103 high frequency keywords are divided into 8 subgroups by the cohesive subgroup analysis. The composition of each subgroup is as follows:
It can be seen from Table 3 that Club of Huawei has different marketing methods for different models. Specific analyses of various types of groups are as follows:
Group 1 mainly includes some topics and activities to increase the stickiness of microblogging fans, such as the activities of “Pollen Handy Photos”, “food”, and “entertainment”. These microblogging related to the daily life of fans can effectively increase the fan’s activity.
Group 2 mainly includes some marketing activities of “Honor Changwan 5X”. According to the composition of the key words group 2, we can find that, for this product, microblogging marketing activities are excellent purchase code and lottery mainly.
Group 3 mainly includes marketing activities of “Huawei P8”. This type of group contains the most keywords, we can see, for this high-end model, the marketing activities are the most diverse.
Group 3 mainly includes marketing activities of “Huawei P8”. This type of group contains the most keywords, we can see, for this high-end models, the marketing activities are the most diverse.
Group 4 mainly includes marketing activities of new product of honor series in 2015. From the composition of group 4, we could found that the Club of Huawei launched a number of marketing activities for college students. Therefore, we can easily draw the conclusion that the products of honor series mainly targeted at young users, especially college students. This is also very consistent with the low-end machine positioning of honor series products.
Group 5 mainly includes “tablet”, “Honor Changwan 4C”, “Huawei Changxiang” and so on. By the group 5, we can know that these products are not Club of Huawei microblogging marketing focus, their marketing activities are also very single, and “Price”, “brand” are their main selling point.
Group 6 is the product information most relevant to the actual use of the user, including “key”, “lens”, “video”, “information”, and “function”.
Group 7 is the topic of product design, including the “battery”, “craft”, “metal”, and “fuselage”.
Group 8 mainly includes keywords related to product features. From these key words, we can easily find that Huawei mobile phones in 2015 mainly has made a
Figure 4. The chart of cohesive subgroups of high frequency key words.
Table 3. High-frequency keyword classification table.
breakthrough in the “screen”, “camera”, “fingerprint”, “chip”, “system”, and “technology”.
This paper constructs a micro-blogging knowledge network with micro-blog text keywords as network nodes, keyword frequency and the co-occurrence relation between them as weights and edges and effectively found the important nodes, the central nodes and the categories of the microblogging transmission through the central analysis of the micro-blogging knowledge network and the cohesive subgroup analysis. Then, with Huawei’s official micro-blog “Club of Huawei” as the research object, this paper constructs the knowledge network of Club of Huawei, and effectively analyzes its key products, marketing selling points and main marketing activities, and further analyzes the major improvement in the field of mobile phone products in 2015 in Huawei. Through the example, we found that the knowledge network model can carry on the thorough and comprehensive analysis to the enterprise’s micro-blog, which is of great significance to the micro-blog marketing and the enterprise’s competitive intelligence.