small worlds with a difference: new gatekeepers and the filtering of political information on...

21
Small Worlds with a Difference New Gatekeepers and the Filtering of Political Information on Twitter Pascal Jürgens University of Mainz [email protected] Andreas Jungherr University of Bamberg andreas.jungherr @uni-bamberg.de Harald Schoen University of Bamberg harald.schoen @uni-bamberg.de 1 full paper at the websci repository

Upload: pascal-juergens

Post on 29-Nov-2014

1.287 views

Category:

Education


2 download

DESCRIPTION

Political discussions on social network platforms represent an increasingly relevant source of political information, an opportunity for the exchange of opinions and a popular source of quotes for media outlets. We analyzed political communication on Twitter during the run-up to the German general election of 2009 by extracting a directed network of user interactions based on the exchange of political information and opinions. In consonance with expectations from previous research, the resulting network exhibits small-world properties, lending itself to fast and efficient information diffusion. We go on to demonstrate that precisely the highly connected nodes, characteristic for small-world networks, are in a position to exert strong, selective influence on the information passed within the network. We use a metric based on entropy to identify these New Gatekeepers and their impact on the information flow. Finally, we perform an analysis of their input and output of political messages. It is shown that both the New Gatekeepers and ordinary users tend to filter political content on Twitter based on their personal preferences. Thus, we show that political communication on Twitter is at the same time highly dependent on a small number of users, critically positioned in the structure of the network, as well as biased by their own political perspectives.

TRANSCRIPT

Page 2: Small Worlds With a Difference: New Gatekeepers and the Filtering of Political Information on Twitter

Political Tweets(German general election 2009)

“Ihr werdet euch noch wünschen wir wären Politikverdrossen [sic!].” – Max Winde (@343max)

rough translation:

“Soon you’ll be wishing we were through with politics”

2

Page 3: Small Worlds With a Difference: New Gatekeepers and the Filtering of Political Information on Twitter

A Political QuestionAssuming that …

the web is a considerable source for political information

each user only perceives a selection of this information

facts and opinions have the potential to influence recipients and journalists

Q: what forces influence the visibility of information?

3

Page 4: Small Worlds With a Difference: New Gatekeepers and the Filtering of Political Information on Twitter

Old Answer

Journalists as gatekeepers

vs

Recipients as selective readers(“Uses & Gratifications”)

4

Page 5: Small Worlds With a Difference: New Gatekeepers and the Filtering of Political Information on Twitter

New EnvironmentNetworked Information

Relevant – social networking sites see immense growth in adoption

New paradigms – real-time, recommended many-to-many communication

Perceived information varies, selection is distributed

Old answers neglect network effects in the process

5

Page 6: Small Worlds With a Difference: New Gatekeepers and the Filtering of Political Information on Twitter

What We Know

Social networks tend to exhibit small world properties (so do networks of political communication on twitter)

Information flows fast and networks are relatively resistant to (random) disruptions

Information should be able to move unencumbered

6

Page 7: Small Worlds With a Difference: New Gatekeepers and the Filtering of Political Information on Twitter

The ProblemsContagion model of information spread

Links can be substituted

But: Almost everyone has almost no friends (power law), key players have dominant role

Information relay via key players

Non-Random removal!

Problem moves from contagion to disruption(Borgatti’s KPP-Neg problem)

7

Page 8: Small Worlds With a Difference: New Gatekeepers and the Filtering of Political Information on Twitter

Comput Math Organiz Theor (2006) 12: 21–34 25

Fig. 3 Network in which removing the two most central nodes (“i” and “ j”) is not as disruptive as removinga different pair of nodes (“i” and “ j”)

Similarly, the graph-theoretic concept of a vertex cut-set generalizes the cutpoint to addressthe ensemble issue directly. A vertex cutset is a set of nodes whose removal would increasethe number of components in the graph. Most graph-theoretic work has focused on minimumweight cutsets, which are smallest sets that have the cutset property. However, cutsets retainmany of the same difficulties as cutpoints when applied to the KPP-Neg problem. First, wecannot specify the number of nodes in the set and then seek the set of that size that doesthe best job (rather, the measure of success is fixed and cutset methods are able to find asmallest set that achieves that level of success). In this sense, the graph-theoretic approachsolves the inverse of the problem we seek to solve. Second, no account is taken of the sizeof components created by the cut. A cut that isolates a single node is no better than onethat divides the graph into equal size components. Third, no account is taken of distancesamong nodes. Hence a set whose removal would not only cut the network in half but alsolengthen distances within each half would not be considered better than one that merely cutthe network in half.

A redundancy principle also applies to KPP-Pos. Consider the graph in Fig. 4. Nodes aand b are individually the best connected. Each is adjacent to five other nodes, more than anyother by far. But together they reach no more than either does alone. In contrast, if a is pairedwith c (which individually reaches only three nodes), the ensemble reaches every node in thenetwork. The reason that {a,c} is more effective than {a,b} is that a and c are less structurallyequivalent (Lorraine and White, 1971; Burt, 1976) than are a and b. Structural equivalencerefers to the extent to which two nodes have overlapping neighborhoods—i.e., are connectedto the same third parties. Structurally equivalent nodes are, by definition, redundant with

Fig. 4 Network in which the twomost central nodes taken together(“a” and “b”) are adjacent tofewer nodes than a different set ofnodes (“a” and “c”)

Springer

Borgatti (2010) p.39

Impact of Key Player Removal

8

Page 9: Small Worlds With a Difference: New Gatekeepers and the Filtering of Political Information on Twitter

Comput Math Organiz Theor (2006) 12: 21–34 25

Fig. 3 Network in which removing the two most central nodes (“i” and “ j”) is not as disruptive as removinga different pair of nodes (“i” and “ j”)

Similarly, the graph-theoretic concept of a vertex cut-set generalizes the cutpoint to addressthe ensemble issue directly. A vertex cutset is a set of nodes whose removal would increasethe number of components in the graph. Most graph-theoretic work has focused on minimumweight cutsets, which are smallest sets that have the cutset property. However, cutsets retainmany of the same difficulties as cutpoints when applied to the KPP-Neg problem. First, wecannot specify the number of nodes in the set and then seek the set of that size that doesthe best job (rather, the measure of success is fixed and cutset methods are able to find asmallest set that achieves that level of success). In this sense, the graph-theoretic approachsolves the inverse of the problem we seek to solve. Second, no account is taken of the sizeof components created by the cut. A cut that isolates a single node is no better than onethat divides the graph into equal size components. Third, no account is taken of distancesamong nodes. Hence a set whose removal would not only cut the network in half but alsolengthen distances within each half would not be considered better than one that merely cutthe network in half.

A redundancy principle also applies to KPP-Pos. Consider the graph in Fig. 4. Nodes aand b are individually the best connected. Each is adjacent to five other nodes, more than anyother by far. But together they reach no more than either does alone. In contrast, if a is pairedwith c (which individually reaches only three nodes), the ensemble reaches every node in thenetwork. The reason that {a,c} is more effective than {a,b} is that a and c are less structurallyequivalent (Lorraine and White, 1971; Burt, 1976) than are a and b. Structural equivalencerefers to the extent to which two nodes have overlapping neighborhoods—i.e., are connectedto the same third parties. Structurally equivalent nodes are, by definition, redundant with

Fig. 4 Network in which the twomost central nodes taken together(“a” and “b”) are adjacent tofewer nodes than a different set ofnodes (“a” and “c”)

Springer

Borgatti (2010) p.39

Impact of Key Player Removal

8

Page 10: Small Worlds With a Difference: New Gatekeepers and the Filtering of Political Information on Twitter

Comput Math Organiz Theor (2006) 12: 21–34 25

Fig. 3 Network in which removing the two most central nodes (“i” and “ j”) is not as disruptive as removinga different pair of nodes (“i” and “ j”)

Similarly, the graph-theoretic concept of a vertex cut-set generalizes the cutpoint to addressthe ensemble issue directly. A vertex cutset is a set of nodes whose removal would increasethe number of components in the graph. Most graph-theoretic work has focused on minimumweight cutsets, which are smallest sets that have the cutset property. However, cutsets retainmany of the same difficulties as cutpoints when applied to the KPP-Neg problem. First, wecannot specify the number of nodes in the set and then seek the set of that size that doesthe best job (rather, the measure of success is fixed and cutset methods are able to find asmallest set that achieves that level of success). In this sense, the graph-theoretic approachsolves the inverse of the problem we seek to solve. Second, no account is taken of the sizeof components created by the cut. A cut that isolates a single node is no better than onethat divides the graph into equal size components. Third, no account is taken of distancesamong nodes. Hence a set whose removal would not only cut the network in half but alsolengthen distances within each half would not be considered better than one that merely cutthe network in half.

A redundancy principle also applies to KPP-Pos. Consider the graph in Fig. 4. Nodes aand b are individually the best connected. Each is adjacent to five other nodes, more than anyother by far. But together they reach no more than either does alone. In contrast, if a is pairedwith c (which individually reaches only three nodes), the ensemble reaches every node in thenetwork. The reason that {a,c} is more effective than {a,b} is that a and c are less structurallyequivalent (Lorraine and White, 1971; Burt, 1976) than are a and b. Structural equivalencerefers to the extent to which two nodes have overlapping neighborhoods—i.e., are connectedto the same third parties. Structurally equivalent nodes are, by definition, redundant with

Fig. 4 Network in which the twomost central nodes taken together(“a” and “b”) are adjacent tofewer nodes than a different set ofnodes (“a” and “c”)

Springer

Borgatti (2010) p.39

Impact of Key Player Removal

8

Network of 8,609 german Twitter users who used political hashtags during the general election campaign 2009 (June 18 to September 30, 2009)

Edges are directed messages (RT and @) which contain political hashtags

Page 11: Small Worlds With a Difference: New Gatekeepers and the Filtering of Political Information on Twitter

Finding Key PlayersNeed general measure for a node’s impact on network’s capability to transfer information

» Ortiz-Arroyo (2010): Centrality Entropy metric (Hce)

9

Page 12: Small Worlds With a Difference: New Gatekeepers and the Filtering of Political Information on Twitter

Centrality Entropy

Calculates the entropy of a network = capacity for information transfer (~ease of transfer)

Iteratively remove nodes and re-calculate to determine largest impact = key players*(KPP-NEG)

Complexity is O(n3)

*note: no optimal solution for KPP-Neg

10

Page 13: Small Worlds With a Difference: New Gatekeepers and the Filtering of Political Information on Twitter

ResultsNetwork centrality entropy: 12.1596

The most influential node reduces entropy by ~.1 when removed (N= 8,609)

Highest entropy impact of a node in Borgatti’s example networks: ~.1 (N=19)

Entropy impact declines rapidly (power law?)

» A small number of users can have a disproportionally large disruptive impact

11

Page 14: Small Worlds With a Difference: New Gatekeepers and the Filtering of Political Information on Twitter

Political Tweets II

During the election campaign, users explicitly coded party affiliations:

#party+ = positive (e.g. cdu+, spd+)

#party– = negative (e.g. cdu–, spd–)

This allows for the creation of party affiliation profiles

12

Page 15: Small Worlds With a Difference: New Gatekeepers and the Filtering of Political Information on Twitter

User Bias

We calculated the distribution of party evaluations for outgoing and incoming* directed messages

A simple χ2 test was used to test for difference of those distributions

*messages from network neighbors

13

Page 16: Small Worlds With a Difference: New Gatekeepers and the Filtering of Political Information on Twitter

ResultsA majority of users with testable volume of messages displayed a significant (p < 0.001) bias

All testable gatekeepers (top100) displayed bias

14

sig. bias no sig. biasnegative party

evaluations 2,866 201

positive party evaluations 3,835 34

Users with significantly differing distributions of incoming and outgoing party evaluations. N = 8,609Users with significantly differing distributions of incoming and outgoing party evaluations. N = 8,609Users with significantly differing distributions of incoming and outgoing party evaluations. N = 8,609

Page 17: Small Worlds With a Difference: New Gatekeepers and the Filtering of Political Information on Twitter

Example User

15

When removed together they decreased the network’s connectivity entropy by 0.6418447. To illustrate the destructive effect in traditional network terms: the number of strongly and weakly connected components increased from 5881 to 6574 and from 374 to 1350, respectively. At the same time, the average path length increased from 4.9885 to 5.9110.

Having identified New Gatekeepers, we turn to the question of how these users select information to spread in the network. In offline communication, traditional Gatekeepers were shown to select to present those elements of reality in their stories in accordance with personal, social and economical selection criteria [17]. A similar question arise with regard to the political twittersphere: Do Twitter users in general and New Gatekeepers in particular post political messages whose political position more or less reflects the political position of those messages that the users receive? Or, do the messages they published or chose to retweet show a political position significantly skewed from the political position of the messages which they receive? In short, do tweets accurately reflect the opinion space around a user, or are they a biased selection of these opinions based on choices by the Twitter user? The nature of our data set allows for a precise analysis of this question.

During the campaign of 2009 German users coded their Twitter messages according to their political convictions by including specific #keywords in their messages. If in agreement with a political party, they posted the party label followed by a + (#cdu+; #spd+). If in critique of a party, they posted the party label followed by a - (#cdu-; #spd-). For every user of our conversation network we computed the exact count of these positive or negative mentions of political parties contained in Twitter messages in her direct neighborhood in the network (the vertices a given vertex is directly linked to by an edge). We then computed the thus marked counts of positive or negative mentions of political parties in the Tweets the user herself published. In a next step we tested if the distribution of these counts showed significant differences with a chi square test. If true, we took this as evidence of bias. We used this procedure for all users in our political conversation network.

Table 2 exemplarily shows the bias of one New Gatekeeper in our data set. The table shows the relationship of outgoing and incoming messages in support of seven political parties for the Twitter account of the German politician Volker Beck, Member of Parliament for the Green Party (@volker_beck). The table shows that in the direct neighborhood of Volker Beck’s network a total of 1,269 messages in support of the FDP (liberals), containing the keyword #fdp+, were posted. We also see that Volker Beck did not post any message, containing #FDP+, in support of the FDP. Thus there is a bias, a systematic difference between the political opinions voiced in Volker Beck’s direct information environment and the political opinion Volker Beck chose to express in his own Tweets.

Table 2. Political bias of the Twitter account @volker_beck by the German politician Volker Beck (Bündnis 90/Die Grünen)

Outgoing2 Incoming3

CDU (conservatives) 0 511

CSU (conservatives) 0 58

SPD (social democrats) 0 876

FDP (liberals) 0 1,269

Grüne (green party) 19 630

Linke (socialists) 0 267

Piraten (pirate party) 10 4,944

Of the users in the conversation network, roughly 3,000 had enough in- and outgoing Twitter messages to statistically test for a bias. For negative mentions of parties, 2,866 users had a significant bias (p < .001) while a mere 201 users did not exhibit a significant bias. It should be noted that a bias could be identified for all of the testable gatekeepers (N=97). The situation is quasi identical for positive mentions of parties: 3,835 users with a significant bias stand against only 34 users without significant bias. Again, all of the gatekeepers (N=100) were found to have sent political messages with a significantly different distribution from their directly connected neighbors.

Consequently the messages of many Twitter users and those of nearly all critically placed New Gatekeepers do not accurately reflect the distribution of political opinion on Twitter but only the political opinion of that user. Add to that the heavy influence of a user’s structural position in the conversation network on the visibility of her opinion and we have a situation in which personal opinions of Twitter users are heavily amplified by the technological bias of the structure of conversation networks on Twitter. Journalists reporting on the political opinion on Twitter might thus only be reporting on the political opinion of a few individuals critically positioned in the structure of the conversation network.

6. NEW MEDIA, TECHNOLOGY, AND THE EMERGENCE OF NEW GATEKEEPERS Our paper thus contributes to the literature on small-world networks since we were able to show that in communication networks small-world structures do not necessarily lead to the fast and impartial distribution of information through network graphs. Instead, these networks are highly dependent on the willingness of users in critical network positions to distribute potentially expressive messages. Thus these individuals become information filters. This is especially relevant in the case of information distribution in highly partisan communication contexts, as for example politics. In our paper we showed the existence of New Gatekeepers, Twitter users in critical structural position in

2 Tweets in support of a political party published by Volker Beck. 3 Tweets in support of a political party published in Volker Beck’s

direct network neighborhood.

Page 18: Small Worlds With a Difference: New Gatekeepers and the Filtering of Political Information on Twitter

Takeaways

The network of political conversations during the general election campaign 2009 was dominated by a small number of key users which functioned as critical relays within the network

16

Page 19: Small Worlds With a Difference: New Gatekeepers and the Filtering of Political Information on Twitter

Takeaways

In their tweets, these key users did not mirror their network environment but instead exhibited an individual political bias

17

Page 20: Small Worlds With a Difference: New Gatekeepers and the Filtering of Political Information on Twitter

Takeaways

The visibility of political information on twitter is critically dependent on the network structure

18

Page 21: Small Worlds With a Difference: New Gatekeepers and the Filtering of Political Information on Twitter

Thank you for your (early morning) attention!Key References

Bakshy, E, Hofman, J. M., Mason, W. A., and Watts, D. J. 2011. Everyone’s an Influencer: Quantifying Influence on Twitter. Proceedings of the Fourth International Conference on Web Search and Data Mining (Hong Kong, China, February 09 – 12, 2011). WSDM 2011.

Borgatti, S. P.,2005. Centrality and network flow. Social Networks 27, 55-71. DOI - http://dx.doi.org/10.1016/j.socnet.2004.11.008.

Borgatti, S. P. 2006. Identifying sets of key players in a social network. Computational & Mathematical Organization Theory 12 (1), 21-34. DOI- http://dx.doi.org/10.1007/s10588-006-7084-x.

Cha, M., Haddadi, H., Benevenuto, F., and Gummadi, K. P. 2010. Measuring User Influence in Twitter: The Million Follower Fallacy. In Proceedings of the Fourth International AAAI Conference on Weblogs and Social Media (Washington, DC, May 23 – 26, 2010). ICWSM. AAAI Press, Menlo Park, CA, 10-17.

Jungherr, A., Jürgens, P., and Schoen, H., 2012. Why the Pirate Party Won the German Election of 2009 or The Trouble With Predictions: A Response to Tumasjan, A., Sprenger, T. O., Sander, P. G., & Welpe, I. M. ‘Predicting Elections With Twitter: What 140 Characters Reveal About Political Sentiment’. Social Science Computer Review. Forthcoming. DOI- http://dx.doi.org/10.1177/0894439311404119.

Jürgens, P., and Jungherr, A. 2011. Wahlkampf vom Sofa aus: Twitter im Bundestagswahlkampf 2009. In: Schweitzer, E. J., and Albrecht, S. (eds.). Das Internet im Wahlkampf: Analysen zur Bundestagswahl 2009. VS-Verlag, Wiesbaden, 201-225. DOI- http://dx.doi.org/10.1007/978-3-531-92853- 1_8.

Ortiz-Arroyo, D. 2010. Discovering Sets of Key Players in Social Networks. In: Abraham, A., Hassanien, A.-E., and Snásel , V. (eds.). Computational Social Network Analysis. Springer Verlag, Dordrecht et al., 27-46. DOI- http://dx.doi.org/10.1007/978-1-84882-229-0_2.

Shannon, C.E. 1948. A mathematical theory of communication. The Bell System Technical Journal 27, 379– 423, 623–656.

Watts, D. J., and Strogatz, S. H. 1998. Collective dynamics of ‘small-world‘ networks. Nature 393, 440-442. DOI- http://dx.doi.org/10.1038/30918.

19