Yohsuke Murase (RIKEN Advanced Institute for Computational Science)
Wednesday 2016-08-10 11.00 – 12.00
Lecture hall AS3, TUAS building
What does Big Data tell? Sampling the social network by communication channels
Big Data has become the primary source of understanding the structure and dynamics of the society at large scale. The network of social interactions can be considered as a multiplex, where each layer corresponds to one communication channel and the aggregate of all them constitutes the entire social network. However, usually one has information only about one of the channels or even a part of it, which should be considered as a sample of the whole. Here we show by simulations and analytical methods that this sampling may lead to bias. For example, while it is expected that the degree distribution of the whole social network has a maximum at a value larger than one, we get with reasonable assumptions about the sampling method a monotonously decreasing distribution as observed in empirical studies of single channel data. We also find that assortativity may occur or get strengthened due to the sampling method. We analyze the far-reaching consequences of our findings.
Article: http://arxiv.org/abs/1511.08749, J. Torok, Y. Murase, H.-H. Jo, J. Kertesz, K. Kaski “What does Big Data tell? Sampling the social network by communication channels”.