Search Results

Now showing 1 - 2 of 2
  • Item
    Simultaneous statistical inference for epigenetic data
    (San Francisco, California, US : PLOS, 2015) Schildknecht, Konstantin; Olek, Sven; Dickhaus, Thorsten
    Epigenetic research leads to complex data structures. Since parametric model assumptions for the distribution of epigenetic data are hard to verify we introduce in the present work a nonparametric statistical framework for two-group comparisons. Furthermore, epigenetic analyses are often performed at various genetic loci simultaneously. Hence, in order to be able to draw valid conclusions for specific loci, an appropriate multiple testing correction is necessary. Finally, with technologies available for the simultaneous assessment of many interrelated biological parameters (such as gene arrays), statistical approaches also need to deal with a possibly unknown dependency structure in the data. Our statistical approach to the nonparametric comparison of two samples with independent multivariate observables is based on recently developed multivariate multiple permutation tests. We adapt their theory in order to cope with families of hypotheses regarding relative effects. Our results indicate that the multivariate multiple permutation test keeps the pre-assigned type I error level for the global null hypothesis. In combination with the closure principle, the family-wise error rate for the simultaneous test of the corresponding locus/parameter-specific null hypotheses can be controlled. In applications we demonstrate that group differences in epigenetic data can be detected reliably with our methodology.
  • Item
    Communication activity in a social network: Relation between long-term correlations and inter-event clustering
    (London : Nature Publishing Group, 2012) Rybski, D.; Buldyrev, S.V.; Havlin, S.; Liljeros, F.; Makse, H.A.
    Human communication in social networks is dominated by emergent statistical laws such as non-trivial correlations and temporal clustering. Recently, we found long-term correlations in the user's activity in social communities. Here, we extend this work to study the collective behavior of the whole community with the goal of understanding the origin of clustering and long-term persistence. At the individual level, we find that the correlations in activity are a byproduct of the clustering expressed in the power-law distribution of inter-event times of single users, i.e. short periods of many events are separated by long periods of no events. On the contrary, the activity of the whole community presents long-term correlations that are a true emergent property of the system, i.e. they are not related to the distribution of inter-event times. This result suggests the existence of collective behavior, possibly arising from nontrivial communication patterns through the embedding social network.