To read this content please select one of the options below:

Factor analysis of Internet traffic destinations from similar source networks

Felipe Mata (Departamento de Tecnología Electrónica y de las Comunicaciones, Universidad Autónoma de Madrid, Madrid, Spain)
José Luis García‐Dorado (Departamento de Tecnología Electrónica y de las Comunicaciones, Universidad Autónoma de Madrid, Madrid, Spain)
Javier Aracil (Departamento de Tecnología Electrónica y de las Comunicaciones, Universidad Autónoma de Madrid, Madrid, Spain)
Jorge E. López de Vergara (Departamento de Tecnología Electrónica y de las Comunicaciones, Universidad Autónoma de Madrid, Madrid, Spain)

Internet Research

ISSN: 1066-2243

Article publication date: 27 January 2012

1670

Abstract

Purpose

This study aims to assess whether similar user populations in the Internet produce similar geographical traffic destination patterns on a per‐country basis.

Design/methodology/approach

The authors collected a country‐wide NetFlow trace, which encompasses the whole Spanish academic network. Such a trace comprises several similar campus networks in terms of population size and structure. To compare their behaviors, the authors propose a mixture model, which is primarily based on the Zipf‐Mandelbrot power law to capture the heavy‐tailed nature of the per‐country traffic distribution. Then, factor analysis is performed to understand the relation between the response variable, number of bytes or packets per day, with dependent variables such as the source IP network, traffic direction, and country.

Findings

Surprisingly, the results show that the geographical distribution is strongly dependent on the source IP network. Furthermore, even though there are thousands of users in a typical campus network, it turns out that the aggregation level which is required to observe a stable geographical pattern is even larger.

Practical implications

Based on these findings, conclusions drawn for one network cannot be directly extrapolated to different ones. Therefore, ISPs' traffic measurement campaigns should include an extensive set of networks to cope with the space diversity, and also encompass a significant period of time due to the large transient time.

Originality/value

Current state of the art includes some analysis of geographical patterns, but not comparisons between networks with similar populations. Such comparison can be useful for the design of content distribution networks and the cost‐optimization of peering agreements.

Keywords

Citation

Mata, F., Luis García‐Dorado, J., Aracil, J. and López de Vergara, J.E. (2012), "Factor analysis of Internet traffic destinations from similar source networks", Internet Research, Vol. 22 No. 1, pp. 29-56. https://doi.org/10.1108/10662241211199951

Publisher

:

Emerald Group Publishing Limited

Copyright © 2012, Emerald Group Publishing Limited

Related articles