Seminario Interdipartimentale di Algoritmica
Monday, November 28, 2005, 12:00 noon
Traceroute-Like Exploration of Unknown Networks: A Statistical Analysis
Alain Barrat, Université de Paris-Sud
DIS - Department
of Computer and System Sciences
Room C3, second floor
Abstract:
The general methodology
used to construct Internet maps consists in merging all the discovered
paths obtained by sending data packets from a set of active computers
to a set of destination hosts, obtaining a graph-like representation of
the network. This technique, sometimes referred to as Internet
tomography, spurs the issue concerning the statistical reliability of
such empirical maps. We tackle this problem by modeling the network
sampling process and by using a mean-field approximation to obtain
expressions for the probability of edge and vertex detection in the
sampled graph. This allows a general understanding of the origin of
possible sampling biases.
Moreover, the inference of many of the most basic topological
quantities from traceroute measurements is in fact a version of the
so-called `species problem' in statistics, which are often quite
challenging. We focus on the most fundamental example of a
traceroute internet species: the number of nodes in a network. We
use statistical subsampling principles to derive two proposed
estimators, and we illustrate the performance of these estimators on
networks with various topological characteristics.
References: cs.NI/0510007, cs.NI/0412007, Phys. Rev. E 71 (2005) 036135.