Internet Flow Characterization -- Adaptive Timeout and Statistical Modeling
Bo Ryu (contact author) HRL Laboratories
David Cheney and Hans-Warner Braun
National Laboratories for Advanced Networking Research (NLANR), UC San Diego
There is a growing effort on understanding Internet traffic dynamics via the abstraction of flows from packets. We first present an adaptive flow timeout strategy, called Measurement-based Binary Exponential Timeout (MBET), for identifying and maintaining flow state information. In this study, we establish an urgent need for, and investigate the feasibility of adopting an adaptive flow timeout strategy that combines fixed and dynamic timeout values based on observed dynamics of intra-flow packet interarrival times. Applying fixed, identical timeout value across all flows, which is a common practice in several recent studies, makes it difficult to find an optimal timeout value, given that Internet flows show an extreme variability in flow statistics such as flow duration and volume. We demonstrate that, via an extensive analysis of a large number of traces collected at three different sites, the MBET achieves significant performance advantages over widely used fixed flow timeout scheme. This is possible since we find that there is a fair amount of flows exhibiting persistency in intra-flow packet arrivals dynamics and the MBET exploits it whenever possible. By persistency we mean that when a large (small) interarrival time is detected, it is more likely that future interarrival times are large (small) as well. If such persistency does exists in the majority of flows, it can serve as quite an effective means for detecting whether a flow will have a high throughput or not based on observing the packet dynamics during the initial period of the flow. For example, if a flow appears to exhibit high throughput in the beginning, a smaller timeout is likely to be sufficient to determine its end. Consequently, the advantages gained by MBET translate to faster detection of the end of flow, more accurate recognition of long-lived flows and substantial reduction in hardware resources.
Second, we present a highly versatile and parameterized statistical modeling framework for characterizing a broad spectrum of flow dynamics. With the growing interest in understanding and characterizing flow dynamics, it has become necessary and crucial to have a set of parameterized statistical modeling tools so that broadly different and yet faithfully realistic flow characteristics can be repeatedly generated. Such tools are essential for evaluating a number of new flow-based traffic engineering tasks such as dynamic routing. This flow characterization framework identifies several components key to the faithful description of flow dynamics in a statistical sense. Based on the analysis of a couple of very large traces, we find that the proposed modeling framework are capable of capturing some of the essential characteristics of flow dynamics. We also identify that some of the widely used assumptions about flow modeling, such as Poisson flow arrivals and uniform/Poisson packet arrivals within flows underestimate the burstiness of flow dynamics. Instead, we find that flow arrivals appear to follow mono-fractal characteristics over time scales greater than 100 msec, and intra-flow packet dynamics exhibit an array of different statistical behaviors, including non-fractal, mono-fractal, and multi-fractal patterns. While these deviations from common modeling assumptions make faithful flow dynamics modeling a challenge, the proposed modeling framework paves the way for accurate flow simulation and provides plausible causes for complex packet-level dynamics such as multi-fractals.