SelfSimilarity in Network Traffic Kevin Henkener 5292002 What

  • Slides: 25
Download presentation
Self-Similarity in Network Traffic Kevin Henkener 5/29/2002

Self-Similarity in Network Traffic Kevin Henkener 5/29/2002

What is Self-Similarity? n Self-similarity describes the phenomenon where a certain property of an

What is Self-Similarity? n Self-similarity describes the phenomenon where a certain property of an object is preserved with respect to scaling in space and/or time. n If an object is self-similar, its parts, when magnified, resemble the shape of the whole.

Pictorial View of Self-Similarity

Pictorial View of Self-Similarity

The Famous Data n n Leland Wilson collected hundreds of millions of Ethernet packets

The Famous Data n n Leland Wilson collected hundreds of millions of Ethernet packets without loss and with recorded time-stamps accurate to within 100µs. Data collected from several Ethernet LAN’s at the Bellcore Morristown Research and Engineering Center at different times over the course of approximately 4 years.

Why is Self-Similarity Important? n n n Recently, network packet traffic has been identified

Why is Self-Similarity Important? n n n Recently, network packet traffic has been identified as being self-similar. Current network traffic modeling using Poisson distributing (etc. ) does not take into account the self-similar nature of traffic. This leads to inaccurate modeling which, when applied to a huge network like the Internet, can lead to huge financial losses.

Problems with Current Models n Current modeling shows that as the number of sources

Problems with Current Models n Current modeling shows that as the number of sources (Ethernet users) increases, the traffic becomes smoother and smoother n Analysis shows that the traffic tends to become less smooth and more bursty as the number of active sources increases

Problems with Current Models Cont. ’d n Were traffic to follow a Poisson or

Problems with Current Models Cont. ’d n Were traffic to follow a Poisson or Markovian arrival process, it would have a characteristic burst length which would tend to be smoothed by averaging over a long enough time scale. Rather, measurements of real traffic indicate that significant traffic variance (burstiness) is present on a wide range of time scales

Pictorial View of Current Modeling

Pictorial View of Current Modeling

Side-by-side View

Side-by-side View

Definitions and Properties n Long-range Dependence q n covariance decays slowly Hurst Parameter q

Definitions and Properties n Long-range Dependence q n covariance decays slowly Hurst Parameter q q Developed by Harold Hurst (1965) H is a measure of “burstiness” n q q also considered a measure of self-similarity 0<H<1 H increases as traffic increases

Definitions and Properties Cont. ’d n n low, medium, and high traffic hours as

Definitions and Properties Cont. ’d n n low, medium, and high traffic hours as traffic increases, the Hurst parameter increases q i. e. , traffic becomes more self-similar

Self-Similar Measures n Background q q q Let time series: X = (Xt :

Self-Similar Measures n Background q q q Let time series: X = (Xt : t = 0, 1, 2, …. ) be a covariance stationary stochastic process autocorrelation function: r(k), k ≥ 0 assume r(k) ~ k-β L(t), as k ∞ where 0 < β < 1 n limt ∞ L(tx) / L(t) = 1, for all x > 0

Second-order Self-Similar n Exactly q q q n Asymptotically q q n A process

Second-order Self-Similar n Exactly q q q n Asymptotically q q n A process X is called (exactly) self-similar with selfsimilarity parameter H = 1 – β/2 if for all m = 1, 2, …. var(X(m)) = σ2 m-β r(m)(k) = r(k), k ≥ 0 r(m)(k) = r(k), as m ∞ aggregated processes are the same Current model shows aggregated processes tending to pure noise

Measuring Self-Similarity n n n time-domain analysis based on R/S statistic analysis of the

Measuring Self-Similarity n n n time-domain analysis based on R/S statistic analysis of the variance of the aggregated processes X(m) periodogram-based analysis in the frequency domain

Methods of Modeling Self-Similar Traffic n Two formal mathematical models that yield elegant representations

Methods of Modeling Self-Similar Traffic n Two formal mathematical models that yield elegant representations of self-similarity q q fractional Gaussian noise fractional autoregressive integrated movingaverage processes

Results n n Ethernet traffic is self-similar irrespective of time Ethernet traffic is self-similar

Results n n Ethernet traffic is self-similar irrespective of time Ethernet traffic is self-similar irrespective of where it is collected The degree of self-similarity measured in terms of the Hurst parameter h is typically a function of the overall utilization of the Ethernet and can be used for measuring the “burstiness” of the traffic Current traffic models are not capable of capturing the self-similarity property

Results Cont. ’d n n n There exists the presence of concentrated periods of

Results Cont. ’d n n n There exists the presence of concentrated periods of congestion at a wide range of time scales This implies the existence of concentrated periods of light network load These two features cannot be easily controlled by traffic control. q i. e. , burstiness cannot be smoothed

Results Cont. ’d n These two implications make it difficult to allocated services such

Results Cont. ’d n These two implications make it difficult to allocated services such that QOS and network utilization are maximized. n Self-similar burstiness can lead to the amplification of packet loss.

Problems with Packet Loss n Effects in TCP q q q n TCP guarantees

Problems with Packet Loss n Effects in TCP q q q n TCP guarantees that packets will be delivered and will be delivered in order When packets are lost in TCP, the lost packets must be retransmitted This wastes valuable resources Effects in UDP q q q UDP sends packets as quickly as possible with no promise of delivery When packets are lost, they are not retransmitted Repercussions for packet loss in UDP include “jitter” in streaming audio/video etc.

Possible Methods for Dealing with the Self-Similar Property of Traffic n n Dynamic Control

Possible Methods for Dealing with the Self-Similar Property of Traffic n n Dynamic Control of Traffic Flow Structural resource allocation

Dynamic Control of Traffic Flow n Predictive feedback control q q identify the on-set

Dynamic Control of Traffic Flow n Predictive feedback control q q identify the on-set of concentrated periods of either high or low traffic activity adjust the mode of congestion control appropriately from conservative to aggressive

Dynamic Control of Traffic Flow Cont. ’d n Adaptive forward error correction q q

Dynamic Control of Traffic Flow Cont. ’d n Adaptive forward error correction q q retransmission of lost information is not viable because of time-constraints (real-time) adjust the degree of redundancy based on the network state n increase level of redundancy when traffic is high q n could backfire as too much of an increase will only further aggrevate congestion decrease level of redundancy when traffic is low

Structural Resource Allocation n Two types: q q n bandwidth buffer size Bandwidth q

Structural Resource Allocation n Two types: q q n bandwidth buffer size Bandwidth q q increase bandwidth to accommodate periods of “burstiness” could be wasteful in times of low traffic intensity

Structural Resource Allocation Cont. ’d q buffer size n n q increase the buffer

Structural Resource Allocation Cont. ’d q buffer size n n q increase the buffer size in routers (et. al. ) such that they can absorb periods of “burstiness” still possible to fill a given router’s buffer and create a bottleneck tradeoff n increase both until they complement each other and begin curtailing the effects of self-similarity