Raptor Codes Amin Shokrollahi EPFL Communication on Multiple

Communication on Multiple Unknown Channels BEC(p 1) BEC(p 2) BEC(p 3) BEC(p 4) BEC(p

The erasure probabilities are unknown. Want to come arbitrarily close to capacity on each

Fountain Codes Sender sends a potentially limitless stream of encoded bits. Receivers collect bits

Universality and Efficiency [Universality] Want sequences of Fountain Codes for which the overhead is

LT-Codes • Invented by Michael Luby in 1998. • First class of universal and

LT-Codes LT-codes use a restricted distribution on Fix distribution Distribution where on is given

The LT Coding Process Input symbols Choose 2 Random original symbols XOR 2 Choose

$Raptor Codes Input symbols Traditional pre-code d – fraction erasures LT-light Output symbols$

Further Content • How do Raptor Codes designed for the BEC perform on other

Parameters Raptor Code with parameters Channel Overhead , if decoding is possible from many

Incremental Redundancy Codes Raptor codes are true incremental redundancy codes. A sender can generate

Sequences Designed for the BEC Type: Left-regular of degree 4, right Poisson, rate 0.

Normalized SNR 0. 000 Eb/N 0 0. 067 0. 135 0. 194 0. 267

Sequences Designed for the BEC Not too bad, but quality decreases when the amount

Gaussian Approximation Assume that the messages sent from input to output nodes are Gaussian.

Nodes of Degree 2 Use equality, differentiate, and compare values at 0 where and

Nodes of Degree 2 Use graph induced on input symbols by output symbols of

Nodes of Degree 2: BEC New output node of degree 2 Information Loss!

Fraction of Nodes of Degree 2 If there exists component of linear size (i.

The -ary symmetric channel (large ) Double verification decoding (Luby-Mitzenmacher): If and are correct,

The -ary symmetric channel (large ) More sophisticated algorithms: induced graph! If two input

The -ary symmetric channel (large ) More sophisticated algorithms: induced graph! More generally: if

The -ary symmetric channel (large ) Limiting case: Giant component consisting of correct edges,

$Nodes of Degree 2 What is the fraction of nodes of degree 2 for$

$General Symmetric Channels: Mimic Proof is information theoretic: if fraction of nodes of degree$

General Symmetric Channels: Mimic Proof Fraction of nodes of degree one for capacity-achieving Raptor

A better Gaussian Approximation Uses adaptation of a method of Ardakani and Kschischang. Heuristic:

$Raptor Codes for any Channel From the formula on the fraction of nodes of$

Conclusions • For LT- and Raptor codes, some decoding algorithms can be phrased directly

Slides: 35

Download presentation

Raptor Codes Amin Shokrollahi EPFL

Communication on Multiple Unknown Channels BEC(p 1) BEC(p 2) BEC(p 3) BEC(p 4) BEC(p 5) BEC(p 6)

Example: Popular Download

Example: Peer-to-Peer

Example: Satellite

The erasure probabilities are unknown. Want to come arbitrarily close to capacity on each of the erasure channels, with minimum amount of feedback. Traditional codes don’t work in this setting since their rate is fixed. Need codes that can adapt automatically to the erasure rate of the channel.

Fountain Codes Sender sends a potentially limitless stream of encoded bits. Receivers collect bits until they are reasonably sure that they can recover the content from the received bits, and send STOP feedback to sender. Automatic adaptation: Receivers with larger loss rate need longer to receive the required information. Want that each receiver is able to recover from the minimum possible amount of received data, and do this efficiently.

Universality and Efficiency [Universality] Want sequences of Fountain Codes for which the overhead is arbitrarily small [Efficiency] Want per-symbol-encoding to run in close to constant time, and decoding to run in time linear in number of output symbols.

LT-Codes • Invented by Michael Luby in 1998. • First class of universal and almost efficient Fountain Codes • Output distribution has a very simple form • Encoding and decoding are very simple

LT-Codes LT-codes use a restricted distribution on Fix distribution Distribution where on is given by is the Hamming weight of Parameters of the code are :

The LT Coding Process Input symbols Choose 2 Random original symbols XOR 2 Choose weight Weight table Insert header, and send Weight Prob 1 0. 055 2 0. 3 3 0. 1 4 0. 08 100000 0. 0004

$Raptor Codes Input symbols Traditional pre-code d – fraction erasures LT-light Output symbols$

Raptor Codes Input symbols Traditional pre-code d – fraction erasures LT-light Output symbols

Further Content • How do Raptor Codes designed for the BEC perform on other symmetric channels (with BP decoding)? • Information theoretic bounds, and fraction of nodes of degrees one and two in capacity-achieving Raptor Codes. • Some examples • Applications

Parameters Raptor Code with parameters Channel Overhead , if decoding is possible from many output symbols. Measure residual error probability as a function of the overhead for a given channel.

Incremental Redundancy Codes Raptor codes are true incremental redundancy codes. A sender can generate as many output bits as necessary for successful decoding. Suitably designed Raptor codes are close to the Shannon capacity for a variety of channel conditions (from very good to rather bad). Raptor codes are competitive with the best LDPC codes under different channel conditions.

Sequences Designed for the BEC Type: Left-regular of degree 4, right Poisson, rate 0. 98 Simulations done on AWGN( ) for various

Normalized SNR 0. 000 Eb/N 0 0. 067 0. 135 0. 194 0. 267 Turbo Threshold Best designs (so far) Sequences Designed for the BEC 0. 331 0. 391 0. 459 0. 522 0. 584 0. 650

Sequences Designed for the BEC Not too bad, but quality decreases when the amount of noise on the channel increases. Need to design better distributions. Idea: adapt the Gaussian approximation technique of Chung, Richardson, and Urbanke.

Gaussian Approximation Assume that the messages sent from input to output nodes are Gaussian. Track the mean of these Gaussians from one round to another. Degree distributions can be designed using this approximation. However, they don’t perform that well. Anything else we can do with this?

Nodes of Degree 2 Use equality, differentiate, and compare values at 0 where and It can be rigorously proved that above condition is necessary for error probability of BP to converge to zero.

Nodes of Degree 2 Use graph induced on input symbols by output symbols of degree 2.

Nodes of Degree 2: BEC

Nodes of Degree 2: BEC New output node of degree 2 Information Loss!

Fraction of Nodes of Degree 2 If there exists component of linear size (i. e. , a giant component), then next output node of degree 2 has constant probability of being useless. Therefore, graph should not have giant component. This means that for capacity achieving degree distributions we must have: On the other hand, if successfully. So, then algorithm cannot start for capacity-achieving codes:

The -ary symmetric channel (large ) Double verification decoding (Luby-Mitzenmacher): If and are correct, then they verify. Remove all of them from graph and continue. Can be shown that number of correct output symbols needs to be at least Times number of input symbols. (Joint work with Karp and Luby. )

The -ary symmetric channel (large ) More sophisticated algorithms: induced graph! If two input symbols are connected by a correct output symbol, and each of them is connected to a correct output symbol of degree one, then the input symbols are verified. Remove from them from graph.

The -ary symmetric channel (large ) More sophisticated algorithms: induced graph! More generally: if there is a path consisting of correct edges, and the two terminal nodes are connected to correct output symbols of degree one, then the input symbols get verified. (More complex algorithms. )

The -ary symmetric channel (large ) Limiting case: Giant component consisting of correct edges, two correct output symbols of degree one “poke” the component. So, ideal distribution “achieves” capacity.

$Nodes of Degree 2 What is the fraction of nodes of degree 2 for$

Nodes of Degree 2 What is the fraction of nodes of degree 2 for capacityachieving Raptor Codes? Turns out, that in the limit we need to have equality: where, in general and is the LLR of the channel.

$General Symmetric Channels: Mimic Proof is information theoretic: if fraction of nodes of degree$

General Symmetric Channels: Mimic Proof is information theoretic: if fraction of nodes of degree 2 is larger by a constant, then : • Expectation of the hyperbolic tangent of messages passed from input to output symbols at given round of BP is larger than a constant. • This shows that • So code cannot achieve capacity.

General Symmetric Channels: Mimic Proof Fraction of nodes of degree one for capacity-achieving Raptor Codes: Noisy observations of Therefore, if , and if of degree one, then So denote output nodes

A better Gaussian Approximation Uses adaptation of a method of Ardakani and Kschischang. Heuristic: Messages passed from input to output symbols are Gaussian, but not vice-versa. Find recursion for the means of these Gaussians, and apply linear programming.

$Raptor Codes for any Channel From the formula on the fraction of nodes of$

Raptor Codes for any Channel From the formula on the fraction of nodes of degree 2 we know that it is not possible to design truly universal Raptor codes. However, how does a Raptor code designed for a binary input memoryless symmetric channel perform for some other channel of this type? The question has been answered (joint work with O. Etesamit): if a code achieves capacity on the BEC, then its overhead over any binary input memoryless symmetric channel is not larger than log(e)-1 = 0. 442…

Conclusions • For LT- and Raptor codes, some decoding algorithms can be phrased directly in terms of subgraphs of graphs induced by output symbols of degree 2. • This leads to a simpler analysis without the use the tree assumption. • For the BEC, and for the q-ary symmetric channel (large q) we obtain essentially the same limiting capacityachieving degree distribution, using the giant component analysis. • An information theoretic analysis gives the optimal fraction of output nodes of degree 2 for general memoryless symmetric channels. • A graph analysis reveals very good degree distributions, which perform very well experimentally.