This is the peer reviewed version of the book chapter:

Radonjić, Aleksandar, Vujičić, Vladimir (2019), "Integer Asymmetric Error Control Codes for Short-Range Optical Networks", Advances in Engineering Research. Volume 29, New York: Nova Science, 2019.


This work is licensed under a Creative Commons Attribution Non Commercial No Derivatives 4.0 license

## Chapter

Integer Asymmetric Error Control Codes for Short-Range Optical Networks

Aleksandar Radonjic ${ }^{*}$<br>Institute of Technical Sciences of the Serbian Academy of Sciences and Arts, Belgrade, Serbia<br>Vladimir Vujicic<br>Institute of Technical Sciences of the Serbian Academy of Sciences and Arts, Belgrade, Serbia


#### Abstract

In most communication networks, error probabilities $1 \rightarrow 0$ and $0 \rightarrow 1$ are equally likely to occur. However, in short-range optical networks (SRONs), such as local and access networks, this is not the case. In these networks, photons may fade or fail to be detected, but new photons cannot be generated. Hence, if the receiver operates correctly, only asymmetric (1 $\rightarrow 0$ ) errors can occur. Motivated by this fact, the authors of this chapter have constructed four classes of integer codes capable of correcting various types of asymmetric errors. The most attractive feature of all these codes is their ability to be implemented "for free" (in software). This is achieved by using integer and lookup table operations, which are supported by all processors. The aim of this chapter is to overview four classes of integer asymmetric codes and to illustrate their potential for use in modern SRONs. Topics covered include: fundamentals in the design of integer codes, necessary and sufficient conditions for constructing integer asymmetric codes and the processor-based strategy for implementation of these codes.


Keywords: Integer codes, burst asymmetric errors, random asymmetric errors, short-range optical networks, multicore processors, theoretical decoding throughput.

[^0]
## 1. Introduction

In optical networks (ONs) using on-off keying, the data are directly encoded into an optical signal: 1's are represented by the presence of light pulses (photons) and 0 's by their absence [1]. When such a signal propagates through the fiber, it suffers from various degradition effects, such as attenuation and dispersion. These effects tend to reduce the number of photons ( $1 \rightarrow 0$ errors) causing partial data loss.

In order to mitigate this problem, the engineers apply two strategies: one for long-range ONs (LRONs), and other for short-range ONs (SRONs). The strategy for LRONs is based on combined use of error control codes (ECCs) and optical amplifiers. After the transmitted signal is protected with ECCs and converted into light pulses, it is periodically strengthened with optical amplifiers. These devices regenerate the signal, but also add noise. So, if they are often used, the received signal may contain pulses at the zero time slots ( $0 \rightarrow 1$ errors). Unlike in LRONs, the signals in SRONs are restored in the electrical domain. This is done using repeaters, which convert optical signals to electronic ones, amplify them, and again convert them to optical signals. Due to this reason, the regenerated signal is always identical or nearly identical to the original one.

With this in mind, in this chapter, we overview four classes of integer codes that are suitable for use in SRONs. The main advantage of these codes, compared to classical ECCs (CECCs), is their ability to exploit the high computing power of the network nodes (PCs, routers, switches, ONU units, etc.) [1]. This has been achieved by using integer and lookup table operations, which are supported by all processors. In addition, unlike CECCs, the presented codes can be interleaved without delay and without using dedicated hardware. Owing to this, they can be transformed into simple codes capable of correcting various combinations of asymmetric $(1 \rightarrow 0)$ errors.

The organization of this chapter is as follows. Section 2 provides necessary and sufficient conditions for constructing four classes of integer codes capable of correcting multiple asymmetric errors. The implementation strategy and theoretical decoding throughputs for these codes are described and evaluated in Section 3, while Section 4 concludes the chapter.

## 2. Integer Error Control Codes

Unlike CECCs, integer ECCs (IECCs) [9], [10], [11], [12], are designed to correct errors of a given type. This means that we can choose certain types of errors and after that construct IECCs capable of correcting them. This characteristic can be deduced from their encoding and decoding procedures.

### 2.1. Encoding and Decoding Procedures

Let $Z_{2^{b}-1}=\left\{0,1, \ldots, 2^{b}-2\right\}$ be the ring of integers modulo $2^{b}-1$ and let $C_{i}$ be integers such that $C_{i} \in Z_{2^{b^{n}}} \backslash\{0,1\}$, where $1 \leq \mathrm{i} \leq \mathrm{k}$. Now, suppose that the data are divided into k b -bit bytes, and that $\mathrm{B}_{\mathrm{i}}$ and $\underline{B}_{i}$ denote integer values of the i -th b-bit byte at the sender and receiver side, respectively. In that case, the encoder will compute the checkbyte using the expression
$C_{\mathrm{B}}=\left[C_{1} \cdot B_{1}+C_{2} \cdot B_{2}+\cdots+C_{k} \cdot B_{k}\right]\left(\bmod 2^{b}-1\right)=\sum_{i=1}^{k} C_{i} \cdot B_{i}\left(\bmod 2^{b}-1\right)$
(1)

At the receiver, the decoder will perform the same calculation
$\mathrm{C}_{\underline{\underline{B}}}=\left[C_{1} \cdot \underline{B}_{1}+C_{2} \cdot \underline{B}_{2}+\cdots+C_{k} \cdot \underline{B}_{k}\right]\left(\bmod 2^{b}-1\right)=\sum_{i=1}^{k} C_{i} \cdot \underline{B}_{i}\left(\bmod 2^{b}-1\right)$
(2)
after which the syndrome S will be formed
$S=\left[C_{\underline{B}}-\underline{C}_{\mathrm{B}}\right]\left(\bmod 2^{b}-1\right) \equiv \sum_{i=1}^{k+1}\left(\underline{B}_{i}-B_{i}\right) \cdot C_{i}\left(\bmod 2^{b}-1\right)=\sum_{i=1}^{k+1} e_{i} \cdot C_{i}\left(\bmod 2^{b}-1\right)$
(3)
where $\mathrm{C}_{\mathrm{k}+1}=-1$.
From (3) it is easy to see that the nonzero value of $S$ indicates the presence of errors $e_{i}$. Whether they can be corrected or not depends on the values of the $C_{i}$ 's. In this chapter, we will see what conditions the $\mathrm{C}_{\mathrm{i}}$ 's must satisfy in order to construct codes capable of correcting burst asymmetric (BA) errors within a byte, random asymmetric (RA) errors within a byte, BA and RA errors within a byte and double asymmetric (DA) errors within a codeword.
2.2. Necessary and Sufficient Conditions for Constructing Asymmetric IECCs

As explained in the previous section, in modern SRONs, the number of received photons never exceeds the number of sent ones. Hence, if the receiver operates correctly, only asymmetric errors may occur [2], [3], [4]. The number and distribution of these errors depends on the nodes' distance and protocol design. Accordingly, it can be said that burst errors (up to five bits) are typical for passive networks, while random errors are dominant in local networks [5], [6], [7], [8]. Having this in mind, we start this subsection by defining the conditions for constructing IECCs capable of correcting $t$ RA errors within a b-bit byte (integer $\mathrm{R}_{\mathrm{t} b} \mathrm{AEC}$ codes). These codes are suitable for use in Ethernet networks using self-synchronous scramblers (e.g. 10GBASE-LR).

### 2.2.1. Integer $\mathrm{R}_{\mathrm{t} / \mathrm{b}} \mathrm{AEC}$ Codes

Definition 1. An error is called a t/b RA error if within a b-bit there exist $t$ asymmetric errors, where $1 \leq \mathrm{t}<\mathrm{b}$.

Table 1. Coefficients for Integer $\mathrm{R}_{t b}$ AEC Codes with Parameters $t=3, b=32$ and $k \leq 64$.

| 2 | 15 | 31 | 71 | 83 | 89 | 101 | 119 |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| 127 | 139 | 141 | 143 | 149 | 157 | 163 | 167 |
| 173 | 177 | 179 | 181 | 189 | 191 | 199 | 203 |
| 211 | 223 | 227 | 229 | 233 | 239 | 251 | 253 |
| 263 | 269 | 271 | 277 | 281 | 283 | 305 | 307 |
| 313 | 317 | 331 | 339 | 349 | 353 | 359 | 361 |
| 367 | 373 | 379 | 383 | 389 | 395 | 397 | 401 |
| 409 | 421 | 431 | 433 | 443 | 463 | 465 | 467 |

Definition 2. The set of syndromes corresponding to t/b RA errors is defined as $s_{1}=\left\{-\left(2^{x_{1}}+2^{x_{2}}+\cdots+2^{x_{m}}\right) \cdot C_{i}\left(\bmod 2^{b}-1\right): 0 \leq x_{1}<x_{2}<\cdots<x_{m} \leq b-1,1 \leq m \leq t, 1 \leq i \leq k+1\right\}$ (4)

With these definitions we can prove the following theorem.
Theorem 1. The codes defined by (1) can correct all $\mathrm{t} / \mathrm{b}$ RA errors if there exist k mutually different coefficients $C_{i} \in Z_{2^{b^{h}} 1} \backslash\{0,1\}$ such that
$\left|s_{1}\right|=(k+1) \cdot \sum_{m=1}^{t}\binom{b}{m}$,
where $\left|s_{1}\right|$ denotes the cardinality of $\mathrm{s}_{1}$.
Proof. From (4) it is clear that the set $\mathrm{s}_{1}$ can be expressed as
$s_{1}=\bigcup_{u=1}^{k+1} X_{u}$
where
$X_{1}=\left\{\left[-\left(2^{x_{1}}+2^{x_{2}}+\cdots+2^{x_{m}}\right) \cdot C_{1}\right]\left(\bmod 2^{b}-1\right): 0 \leq x_{1}<x_{2}<\cdots<x_{m} \leq b-1,1 \leq m \leq t\right\}$
$X_{k}=\left\{\left[-\left(2^{x_{1}}+2^{x_{2}}+\cdots+2^{x_{m}}\right) \cdot C_{k}\right]\left(\bmod 2^{b}-1\right): 0 \leq x_{1}<x_{2}<\cdots<x_{m} \leq b-1,1 \leq m \leq t\right\}$
$X_{k+1}=\left\{2^{x_{1}}+2^{x_{2}}+\cdots+2^{x_{m}}: 0 \leq x_{1}<x_{2}<\cdots<x_{m} \leq b-1,1 \leq m \leq t\right\}$
From this it is easy to see that the syndromes caused by $\mathrm{t} / \mathrm{b}$ asymmetric errors will be nonzero and mutually different only if there exist k different coefficients $C_{i} \in Z_{2^{p_{1}}} \backslash\{0,1\}$ such that
$X_{1} \cap \cdots \cap X_{k} \cap X_{k+1}=\varnothing$
$\left|X_{1}\right|=\cdots=\left|X_{k}\right|=\left|X_{k+1}\right|$.
In that case, the set $\mathrm{s}_{1}$ will have
$\left|s_{1}\right|=\sum_{u=1}^{k+1}\left|X_{u}\right|=(k+1) \cdot \sum_{m=1}^{t}\binom{b}{m}$
nonzero elements.
In order to illustrate the applicability of Theorem 1, we show results of a computer-search for codes with parameters $\mathrm{t}=3, \mathrm{~b}=32$ and $\mathrm{k} \leq 64$ (Table 1).

### 2.2.2. Integer $\mathrm{B}_{\mathrm{l} / \mathrm{b}} \mathrm{AEC}$ Codes

In some SRONs, such as passive optical networks (e.g. 10G PONs), the errors tend to occur in bursts. Hence, in these networks, it is preferable to use codes capable of correcting l-bit BA errors or l-bit BA errors in a b-bit byte (integer $\mathrm{B}_{\mathrm{l} b} \mathrm{AEC}$ codes). The first step towards the construction of the latter codes is to define the integer values of BA errors confined to a b-bit byte. For that purpose, we will rely on the analysis from [9]. In that paper, it was shown that the integer value of a l-bit burst error within a b-bit byte is equal to $\mathrm{e}= \pm 2^{\mathrm{s}} \cdot(2 \mathrm{n}-1)$, where 0 $\leq \mathrm{s} \leq \mathrm{b}-1,1 \leq \mathrm{n} \leq 2^{\mathrm{u}-1}$ and $1 \leq \mathrm{u} \leq 1$. With this in mind, we can state the following definitions and theorem.

Definition 3. An error is called $1 / b$ BA error if within a b-bit there exists any number of asymmetric errors confined to $l(<b)$ adjacent positions.

Definition 4. The set of syndromes corresponding to $1 / b$ BA errors is defined as $s_{2}=\left\{\left[-2^{s} \cdot(2 n-1) \cdot C_{i}\right]\left(\bmod 2^{b}-1\right): 0 \leq s \leq b-l, 1 \leq n \leq 2^{u-1}, 1 \leq u \leq l, 1 \leq i \leq k+1\right\}$ (5)

Theorem 2. The codes defined by (1) can correct all $1 / b$ BA errors if there exist k mutually different coefficients $C_{i} \in Z_{2^{p^{b}-1}} \backslash\{0,1\}$ such that $\left|s_{2}\right|=(k+1) \cdot\left[2^{l-1} \cdot(b-l+2)-1\right]$.

Proof. Observe that the set $\mathrm{s}_{2}$ can be expressed as
$s_{2}=\bigcup_{u=1}^{l} Y_{u}$
where
$Y_{1}=\left\{\left[-2^{s} \cdot(1) \cdot C_{i}\right]\left(\bmod 2^{b}-1\right): 0 \leq s \leq b-1,1 \leq i \leq k+1\right\}$,
$Y_{2}=\left\{\left[-2^{s} \cdot(3) \cdot C_{i}\right]\left(\bmod 2^{b}-1\right): 0 \leq s \leq b-2,1 \leq i \leq k+1\right\}$,
!
$Y_{l}=\left\{\left[-2^{s} \cdot\left(2^{-1}+1,2^{2^{-1}}+3, \ldots, 2^{l}-1\right) \cdot C_{i}\right]\left(\bmod 2^{b}-1\right): 0 \leq s \leq b-l, 1 \leq i \leq k+1\right\}$.
Now, suppose that the coefficients $C_{i} \in Z_{2^{b_{1}-1}}\{0,1\}$ have values such that
$Y_{1} \cap Y_{2} \cap \cdots \cap Y_{l}=\varnothing$,
$\left|Y_{1}\right|=(k+1) \cdot b$,
$\left|Y_{h}\right|=(k+1) \cdot 2^{h-2} \cdot(b-h+1), 2 \leq h \leq l$.
In that case, it is easy to show that

$$
\left|s_{2}\right|=\sum_{u=1}^{l}\left|Y_{u}\right|=(k+1) \cdot\left[2^{l-1} \cdot(b-l+2)-1\right] \cdot \square
$$

The next step in constructing codes capable of correcting $1 / b \mathrm{BA}$ errors is to find the $\mathrm{C}_{\mathrm{i}}$ 's that satisfy the condition of Theorems 2 . For that purpose it is necessary to perform an exhaustive search on all possible candidates from the set $Z_{2^{p^{\prime}-1}}\{0,1\}$. In this chapter, we have restricted ourselves to the codes with

Table 2. Coefficients for Integer $\mathrm{B}_{l / b}$ AEC Codes with Parameters $l=5, b=32$ and $k \leq 64$.

| 2 | 33 | 35 | 37 | 41 | 43 | 47 | 53 |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| 59 | 61 | 67 | 71 | 73 | 79 | 83 | 89 |
| 97 | 101 | 103 | 107 | 109 | 113 | 117 | 127 |
| 131 | 137 | 139 | 149 | 151 | 157 | 163 | 167 |
| 173 | 179 | 181 | 191 | 193 | 197 | 199 | 211 |
| 223 | 227 | 229 | 233 | 239 | 241 | 251 | 255 |
| 257 | 263 | 269 | 271 | 277 | 281 | 283 | 293 |
| 307 | 311 | 313 | 317 | 331 | 337 | 343 | 347 |

parameters $\mathrm{l}=5, \mathrm{~b}=32$ and $\mathrm{k} \leq 64$ (Table 2).

### 2.2.3. Integer $\mathrm{B}_{1 / b} \mathrm{AEC}-\mathrm{R}_{\mathrm{tb}} \mathrm{AEC}$ Codes

The previous classes of IECCs are designed to correct t/b RA or 1/b BA errors. For this reason, they have limited practical applicability. One solution to overcome this drawback is to design codes with $\mathrm{B}_{1 / b} A E C-\mathrm{R}_{t b} A E C$ capability (integer $\mathrm{B}_{\mathrm{l} b} \mathrm{AEC}-\mathrm{R}_{\mathrm{tb}} \mathrm{AEC}$ codes). Such codes would have the potential to be used in both passive optical networks and Ethernet networks using self-synchronous


Fig. 1. Short $t / b$ RA errors and (b) long $t / b$ RA errors.
scramblers. However, before we move to the mathematical details, let us observe that $\mathrm{t} / \mathrm{b}$ RA errors spaced less than 1 bits (short $\mathrm{t} / \mathrm{b}$ RA errors) can be treated as $1 / \mathrm{b}$ BA errors. Hence, in addition to $1 / b$ BA errors, we need to consider t/b RA errors that are spaced $\mathrm{d}(\mathrm{l}<\mathrm{d}<\mathrm{b})$ bits apart (long t/b RA errors) (Fig. 1).
Now, we can give the following definition.
Definition 5. Let $3 \leq \mathrm{v}<1<\mathrm{b}$ and $\mathrm{s}+\mathrm{l} \leq \mathrm{z} \leq \mathrm{b}-1$. Then, the set of syndromes corresponding to long $\mathrm{t} / \mathrm{b}$ RA errors is defined as $s_{3}=\varepsilon_{1} \cup \varepsilon_{2}$ (6)
where
$\varepsilon_{1}=\left\{\left[-\left(2^{s}+2^{z}\right) \cdot C_{i}\right]\left(\bmod 2^{b}-1\right): 1 \leq i \leq k+1\right\}$
(7)
$\varepsilon_{2}=\left\{\left[-\left(2^{s}+2^{x_{1}}+\cdots+2^{x_{n-2}}+2^{z}\right) \cdot C_{i}\right]\left(\bmod 2^{b}-1\right): 0 \leq s<x_{1}<\cdots<x_{v-2} \leq z, 1 \leq i \leq k+1\right\}$
(8)

Although the expressions (5)-(8) provide a theoretical basis for the construction of integer $\mathrm{B}_{1 / b} \mathrm{AEC}-\mathrm{R}_{\mathrm{t} b} \mathrm{AEC}$ codes, they do not give the explicit information about the number of nonzero syndromes. Hence, we need the following theorem.

Theorem 3. The codes defined by (1) can correct all $1 / b$ BA and $t / b$ RA errors if there exist k mutually different coefficients $C_{i} \in Z_{2^{b}-1} \backslash\{0,1\}$ such that

1. $\left|s_{2}\right|=(k+1) \cdot\left[2^{l-1} \cdot(b-l+2)-1\right]$,
2. $\left|s_{3}\right|=(k+1) \cdot \sum_{i=0}^{b-l-1} \sum_{j=2}^{t}(b-l-i) \cdot\binom{l+i-1}{j-2}$,
3. $s_{2} \cap s_{3}=\varnothing$.

Proof. The proof for Condition 1 is the same as that given in Theorem 2. Hence, it will be omitted. As far as Condition 2 is concerned, it states that that long t/b RA errors generate $(k+1) \cdot \sum_{i=1}^{b--1} \sum_{j=2}^{t}(b-l-i) \cdot\binom{l+i-1}{j-2}$ syndromes that are nonzero. To prove this, note that the set $\mathrm{s}_{3}$ can be expressed as

$$
s_{3}=\bigcup_{u=0}^{b-l-1} Z_{u}
$$

where

$$
\begin{aligned}
& Z_{0}=\{[-(2^{s}+\underbrace{2^{e}+2^{f}+\cdots+2^{p}}_{t-2 \text { addends }}+2^{s+l}) \cdot C_{i}]\left(\bmod 2^{b}-1\right): 0 \leq s \leq b-l-1,1 \leq i \leq k+1\} \\
& Z_{1}=\{[-(2^{s}+\underbrace{2^{e}+2^{f}+\cdots+2^{p}}_{t-2 \text { addends }}+2^{s+l+1}) \cdot C_{i}]\left(\bmod 2^{b}-1\right): 0 \leq s \leq b-l-2,1 \leq i \leq k+1\} \\
& \vdots \\
& Z_{b-l-1}=\{[-(2^{s}+\underbrace{2^{e}+2^{f}+\cdots+2^{p}}_{t-2 \text { addends }}+2^{s+b-1}) \cdot C_{i}]\left(\bmod 2^{b}-1\right): s=0,1 \leq i \leq k+1\}
\end{aligned}
$$

and $\mathrm{s}<\mathrm{e}<\mathrm{f}<\ldots<\mathrm{p}<\mathrm{s}+1+\mathrm{u}$ for any $\mathrm{u}=0,1, \ldots, \mathrm{~b}-1-1$. Consequently, we can write

$$
\begin{aligned}
& \left|Z_{0}\right|=(b-l) \cdot\binom{l-1}{t-2} \cdot(k+1) \\
& \left|Z_{1}\right|=(b-l-1) \cdot\binom{l}{t-2} \cdot(k+1) \\
& \quad \vdots \\
& \left|Z_{b-l-1}\right|=1 \cdot\binom{b-2}{t-2} \cdot(k+1)
\end{aligned}
$$

wherefrom it follows that
$\left|s_{3}\right|=\sum_{u=0}^{b-l-1}\left|Z_{u}\right|=(k+1) \cdot \sum_{i=0}^{b-l-1} \sum_{j=2}^{t}(b-l-i) \cdot\binom{l+i-1}{j-2}$.
On the other hand, Condition 3 is a necessary condition for distinguishing l/b BA
errors from long t/b RA errors. So, integer codes satisfying conditions 1 to 3 are $\mathrm{B}_{1 / b} \mathrm{AEC}-\mathrm{R}_{\mathrm{tb}} \mathrm{AEC}$ codes.

Theorem 4. Let $\mathrm{t}=3$ and let $\xi_{1}=s_{2} \cup s_{3}$ be the error set for integer $\mathrm{B}_{1 / b} \mathrm{AEC}$ $\mathrm{R}_{\mathrm{tb} b} \mathrm{AEC}$ codes. Then,
$\left|\xi_{1}\right|=\left|s_{2}\right|+\left|s_{3}\right|=\left[2^{l-1} \cdot(b-l+2)-1+\frac{(b-l)^{2}+b-l}{2} \cdot b\right] \cdot(k+1)-\left[\frac{2 \cdot(b-l)^{3}+3 \cdot(b-l)^{2}+b-l}{6}\right] \cdot(k+1)$.
Proof. This theorem follows directly from Theorem 4.
To illustrate the applicability of Theorem 4, we show results of a computersearch for the codes with parameters $l=5, t=3, b=32$ and $k \leq 64$ (Table 3).

### 2.2.4. Integer DAEC Codes

In Ethernet networks not using self-synchronous scramblers (e.g. 10GBASELX4), the errors are randomly distributed. For this reason, in these networks, it is desirable to protect the data with DA error correcting (DAEC) codes. These codes are able to correct three types of errors: single asymmetric errors, DA errors corrupting one b-bit byte and DA errors corrupting two b-bit bytes.

Definition 6. The set of syndromes corresponding to single asymmetric errors is defined as
$s_{4}=\left\{\left(-2^{s} \cdot C_{i}\right)\left(\bmod 2^{b}-1\right): 0 \leq s \leq b-1,1 \leq i \leq k+1\right\}$
(9)

Definition 7. The set of syndromes corresponding to DA errors corrupting one b-bit byte is defined as
$s_{5}=\left\{\left[\left(-2^{r}-2^{s}\right) \cdot C_{i}\right]\left(\bmod 2^{b}-1\right): 0 \leq r<s \leq b-1,1 \leq i \leq k+1\right\}$
Definition 8. The set of syndromes corresponding to DA errors corrupting two b-bit bytes is defined as
$s_{6}=\left\{\left(-2^{r} \cdot C_{i}-2^{s} \cdot C_{j}\right)\left(\bmod 2^{b}-1\right): 0 \leq r, s \leq b-1,1 \leq i<j \leq k+1\right\}$
(11)

Now we are able to state the following theorem.
Theorem 5. The codes defined by (1) can correct all single asymmetric errors and all DA errors if there exist k mutually different coefficients $C_{i} \in Z_{2^{b^{b}-1}} \backslash\{0,1\}$ such that

Table 3. Coefficients for Integer $\mathrm{B}_{l / b} \mathrm{AEC}-\mathrm{R}_{t / b} \mathrm{AEC}$ Codes with Parameters $l=5, t=3, b=32$ and $k \leq 64$.

| 2 | 45 | 71 | 83 | 89 | 101 | 119 | 127 |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| 139 | 141 | 143 | 149 | 157 | 159 | 163 | 167 |
| 173 | 177 | 179 | 181 | 191 | 199 | 211 | 223 |
| 227 | 229 | 233 | 237 | 239 | 251 | 263 | 269 |
| 271 | 277 | 281 | 283 | 301 | 395 | 307 | 313 |
| 317 | 331 | 339 | 349 | 353 | 361 | 367 | 373 |
| 379 | 383 | 389 | 397 | 401 | 409 | 421 | 431 |
| 433 | 443 | 453 | 463 | 467 | 479 | 487 | 499 |


| Nonzero syndrome | Error location $(i)$ | Error vector $(i)$ |
| :--- | :--- | :--- |
| $\longleftarrow b \longrightarrow\left[\log _{2}(k+1)\right\rceil \rightarrow \longleftarrow$ | $\longrightarrow$ |  |

Fig. 2. Bit-width of one syndrome table entry in the case of IECCs correcting errors within one $b$-bit byte.

1. $\left|s_{4}\right|=(k+1) \cdot b$,
2. $\left|s_{5}\right|=(k+1) \cdot \frac{b \cdot(b-1)}{2}$,
3. $\left|s_{6}\right|=(k+1) \cdot \frac{b^{2} \cdot k}{2}$,
4. $s_{4} \cap s_{5} \cap s_{6}=\varnothing$

Proof. It can be easily proved that Conditions 1 is the necessary and sufficient condition for correcting single asymmetric errors, Condition 2 for correcting DA errors within a b-bit byte, and Condition 3 for correcting DA errors corrupting two b-bit bytes. Finally, Condition 4 ensures that the syndromes caused by single and DA errors are distinguishable.

Theorem 6. Let $\xi_{2}=s_{4} \cup s_{5} \cup s_{6}$ be the error set for integer DAEC codes. Then, $\left|\xi_{2}\right|=\left|s_{4}\right|+\left|s_{5}\right|+\left|s_{6}\right|=(k+1)^{2} \cdot \frac{b^{2}}{2}+(k+1) \cdot \frac{b}{2}$

Proof. This theorem follows from Theorem 5.
To illustrate the applicability of Theorem 5, we show results of a computersearch for the codes with parameters $\mathrm{b}=32$ and $\mathrm{k} \leq 64$ (Table 4).

### 2.3. Error Correction Procedure

After receiving the incoming packet, the decoder will generate the syndrome S. On the basis of its value, it will either accept the packet $(\mathrm{S}=0)$ or try to recover the original data $(\mathrm{S} \neq 0)$. In the latter case, the decoder will look up the syndrome

Table 4. Coefficients for Integer DAEC Codes with Parameters $b=32$ and $k \leq 64$.

| 19 | 93 | 197 | 485 | 1111 | 1771 | 2247 | 2625 |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| 3923 | 4721 | 6459 | 7463 | 10941 | 13277 | 14329 | 15263 |
| 20183 | 21329 | 26267 | 31609 | 36597 | 40453 | 44993 | 47465 |
| 63449 | 65371 | 82467 | 86767 | 92563 | 95947 | 105581 | 108297 |
| 110849 | 131365 | 138807 | 150609 | 152801 | 177615 | 197791 | 203213 |
| 229677 | 246945 | 263135 | 278391 | 303707 | 305785 | 332895 | 341235 |
| 352515 | 388459 | 428269 | 435293 | 457423 | 478493 | 490611 | 504469 |
| 545203 | 571633 | 590825 | 595143 | 674535 | 715827 | 740695 | 742475 |

table to get the error correction data. In the case of IECCs described in Sections 2.2.1, 2.2.2 and 2.2.3, this table will have $\mathrm{E}_{1}=\left\{\left|\mathbf{s}_{1}\right|,\left|\mathbf{s}_{2}\right|, \quad\left|\xi_{1}\right|\right\}$ entries (Theorems 1,2 and 4), where each entry describes the relationship between the nonzero syndrome, error location and error vector (Definitions 2, 4 and 5) (Fig. 2).

So, when $S \neq 0$, the decoder's task is to find the entry with the first b bits as that of the syndrome S . If the syndrome table is sorted in increasing order (according to the values of $S$ ), this task will be completed after $n_{T L}$ table lookups ( $1 \leq n_{\mathrm{TL}} \leq\left\lfloor\log _{2} E_{1}\right\rfloor+2$ ) [13]. In the next step, using the error correction data, the decoder will execute the operation
$B_{i}=\underline{B}_{i}+e_{i}\left(\bmod 2^{b}-1\right)$
(12)
where $1 \leq \mathrm{i} \leq \mathrm{k}+1$ and where $\mathrm{e}_{\mathrm{i}} \in \mathrm{s}_{1}, \mathrm{e}_{\mathrm{i}} \in \mathrm{s}_{2}$ or $\mathrm{e}_{\mathrm{i}} \in \xi_{1}$. The similar procedure applies for IECCs described in Sections 2.2.4. In their case, the syndrome table has $\mathrm{E}_{2}=\left|\xi_{2}\right|$ entries (Theorem 6), where each entry occupies $3 \cdot b+2 \cdot\left\lceil\log _{2}(k+1)\right\rceil$ bits (Definitions 6, 7 and 8) (Fig. 3).

Accordingly, if the syndrome table is sorted in increasing order (according to the values of S ), the decoder will perform $\mathrm{n}_{\mathrm{TL}}$ table lookups ( $1 \leq n_{\mathrm{TL}} \leq\left\lfloor\log _{2} E_{2}\right\rfloor+2$ ) to find the entry where the first $b$ bits matches that of the syndrome $S$. After that, it


Fig. 3. Bit-width of one syndrome table entry in the case of IECCs correcting errors within two $b$-bit bytes. will execute the operations
$B_{i}=\underline{B}_{i}+e_{i}\left(\bmod 2^{b}-1\right)$
(13)
$B_{j}=\underline{B}_{j}+e_{j}\left(\bmod 2^{b}-1\right)$
(14)
where $1 \leq \mathrm{i}<\mathrm{j} \leq \mathrm{k}+1$ and where 1) $\mathrm{e}_{\mathrm{i}}, \mathrm{e}_{\mathrm{j}} \in \xi_{2}$, 2) $\mathrm{e}_{\mathrm{i}} \in \xi_{2}, \mathrm{e}_{\mathrm{j}}=0$, or 3) $\mathrm{e}_{\mathrm{i}}=0$ and $\mathrm{e}_{\mathrm{j}} \in \xi_{2}$.

## 3. Evaluation and Implementation Strategy

From coding theory [14] it is known that practical implementation of CECCs is extremely expensive. For that purpose it is necessary to equip each node (PC, router, switch, OLT/ONU unit, etc.) with hardware that performs encoding and decoding operations. The software implementation is not feasible, since CECCs use Galois field arithmetic. This type of arithmetic entirely differs from the integer arithmetic of modern processors, which results in the fact that the software encoding and decoding of CECCs requires complex and time-consuming instructions.

Unlike CECCs, IECCs use integer and lookup table operations. Owing to this, they have the potential to be implemented "for free", i.e. without any hardware assistance. To illustrate this, in this section, we will consider two scenarios: the first one, where all nodes are equipped with quad-core processors and the second, where they are equipped with eight-core processors.
3.1. Scenario 1: Network nodes are equipped with quad-core processors

Suppose that all network nodes are equipped with quad-core processors (Fig. 4) having the following specifications [15], [16]:

1) clock rate: $\mathrm{C}_{\mathrm{R} 4}=3.5 \cdot 10^{9} \mathrm{~Hz}$,


Fig. 4. Block diagram of quad-core processor.
2) integer addition/subtraction operation: 1 cycle latency,
3) integer multiplication operation: 3 cycles latency,
4) 128-bit shift operation: 1 cycle latency,
5) modulo reduction operation: 1 cycle latency,
6) comparison operation: 1 cycle latency,
7) access to the L1 cache ( 32 KB per core): 4 cycles latency,
8) access to the L2 cache ( 256 KB per core): 11 cycles latency,
9) access to the L3 cache ( 16 MB shared): 28 cycles latency.

In addition, suppose that the coefficients $\mathrm{C}_{\mathrm{i}}$ are stored in each of the four L1 caches and that the syndrome table is placed into the L3 cache. In that case, instead of one, the decoder will (in parallel) generate four check-bytes:

- Core 1

$$
C_{\underline{B} 1}=\left[C_{1} \cdot \underline{B}_{1}+C_{2} \cdot \underline{B}_{5}+\cdots+C_{k} \cdot \underline{B}_{4 \cdot(k-1)+1}\right]\left(\bmod 2^{32}-1\right)=\sum_{i=1}^{k} C_{i} \cdot \underline{B}_{4 \cdot(i-1)+1}\left(\bmod 2^{32}-1\right)
$$

(15)

- Core 2

$$
\mathrm{C}_{\underline{B} 2}=\left[C_{1} \cdot \underline{B}_{2}+C_{2} \cdot \underline{B}_{6}+\cdots+C_{k} \cdot \underline{B}_{4 \cdot(k-1)+2}\right]\left(\bmod 2^{32}-1\right)=\sum_{i=1}^{k} C_{i} \cdot \underline{B}_{4 \cdot(i-1)+2}\left(\bmod 2^{32}-1\right)
$$

## (16)

- Core 3

$$
C_{\underline{B} 3}=\left[C_{1} \cdot B_{3}+C_{2} \cdot \underline{B}_{7}+\cdots+C_{k} \cdot \underline{B}_{4 \cdot(k-1)+3}\right]\left(\bmod 2^{32}-1\right)=\sum_{i=1}^{k} C_{i} \cdot \underline{B}_{4 \cdot(i-1)+3}\left(\bmod 2^{32}-1\right)
$$

(17)

- Core 4

$$
\begin{equation*}
C_{\underline{B} 4}=\left[C_{1} \cdot \underline{B}_{4}+C_{2} \cdot \underline{B}_{8}+\cdots+C_{k} \cdot \underline{B}_{4 \cdot(k-1)+4}\right]\left(\bmod 2^{32}-1\right)=\sum_{i=1}^{k} C_{i} \cdot \underline{B}_{4 \cdot(i-1)+4}\left(\bmod 2^{32}-1\right) \tag{18}
\end{equation*}
$$

If we add to this $\mathrm{K} / 128=4 \cdot \mathrm{k} \cdot \mathrm{b} / 128=\mathrm{k}$ shift operations ( K - the number of data bits) we easily calculate that each processing core requires $T_{1}=9 \cdot \mathrm{k}$ clock cycles ( k accesses to the L 1 cache, k integer multiplications, $\mathrm{k}-1$ integer additions, 1 modulo reduction and k shift operations) to compute the value of the check-byte. After finishing this task, each core will take $T_{2}=2$ clock cycles ( 1 integer subtraction and 1 modulo reduction) to perform the following operations:

- Core 1

$$
S_{1}=\left[C_{\underline{B} 1}-\underline{C}_{B 1}\right]\left(\bmod ^{32}-1\right)
$$

(19)

- Core 2

$$
S_{2}=\left[C_{B 2}-C_{B 2}\right]\left(\bmod 2^{32}-1\right)
$$

(20)

- Core 3

$$
S_{3}=\left[C_{\underline{B} 3}-\underline{C}_{B 3}\right]\left(\bmod 2^{32}-1\right)
$$

(21)

- Core 4

$$
S_{4}=\left[C_{B 4}-C_{B 4}\right]\left(\bmod 2^{32}-1\right)
$$

(22)

As explained in Section 2.3, if the data are received in error, the decoder will additionally perform $\mathrm{n}_{\mathrm{TL}}$ table lookups, $\mathrm{n}_{\mathrm{TL}}$ comparisons, 1 integer addition (or two integer additions in parallel) and 1 modulo reduction (or two modulo reductions in parallel). In our case, four such operations will be executed in parallel in $\mathrm{T}_{3}=29 \cdot \mathrm{n}_{\mathrm{TL}}+2$ clock cycles. Hence, if we sum up all processing times, we come to the conclusion that the processor requires
$\mathrm{T}_{4}=\mathrm{T}_{1}+\mathrm{T}_{2}+\mathrm{T}_{3}=9 \cdot k+29 \cdot n_{\mathrm{TL}}+4$
(23)
clock cycles to process K data bits, i.e. one second to decode
$G_{4}=\frac{C_{\mathrm{R} 4}}{\mathrm{~T}_{4} / K}=\frac{\left(3.5 \cdot 10^{9}\right) \cdot 128 \cdot k}{9 \cdot k+29 \cdot n_{\mathrm{TL}}+4}$
data bits. By substituting the values of k and $\mathrm{n}_{\mathrm{TL}}$ in (24) we obtain that $\mathrm{G}_{\min }=$ 15.91 Gbps and $\mathrm{G}_{\max }=27.46 \mathrm{Gbps}$. In other words, it can be concluded that all codes from Table 5, except $(2080,2048)$ DAEC code, have the potential to be used in 10G networks [1]. In addition, from (15)-(22) we see that the analyzed codes can correct asymmetric errors affecting four adjacent 32-bit bytes. This feature is very useful, since in practice channel errors often corrupt two adjacent bytes [5], [6], [7].

### 3.2. Scenario 2: Network nodes are equipped with eight-core processors

Suppose now that the network nodes are equipped with eight-core processors (Fig. 5) having the following specifications [17], [18]:

1) clock rate: $\mathrm{C}_{\mathrm{R} 8}=3.5 \cdot 10^{9} \mathrm{~Hz}$,
2) integer addition/subtraction/multiplication operation: 2 cycles latency,
3) 128-bit shift operation: 1 cycle latency,
4) modulo reduction operation: 1 cycle latency,
5) comparison operation: 1 cycle latency,
6) access to the L1 cache ( 64 KB per core): 3 cycles latency,

Table 5. Memory Requirements and Theoretical Decoding Throughputs for Some Four-Byte Interleaved Integer Codes.

| Integer Codes | $k$ | Memory <br> Requirements for Storing the Coefficients $C_{i}$ | Memory <br> Requirements for Storing the Syndrome Table | Number of Table Lookups | Minimum <br> Theoretical <br> Decoding <br> Throughput |
| :---: | :---: | :---: | :---: | :---: | :---: |
| $(1056,1024) \mathrm{B}_{5 / 32} \mathrm{AEC} \mathrm{Code}$ | 32 | $4 \times 128$ B | 0.14 MB | $1 \leq n_{\text {TL }} \leq 15$ | 19.72 Gbps |
| $(1056,1024) \mathrm{R}_{3 / 32}$ AEC Code | 32 | $4 \times 128$ B | 1.58 MB | $1 \leq n_{\text {TL }} \leq 19$ | 17.01 Gbps |
| $(1056,1032) \mathrm{B}_{5 / 32} \mathrm{AEC}-\mathrm{R}_{3 / 32} \mathrm{AEC} \mathrm{Code}$ | 32 | $4 \times 128$ B | 1.63 MB | $1 \leq n_{\text {TL }} \leq 19$ | 17.01 Gbps |
| $(1056,1032)$ DAEC Code | 32 | $4 \times 128$ B | 7.53 MB | $1 \leq n_{\text {TL }} \leq 21$ | 15.91 Gbps |
| $(2080,2048) \mathrm{B}_{5 / 32} \mathrm{AEC} \mathrm{Code}$ | 64 | $4 \times 256 \mathrm{~B}$ | 0.27 MB | $1 \leq n_{\text {TL }} \leq 16$ | 27.46 Gbps |
| $(2080,2048) \mathrm{R}_{3 / 32} \mathrm{AEC} \mathrm{Code}$ | 64 | $4 \times 256 \mathrm{~B}$ | 3.17 MB | $1 \leq n_{\text {TL }} \leq 20$ | 24.72 Gbps |
| $(2080,2048) \mathrm{B}_{5 / 32} \mathrm{AEC}-\mathrm{R}_{3 / 32} \mathrm{AEC} \mathrm{Code}$ | 64 | $4 \times 256 \mathrm{~B}$ | 3.25 MB | $1 \leq n_{\text {TL }} \leq 20$ | 24.72 Gbps |
| (2080, 2048) DAEC Code | 64 | $4 \times 256 \mathrm{~B}$ | 29.76 MB | $1 \leq n_{\text {TL }} \leq 23$ | -------- ${ }^{1}$ |

[^1]7) access to the L2 cache ( 256 KB per core): 8 cycles latency, 8) access to the L3 cache ( 32 MB shared): 25 cycles latency.


Fig. 5. Block diagram of eight-core processor.
Given this, suppose that the data word has $\mathrm{K}=8 \cdot \mathrm{~b} \cdot \mathrm{k}=256 \cdot \mathrm{k}$ bits and that the coefficients $\mathrm{C}_{\mathrm{i}}$ are stored in each of the eight L1 caches. In addition, assume that that the syndrome table is placed into the L3 cache. In that case, instead of one, we will have eight check-bytes. Their values are calculated as follows:

- Core 1
$\mathrm{C}_{\underline{B} 1}=\left[\mathrm{C}_{1} \cdot \underline{B}_{1}+\mathrm{C}_{2} \cdot \underline{B}_{9}+\cdots+\mathrm{C}_{k} \cdot \underline{B}_{8 \cdot(k-1)+1}\right]\left(\bmod 2^{32}-1\right)=\sum_{i=1}^{k} C_{i} \cdot \underline{B}_{8 \cdot(i-1)+1}\left(\bmod 2^{32}-1\right)$
(25)
- Core 2
$\mathrm{C}_{\underline{\mathrm{B}} 2}=\left[\mathrm{C}_{1} \cdot \underline{B}_{2}+C_{2} \cdot \underline{B}_{10}+\cdots+C_{k} \cdot \underline{B}_{8 \cdot(k-1)+2}\right]\left(\bmod 2^{32}-1\right)=\sum_{i=1}^{k} C_{i} \cdot \underline{B}_{8 \cdot(i-1)+2}\left(\bmod 2^{32}-1\right)$
(26)
- Core 3
$\mathrm{C}_{\underline{B} 3}=\left[\mathrm{C}_{1} \cdot \underline{B}_{3}+\mathrm{C}_{2} \cdot \underline{B}_{11}+\cdots+C_{k} \cdot \underline{B}_{8 \cdot(k-1)+3}\right]\left(\bmod 2^{32}-1\right)=\sum_{i=1}^{k} C_{i} \cdot \underline{B}_{8 \cdot(i-1)+3}\left(\bmod 2^{32}-1\right)$
(27)
- Core 4
$C_{\underline{B} 4}=\left[C_{1} \cdot \underline{B}_{4}+C_{2} \cdot \underline{B}_{12}+\cdots+C_{k} \cdot \underline{B}_{8 \cdot(k-1)+4}\right]\left(\bmod 2^{32}-1\right)=\sum_{i=1}^{k} C_{i} \cdot \underline{B}_{8 \cdot(i-1)+4}\left(\bmod 2^{32}-1\right)$
- Core 5
$C_{\underline{B} 5}=\left[C_{1} \cdot \underline{B}_{5}+C_{2} \cdot \underline{B}_{13}+\cdots+C_{k} \cdot \underline{B}_{8 \cdot(k-1)+5}\right]\left(\bmod 2^{32}-1\right)=\sum_{i=1}^{k} C_{i} \cdot \underline{B}_{8 \cdot(i-1)+5}\left(\bmod 2^{32}-1\right)$
(29)
- Core 6
$\mathrm{C}_{\underline{B} 6}=\left[\mathrm{C}_{1} \cdot \underline{B}_{6}+\mathrm{C}_{2} \cdot \underline{B}_{14}+\cdots+C_{k} \cdot \underline{B}_{8 \cdot(k-1)+6}\right]\left(\bmod 2^{32}-1\right)=\sum_{i=1}^{k} C_{i} \cdot \underline{B}_{8 \cdot(i-1)+6}\left(\bmod 2^{32}-1\right)$
- Core 7
$C_{\underline{B} 7}=\left[C_{1} \cdot \underline{B}_{7}+C_{2} \cdot \underline{B}_{15}+\cdots+C_{k} \cdot \underline{-}_{8 \cdot(k-1)+7}\right]\left(\bmod 2^{32}-1\right)=\sum_{i=1}^{k} C_{i} \cdot \underline{B}_{8 \cdot(i-1)+7}\left(\bmod 2^{32}-1\right)$
(31)
- Core 8
$\mathrm{C}_{\underline{B} 8}=\left[\mathrm{C}_{1} \cdot \underline{B}_{8}+C_{2} \cdot \underline{B}_{16}+\cdots+C_{k} \cdot \underline{B}_{8 \cdot(k-1)+8}\right]\left(\bmod 2^{32}-1\right)=\sum_{i=1}^{k} C_{i} \cdot \underline{B}_{8 \cdot(i-1)+8}\left(\bmod 2^{32}-1\right)$
(32)

If we add to this $\mathrm{K} / 128=2 \cdot \mathrm{k}$ shift operations, we calculate that each processor requires $\mathrm{T}_{5}=9 \cdot \mathrm{k}$ clock cycles ( k accesses to the L 1 cache, k integer multiplications, $\mathrm{k}-1$ integer additions, 1 modulo reduction and $2 \cdot \mathrm{k}$ shift operations) to compute the check-byte. After finishing this task, the processor will take $\mathrm{T}_{6}=3$ clock cycles ( 1 integer subtraction and 1 modulo reduction) to generate the syndromes

- Core 1

$$
S_{1}=\left[C_{\underline{B} 1}-\underline{C}_{\mathrm{B} 1}\right]\left(\bmod 2^{32}-1\right)
$$

(33)

- Core 2

$$
S_{2}=\left[C_{\underline{B} 2}-\underline{C}_{B 2}\right]\left(\bmod 2^{32}-1\right)
$$

(34)

- Core 3

$$
S_{3}=\left[C_{\underline{B} 3}-\underline{C}_{B 3}\right]\left(\bmod 2^{32}-1\right)
$$

(35)

- Core 4

$$
S_{4}=\left[C_{\underline{B 4} 4}-\underline{C}_{B 4}\right]\left(\bmod 2^{32}-1\right)
$$

(36)

- Core 5

$$
S_{5}=\left[C_{\underline{B} 5}-\underline{C}_{B 5}\right]\left(\bmod 2^{32}-1\right)
$$

(37)

- Core 6

$$
S_{6}=\left[C_{B 6}-\underline{C}_{B 6}\right]\left(\bmod 2^{32}-1\right)
$$

(38)

- Core 7

$$
S_{7}=\left[\mathrm{C}_{\underline{B} 7}-\underline{C}_{B 7}\right]\left(\bmod 2^{32}-1\right)
$$

(39)

- Core 8

$$
\begin{equation*}
S_{8}=\left[C_{\underline{B} 8}-\underline{C}_{\mathrm{B} 8}\right]\left(\bmod 2^{32}-1\right) \tag{40}
\end{equation*}
$$

As in the previous scenario, if the data are received in error, the decoder will perform $\mathrm{n}_{\mathrm{TL}}$ table lookups, $\mathrm{n}_{\mathrm{TL}}$ comparisons, 1 integer addition (or two integer
additions in parallel) and 1 modulo reduction (or two modulo reductions in parallel). In this particular case, eight such operations will be executed in parallel $\mathrm{T}_{7}=26 \cdot \mathrm{n}_{\mathrm{TL}}+6$ clock cycles. So, if we sum up all processing times, we come to the conclusion that the processor requires
$\mathrm{T}_{8}=\mathrm{T}_{5}+\mathrm{T}_{6}+\mathrm{T}_{7}=9 \cdot k+26 \cdot n_{\mathrm{TL}}+9$
(41)
clock cycles to process K data bits, i.e. one second to decode
$G_{8}=\frac{C_{\mathrm{R} 8}}{\mathrm{~T}_{8} / K}=\frac{\left(3.5 \cdot 10^{9}\right) \cdot 256 \cdot k}{9 \cdot k+26 \cdot n_{\mathrm{TL}}+9}$
(42)
data bits. By substituting the values of k and $\mathrm{n}_{\mathrm{TL}}$ in (42) we easily calculate that $\mathrm{G}_{\min }=34.01 \mathrm{Gbps}$ and $\mathrm{G}_{\text {max. }}=57.29 \mathrm{Gbps}$ (Table 6). In this scenario, unlike the previous one, all considered codes (Table 6) have the potential to be used in 10G networks. Moreover, we see that most of them could be candidates for use in 40G networks. In addition, unlike the previous scenario, in this one, the data bytes are interleaved to a depth of eight. This additionally reduces the probability that some errors may be undetected or miscorrected.

## 4. Conclusion

In this chapter, we have reviewed four classes of integer codes capable of correcting multiple asymmetric errors. We have shown that these codes have two important characteristics: first, they use processor-friendly operations, and second, they can be interleaved without delay and without using dedicated hardware. Thanks to these, they have potential to be implemented "for free" in short-range optical networks. To illustrate this, we have shown that the four-byte and eight-byte interleaved codes, implemented on the four-core and eigth-core processor, respectively, achieve theoretical throughputs of several tens of Gbps. In the future, we plan to extend our approach to codes capable of correcting symmetric errors. Unlike the presented ones, these codes would have the potential to be used in long-range optical networks.

Table 6. Memory Requirements and Theoretical Decoding Throughputs for Some Eight-Byte Interleaved Integer Codes.

| Integer Codes | $k$ | Memory <br> Requirements for Storing the Coefficients $C_{i}$ | Memory <br> Requirements for Storing the Syndrome Table | Number of Table Lookups | Minimum <br> Theoretical <br> Decoding <br> Throughput |
| :---: | :---: | :---: | :---: | :---: | :---: |
| $(1056,1024) \mathrm{B}_{5 / 32} \mathrm{AEC} \mathrm{Code}$ | 32 | $8 \times 128$ B | 0.14 MB | $1 \leq n_{\text {TL }} \leq 15$ | 41.74 Gbps |
| $(1056,1024) \mathrm{R}_{3 / 32} \mathrm{AEC}$ Code | 32 | $8 \times 128$ B | 1.58 MB | $1 \leq n_{\text {TL }} \leq 19$ | 36.25 Gbps |
| $(1056,1032) \mathrm{B}_{5 / 32} \mathrm{AEC}-\mathrm{R}_{3 / 32} \mathrm{AEC}$ Code | 32 | $8 \times 128$ B | 1.63 MB | $1 \leq n_{\text {TL }} \leq 19$ | 36.25 Gbps |
| $(1056,1032)$ DAEC Code | 32 | $8 \times 128$ B | 7.53 MB | $1 \leq n_{\text {TL }} \leq 21$ | 34.01 Gbps |
| $(2080,2048) \mathrm{B}_{5 / 32} \mathrm{AEC}$ Code | 64 | $8 \times 256$ B | 0.27 MB | $1 \leq n_{\text {TL }} \leq 16$ | 57.29 Gbps |
| $(2080,2048) \mathrm{R}_{3 / 32} \mathrm{AEC} \mathrm{Code}$ | 64 | $8 \times 256$ B | 3.17 MB | $1 \leq n_{\text {TL }} \leq 20$ | 51.90 Gbps |
| (2080, 2048) $\mathrm{B}_{5 / 32} \mathrm{AEC}-\mathrm{R}_{3 / 32} \mathrm{AEC}$ Code | 64 | $8 \times 256 \mathrm{~B}$ | 3.25 MB | $1 \leq n_{\text {TL }} \leq 20$ | 51.90 Gbps |
| (2080, 2048) DAEC Code | 64 | $8 \times 256 \mathrm{~B}$ | 29.76 MB | $1 \leq n_{\text {TL }} \leq 23$ | 48.47 Gbps |

## References

[1] R. Ramaswani, K. Sivarajan and G. Sasaki, Optical Networks: A Practical Perspective, 3rd ed., Elsevier, Inc., 2010.
[2] J. R. Pierce, "Optical Channels: Practical Limits with Photon Counting," IEEE Trans. Communications, vol. 26, pp. 1819-1821, Dec. 1978.
[3] P. Oprisan and B. Bose, "ARQ in Optical Networks," Proc. IEEE Int’l Symp. Pacific Rim Dependable Computing, pp. 251-257, Dec. 2001.
[4] S. Elmougy, "Some Contributions to Asymmetric Error Control Codes," PhD thesis, Oregon State Univ., 2005.
[5] D. Mello, E. Offer and J. Reichert, "Error Arrival Statistics for FEC Design in Four-Wave Mixing Limited Systems," Proc. Optical Fiber Communications Conference, pp. 529-530, Mar. 2003.
[6] L. James, "Error Behaviour in Optical Networks," PhD thesis, Univ. Cambridge, 2005.
[7] P. Anslow and O. Ishida, "Error Distribution in Optical Links," IEEE 802.3 HSSG Interim Meeting, Nov. 2007.
[8] K. Cho et al., "Performance of Forward-Error Correction Code in $10-\mathrm{Gb} / \mathrm{s}$ RSOA-Based WDM PON," IEEE Photon. Technology Letters, vol. 22, no. 1, pp. 57-59, Jan. 2010.
[9] A. Radonjic and V. Vujicic, "Integer Codes Correcting Burst Errors within a Byte," IEEE Trans. Computers, vol. 62, no. 2, pp. 411-415, Feb. 2013.
[10] A. Radonjic et al.,"Integer Codes Correcting Double Asymmetric Errors," IET Commun., vol. 10, no. 14, pp. 1691-1696, Sep. 2016.
[11] A. Radonjic and V. Vujicic, "Integer Codes Correcting Spotty Byte Asymmetric Errors," IEEE Commun. Lett., vol. 20, no. 12, pp. 2338-2341, Dec. 2016.
[12] A. Radonjic and V. Vujicic, "Integer Codes Correcting Burst and Random Asymmetric Errors within a Byte," J. Franklin Inst., vol. 355, no. 2, pp. 981996, Jan. 2018.
[13] K. Mehlhornand and P. Sanders, Algorithms and Data Structures: The Basic Toolbox, Springer, 2008.
[14] E. Fujiwara, Code Design for Dependable Systems: Theory and Practical Application, John Wiley \& Sons, Inc., 2006.
[15] J. Hennessy and D. Patterson, Computer Architecture: A Quantitative Approach, 5th ed., Elsevier, Inc., 2012.
[16] A. Fog, "The Microarchitecture of Intel, AMD and VIA CPUs: An Optimization Guide for Assembly Programmers and Compiler Makers," Tech. Univ. Denmark, Feb. 2017.
[17] L. Johnsson, "Introduction to HPC architecture," Dept. Computer Sciences, Univ. Houston, Jan. 2014.
[18] http://www.7-cpu.com/cpu/Power7.html


[^0]:    * Corresponding Author address Email: sasa_radonjic@yahoo.com

[^1]:    ${ }^{1}$ The size of the syndrome table exceeds the capacity of the cache.

