Locally testable code
A locally testable code is a type of error-correcting code for which it can be determined if a string is a word in that code by looking at a small (frequently constant) number of bits of the string. In some situations, it is useful to know if the data is corrupted without decoding all of it so that appropriate action can be taken in response. For example, in communication, if the receiver encounters a corrupted code, it can request the data be re-sent, which could increase the accuracy of said data. Similarly, in data storage, these codes can allow for damaged data to be recovered and rewritten properly.
In contrast, locally decodable codes use a small number of bits of the codeword to probabilistically recover the original information. The fraction of errors determines how likely it is that the decoder correctly recovers the original bit; however, not all locally decodable codes are locally testable.[1]
Clearly, any valid codeword should be accepted as a codeword, but strings that are not codewords could be only one bit off, which would require many (certainly more than a constant number) probes. To account for this, testing failure is only defined if the string is off by at least a set fraction of its bits. This implies words of the code must be longer than the input strings by adding some redundancy.
Definition
To measure the distance between two strings, the Hamming distance is used
The distance of a string  from a code
 from a code  is computed by
 is computed by
Relative distances are computed as a fraction of the number of bits
A code  is called
 is called  -local
-local  -testable if there exists a Turing machine M given random access to an input
-testable if there exists a Turing machine M given random access to an input  that makes at most
 that makes at most  non-adaptive queries of
 non-adaptive queries of  and satisfies the following:[2]
 and satisfies the following:[2]
-  For any  and and , ,![Pr[M^w(1^k)=1]=1](../I/m/c0de4ac479e5e2f82a6bc5d7f006d4bd.png) .  In other words, M accepts given access to any codeword of C. .  In other words, M accepts given access to any codeword of C.
-  For  such that such that , ,![Pr[M^w(1^k)=1]\leq1/2](../I/m/eb758fc7e93ea5b34781dcf503c9fd63.png) . M must reject strings . M must reject strings -far from C at least half the time. -far from C at least half the time.
 
-  For any 
Limits
It remains an open question whether there are any locally testable codes of linear size, but there are several constructions that are considered "nearly linear":[3]
-  Polynomial arbitrarily close to linear; for any  , , . .
-  Functions of the form  , where , where is a function tending toward 0. This makes n closer to linear as k increases. For example: is a function tending toward 0. This makes n closer to linear as k increases. For example:-   
-   for some for some 
-   for for 
 
-  
These have both been achieved, even with constant query complexity and a binary alphabet, such as with  for any
 for any  .
The next nearly linear goal is linear up to a polylogarithmic factor;
.
The next nearly linear goal is linear up to a polylogarithmic factor;  .  Nobody has yet to come up with a linearly testable code that satisfies this constraint.[3]
.  Nobody has yet to come up with a linearly testable code that satisfies this constraint.[3]
Parallels
Locally testable codes have a lot in common with probabilistically checkable proofs (PCPs).  This should be apparent from the similarities of their construction.  In both, we are given  random nonadaptive queries into a large string and if we want to accept, we must with probability 1, and if not, we must accept no more than half the time.  The major difference is that PCPs are interested in accepting
 random nonadaptive queries into a large string and if we want to accept, we must with probability 1, and if not, we must accept no more than half the time.  The major difference is that PCPs are interested in accepting  if there exists a
 if there exists a  so that
 so that  .  Locally testable codes, on the other hand, accept
.  Locally testable codes, on the other hand, accept  if it is part of the code.  Many things can go wrong in assuming a PCP proof encodes a locally testable code.  For example, the PCP definition says nothing about invalid proofs, only invalid inputs.
 if it is part of the code.  Many things can go wrong in assuming a PCP proof encodes a locally testable code.  For example, the PCP definition says nothing about invalid proofs, only invalid inputs.
Despite this difference, locally testable codes and PCPs are similar enough that frequently to construct one, a prover will construct the other along the way.[4]
Examples
Hadamard code
One of the most famous error-correcting codes, the Hadamard code is a locally testable code.  A codeword x is encoded in the Hadamard code to be the linear function  (mod 2).  This requires listing out the result of this function for every possible y, which requires exponentially more bits than its input.  To test if a string w is a codeword of the Hadamard code, all we have to do is test if the function it encodes is linear.  This means simply checking if
 (mod 2).  This requires listing out the result of this function for every possible y, which requires exponentially more bits than its input.  To test if a string w is a codeword of the Hadamard code, all we have to do is test if the function it encodes is linear.  This means simply checking if  for x and y uniformly random vectors (where
 for x and y uniformly random vectors (where  denotes bitwise XOR).
 denotes bitwise XOR).
It is easy to see that for any valid encoding  , this equation is true, as that is the definition of a linear function.  Somewhat harder, however, is showing that a string that is
, this equation is true, as that is the definition of a linear function.  Somewhat harder, however, is showing that a string that is  -far from C will have an upper bound on its error in terms of
-far from C will have an upper bound on its error in terms of  .  One bound is found by the direct approach of approximating the chances of exactly one of the three probes yielding an incorrect result.  Let A, B, and C be the events of
.  One bound is found by the direct approach of approximating the chances of exactly one of the three probes yielding an incorrect result.  Let A, B, and C be the events of  ,
,  , and
, and  being incorrect.  Let E be the event of exactly one of these occurring.  This comes out to
 being incorrect.  Let E be the event of exactly one of these occurring.  This comes out to
This works for  , but shortly after,
, but shortly after,  .  With additional work, it can be shown that the error is bounded by
.  With additional work, it can be shown that the error is bounded by
For any given  , this only has a constant chance of false positives, so we can simply check a constant number of times to get the probability below 1/2.[3]
, this only has a constant chance of false positives, so we can simply check a constant number of times to get the probability below 1/2.[3]
Long code
The Long code is another code with very large blowup which is close to locally testable. Given an input  (note, this takes
 (note, this takes  bits to represent), the function that returns the
 bits to represent), the function that returns the  bit of the input,
 bit of the input,  , is evaluated on all possible
, is evaluated on all possible  -bit inputs
-bit inputs  , and the codeword is the concatenation of these (of length
, and the codeword is the concatenation of these (of length  ).  The way to locally test this with some errors is to pick a uniformly random input
).  The way to locally test this with some errors is to pick a uniformly random input  and set
 and set  , but with a small chance of flipping each bit,
, but with a small chance of flipping each bit,  .  Accept a function
.  Accept a function  as a codeword if
 as a codeword if  .  If
.  If  is a codeword, this will accept
 is a codeword, this will accept  as long as
 as long as  was unchanged, which happens with probability
 was unchanged, which happens with probability  .  This violates the requirement that codewords are always accepted, but may be good enough for some needs.[5]
.  This violates the requirement that codewords are always accepted, but may be good enough for some needs.[5]
Other locally testable codes include Reed-Muller codes (see locally decodable codes for a decoding algorithm), Reed-Solomon codes, and the short code.
References
- ↑ Kaufman, Tali; Viderman, Michael. "Locally Testable vs. Locally Decodable Codes".
- ↑ Ben-Sasson, Eli; Sudan, Madhu. "Robust Locally Testable Codes and Products of Codes" (PDF).
- 1 2 3  Goldreich, Oded. "Short Locally Testable Codes and Proofs (Survey)". CiteSeerX: 10.1 ..1 .110 .2530 
- ↑ Cheraghchi, Mahdi. "Locally Testable Codes.".
- ↑ Kol, Gillat; Raz, Ran. "Bounds on Locally Testable Codes with Unique Tests" (PDF).




