Programming by permutation

Programming by permutation, sometimes called "programming by accident" or "by-try programming", is an approach to software development wherein a programming problem is solved by iteratively making small changes (permutations) and testing each change to see if it behaves as desired. This approach sometimes seems attractive when the programmer does not fully understand the code and believes that one or more small modifications may result in code that is correct.

This tactic is not productive when:

  1. a series of small modifications can easily introduce new undetected bugs into the code, leading to a "solution" that is even less correct than the starting point
  1. many false starts and corrections usually occur before a satisfactory endpoint is reached
  2. in the worst case the original state of the code may be irretrievably lost

Programming by permutation gives little or no assurance about the quality of the code produced -- it is the polar opposite of Formal verification.

Programmers are often compelled to program by permutation when an API is insufficiently documented. This lack of clarity drives others to copy and paste from reference code which is assumed to be correct, but was itself written as a result of programming by permutation.

In some cases where the programmer can logically explain that exactly one out of a small set of variations must work, programming by permutation leads to correct code (which then can be verified) and makes it unnecessary to think about the other (wrong) variations.

Example

For example, the following code sample in C (intended to find and copy a series of digits from a larger string) has several problems:

char* buffer = "123abc";
char destination[10];
                                                                                
int i = 0;
int j = 0;
int l = strlen(buffer);

while (i < l) {
    if (isdigit(buffer[i])) {
        destination[j++] = buffer[i++];
    }
    ++i;
}

destination[j] = '\0';
printf("%s\n", destination);

First of all, it doesn't give the right answer. With the given starting string, it produces the output "13", when the correct answer is "123". A programmer who does not recognize the structural problems may seize on one statement, saying "ah, there's an extra increment". The line "++i" is removed; but testing the code results in an infinite loop. "Oops, wrong increment." The former statement is added back, and the line above it is changed to remove the post-increment of variable i:

    if (isdigit(buffer[i])) {
        destination[j++] = buffer[i];
    }

Testing the code now produces the correct answer, "123". The programmer sighs in contentment: "There, that did it. All finished now." Additional testing with various other input strings bears out this conclusion.

Of course, other problems remain. Because the programmer does not take the trouble to fully understand the code, they go unrecognized:

While the solution is correct for a limited set of input strings, it is not fully correct, and since the programmer did not bother to understand the code, the errors will not be discovered until a later testing stage, if at all.

Also known as "Trial and Error", "Generate and Test", "Poke and Hope",[1] "The Birdshot Method" and the "Million Monkeys Programming Style".

References

  1. Mathematics and Computer Education (University of Michigan) 21: 78. 1987. Missing or empty |title= (help)
This article is issued from Wikipedia - version of the Thursday, November 12, 2015. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.