The Forum > General Discussion > What is the data that you're censoring?
What is the data that you're censoring?
- Pages:
-
- Page 1
- 2
-
- All
The National Forum | Donate | Your Account | On Line Opinion | Forum | Blogs | Polling | About |
Syndicate RSS/XML |
|
About Us :: Search :: Discuss :: Feedback :: Legals :: Privacy |
My purpose here is to show that the idea that a set of data contains particular content is simplistic, and that it doesn't have to be true.
Suppose that we want to make available some pictures. For the sake of simplicity, assume that each picture requires 1MB of data. Suppose we have picture files named a, b, c, d and e. We follow the following procedure:
1. Construct a file consisting of 1MB of random data. Call this A.
2. Perform byte-wise exclusive ORs of A and the picture file a, to produce the file B.
3. Now do the same with B, and the picture file b, to produce the file C.
4. Continue with this scheme to produce files D, E and F.
Now, clearly we can get back any of the pictures by obtaining two consecutive files in the set, but no individual file can be said to contain a picture. Further, all the files have the property that they appear to consist of random data. File A (our original) is not qualitatively different from the others.
Of course, we don't have to stick to this linear scheme. We could derive other sequences of files from any of the new files (including A). Given such a set of files, it would be impossible to determine the starting point.
Having created the files, we would upload them to different web sites. In such a scenario, it is not sensible to assert that a particular site holds some impugned content, or that someone was downloading it. A person wanting some particular content would just need the identities of the two files used to create it. The content only exists when they combine the two files.