DOISerbia - On the randomness that generates biased samples: The limited randomness approach - Lagogiannis, George; Kontopoulos, Stavros; Makris, Christos

Computer Science and Information Systems 2019 Volume 16, Issue 1, Pages: 205-225
https://doi.org/10.2298/CSIS180310015L
Full text ( 337 KB)


On the randomness that generates biased samples: The limited randomness approach

Lagogiannis George (Agricultural University of Athens, Department of Agricultural Economics and Rural Development, Athens, Greece)
Kontopoulos Stavros (University of Patras, Department of Computer Engineering and Informatics, Patras, Greece)
Makris Christos (University of Patras, Department of Computer Engineering and Informatics, Patras, Greece)

We introduce two new algorithms for creating an exponentially biased sample over a possibly infinite data stream. Such an algorithm exists in the literature and uses O(log n) random bits per stream element, where n is the number of elements in the sample. In this paper we present algorithms that use O(1) random bits per stream element. In essence, what we achieve is to be able to choose an element at random, out of n elements, by sparing O(1) random bits. Although in general this is not possible, the exact problem we are studying makes it possible. The needed randomness for this task is provided through a random walk. To prove the correctness of our algorithms we use a model also introduced in this paper, the limited randomness model. It is based on the fact that survival probabilities are assigned to the stream elements before they start to arrive.

Keywords: Biased reservoir sampling, Markov chain, random walk