S. Christodoulakis, "Estimating record selectivities", Information Systems, vol. 8, no. 2, pp. 105-115, 1983. doi: 10.1016/0306-4379(83)90035-2
https://doi.org/10.1016/0306-4379(83)90035-2
In this paper we examine the problem of modelling data base contents and user requests. This modelling is necessary in analytic data base performance evaluation studies in order to estimate the number of records of a file that have to be retrieved in response to user(s) requests. The cpu, io, and telecommunication costs of the system are directly or indirectly expressed in terms of these quantities.We first show that certain assumptions-used for modelling data base contents, data placement on devices and user requests often are not satisfied in actual data base environments. Thereafter we provide more detailed modelling techniques based on a multivariate statistical model, and we demonstrate their use in improving data base performance.