We consider the problem of estimating the probability of a string
drawn i.i.d. from an unknown distribution. The key feature of this
problem is that the length of the observed string is of the same order
as the size of the underlying alphabet. The classical Turing estimator
(introduced by I. J. Good) provides the traditional answer to this
problem. We introduce a natural scaling formulation to study this
problem and use it to show that probability estimation using the
traditional (Good-)Turing estimator is not  consistent (in a
statistical sense). Drawing from the intuition derived from this
exercise, we introduce a novel probability estimator that is indeed
strongly consistent with the natural scaling model.