Turing's formula revisited*

Author: Zhang Zhiyi   Huang Hongwei  

Publisher: Routledge Ltd

ISSN: 0929-6174

Source: Journal of Quantitative Linguistics, Vol.14, Iss.2-3, 2007-08, pp. : 222-241

Disclaimer: Any content in publications that violate the sovereignty, the constitution or regulations of the PRC is not accepted or approved by CNPIEC.

Previous Menu Next

Abstract

A simple frequentist's justification of Turing's formula, an improvement to Turing's formula by means of reduced bias, a clarification of the relationships among various objects related to Turing's formula, a conservative confidence interval to Turing's target, and a conservative testing procedure using observed rank-frequencies under a hypothesized known infinite-dimensional multinomial distribution are given in this paper. As an example, the authorship of the nine-stanza poem "Shall I Die?" is tested against Shakespeare's canon and statistically significant evidence is found for a difference in word type usage.