Writer invariant
Writer invariant, also called authorial invariant or author's invariant, is a property of a text which is invariant of its author, that is, it will be similar in all texts of a given author and different in texts of different authors. It can be used to find plagiarism or discover who is real author of anonymously published text. Writer invariant is also an author's pattern of writing a letter in handwritten text recognition.[1]
While it is generally recognised that writer invariants exist,[2] it is not agreed what properties of a text should be used.[2][3] Among the first ones used was distribution of word lengths;[2] other proposed invariants include average sentence length,[2][3] average word length,[2][3] noun, verb or adjective usage frequency,[3] vocabulary richness,[2] and frequency of function words,[2][3] or specific function words.[3]
Of these, average sentence lengths can be very similar in works of different authors[2][3] or vary significantly even within a single work;[3] average word lengths likewise turn out to be very similar in works of different authors.[3] Analysis of function words shows promise because they are used by authors unconsciously.[2][3][4]
See also
References
- ↑ Ali Nosary, Laurent Heutte, Thierry Paquet and Yves Lecourtier. "A Step Towards the Use of Writer's Properties for Text Recognition" (PDF). Laboratoire Perception, Systèmes, Information (PSI), Université de Rouen. Retrieved 2007-09-06.
- 1 2 3 4 5 6 7 8 9 Peng, Roger D. ; Hengartner, Nicolas W. (August 2002). "Quantitative analysis of literary styles. (General)." (PDF). The American Statistician 56: 175–185. doi:10.1198/000313002100. Retrieved 2009-05-27.
- 1 2 3 4 5 6 7 8 9 10 Fomenko, A. T.; V. P. Fomenko and T. G. Fomenko (2005) [2005]. "The authorial invariant in Russian literary texts. Its application: who was the real author of the "Quiet Don"?". History: Fiction or Science?. Bellevue, WA: Delamere. pp. 425–444. ISBN 2-913621-06-6.
- ↑ Buckland, Warren. "Forensic Semiotics". The Semiotic Review of Books 10 (3). Retrieved 2007-09-06.