The paper deals with the problem of authorship attribution. We assume that texts are generated based on distinct probability sources. The proposed method is based on resampling procedure applied to simulate samples from two texts. We use k-nearest neighbors two-sample test to check if samples were drawn from the same population. The method shows high ability to distinguish texts of different origin.
Original languageEnglish
JournalProceedings Elmar - International Symposium Electronics in Marine
StatePublished - 2015
Externally publishedYes

    Research areas

  • Authorship Attribution, re-sampling, two-sample test, KNN

ID: 5781815