Beyond OCR: Handwritten manuscript attribute understanding

He, S., 2017, [Groningen]: University of Groningen. 157 p.

Research output: ThesisThesis fully internal (DIV)Academic

Copy link to clipboard


  • Title and contents

    Final publisher's version, 112 KB, PDF document

  • Chapter 1

    Final publisher's version, 44 MB, PDF document

  • Chapter 2

    Final publisher's version, 332 KB, PDF document

  • Chapter 3

    Final publisher's version, 3 MB, PDF document

  • Chapter 4

    Final publisher's version, 978 KB, PDF document

  • Chapter 5

    Final publisher's version, 792 KB, PDF document

  • Chapter 6

    Final publisher's version, 2 MB, PDF document

  • Chapter 7

    Final publisher's version, 825 KB, PDF document

  • Chapter 8

    Final publisher's version, 3 MB, PDF document

  • Bibliography

    Final publisher's version, 88 KB, PDF document

  • Summary

    Final publisher's version, 77 KB, PDF document

  • Samenvatting

    Final publisher's version, 75 KB, PDF document

  • Publications

    Final publisher's version, 67 KB, PDF document

  • Acknowledgements

    Final publisher's version, 48 KB, PDF document

  • Complete thesis

    Final publisher's version, 54 MB, PDF document

  • Propositions

    Final publisher's version, 30 KB, PDF document

  • Sheng He
Knowing the author, date and location of handwritten historical documents is very important for historians to completely understand and reveal the valuable information they contain. In this thesis, three attributes, such as writer, date and geographical location, are studied by analyzing the handwriting style contained in manuscript images and develop novel algorithms to estimate these attributes on the basis of pattern recognition methods.

Handwriting styles are different between different individuals and implicitly encoded in the handwritten patterns when they were written down. This information can be used for writer identification. In this thesis, different features, such as textural-based, textural-free and grapheme-based features, are
designed and extracted to present the handwriting style of historical handwritten documents in particular. These features are computational efficient and explainable to end users.

According to paleographical expertise, handwriting styles change gradually, continuously and in general within a relatively limited time frame, within 25 years. Modeling the gradual style evolution can be used to date and localize historical manuscripts. This thesis designed a system to date the charters
produced between 1300 and 1550 CE in the Medieval Dutch language area.

We have shown that designed shape features can be applied quickly and conveniently, without much training efforts on new data sets and problems, even in conditions where the amount of labeled data is relatively limited.
Original languageEnglish
QualificationDoctor of Philosophy
Awarding Institution
  • Schomaker, Lambert, Supervisor
  • Burgers, Jan , Supervisor, External person
  • Liu, Cheng-Lin, Assessment committee, External person
  • Biehl, M. , Assessment committee
  • Postma, Eric, Assessment committee, External person
Award date17-Mar-2017
Place of Publication[Groningen]
Print ISBNs978-90-367-9643-9
Electronic ISBNs978-90-367-9642-2
Publication statusPublished - 2017

View graph of relations

Download statistics

No data available

ID: 40224439