Contribution to a conference proceedings/Contribution to a book DZNE-2025-01114

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
Transcription of Ottoman Documents using Transformer Based Models | Osmanlica Dok manlarin D n st r c Tabanli Modeller ile Transkripsiyonu

 ;  ;  ;

2025
IEEE

2025 33rd Signal Processing and Communications Applications Conference (SIU) : [Proceedings] - IEEE, 2025. - ISBN 979-8-3315-6655-5 - doi:10.1109/SIU66497.2025.11112382
33rd Signal Processing and Communications Applications Conference, SIU 2025, SileSile, Istanbul, 25 Jun 2025 - 28 Jun 20252025-06-252025-06-28
IEEE 1 - 4 () [10.1109/SIU66497.2025.11112382]

This record in other databases:

Please use a persistent id in citations: doi:

Abstract: Although access to a large number of Ottoman documents has become easier today, the Arabic-Persian-based Ottoman script remains a barrier for interested users in utilizing these documents. To address this challenge, there is a need for automatic transcription systems. While some deep learning-based commercial and academic models exist for Ottoman transcription, no studies have yet explored models based on transformer architectures. This paper introduces an Ottoman transcription system developed using TrOCR, a transformer-based model. Instead of the commonly used two-step approach in the literature, a model was designed to perform both optical character recognition and transcription into Turkish in one step. Additionally, the decoder responsible for language modeling was initialized with a BERT-based model trained on Turkish data, achieving results comparable to the original model. During testing, this model produced outputs more quickly due to improved tokenization performance.


Contributing Institute(s):
  1. Spatial Dynamics of Neurodegeneration (AG Gokce)
Research Program(s):
  1. 351 - Brain Function (POF4-351) (POF4-351)

Appears in the scientific report 2025
Click to display QR Code for this record

The record appears in these collections:
Document types > Events > Contributions to a conference proceedings
Document types > Books > Contribution to a book
Institute Collections > BN DZNE > BN DZNE-AG Gokce
Public records
Publications Database

 Record created 2025-09-22, last modified 2025-10-08


Restricted:
Download fulltext PDF Download fulltext PDF (PDFA)
Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)