Hasil Pencarian

Ditemukan 2 dokumen yang sesuai dengan query

Muhammad Hanif Fahreza

Sistem Optical Character Recognition Untuk Huruf Arab Pegon = Optical Character Recognition System for Printed Pegon Manuscripts

Aksara Pegon adalah sistem penulisan berbasis Arab yang digunakan untuk menulis bahasa Jawa, Sunda, Madura, dan Indonesia. Karena berbagai alasan, aksara ini telah diturunkan ke ranah kolektor naskah sejarah dan pesantren, sehingga perlu dilestarikan. Salah satu metode pelestarian ini adalah melalui digitalisasi; lebih tepatnya dengan mentranskripsikan isi dari naskah-naskah yang ada ke dalam bentuk teks machine encoded, dimana proses tersebut jika dilakukan secara otomatis disebut juga sebagai OCR, atau Pengenalan Karakter Optik. Sampai saat ini belum ada literatur yang dipublikasikan mengenai sistem OCR untuk aksara ini. Oleh karena itu, penelitian ini bertujuan untuk menjembatani kesenjangan tersebut dengan menyediakan OCR untuk subset tertentu dari naskah Pegon, yaitu naskah Pegon yang dicetak. Penelitian ini memperkenalkan dataset yang disintesis dan yang dianotasi untuk pengenalan teks Pegon cetak. Dataset-dataset ini kemudian digunakan untuk mengevaluasi sistem OCR Arab konvensional yang sudah ada pada domain Pegon, baik versi asli maupun yang dimodifikasi, serta sistem berbasis teknik deep learning yang lebih baru dalam literatur. Hasilnya menunjukkan bahwa teknik deep learning mengungguli teknik konvensional, di mana teknik konvensional gagal mendeteksi teks Pegon sama sekali, sementara sistem yang diusulkan, khususnya menggunakan YOLOv5 untuk segmentasi baris dan arsitektur CTC-CRNN untuk pengenalan teks baris, mencapai nilai F1 sebesar 0,94 untuk segmentasi dan CER 0,03 untuk pengenalan teks.

The Pegon script is an Arabic-based writing system intended for writing the Javanese,
Sundanese, and Indonesian languages. Due to various reasons, this script has been
relegated to the domain of historical manuscript collectors and private Islamic boarding
schools or pesantren, presenting a need for preservation. One of these methods of
preservation is through digitization; more specifically, by transcribing the content of
these existing manuscripts into machine-encoded text, the automated process of which is
referred to as OCR. There has been heretofore no published literature on OCR systems
for this specific script. Hence, this research aims to bridge that gap by providing a
foray into the OCR of a specific subset of Pegon manuscripts, namely of printed Pegon
manuscripts. This research evaluates existing and modified versions of conventional
Arabic OCR systems on the domain of Pegon, as well as the more recent deep learning
techniques in the literature, along with introducing new datasets for use in developing
with said deep learning techniques. The results show the outperformance of these deep
learning techniques over the conventional techniques and with which components of a
Pegon OCR system is proposed.
"

Depok: Fakultas Ilmu Komputer Universitas Indonesia, 2023

S-pdf

UI - Skripsi Membership Universitas Indonesia Library

Mega Fransiska

Proses Otomatisasi Dan Otentikasi Pengisian Data Kartu Tanda Penduduk Berbasis Convolutional Neural Network = Automation and Authentication Process of ID Card Application Form Based on Convolutional Neural Network

"Digitalisasi proses pengisian data Kartu Tanda Penduduk (KTP) memerlukan proses

otomatisasi dan otentikasi, yang dapat dilakukan dengan proses pembacaan teks pada gambar

KTP oleh komputer secara otomatis serta mengevaluasi kemiripan wajah pada foto KTP dan

swafoto pendaftar. Proses pembacaan data pada KTP secara otomatis disebut juga secagai proses

Optical Character Recognition (OCR), sedangkan pengevaluasian kemiripan wajah dapat

dilakukan dengan model Siamese Network. Baik Siamese Network maupun model untuk OCR

merupakan model yang pada dasarnya digunakan untuk mengolah gambar. Oleh karena itu

digunakan Convolutional Neural Network (CNN) sebagai model dasar pada penelitian ini. Pada

proses OCR dan evaluasi kemiripan wajah dibutuhkan model yang mampu mendeteksi lokasi teks

dan wajah yang akan diekstrak dari gambar, model tersebut merupakan model text detection dan

face detection. Model text detection dan face detection merupakan aplikasi dari model object

detection. Pada model object detection terbaru, dikembangkan model modifikasi CNN yang

mampu mendeteksi obek yang berukuran sangat kecil dan sangat besar, model tersebut dinamakan

Bidirectional Feature Pyramid Network (BiFPN). Setelah mengekstrak lokasi teks, langkah dari

proses OCR selanjutnya adalah mengenali setiap karakter dalam teks (text recognition), yang

dapat dilakukan dengan model Bidirectional Long Short-Term Memory (BiLSTM). Sedangkan

dari wajah yang diekstrak selanjutnya ditentukan apakah berasal dari orang yang sama atau tidak

oleh model Siamese Network. Pada penelitian ini akan dibangun arsitektur CNN Effiception, yang

digabungkan menjadi CNN-BiFPN untuk proses object detection, CNN-BiLSTM, untuk proses

text recognition, dan CNN dalam bentuk Siamese Network untuk mengevaluasi kemiripan wajah

Digitization of ID card applications requires automation and an authentication process,
which can be done by computerized ID card information reading and face's similarity evaluating
on ID card's photo and applicant selfie. The computerized ID card information reading is named
Optical Character Recognition (OCR). While the face's similarity authentication is done by the
Siamese Network model. Both the Siamese Network and OCR model basically used to process
images. Therefore, the Convolutional Neural Network (CNN) became the base model for this
study. Each of OCR and face's similarity authentication required a model that can detect the
location of text and face to be extracted from the image. They are text detection and face detection
model, which are the applications of object detection. The latest object detection model,
EfficientDet, used CNN modification that capable to detect a tiny and huge object at the same
time, is called Bidirectional Feature Pyramid Network (BiFPN). After extracting the location of
the text, the next step of the OCR process is to recognize each character in the text (text
recognition), which can be done with the Bidirectional Long Short-Term Memory (BiLSTM).
Meanwhile, the extracted face, from the selfie and ID card's photo, then be determined either from
the same person or not, by the Siamese Network. The product of this study is the CNN
architecture, Effiception, which is combined into CNN-BiFPN for object detection process, CNNBiLSTM,
for text recognition process, and its modification into Siamese Network architecture to
evaluate the face's similarity"

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2021

S-Pdf

UI - Skripsi Membership Universitas Indonesia Library

Hasil Pencarian :: Simpan CSV :: Kembali

Hasil Pencarian