Hasil Pencarian

Ditemukan 171020 dokumen yang sesuai dengan query

Timotius Victory

Perbandingan Model Deep Learning dan Model Machine Learning dalam Kasus Analisis Sentimen pada Ulasan Pengguna Traveloka = Comparison of Deep Learning and Machine Learning Models in the Case of Sentiment Analysis of Traveloka User Reviews

"Pengguna media sosial di Indonesia merupakan salah satu yang terbanyak di dunia. Hal ini mendorong pemilik produk atau layanan menggunakan media sosial sebagai saluran utama untuk penjualan dan layanan pelanggan. Masyarakat Indonesia cenderung mencari ulasan online sebelum memutuskan pembelian, sehingga ulasan pengguna sangat mempengaruhi keputusan pembelian dan keberhasilan bisnis. Oleh karena itu, pemilik produk dan layanan harus cepat tanggap terhadap sentimen ulasan pengguna untuk mempertahankan reputasi dan menghindari penurunan penjualan. Analisis sentimen adalah salah satu cara untuk mengetahui sentimen terhadap produk atau layanan. Terdapat pendekatan machine learning dan deep learning dalam analisis sentimen. Penggunaan machine learning pada analisis sentimen ulasan pengguna berbahasa Indonesia telah banyak dilakukan, namun eksplorasi dalam bidang deep learning masih jarang ditemukan. Penelitian ini menggunakan model CNN-BiLSTM dan BiLSTM-CNN yang dibandingkan dengan logistic regression, support vector machine, dan naïve bayes. Pada skenario pertama, analisis ulasan pengguna di Traveloka menunjukkan model BiLSTM-CNN dengan Precision tertinggi 85% dan AUC 82.14%, serta model Support Vector Machine (SVM) dengan Accuracy 83.25% dan F1-Score 86.53%. Pada skenario kedua, analisis ulasan pengguna provider telekomunikasi menunjukkan SVM sebagai yang terbaik dengan Accuracy 78.15%, Precision 68.78%, F1-Score 76.33%, dan AUC 77.36%. Dari hasil ini, model machine learning lebih unggul dibandingkan deep learning.

Social media users in Indonesia are among the largest in the world. This drives product or service owners to use social media as the main channel for sales and customer service. Indonesian consumers tend to look for online reviews before making a purchase decision, so user reviews greatly influence purchasing decisions and business success. Therefore, product and service owners must quickly respond to user review sentiments to maintain reputation and avoid sales decline. Sentiment analysis is one way to understand the sentiment towards a product or service. There are machine learning and deep learning approaches in sentiment analysis. The use of machine learning in sentiment analysis of user reviews in Indonesian has been widely conducted, but exploration in the field of deep learning is still rarely found. This study uses CNN-BiLSTM and BiLSTM-CNN models compared to logistic regression, support vector machine, and naïve bayes. In the first scenario, analysis of user reviews on Traveloka shows the BiLSTM-CNN model with the highest Precision of 85% and AUC of 82.14%, and the Support Vector Machine (SVM) model with an Accuracy of 83.25% and F1-Score of 86.53%. In the second scenario, analysis of user reviews of telecommunications providers shows SVM as the best with an Accuracy of 78.15%, Precision of 68.78%, F1-Score of 76.33%, and AUC of 77.36%. From these results, machine learning models outperform deep learning models."

Depok: Fakultas Ilmu Komputer Universitas Indonesia, 2024

T-pdf

UI - Tesis Membership Universitas Indonesia Library

Theresia Gowandi

Analisis kinerja model gabungan convolutional neural network dengan long short-term memory dan gated recurrent unit untuk analisis sentimen berbahasa Indonesia = Performance analysis of combined convolutional neural network with long short-term memory and gated recurrent unit model on sentiment analysis in Indonesian language

"Analisis sentimen adalah salah satu bidang dari Pemrosesan Bahasa Alami yang membangun sistem untuk mengenal opini dalam teks dan mengelompokkan ke dalam sentimen positif atau negatif. Banyak peneliti telah membangun model yang menghasilkan akurasi terbaik dalam melakukan analisis sentimen. Tiga diantaranya adalah Convolutional Neural Network (CNN), Long Short-Term Memory (LSTM), dan Gated Recurrent Unit (GRU), yang merupakan bagian dari deep learning. CNN digunakan karena kemampuannya dalam mengekstrak fitur penting dalam penggalan kalimat, sedangkan LSTM dan GRU digunakan karena kemampuannya yang memiliki memori akan input yang telah diproses sebelumnya. GRU memiliki struktur yang lebih sederhana dibandingkan dengan LSTM. Ketiga model tersebut dapat digabungkan menjadi model gabungan LSTM-CNN, CNN-LSTM, GRU-CNN, dan CNN-GRU. Penelitian sebelumnya telah membuktikan bahwa model gabungan tersebut memiliki akurasi yang lebih baik dibandingkan dengan model dasar LSTM, GRU, dan CNN. Implementasi model dilakukan pada data ulasan aplikasi berbahasa Indonesia. Hasilnya, didapatkan bahwa hampir seluruh model gabungan memiliki akurasi yang lebih baik dibandingkan dengan model dasar.

Sentiment analysis is one of the fields of Natural Language Processing that builds a system to recognize and extract opinion in the form of text into positive or negative sentiment. Nowadays, many researchers have developed methods that yield the best accuracy in performing analysis sentiment. Three particular models are Convolutional Neural Network (CNN), Long Short-Term Memory (LSTM), and Gated Recurrent Unit (GRU), which are part of deep learning architectures. CNN is used because of its ability to extract important features from each sentence fragment, while LSTM and GRU are used because of their ability to have a memory of prior inputs. GRU has a simpler and more practical structure compared to LSTM. These models can be combined into combined LSTM-CNN, CNN-LSTM, GRU-CNN, and CNN-GRU model. Former researches have proved that these models have better accuracy compared to standard models. This research is focused on the performance of all the combined LSTM-CNN, CNN-LSTM, GRU-CNN, CNN-GRU models and will be compared to the standard LSTM, GRU, CNN models. Implementation of the model is performed on a collection of application review data in Indonesian text. As a result, almost all of the combined models have better accuracy than the standard models."

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2021

S-pdf

UI - Skripsi Membership Universitas Indonesia Library

Jauzak Hussaini Windiatmaja

Text Classification untuk Verifikasi Fakta pada Kanal Berita Bahasa Indonesia menggunakan Deep Learning dengan Ensemble Technique = Text Classification for Fact Verification for Indonesian News Channel Using Deep Learning with Ensemble Technique

"Sumber informasi di jejaring berita daring adalah instrumen yang memungkinkan individu membaca berita, menerbitkan berita, dan berkomunikasi. Hal ini sudah menjadi tren dalam masyarakat yang sangat mobile. Oleh karena itu, proses verifikasi fakta suatu pemberitaan menjadi sangat penting. Dengan pertimbangan tersebut, sebuah tools berbasis web service untuk verifikasi fakta menggunakan metode deep learning dengan teknik ensemble dibangun. Penggunaan teknik ensemble pada model deep learning adalah proses beberapa model pembelajaran mesin digabungkan secara strategis untuk menyelesaikan masalah menggunakan lebih dari satu model. Untuk melatih model, dibangun sebuah dataset. Dataset berisi pasangan klaim dan label. Klaim dibangun dengan data crawling di kanal berita berbahasa Indonesia. Tiga model deep learning dibangun dan dilatih menggunakan dataset yang dibuat, dengan arsitektur jaringan dan hyperparameter yang berbeda. Setelah model dilatih menggunakan dataset, ketiga model diagregasikan untuk membentuk sebuah model baru. Untuk memastikan bahwa model agregat berfungsi lebih baik daripada model tunggal, performa model deep learning ensemble dibandingkan dengan model deep learning dasar. Hasil penelitian menunjukkan bahwa model ensemble memiliki akurasi 85,18% sedangkan model tunggal memiliki akurasi 83,9%, 83,19%, dan 81,94%. Hasil ini menunjukkan bahwa model ensemble yang dibangun meningkatkan kinerja verifikasi fakta dari tiga model tunggal. Hasil penelitian juga menunjukkan bahwa metode deep learning mengungguli performa metode machine learning lain seperti naive bayes dan random forest. Untuk memvalidasi kinerja tools yang dibangun, response time dari web service diukur. Hasil pengukuran menunjukkan rata-rata response time 6.447,9 milidetik.

Information sources on social networks are instruments that allow individuals to read news, publish news, and communicate. This is a trend in a highly mobile society. Therefore, the process of verifying facts is very important. With these considerations, we built a web service-based tool for fact verification using deep learning methods with ensemble technique. The use of ensemble techniques in deep learning models is a process in which several machine learning models are combined to solve problems. To train the model, we created a dataset. Our dataset of Indonesian news contains pairs of claims along with labels. Claims are built by crawling data on Indonesian news channels. Three deep learning models have been built and trained using the previously created dataset with different network architectures and hyperparameters. After the model is trained, three models are aggregated to form a new model. To ensure that the aggregated model performs better than the single model, the deep learning ensemble model is compared to the single models. The results showed that the ensemble model has an accuracy of 85.18% while the single models have an accuracy of 83.9%, 83.19%, and 81.94% consecutively. These results indicate that the ensemble model built improves the fact-verification performance of the three single models. The results also show that by using the same dataset, deep learning methods outperform other machine learning methods such as naive bayes and random forest. To validate the performance of the tools we created, the response time of the web service is measured. The measurement result shows an average response time of 6447.9 milliseconds."

Depok: Fakultas Teknik Universitas Indonesia, 2021

T-Pdf

UI - Tesis Membership Universitas Indonesia Library

Hartina Hiromi Satyanegara

Pengembangan dan Analisis Metode Hybrid Deep Learning (CNN-MLP dan CNN-LSTM) untuk Peningkatan Akurasi Deteksi Serangan Man In The Middle (MitM) = Development and Analysis of Hybrid Deep Learning (CNN-MLP and CNN-LSTM) Methods to Improve Man in the Middle Detection Accuracy

"Serangan MitM ini memiliki dampak yang cukup besar dan dapat membuka jalan untuk serangan selanjutnya, seperti Phishing. Penelitian ini membahas tentang pendekatan metode hybrid deep learning yang dapat membantu pendeteksian serangan MitM secara efektif. Metode hybrid deep learning yang digunakan dalam penelitian ini adalah CNN-MLP dan CNN-LSTM, yaitu merupakan gabungan dari CNN, MLP, dan LSTM. Selain itu, dalam skenario eksperimennya menggunakan berbagai metode feature scaling (StandardScaler, MinMaxScaler, dan MaxAbsScaler) dan tanpa menggunakan metode feature scaling sebelum melakukan pemodelan, yang kemudian akan ditentukan metode hybrid deep learning yang terbaik untuk mendeteksi serangan MitM dengan baik. Dataset yang digunakan dalam penelitian ini yaitu Kitsune Network Attack Dataset (ARP MitM Ettercap). Hasil dari penelitian ini yaitu metode CNN-MLP dengan 10 epoch menggunakan MaxAbsScaler memiliki nilai accuracy tertinggi, yaitu 99.93%. Pada urutan kedua, CNN-MLP dengan 10 epoch menggunakan StandardScaler memiliki nilai accuracy sebesar 99.89%.

Man in the Middle (MitM) has a sizeable impact because it could make the attackers will do another attacks, such as Phishing. This research is discussing about hybrid deep learning methods-approach on detecting MitM attacks effectively. We were used 2 (two) combinations of the Deep Learning methods (CNN, MLP, and LSTM), which are CNN-MLP and CNN-LSTM. Besides that, in the experiment scenarios, we also used various Feature Scaling methods (StandardScaler, MinMaxScaler, and MaxAbsScaler) and without using any Feature Scaling methods before building the models and will determine the better hybrid Deep Learning methods for detecting MitM attack. Kitsune Network Attack Dataset (ARP MitM Ettercap) is the dataset used in this study. The results of this research proves that CNN-MLP that with 10 epoch using MaxAbsScaler has the highest accuracy rate of 99.93%. In second place, CNN-MLP with 10 epoch using StandardScaler has the accuracy rate of 99.89%."

Depok: Fakultas Teknik Universitas Indonesia, 2022

T-pdf

UI - Tesis Membership Universitas Indonesia Library

Muhammad Fauzi Rahmad

OPTIMASI HYBRID DEEP LEARNING UNTUK MENDETEKSI SERANGAN BOTNET PADA SISTEM IOT = Hybrid Deep Learning Optimization for Botnet Attacks in IoT Systems

"Arsitektur model deep learning kini sudah semakin kompleks setiap harinya. Namun semakin besar model maka dibutuhkan kekuatan komputasi yang cukup besar juga dalam menjalankan model. Sehingga tidak semua perangkat Internet of Things (IoT) dapat menjalankan model yang begitu besar secara efisien. Untuk itu teknik model optimization sangat diperlukan. Pada penelitian kali ini penulis menggunakan metode optimasi menggunakan layer weight regularization, serta penyederhanaan arsitektur model pada hybrid deep learning model. Dataset yang digunakan pada penelitian kali ini adalah N-BaIoT. Sementara evaluasi performa model yang digunakan adalah accuracy, confussion matrix, dan detection time. Dengan tingkat accuracy yang sama, model yang diusulkan berhasil meningkatkan waktu deteksi model lebih cepat 0,8 ms dibandingkan dengan model acuan.

The deep learning model architecture is getting more complex every day. However, the larger the model, the greater the computational power is needed to run the model. So not all Internet of Things (IoT) devices can run such a large model efficiently. For this reason, model optimization techniques are needed. In this study, the author uses an optimization method using layer weight regularization, as well as simplification of the model architecture on the hybrid deep learning model. The dataset used in this research is N-BaIoT. While the evaluation of the performance of the model used is accuracy, confusion matrix, and detection time. With the same level of accuracy, the proposed model succeeded in increasing the detection time of the model by 0.8 ms faster than the reference method."

Depok: Fakultas Teknik Universitas Indonesia, 2022

S-pdf

UI - Skripsi Membership Universitas Indonesia Library

Adawiyah Ulfa

Evaluasi Klasifikasi Hubungan Kuantitatif Struktur Aktivitas Molekul dengan Model Hybrid Deep Learning dan Pemilihan Fitur Recursive Feature Elimination pada Inhibitor Dipeptidyl Peptidase-4 = Evaluation of the Classification in Quantitative Structures Activity Relationships of Molecular with Hybrid Deep Learning Models and Selection Features of Recursive Feature Elimination in Dipeptidyl Peptidase-4 Inhibitors

"Pengembangan inhibitor Dipeptidyl Peptidae-4 (DPP-4) sangat diperlukan dalam pengobatan Diabetes Mellitus tipe 2 dengan efek samping yang rendah. Pemodelan hubungan kuantitatif struktur aktivitas (QSAR) merupakan pendekatan analisis hubungan struktur kimia dengan aktivitasnya yang banyak digunakan dalam desain obat penyakit Diabetes. Pada tesis ini, model QSAR klasifikasi dibangun untuk memprediksi struktur aktivitas senyawa pada inhibitor DPP-4 yang dapat memblokir kerja enzim DPP-4. Dalam representasi molekul digunakan circular fingerprint ECFP dan FCFP yang menyajikan notasi SMILES dalam format vektor biner. Fingerprint ECFP dan FCFP yang berdiameter 4 dan 6 sebagai input data dalam membangun model QSAR klasifikasi. Pada QSAR klasifikasi dengan pendekatan deep learning memberikan waktu yang cepat dalam proses virtual screening senyawa aktif atau tidak aktif dalam inhibitor DPP-4. Penelitian ini menggunakan model Hybrid Deep Learning 1D CNN-LSTM untuk memprediksi aktivitas senyawa inhibitor dalam kelas aktif atau tidak aktif berdasarkan nilai aktivitas biologis dengan proporsi data latih dan data uji yang berbeda. Dalam arsitektur 1D CNN-LSTM terdiri dari model 1D CNN sebagai tahap ektraksi fitur dan output dari lapisan konvolusi 1D CNN digunakan dalam lapisan LSTM. Selain itu, pemilihan fitur dengan metode Random Forest-Recursive Feature Elimination (RF-RFE) digunakan untuk memperoleh fitur yang optimal dari dataset ECFP dan FCFP. Selanjutnya, penelitian ini membandingkan performa model dengan menerapkan pemilihan fitur RF-RFE dan tanpa pemilihan fitur RF-RFE. Hasil penelitian ini menunjukkan bahwa model QSAR klasifikasi menggunakan Hybrid Deep Learning yaitu 1D CNN-LSTM dengan pemilihan fitur RF-RFE memperoleh performa model yang lebih baik dibandingkan model tanpa pemilihan fitur optimal. Performa model 1D CNN-LSTM dengan pemilihan fitur RF-RFE menggunakan data ECFP_4 dengan proporsi data latih 80% memiliki akurasi sebesar 0.9075, sensitivitas 0.9008, spesifisitas 0.9142, dan nilai MCC 0.8151.

The development of Dipeptidyl Peptidase-4 (DPP-4) inhibitors is urgently needed in the treatment of Type 2 Diabetes Mellitus with low side effects. Activity structure quantitative relationship modeling (QSAR) is an analytical approach to the relationship between chemical structure and activity which is widely used in diabetes drug design. In this thesis, a classification QSAR model was built to predict the structure of the activity of the DPP-4 inhibitor compound that can block the action of the DPP-4 enzyme. In molecular representation, ECFP and FCFP circular fingerprints are used which present SMILES notation in binary vector format. ECFP and FCFP fingerprints with diameters of 4 and 6 as input data in building a classification QSAR model. The QSAR classification with a deep learning approach provides fast time in the virtual screening process for active or inactive compounds in DPP-4 inhibitors. This study uses the Hybrid Deep Learning 1D CNN-LSTM model to predict the activity of inhibitor compounds inactive or inactive classes based on the value of biological activity with different proportions of training data and test data. The 1D CNN-LSTM architecture consists of a 1D CNN model as the feature extraction stage and output of 1D CNN convolution layer is used in the LSTM layer. In addition, feature selection using the Random Forest-Recursive Feature Elimination (RF-RFE) method was used to obtain optimal features from the ECFP and FCFP datasets. Furthermore, this study compares the performance of the model by applying the RF-RFE feature selection and without the RF-RFE feature selection. The results of this study indicate that the classification QSAR model using Hybrid Deep Learning, namely 1D CNN-LSTM with RF-RFE feature selection, obtains better model performance than the model without optimal feature selection. The performance of the CNN-LSTM 1D model with RF-RFE feature selection using ECFP_4 data with a proportion of 80% training data has an accuracy of 0.9075, sensitivity of 0.9008, specificity of 0.9142, and an MCC value of 0.8151.
"

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2021

T-Pdf

UI - Tesis Membership Universitas Indonesia Library

Bella Septina Ika Hartanti

Segmentasi Banjir Daerah Urban di Indonesia dengan Teknik Deep Learning dan Pendekatan Semi-Supervised Learning = Flood Segmentation in Urban Region with Deep Learning Method and Semi-Supervised Approach

"Bencana banjir merupakan salah satu peristiwa alam yang sering terjadi di dunia, termasuk Indonesia, dan terjadi ketika aliran air yang berlebihan menggenangi daratan dalam jangka waktu tertentu. Perubahan iklim, cuaca ekstrem, urbanisasi yang tidak terkendali, dan kondisi geografis yang kompleks telah berkontribusi terhadap peningkatan frekuensi dan intensitas banjir, terutama di daerah perkotaan. Analisis banjir otomatis dan deteksi citra dapat memberikan panduan dan informasi yang berguna dalam membuat keputusan untuk mengurangi dampak destruktif seperti korban jiwa dan ekonomi, salah satunya dengan melakukan segmentasi untuk membantu proses pembuatan peta kerawanan banjir. Namun, sejumlah kecil data beresolusi tinggi dan berlabel yang tersedia membuat proses segmentasi sulit untuk dilakukan. Oleh karena itu, penulis mengusulkan pendekatan semi-supervised yaitu mean teacher dengan memanfaatkan teknik deep learning. Adapun dataset yang digunakan adalah citra SAR Sentinel-1 C-band yang telah diolah sebelumnya. Hasil penelitian menunjukkan bahwa model usulan memberikan kenaikan performa yang cukup signifikan pada metrik IoU sebesar 5% terhadap baseline yang mengimplementasikan teknik pseudo-labeling.

Floods are one of the natural disaster events that occur in the world. Floods happen when excessive water flows and submerges land for a certain period of time. Climate change, extreme weather, uncontrolled urbanization, and complex geographical conditions have contributed to the increase in the frequency and intensity of floods, especially in urban areas. Automatic flood analysis and detection of imagery can provide useful guidance and information in making decisions to reduce destructive impacts such as loss of life and economy. However, the small amount of high-resolution and labeled data available makes the segmentation process difficult for flood detection. Therefore, the author proposes a semi-supervised approach, namely mean teacher by utilizing the deep learning architecture. The dataset used is the SAR image of Sentinel-1 C-band which has been processed. The results show that the proposed model provides a significant increase in performance on the IoU metric by 5% against the baseline that implements the pseudo-labeling technique."

Depok: Fakultas Ilmu Komputer Universitas Indonesia, 2023

T-pdf

UI - Tesis Membership Universitas Indonesia Library

Syach Riyan Muhammad Ardiyansyah

Analisis Kinerja Metode BERT-IDEC untuk Deteksi Topik = BERT-IDEC Method Performance Analysis for Topic Detection

"Pendeteksian topik merupakan sebuah proses dalam menganalisis data teks untuk menemukan sebuah topik-topik yang ada pada data teks. Pada era digital saat ini, pendeteksian topik sering digunakan untuk menganalisis topik dan mengelompokkan informasi berdasarkan topiknya. Machine learning membantu proses pendeteksian topik menjadi lebih cepat dan efisien, terutama pada data teks dengan ukuran data yang besar. Salah satu metode machine learning yang dapat digunakan untuk pendeteksian topik adalah metode clustering. Namun karena dimensi data yang tinggi membuat beberapa metode clustering kurang efektif menyelesaikan pendeteksian topik. Untuk mengatasi hal tersebut data yang memiliki ukuran dimensi yang cukup tinggi perlu dilakukan proses reduksi dimensi terlebih dahulu. Improved Deep Embedded Clustering (IDEC) merupakan sebuah metode clustering yang secara bersamaan melakukan reduksi dimensi data dan clustering. Oleh karena itu, pada penelitian ini dilakukan pendeteksian topik dengan metode clustering IDEC. Data yang digunakan pada penelitian ini merupakan data berita online AG News, Yahoo! Answer, dan R2. Namun pada metode IDEC, data teks tidak bisa langsung menerima input berupa data teks. Data teks perlu diubah menjadi vektor representasi yang dapat diterima input. Pada penelitian ini digunakan metode representasi teks Bidirectional Encoder Representation from Transformers (BERT). Data teks mula-mula akan diubah oleh BERT menjadi vektor representasi, setelah itu vektor representasi akan diterima dan dilakukan pendeteksian topik oleh metode IDEC. Kemudian pada proses simulasi dilakukan perbandingan kinerja model IDEC dengan representasi teks BERT dan model IDEC dengan representasi teks TF-IDF. Didapatkan hasil simulasi dari kinerja model IDEC dengan representasi teks BERT memiliki kinerja yang lebih unggul dibandingkan dengan model IDEC dengan representasi teks TF-IDF

Topic detection is a process in analyzing text data to find topics that exist in text data. In today's digital era, topic detection is often used to analyze topics and grouping the information by topic. Machine learning helps the topic detection process to be faster and more efficient, especially in text data with large data sizes. One of the machine learning methods that can be used for topic detection is the clustering method. However, because the high data dimensions make some clustering methods less effective in completing topic detection. To overcome this, data that has a sufficiently high dimension size needs to be carried out in a dimension reduction process first. Improved Deep Embedded Clustering (IDEC) is a clustering method that simultaneously performs data dimension reduction and clustering. Therefore, in this study, topic detection was carried out using the IDEC clustering method. The data used in this study is the online news data of AG News, Yahoo! Answer, and R2. However, in the IDEC method, text data cannot directly receive input in the form of text data. Text data needs to be converted into a vector representation that can accept input. In this study, the Bidirectional Encoder Representation from Transformers (BERT) text representation method was used. The text data will first be converted by BERT into a vector representation, after that the vector representation will be accepted and topic detection will be carried out by the IDEC method. Then the simulation process compares the performance of the IDEC model with the BERT text representation and the IDEC model with the TF-IDF text representation. The simulation results obtained from the performance of the IDEC model with the text representation of BERT which has superior performance compared to the IDEC model with the text representation of TF-IDF."

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2022

S-pdf

UI - Skripsi Membership Universitas Indonesia Library

Denanir Fadila Nasiri

KLASIFIKASI TINDAK PIDANA BERDASARKAN FAKTA PADA DOKUMEN PUTUSAN PENGADILAN DENGAN MODEL DEEP LEARNING = Criminal Offense Classification Based on Facts in Court Decision Documents with Deep Learning Models

"Legal reasoning merupakan metode yang digunakan untuk menerapkan aturan atau Undang-Undang terhadap fakta yang dimiliki dengan tujuan untuk memperoleh argumentasi hukum. Salah satu metode legal reasoning adalah dengan penalaran induktif, yaitu didasarkan pada kasus-kasus terdahulu. Mahkamah Agung di Indonesia melalui situs Direktori Putusan Pengadilan, yang menyediakan dokumen hasil proses pengadilan yang saat ini menampung jumlah dokumen yang sangat besar. Kumpulan dokumen tersebut dapat dimanfaatkan untuk melakukan aktivitas legal reasoning, seperti klasifikasi jenis tindak pidana (criminal offense). Pada penelitian ini, penulis mengusulkan metode deep learning untuk mengklasifikasikan jenis tindak pidana. Hal ini dapat berguna untuk memberikan efisiensi dan referensi kepada praktisi hukum maupun memudahkan masyarakat untuk memahami dasar hukum dari suatu kasus. Secara spesifik, salah satu rancangan model yang diusulkan adalah dengan penerapan model LEAM (Label Embedding Attentive Model) dengan penambahan sejumlah keyword pada label embedding. Model ini secara konsisten memberikan performa yang baik dalam eksperimen, termasuk pada imbalanced dataset dengan perolehan f1-score 68%.

Legal reasoning is a sequence of activities to identify law rules and obtain legal arguments. One of the method in legal reasoning is by using inductive reasoning, which analyzes previous decided cases. Indonesia’s Supreme Court stores the court decision documents online in a large sum. These collections can be utilized to perform legal reasoning, where in this research we focus on the classification of criminal offense. We performed pre-processing tasks including conversion of document to text and cleaning text. We then compared deep learning models, such as LSTM, BiLSTM, CNN+LSTM, and LEAM (Label Embedding Attentive Model). Instead of using only the label name in LEAM, we also carried out experiments by adding related keywords for each label. The LEAM model with additional keywords obtained the best result in an imbalanced dataset with 68% macro average f1-score."

Depok: Fakultas Ilmu Komputer Universitas Indonesia, 2021

T-pdf

UI - Tesis Membership Universitas Indonesia Library

Alrafiful Rahman

Deep Learning Dengan Model Concatenater untuk Mendeteksi Penyakit Paru-Paru Covid-19 dengan Gambar CT Scan = Deep Learning with Concatenate Model to Detect Covid-19 Lung Disease with CT Scan Images

"COVID-19 merupakan penyakit pernapasan seperti pneumonia yang mengakibatkan kematian pada jutaan orang setiap harinya. Januari 2020, "Organisasi Kesehatan Dunia" WHO menyatakan COVID-19 sebagai wabah penyakit virus yang menjadi perhatian internasional sebagai darurat kesehatan masyarakat yang menjadi perhatian internasional, dikenal sebagai pandemi dunia. Dilaporkan dari 205 negara di seluruh dunia, pada 1 April 2020, penularan virus COVID-19 sekitar ada lebih dari 900000 kasus COVID-19 yang dikonfirmasi dan hampir 50000 kematian. Berdasarkan laporan WHO, angka kematian 2-3% orang karena virus. Sangat penting untuk melakukan tes diagnostik sejak dini stadium berdasarkan kriteria sebagai gejala klinis, "Reverse-Transcription Polymerase Chain Reaction" (RT-PCR), sehingga dapat segera mengisolasi orang yang terinfeksi. Mendiagnosis penyakit virus COVID-19 dengan pencitraan yang lebih efektif menggunakan citra CT dada. Model DenseNet201, MobileNet, Xception, InceptionV3, ResNet152V2, dan VGG19 untuk memeriksa keakuratannya dalam pengenalan gambar. Untuk menganalisis kinerja model, 1888 sampel dari gambar CT paru-paru dikumpulkan dari situs resmi Kaggle. Model penggabungan (concatenate) pada arsitektur CNN yang telah terlatih seperti penggabungan (concatenate) antara ResNet152V2 dengan VGG19 memiliki accuracy sebesar 99,65%, sensitivity sebesar 99,66%, precision sebesar 99,66%, recall sebesar 99,66%, specificity sebesar 99,64%, dan skor F-measure sebesar 99,66%; gabungan DenseNet201 dan MobileNet diperoleh saat batchsize 32 dan 64 dengan learning rate 0,0001 diperoleh accuracy sebesar 99,65%, sensitivity sebesar 99,64%, precision sebesar 99,64%, recall sebesar 99,64%, specificity sebesar 99,66%, dan F-measure sebesar 99,64%; serta gabungan DenseNet201 dan MobileNet diperoleh saat batchsize 32 dan 64 dengan learning rate 0,001 maupun gabungan InceptionV3 dan Xception saat batchsize 32 dan learning rate 0,0001 diperoleh accuracy sebesar 99,65%, sensitivity sebesar 100%, precision sebesar 99,28%, recall sebesar 100%, specificity sebesar 99,31%, dan F-measure sebesar 99,64%.

COVID-19 is a respiratory disease like pneumonia that kills millions of people every day. January 2020, the WHO "World Health Organization" declared COVID-19 as a viral outbreak of international concern as a public health emergency of international concern, known as a world pandemic. Reported from 205 countries around the world, as of April 1, 2020, the transmission of the COVID-19 virus was around more than 900000 confirmed cases of COVID-19 and nearly 50000 deaths. Based on the WHO report, the death rate of 2-3% of people is due to the virus. To isolate the infected person immediately, it is very important to carry out a diagnostic test early based on the criteria as a clinical symptom, "Reverse-Transcription Polymerase Chain Reaction" (RT-PCR). Diagnosing COVID-19 viral disease with more effective imaging using chest CT images. DenseNet201, MobileNet, Xception, InceptionV3, ResNet152V2, and VGG19 models for accuracy in image recognition. To analyze the model's performance, 1888 samples of CT images of the lungs were collected from the official Kaggle website. The concatenate model on the CNN architecture that has occurred, such as the concatenate between ResNet152V2 and VGG19, has an accuracy of 99.65%, sensitivity of 99.66%, the precision of 99.66%, recall of 99.66%, specificity by 99.64%, and the F-measure score of 99.66%; the combination of DenseNet201 and MobileNet was obtained when batch size 32 and 64 with a learning rate of 0.0001 obtained an accuracy of 99.65%, the sensitivity of 99.64%, the precision of 99.64%, recall of 99.64%, specificity of 99.66 %, and F-measure of 99.64%; and the combination of DenseNet201 and MobileNet obtained at batch size 32 and 64 with a learning rate of 0.001 or a combination of InceptionV3 and Xception at batch size 32 and a learning rate of 0.0001 obtained an accuracy of 99.65%, the sensitivity of 100%, precision of 99.28%, recall of 100%, specificity of 99.31%, and F-measure of 99.64%."

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2021

T-pdf

UI - Tesis Membership Universitas Indonesia Library

<< 1 2 3 4 5 6 7 8 9 10 >>

Hasil Pencarian :: Simpan CSV :: Kembali

Hasil Pencarian