Hasil Pencarian

Ditemukan 149668 dokumen yang sesuai dengan query

Vinezha Panca

Klasifikasi multikelas kanker otak dengan metode multiple support vector machine recursive feature elimination dan twin support vector machine = Multiclass brain cancer classification using multiple support vector machine recursive feature elimination and twin support vector machine

"ABSTRAK

Kanker merupakan salah satu penyebab kematian terbesar di seluruh dunia. Secara khusus, kanker otak adalah kanker yang terjadi pada sistem saraf pusat. Salah satu hal yang dapat dilakukan untuk penelitian kanker otak menggunakan machine learning adalah melakukan pendeteksian jenis kanker otak dengan memanfaatkan microarray data. Permasalahan tersebut merupakan masalah klasifikasi multikelas. Dengan menggunakan pendekatan one versus one, akan terbentuk sebanyak k k-1 /2 masalah dua kelas, di mana k menunjukkan jumlah kelas. Karena data kanker otak memiliki fitur yang sangat banyak, perlu dilakukan seleksi fitur. Pada penelitian ini, akan diimplementasikan metode Multiple Multiclass Support Vector Machine Recursive Feature Elimination MMSVM-RFE sebagai metode seleksi fitur, dan Twin Support Vector Machine TWSVM sebagai metode klasifikasi. Pada metode MMSVM-RFE dilakukan pelatihan SVM-RFE pada setiap masalah dua kelas, sehingga setiap masalah dua kelas memiliki pengurutan fitur masing-masing. Sebagai metode klasifikasi, TWSVM memiliki tujuan untuk mencari hyperplane masing ndash; masing kelas sedemikian sehingga data kelas satu sedekat mungkin terhadap suatu hyperplane namun sejauh mungkin dengan hyperplane lainnya. Rata-rata akurasi tertinggi pada simulasi menggunakan kernel linear pada MMSVM-RFE dan kernel linear pada TWSVM adalah 95,33 dengan menggunakan 200 fitur. Rata-rata akurasi tertinggi pada simulasi menggunakan kernel linear pada MMSVM-RFE dan kernel RBF pada TWSVM adalah 87 dengan 70 fitur. Sedangkan apabila proses validasi juga dilakukan pada seleksi fitur, rata-rata akurasi tertinggi yang diperoleh adalah 90,67 dengan menggunakan 90 fitur.

ABSTRACT

Cancer is one of main causes of death worldwide. Brain cancer is a type of cancer which occurs at central nervous system. Taking advantage from microarray data, machine learning methods can be applied to help brain cancer prediction according to its types. This problem can be referred as a multiclass classification problem. Using one versus one approach, the multiclass problem with k classes can be transformed into k k 1 2 binary class problems. The huge amount of features makes it necessary to use feature selection. In this research, Multiple Multiclass Support Vector Machine Recursive Feature Elimination MMSVM RFE method is implemented as the feature selection method, and Twin Support Vector Machine TWSVM method is implemented as the classification method. The main concept of MMSVM RFE is to train SVM RFE at each binary problem so that each binary problem will have their own arrangements of feature. As a classification method, TWSVM is trained to find two hyperplanes, each representative of its own class. The data of one class must be as near as possible from its representative hyperplane while also must be as far as possible from the other hyperplane. In the simulation which uses linear kernel on MMSVM RFE and linear kernel on TWSVM, the highest average accuracy is 95,33 , using 200 features. In the simulation which uses linear kernel on MMSVM RFE and RBF kernel on TWSVM, the highest average accuracy is 87 , using 70 features. In the case where the feature selection process is included in doing validation, the highest average accuracy is 90,67 , using 90 features."

2016

S66302

UI - Skripsi Membership Universitas Indonesia Library

Selly Anastassia Amellia Kharis

Klasifikasi kanker paru-paru menggunakan support vector machine dengan pemilihan fitur berbasis support vector machine recursive feature elimination dan artificial bee colony = Classification of lung cancer using support vector machine and feature selection based on support vector machine recursive feature elimination and artificial bee colony

"Kanker merupakan kelompok penyakit yang ditandai dengan pertumbuhan dan penyebaran sel-sel abnormal yang tidak terkendali. Jika penyebaran sel tersebut tidak terkendali, hal ini dapat menyebabkan kematian. Berdasarkan American Cancer Society, pendeteksian dini terhadap sel kanker dapat meningkatkan angka harapan hidup seorang pasien lebih dari 97 . Banyak penelitian yang telah meneliti mengenai klasifikasi kanker menggunakan microarray data. Microarray data terdiri dari ribuan fitur gen namun hanya memiliki puluhan atau ratusan sampel. Hal tersebut dapat menurunkan akurasi klasifikasi sehingga perlu dilakukannya pemilihan fitur sebelum proses klasifikasi.

Pada penelitian ini dilakukan dua tahap pemilihan fitur. Pertama, support vector machine recursive feature elimination SVM-RFE digunakan untuk prefilter gen. Kedua, hasil pemilihan fitur SVM-RFE diseleksi kembali dengan menggunakan artificial bee colony ABC yang merupakan algoritma optimisasi berdasarkan perilaku lebah madu. Penelitian ini menggunakan dua dataset, yaitu data kanker paru-paru Michigan dan Ontario dari Kent Ridge Biomedical Dataset.

Hasil percobaan dengan menggunakan SVM-RFE dan ABC menunjukkan nilai akurasi klasifikasi yang lebih tinggi daripada tanpa pemilihan fitur, SVM-RFE, dan ABC, yaitu 98 untuk data kanker paru-paru Michigan dengan menggunakan 100 fitur dan 97 untuk data kanker paru-paru Ontario dengan menggunakan 70 fitur.

Cancer is a group of diseases characterized by the uncontrolled growth and spread of abnormal cells. If the spread is not controlled, it can result in death. Based on American Cancer Society, early detection of cancerous cells can increase survival rates for patients by more than 97 . Many study showed new aspect of cancer classification based microarray data. Microarray data are composed of many thousands of features genes and from tens to hundreds of instances. It can decrease classification accuracy so feature selection is needed before the classification process
In this paper, we propose two stages feature selection. First, support vector machine recursive feature elimination recursive feature elimination SVM RFE is used to prefilter the genes. Second, the SVM RFE features selection result is selected again using Artificial Bee Colony ABC which is an optimization algorithm based on a particular intelligent behavior of honeybee swarms. This research conducted experiments on Ontario and Michigan Lung Cancer Data from Kent Ridge Biomedical Dataset.
Experiment results demonstrate that this approach provides a higher classification accuracy rate than without feature selection, SVM RFE, and ABC, 98 for Michigan lung cancer dataset with using 100 features and 97 for Ontario lung cancer dataset with using 70 features."

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2018

T49733

UI - Tesis Membership Universitas Indonesia Library

Alifah

Klasifikasi retinopati diabetik menggunakan Support Vector Machine (SVM) dengan metode seleksi fitur Recursive Feature Elimination (RFE) dan chi-square = Classification of retinopathy diabetic using Support Vector Machine (SVM) with feature selection method Recursive Feature Elimination (RFE) and chi-square

"Diabetes Melitus (DM) merupakan gangguan sistem metabolik akibat pankreas tidak memproduksi cukup insulin atau tubuh tidak mampu menggunakan insulin yang ada secara efektif. Menderita diabetes dalam jangka waktu panjang dapat mengakibatkan berbagai macam komplikasi salah satu di antaranya adalah Retinopati diabetik. Retinopati diabetik adalah kelainan pada bagian mata yang disebabkan oleh adanya kerusakan dan penyumbatan pada pembuluh darah di bagian belakang mata (retina). Pada penelitian kali ini akan di gunakan data retinopati diabetik dengan menggunakan metode seleksi fitur Recursive Feature Elimination (RFE) dan Chi-Square dan akan di klasifikasi menggunakan Support Vector Machine.

Diabetic retinopathy is one of the complication of diabetes, which is an eye disease that can cause blindness. Its happen because of damage of retina as a result of the long illness of diabetic melitus. People usually do research using image data in diabetic patients. This paper present about diabetic retinopathy will extracting with feature selection. In this study, we use data diabetic patients who will be extracted with a feature selection method. Feature selection used in this study is Recursive Feature Elimination (RFE) and Chi-Square. For classification of diabetic retinopathy has been done by Support Vector Machine (SVM). From the experimental result with various tunning hyperparameters, the classification model can obtain the accuracy between 97%-100% for both methods."

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2019

S-pdf

UI - Skripsi Membership Universitas Indonesia Library

Muslar Alibasya

Deteksi penyakit kanker paru-paru berdasarkan data microarray dengan seleksi fitur Support Vector Machine-Recursive Feature Elimination (SVM-RFE) = Detection of lung cancer based on microarray data using feature selection Support Vector Machine-Recursive Feature Elimination (SVM-RFE)

"Kanker paru-paru merupakan jenis kanker yang dimulai dan tumbuh di dalam paru-paru. Kanker paru-paru terjadi ketika sel-sel yang melapisi bronkus dan bronkiolus tumbuh secara tidak terkendali. Hal ini dapat menyebabkan kematian jika tidak ditangani dengan cepat dan tepat. Pengklasifikasian dini merupakan salah satu solusi yang tepat untuk mengurangi jumlah kematian yang disebabkan oleh kanker paru-paru. Pendekatan machine learning dapat digunakan untuk mengklasifikasi kanker paru-paru. Dalam penelitian ini, pengklasifikasian dilakukan dengan menggunakan data microarray. Data microarray memiliki fitur yang sangat banyak. Oleh karena itu, dibutuhkan seleksi fitur agar proses klasifikasi berlangsung optimal. Pada penelitian ini, penulis mengusulkan metode Support Vector Machine-Recursive Feature Elimination (SVM-RFE) untuk metode seleksi fitur. Data microarray yang digunakan diambil dari National Center for Biotechnology Information (NCBI) yang merupakan sebuah website online database. Pada penelitian ini, penulis menggunakan SVM-RFE sebagai metode seleksi fitur untuk mengeliminasi fitur yang kurang relevan. Setelah itu pendekatan k-fold cross-validation digunakan sebagai pembagian data, dan beberapa machine learning classifier yaitu Support Vector Machine (SVM), Random Forest (RF), Decision Tree (DT), dan Extreme Gradient Boosting (XGBoost) digunakan sebagai metode klasifikasi. Dari hasil simulasi menunjukkan bahwa hasil terbaik berdasarkan nilai akurasi, precision, recall dan running time diperoleh oleh metode klasifikasi SVM dengan nilai akurasi 100%, precision 100%, recall 100% dan running time 5,42 detik.

Lung cancer is a type of cancer that begins in the lungs. Lung cancer occurs when the cells that cover the bronchi and bronchioles grow uncontrollably. This can lead to death if not treated quickly and appropriately. Early classification is one of the appropriate solution to reduce the number of deaths caused by lung cancer. Machine learning approach can be used to classify lung cancer. In this research, classification is done using microarray data which has a lot of features. Therefore, feature selection is applied such that the classification process used the optimal number of features. In this study, the researcher proposes the Support Vector Machine-Recursive Feature Elimination (SVM- RFE) method for the feature selection method. The microarray data was taken from the National Center for Biotechnology Information (NCBI), which is an online database website. In this study, the researcher used SVM-RFE as a feature selection method to eliminate irrelevant features. Afterwards, the k-fold cross-validation method and several machine learning classifiers, namely Support Vector Machine (SVM), Random Forest (RF), Decision Tree (DT), and Extreme Gradient Boosting (XGBoost) will be used as classification methods. In the final stage, the researcher will analyze the performance results of the proposed method based on the accuracy and running time of each classifier. The simulation results show that the best results based on the values of accuracy, precision, recall and running time are obtained by the SVM classification method with a value of 100% accuracy, 100% precision, 100% recall and running time of 5.42 seconds."

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2021

S-pdf

UI - Skripsi Membership Universitas Indonesia Library

Nurul Maghfirah

Klasifikasi data kanker menggunakan support vector machines dengan pemilihan fitur menggunakan correlated based support vector machines-recursive feature elimination = Cancer classification using support vector machines with feature selection using correlated based support vector machines recursive feature elimination

"Kematian yang disebabkan oleh kanker diperkirakan akan terus meningkat, padahal jumlah kematian ini dapat dikurangi dengan adanya deteksi dini. Salah satunya adalah dengan klasifikasi data kanker. Data kanker yang digunakan merupakan data kanker berdimensi tinggi dengan ribuan fitur, tetapi tidak semua fitur yang ada merupakan fitur yang relevan. Oleh karena itu, perlu adanya proses seleksi fitur. Untuk meningkatkan tingkat akurasi yang dihasilkan, digunakan sebuah metode seleksi fitur yang meninjau adanya korelasi antar gen, yaitu CSVM-RFE. Pada metode tersebut, data yang ada diproyeksikan dan diubah menjadi sebuah data baru dengan ekstraksi fitur, dan kemudian dilakukan proses seleksi fitur. Penggunaan dua metode tersebut pada klasifikasi tiga data kanker yang ada terbukti menghasilkan tingkat akurasi yang tinggi, pada data kanker kolon tingkat akurasi yang didapatkan adalah sebesar 96.6, pada kanker prostat sebesar 98.9, dan pada kanker lymphoma sebesar 98,6.

The number of death caused by cancer expected to rise over two decades, whereas the number of death can be reduced by early detection. One of them is cancer classification. Cancer dataset is a high dimensional dataset that consist of thousands of features, but not all of these features are relevant. Therefore, it is necessary to remove the redundant features using feature selection. Feature selection can also improve the accuracy of classification. Many feature selection methods do not consider the correlated genes, so we need a new feature selection method that consider the correlated genes. It is CSVM RFE, in this method the existing data is projected and converted into a new data with feature extraction. These two methods are applied to the cancer datasets, and produce the accuracy of 96.6 using colon cancer dataset, 98.9 using prostate cancer dataset, and 98.6 using lymphoma cancer dataset."

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2017

S69588

UI - Skripsi Membership Universitas Indonesia Library

Anas Bachtiar

Perbandingan Metode Seleksi Fitur Support Vector Machine-Recursive Feature Elimination (SVM-RFE) dan One Dimensional Naïve Bayes Classifier (1-DBC) pada Klasifikasi Kanker Prostat = Selecting Features Subsets based on Support Vector Machine-Recursive Features Elimination and One-Dimensional Naïve Bayes Classifier for Classification of Prostate Cancer

"Kematian yang disebabkan oleh kanker diperkirakan akan terus meningkat, terutama untuk kanker prostat. Penyakit ini adalah jenis kanker yang paling umum untuk pria di dunia. Jumlah kematian dapat dikurangi dengan deteksi dini menggunakan machine learning. Salah satunya adalah klasifikasi data kanker prostat. Data kanker yang digunakan memiliki berbagai fitur, tetapi tidak semua fitur adalah fitur penting. Dalam penelitian ini, kami menggunakan Support Vector Machine-Recursive Feature Elimination (SVM-RFE) dan One Dimensional Naïve Bayes Classifier (1-DBC) sebagai metode seleksi fitur. Dalam kedua metode itu akan mendapatkan peringkat untuk setiap fitur. Penggunaan kedua metode ini dalam klasifikasi data kanker prostat menghasilkan tingkat evaluasi yang tinggi. Kedua metode ini dapat menghasilkan tingkat akurasi 100%, precision 100%, dan recall 100% pada metode klasifikasi Random Forest. Dan menghasilkan tingkat akurasi 95%, precision 100%, dan recall 94,11% pada metode klasifikasi SVM. Dalam evaluasi tambahan, SVM-RFE memiliki running time lebih rendah dari 1-DBC.

Death caused by cancer is expected to continue to increase, especially for prostate cancer. This disease is the most common type of cancer for men in the world. The number of deaths can be reduced by early detection using machine learning. One of them is the classification of prostate cancer data. Cancer data used has various features, but not all features are essential features. In this study, we use Support Vector Machine-Recursive Feature Elimination (SVM-RFE) and One Dimensional Naïve Bayes Classifier (1-DBC) as a feature selection method. In both methods, it will get a rating for each feature. The use of these two methods in the classification of prostate cancer data produces a high level of evaluation. Both of these methods can produce 100% accuracy, 100% precision, and 100% recall in the Random Forest classification method. And it produces 95% accuracy, 100% precision, and 94.11% recall in the SVM classification method. In the additional evaluation, SVM-RFE has a running time lower than 1-DBC."

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2019

S-pdf

UI - Skripsi Membership Universitas Indonesia Library

Dian Puspita Sari

Klasifikasi sekuens protein Coronavirus penyebab COVID-19 menggunakan metode Particle Swarm Optimization-Support Vector Machine dan Seleksi Fitur Random Forest-Recursive Feature Elimination = Classification of coronavirus protein sequences cause COVID-19 disease using Particle Swarm Optimization-Support Vector Machine Method and Feature Selection of Random Forest-Recursive Feature Elimination

"Coronavirus yaitu kelompok virus yang menginfeksi sistem pernapasan yang dapat menyebabkan infeksi pernapasan ringan maupun berat. Salah satu virus yang termasuk ke dalam coronavirus adalah SARS-CoV-2. Penyakit yang disebabkan oleh virus SARS-CoV-2 disebut COVID-19. COVID-19 pertama kali terdeteksi pada tahun 2019 di Wuhan, China. Penyebaran COVID-19 sangat cepat dengan tingkat kematian yang tinggi terus terjadi di berbagai negara sehingga penyakit ini berstatus pandemi. Skripsi ini menyelesaikan masalah klasifikasi virus SARS-CoV-2 dengan menggunakan data sekuens protein coronavirus. Seleksi fitur pada data sekuens protein coronavirus menggunakan metode seleksi fitur Random Forest-Recurisive Feature Elimination (RF-RFE). Setelah dilakukan seleksi fitur, dilakukan klasifikasi menggunakan pendekatan machine learning dengan metode Support Vector Machine (SVM) dan Particle Swarm Optimization-Support Vector Machine (PSO-SVM). Hasil terbaik performa rata-rata akurasi, spesifisitas, dan sensitivitas untuk metode SVM berturut-turut adalah 93,43%, 98,06%, dan 88,84% pada data pelatihan sebesar 80%. Untuk metode PSO-SVM, hasil terbaik rata-rata akurasi dan spesifisitas adalah 98,48% dan 98,57% pada data pelatihan sebesar 80%, sedangkan hasil terbaik rata-rata sensitivitas adalah 98,96% pada data pelatihan sebesar 90%. Oleh karena itu, pada penelitian ini dapat disimpulkan bahwa metode PSO-SVM menghasilkan performa yang lebih baik dibandingkan dengan metode SVM.

Coronaviruses are a group of viruses that infect the respiratory system that can cause mild or severe respiratory infections. One of the viruses that belongs to the coronavirus is SARS-CoV-2. The disease caused by the SARS-CoV-2 virus is called COVID-19. COVID-19 was first detected in 2019 in Wuhan, China. The spread of COVID-19 is very fast with a high mortality rate that continues to occur in various countries so that this disease has a pandemic status. This thesis solves the problem of classifying the SARS-CoV-2 virus using coronavirus protein sequence data. Feature selection on coronavirus protein sequence data used the Random Forest-Recursive Feature Elimination (RF-RFE) feature selection method. After feature selection, classification is carried out using a machine learning approach with the Support Vector Machine (SVM) and Particle Swarm Optimization-Support Vector Machine (PSO-SVM) methods. The best results of the average performance of accuracy, specificity, and sensitivity for the SVM method are 93.43%, 98.06%, and 88.84%, respectively, for training data of 80%. For the PSO-SVM method, the best results on average accuracy and specificity are 98.48% and 98.57% on training data of 80%, while the best results on average sensitivity are 98.96% on training data of 90%. Therefore, in this study it can be concluded that the PSO-SVM method produces better performance than the SVM method."

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2021

S-pdf

UI - Skripsi Membership Universitas Indonesia Library

Arfiani

Klasifikasi data infark serebri menggunakan multiple support vector machine dengan seleksi fitur information gain = Cerebral infarction data classification using multiple support vector machine with information gain feature selection

"Stroke merupakan penyakit yang menempati urutan ketiga sebagai penyebab kematian terbesar di dunia setelah penyakit jantung dan kanker. Stroke juga menduduki posisi pertama sebagai penyakit yang dapat menyebabkan kecacatan, baik ringan maupun berat. Salah satu jenis stroke yang umum terjadi adalah infark serebri. Di Indonesia, jumlah penderita stroke, terutama infark serebri, semakin meningkat setiap tahunnya. Tidak hanya terjadi pada seseorang yang berusia lanjut, namun infark serebri juga dapat terjadi pada seseorang yang masih muda dan produktif. Oleh sebab itu, pendeteksian dini terhadap infark serebri sangatlah penting. Berbagai metode medis selalu digunakan untuk mengklasifikasi infark serebri, namun dalam penelitian ini, akan digunakan metode machine learning. Metode yang diusulkan yaitu Multiple Support Vector Machine dengan Seleksi Fitur Information Gain (MSVM-IG). MSVM-IG merupakan metode baru yang menggunakan support vector sebagai data baru untuk selanjutnya dilakukan seleksi fitur dan evaluasi performa. Data yang digunakan berupa data numerik hasil CT Scan yang diperoleh dari RSUPN dr. Cipto Mangunkusumo, Jakarta. Berdasarkan hasil uji coba, metode yang diusulkan mampu mencapai nilai akurasi sebesar 88,71%. Sehingga, metode MSVM-IG ini dapat menjadi salah satu alternatif untuk membantu praktisi medis dalam mengklasifikasi infark serebri.

Stroke is a disease that ranks third as the biggest cause of death in the world after heart disease and cancer. Stroke also occupies the first position as a disease that can cause disability, both mild and severe. One type of stroke that is common is cerebral infarction. In Indonesia, the number of stroke patients, especially cerebral infarction, is increasing every year. Not only occurs in someone who is elderly, but cerebral infarction can also occur in someone who is young and productive. Therefore, early detection of cerebral infarction is very important. Various medical methods are always used to classify cerebral infarction, but in this study, machine learning methods would be used. The proposed method is Multiple Support Vector Machine with Information Gain Feature Selection (MSVM-IG). MSVM-IG is a new method that uses support vector as a new dataset, then feature selection step and performance evaluation are performed. The data used in the form of numerical data results of CT scan obtained from RSUPN Dr. Cipto Mangunkusumo, Jakarta. Based on the results, the proposed method is able to achieve an accuracy value of 88.71%. Thus, the MSVM-IG could be an alternative to assist medical practitioners in classifying cerebral infarction."

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2019

S-pdf

UI - Skripsi Membership Universitas Indonesia Library

Melati Vidi Jannati

Klasifikasi kanker paru-paru menggunakan support vector machine dengan pemilihan fitur berbasis fungsi kernel = Classification of lung cancer using support vector machine with feature selection based on kernel function

"Klasifikasi data kanker menggunakan microarray data menjadi salah satu cara untuk mendapatkan pengobatan yang lebih tepat. Kendala yang terdapat adalah karakteristik dari microarray yang memiliki fitur yang sangat banyak. Seringkali fitur tersebut tidak begitu informatif bagi pengklasifikasian sehingga perlu adanya suatu cara untuk memilih fitur-fitur yang mengandung informasi yang penting. Salah satu cara tersebut adalah dengan pemilihan fitur. Pada penelitian ini, metode pemilihan fitur yang digunakan berdasarkan clustering dengan fungsi kernel. Fitur-fitur yang sudah terpilih kemudian diklasifikasikan menggunakan metode Support Vector Machine.

Evaluasi dari klasifikasi pada penelitian ini melibatkan K-Fold Cross Validation, metode tersebut akan membagi data secara acak, tetapi merata sehingga akurasi yang didapat juga merata. Hasil akurasi tersebut dilakukan dengan berbagai uji terhadap parameter yang berkaitan seperti K partisi, nilai dan fitur-fitur yang digunakan. Pada proses klasifikasi tanpa pemilihan fitur tingkat akurasinya mencapai 89.68 dengan k partisi sebanyak 6 sementara dengan 5 fitur akurasinya menjadi 95.87 pada partisi sebanyak 10.

Classification of cancer using microarray data is one way to get a more precise treatment. The obstacle on classification data is the characteristics of microarray data that is having many features. These features are often not so informative for classification, so it needs a way to select the features that contain important information. One way is by selection feature. In this research, the method of selection features that are used based on clustering with kernel function. Features that are already selected then classified using Support Vector Machine.
Evaluation of classification in this research involves a K Fold Cross Validation, that methods split data randomly but uniformly so that it can reach all of accuracy. The results of accuracy data was done with different test against related parameters such as K partition, the value of and the features that are used. On the classification process without selection features rate of accuracy reached on 89.68 with k partition number 6 while with the 5 features obtained 95.87 on partition number 10."

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2016

S66852

UI - Skripsi Membership Universitas Indonesia Library

Qisthina Syifa Setiawan

Perbandingan metode Naïve Bayes dan Support Vector Machine dengan seleksi fitur grey wolf optimization untuk klasifikasi data kanker serviks = Comparison of Naïve Bayes and Support Vector Machine with grey wolf optimization feature selection for cervical cancer data classification

"Serviks atau leher rahim merupakan salah satu bagian dari sistem alat reproduksi wanita. Salah satu penyakit yang dapat menyerang serviks adalah kanker. Di dunia, kanker serviks adalah salah satu kanker yang menyebabkan kematian dan keganasan yang paling umum terjadi pada wanita. Kanker serviks merupakan penyakit yang memiliki peluang sembuh cukup besar jika terdeteksi sejak dini. Seiring dengan perkembangan teknologi dalam berbagai bidang, termasuk dalam bidang medis, maka pendeteksian dini kanker serviks dapat dilakukan dengan klasifikasi menggunakan bantuan dari metode klasifikasi machine learning. Pada penelitian ini, metode klasifikasi machine learning yang digunakan untuk mengklasifikasikan kanker serviks adalah metode Naïve Bayes (NB) dan Support Vector Machine (SVM) dengan seleksi fitur Grey Wolf Optimization (GWO). Seleksi fitur GWO merupakan seleksi fitur metode wrapper yang digunakan pada penelitian ini untuk mengeliminasi fitur-fitur tidak relevan dalam mengklasifikasikan data kanker serviks, agar NB dan SVM dapat mengklasifikasi dengan lebih akurat. Sehingga, metode ini disebut sebagai metode NB–GWO dan SVM–GWO. Data kanker serviks yang digunakan pada penelitian ini merupakan data numerik dari hasil citra MRI yang diperoleh dari Departemen Radiologi RSUPN Dr. Cipto Mangunkusumo. Berdasarkan hasil penelitian dengan seleksi fitur GWO, metode NB–GWO menghasilkan rata-rata akurasi, recall, dan f1-score tertinggi masing-masing sebesar 96,30%, 96,08%, 97,93%, dan 96,30%, sedangkan metode SVM–GWO menghasilkan rata-rata akurasi dan f1-score tertinggi masing-masing sebesar 95,37% dan 95,36% dengan kernel Linier, rata- rata presisi tertinggi sebesar 97,56% dengan kernel Polinomial, serta rata-rata recall tertinggi sebesar 99,75% dengan kernel RBF. Kemudian, berdasarkan hasil klasifikasi tanpa seleksi fitur GWO, metode NB menghasilkan rata-rata akurasi, presisi, recall, dan f1-score tertinggi masing-masing sebesar 91,98%, 95,21%, 92,90%, 91,95%, sedangkan metode SVM menghasilkan rata-rata akurasi, recall, dan f1-score tertinggi sebesar 92,13%, 99,24%, dan 92,19% dengan kernel RBF, serta rata-rata presisi tertinggi sebesar 93,59% dengan kernel Polinomial. Dengan demikian, metode seleksi fitur GWO dapat meningkatkan kinerja dari NB dan SVM dalam mengklasifikasikan data kanker serviks. Selanjutnya, berdasarkan hasil perbandingan kinerja dari NB–GWO dan SVM–GWO, maka secara keseluruhan metode NB–GWO menghasilkan kinerja yang lebih baik dalam mengklasifikasikan data kanker serviks dibandingkan dengan SVM–GWO.

Cervix is one part of the female reproductive system. One of the diseases that can attack the cervix is cancer. In the world, cervical cancer is one of the cancers that cause death and malignancy that is most common in women. Cervical cancer is a disease that has a considerable chance of recovery if detected early. Along with the development of technology in various fields, including in the medical field, the early detection of cervical cancer can be done by classification using the help of machine learning classification methods. In this study, the machine learning classification method used to classify cervical cancer was Naïve Bayes (NB) and Support Vector Machine (SVM) with Grey Wolf Optimization (GWO) feature selection. GWO feature selection is a wrapper feature selection method used in this study to eliminate irrelevant features in classifying cervical cancer data, so that NB and SVM can classify more accurately. Thus, this method is referred to as the NB–GWO and SVM–GWO. Cervical cancer data used in this study is numerical data from MRI obtained from the Department of Radiology RSUPN Dr. Cipto Mangunkusumo. Based on the results of the study with GWO feature selection, NB– GWO produced the highest average accuracy, recall, and f1-score of 96.30%, 96.08%, 97.93%, and 96.30% respectively, while SVM–GWO produced the highest average accuracy and f1-score of 95.37% and 95.36% respectively with Linear kernel, the highest precision average of 97.56% with Polynomial kernel, and the highest recall average of 99.75% with RBF kernel. Then, based on the results of classification without GWO feature selection, the NB produced the highest average accuracy, precision, recall, and f1- score of 91.98%, 95.21%, 92.90%, 91.95% respectively, while SVM produced the highest average accuracy, recall, and f1-score of 92.13%, 99.24%, and 92.19% with RBF kernel, and the highest average precision of 93.59% with Polynomial kernel. Thus, GWO feature selection method was able to improve the performance of NB and SVM in classifying cervical cancer. Furthermore, based on the results of performance comparison from NB– GWO and SVM–GWO, the overall method of NB–GWO resulted in better performance in classifying cervical cancer data compared to SVM–GWO."

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2021

S-Pdf

UI - Skripsi Membership Universitas Indonesia Library

<< 1 2 3 4 5 6 7 8 9 10 >>

Hasil Pencarian :: Simpan CSV :: Kembali

Hasil Pencarian