Hasil Pencarian

Ditemukan 4 dokumen yang sesuai dengan query

Endang Fiansyah

Perbandingan Kinerja Metode Pemelajaran Mesin Dalam Mengklasifikasi dan Memprediksi Pelanggan Churn Dan Loyal Di Aplikasi Indosat Imkas Dan Pede = Comparison Of The Performance Of Machine Learning Methods In In Classifying And Predicting Churn And Loyal Customers In The Indosat Imkas And Pede Applications

"Perusahaan IAPS telah meluncurkan berbagai aplikasi, salah satunya ialah aplikasi Indosat Imkas dan Pede (Ponsel Duit) (berikutnya disebut aplikasi). Permasalahannya ialah masih terdapat sembilan belas persen pengguna valid yang tidak bertransaksi tiap bulan di aplikasi untuk mencapai target yaitu dua puluh persen pengguna valid bertransaksi tiap bulan di aplikasi. Dapat terjadi perubahan strategi bisnis apabila gagal mencapai target dan mengalami kerugian secara finansial. Peneliti menemukan satu masalah yang berdampak paling signifikan yaitu pemberian informasi dan promosi yang tidak berjalan efektif. Diperlukan teknik data mining dan klasifikasi churn yang menggunakan pemelajaran mesin untuk memprediksi pengguna yang tepat untuk pemberian informasi dan promosi. Pertanyaan penelitian ini adalah apa model pemelajaran mesin terbaik dalam melakukan prediksi pengguna yang churn dan loyal. Selain itu, apa faktor-faktor yang mempengaruhi keputusan pelanggan untuk churn dan loyal. Penelitian ini menghasilkan model yang dikembangkan menggunakan mesin pemelajaran dengan pengawasan menggunakan classifier Random Forest merupakan model pemelajaran mesin dengan performa terbaik untuk melakukan klasifikasi dan prediksi pengguna yang churn dan loyal. Selain itu, data ulasan pelanggan Google playstore yang di klasifikasikan kedalam push-pull classification dapat meningkatkan performa classifier. Beberapa fitur yang memengaruhi keputusan churn pelanggan yaitu “debetKreditRatio”, “push_service”. Kedua fitur itu berkaitan erat dengan tingkat layanan di aplikasi, semakin tinggi tingkat layanan yang diterima pengguna maka peluang akan churn rendah. Organisasi hendaknya membuat strategi untuk meningkatkan nilai “debetKreditRatio” dan “push service”. Selanjutnya organisasi dapat mengembangkan strategi retensi untuk pemberian informasi dan promosi yang berbeda untuk pelanggan yang akan churn dan loyal.

The IAPS company has has launched various applications, including the Indosat Imkas and Pede (Ponsel Duit) applications (from now on referred to as applications). The problem is that there is still nineteen percent of valid users who do not transact every month on the application to achieve the target, which is twenty percent of valid users who transact every month on the application. There can be a change in business strategy if it fails to achieve the target and suffers a financial loss. The researcher found one problem that had the most significant impact, namely promotions that did not work effectively. Data mining and churn classification techniques are required that use machines learning to predict the right users for targeted promotions or other strategies. The research question is what is the best machine learning model in predicting churn and loyal users. In addition, another research question is what are the factors that influence the customer's decision to churn and be loyal. This study resulted in a model developed using a supervised learning machine using the Random Forest classifier, which is the best-performing machine learning model for classifying and predicting churn and loyal users. In addition, customer reviews from Google play store data classified into a push-pull classification can also improve the classifier's performance. Several features affect customer churn decisions, namely "debetKreditRatio," "push_service." Both features are closely related to the level of service in the application. The higher the level of service the user receives, the lower the chance of churn. Organizations should develop strategies to increase the value of the "debit credit ratio" and "push service." Furthermore, the organization can develop different promotional approaches for customers who will churn and be loyal."

Jakarta: Fakultas Ilmu Komputer Universitas Indonesia, 2021

TA-pdf

UI - Tugas Akhir Universitas Indonesia Library

Darian Texanditama

Optimalisasi Kinerja Pemelajaran Mesin di Bidang Pendidikan dengan Contoh Kasus Prediksi Mahasiswa Putus Studi di Beberapa Perguruan Tinggi Indonesia = Optimizing Machine Learning Performance in the Field of Education with Case Example of Predicting Student Drop-out at Several Indonesian Universities

"Pemelajaran mesin dikenal sangat berguna dalam menyelesaikan permasalahan prediksi dan klasifikasi melalui pembelajaran pola dan perilaku data yang tersedia. Oleh karena itu, pemelajaran mesin dapat dimanfaatkan di berbagai bidang kehidupan dan industri modern. Namun, kinerja pemelajaran mesin sangat tergantung dari model pemelajaran mesin yang digunakan maupun dari kualitas data yang digunakan untuk pemelajaran. Data yang tidak bersih, tidak representatif, dan ketersediaannya terbatas akan mengurangi kualitas hasil prediksinya.

Penelitian ini bertujuan untuk menguji kombinasi beberapa metode pemrosesan data (yaitu MissForest, GAIN, ENN, dan TabGAN oversampling) dengan model pembelajaran mesin (yaitu model CatBoost dan model klasifikasi biner berbasis neural network) untuk memprediksi kasus mahasiswa putus studi di beberapa universitas di Indonesia menggunakan data dari PDDikti. Penambahan fitur dilakukan untuk memberi label bidang studi terhadap dataset tersebut. Selain penambahan fitur seleksi fitur relevan menggunakan korelasi Pearson serta feature importances juga dilakukan setelah pelatihan model awal. Google Colab dengan bahasa pemrograman Python digunakan untuk menjalankan algoritma pemrosesan data dan pelatihan model.

Hasil penelitian menunjukkan bahwa model CatBoost dengan kombinasi metode imputasi GAIN, undersampling ENN, dan tanpa fitur kelompok bidang studi memberikan F1-score tertinggi yaitu 66,38% dengan nilai precision 71,75% dan nilai recall 61,76%. Apabila digunakan model klasifikasi biner pemelajaran dalam akan didapatkan metrik terbaik F1-score 62,32%. Hasil terbaik penelitian ini menunjukkan peningkatan F1-score sebesar 2,15% dibandingkan dengan F1-score pada penelitian sebelumnya yang menggunakan model CatBoost bersama kombinasi Missforest dan ENN tanpa fitur kelompok

bidang studi. Penelitian ini menunjukkan bahwa oversampling dan undersampling memberikan dampak yang berlawanan terhadap metrik precision dan recall. Penelitian juga menemukan seleksi fitur dapat meningkatkan kinerja model namun tidak berdampak besar dibandingkan teknik-teknik lain misalnya balancing dan optimisasi hyperparameter.

Machine learning is known to be very useful in solving prediction and classification problems
by learning the patterns and behavior of available data. Therefore, machine learning can be utilized in various areas of modern life and industry. However, the performance of machine learning is highly dependent on the machine learning model used as well as on the quality of the data used for learning. Data that is not clean, not representative, and scarce will reduce the quality of the prediction results.
This study aims to test the combination of several data processing methods (namely MissForest, GAIN, ENN, and TabGAN oversampling) with machine learning models (CatBoost and binary classification models based on neural networks) to predict dropout cases at several Indonesian universities using data from PDDikti. The addition of features is done to label data with their respective fields of study. Other than adding features, selection of relevant features using Pearson’s correlation as well as feature importances is also carried out after initial model training. Google Colab with the Python programming language is used to run data processing algorithms and train models.
This study shows that CatBoost with the combination of GAIN imputation, ENN undersampling, and no field of study feature results in the highest F1-score of 66.38%, which are composed of 71.75% in precision and 61.76% in recall. If a deep learning binary classification model is used instead, the best F1-score result is 62.32%. The best result from this study shows an increase in F1-score of 2.15% compared to the F1-score of the previous study (64.23%) which used CatBoost along with a combination of Missforest, ENN and no field of study features. This research shows oversampling and undersampling produce opposite effects on precision and recall scores. Research has also
found that feature selection can improve model performance but does not have a large impact compared to other techniques such as balancing and hyperparameter optimization"

Depok: Fakultas Ilmu Komputer Universitas Indonesia, 2022

S-pdf

UI - Skripsi Membership Universitas Indonesia Library

Endang Fiansyah

Perbandingan kinerja metode pemelajaran mesin dalam mengklasifikasi dan memprediksi pelanggan Churn dan Loyal di aplikasi Indosat Imkas dan Pede = Comparison of the performance of machine learning methods in in classifying and predicting Churn and Loyal customers in the Indosat Imkas and Pede applications

Depok: Fakultas Ilmu Komputer Universitas Indonesia, 2021

TA-pdf

UI - Tugas Akhir Universitas Indonesia Library

Valentinus Paramarta

Perbandingan kinerja Support Vector Machine (SVM), Decision Tree, dan k-Nearest Neighbors untuk memprediksi adiksi internet dan kondisi kesehatan mental: studi kasus mahasiswa Saintek Universitas Indonesia = Performance comparison of Support Vector Machine (SVM), Decision Tree, and k-Nearest Neighbors for internet addiction and mental health status prediction: a study case of STEM Students at Universitas Indonesia

"Semakin tinggi penetrasi penggunaan Internet seseorang, maka akan semakin berpotensi terkena Gangguan Adiksi Internet (GAI) yang dapat berdampak buruk pada status kesehatan mental penggunanya. Mayoritas penduduk Indonesia telah menggunakan layanan Internet selama 2 sampai 3 tahun dengan penggunaan rata-rata di atas 8 jam

perhari. Hal tersebut menunjukkan penggunaan Internet dan potensi dampaknya pada kesehatan mental di Indonesia penting untuk diperhatikan sedini mungkin. Penelitian lain menunjukkan bahwa tingkat kesehatan mental yang dialami seseorang dapat mempengaruhi perilaku penggunaan Internetnya, sehingga menyebabkan munculnya keinginan yang tidak terkendali dan berlebihan dalam pengaksesan Internet. Secara tidak langsung, hal tersebut menyatakan bahwa kesehatan mental seseorang juga dapat diamati melalui tingkah laku serta kebiasaan seseorang dalam menggunakan Internet. Prediksi GAI dan gangguan kesehatan mental mahasiswa UI dilakukan dengan menggunakan algoritma pemelajaran mesin Support Vector Machine (SVM) berdasarkan perilaku penggunaan Internet yang dilakukan. Sampel diambil dari mahasiswa UI rumpun Ilmu Saintek (Ilmu Komputer, Teknik, dan MIPA). Data yang diambil adalah riwayat penulusuran halaman website yang diakses oleh mahasiswa dan hasil kuesioner Internet addiction test (IAT) dan General Health Questionnaire (GHQ-12). Riwayat penelusuran website dijadikan himpunan fitur yang merepresentasikan perilaku penggunaan Internet responden, sedangkan hasil skor kuesioner IAT dan GHQ-12 digunakan untuk menjadi ground truth atau label pada dataset. Tahapan preprocessing yang dilakukan adalah metode Synthetic Minority Over-Sampling Technique (SMOTE) untuk mengatasi ketidak seimbangan persebaran data pada kelas data yang digunakan. Metode SVM selanjutnya dibandingkan dengan performa lainnya seperti Decision Tree dan k-Nearest Neighbor (kNN). Untuk meningkatkan performa akurasinya, peneliti menggunakan metode grid search untuk mendapatkan parameter terbaik. Proses validasi dilakukan menggunakan cross-validation pada metode grid search. Hasil yang didapatkan menunjukkan bahwa performa akurasi tertinggi pada SVM untuk memprediksi GAI adalah 88% pada dataset kedua. Saat dilakukan perbandingan hasil dengan metode pemelajaran mesin Decision Tree dan kNN, didapatkan performa nilai akurasi tertinggi dicapai pada metode Decision Tree dengan nilai akurasi sebesar 96%. Sedangkan untuk prediksi gangguan kesehatan mental, metode SVM mendapatkan nilai performa akurasi tertinggi sebesar 71% pada dataset gabungan. Saat dilakukan perbandingan hasil performa akurasi dengan Decision

Tree dan kNN, didapatkan nilai performa akurasi tertinggi dicapai pada metode kNN sebesar 72%. Hasil penelitian ini menunjukkan bahwa metode grid search meningkatkan performa SVM, Decision Tree, dan kNN karena adanya perubahan nilai parameter.

Excessive internet usage lead to potential Internet Addiction Disorders (IAD) which affect user`s mental health. The mayority of Indonesian people have been used Internet services for 2 until 3 years in their lives with an average use of above 8 hours per day. It shows that an increase of internet usage has a positive potential impact to an increase in mental disorder. Other research shows that the level of mental health experienced by a person can influence his Internet usage behavior, thus causing an uncontrolled and excessive desire to access the Internet. It could be concluded that the mental health can also be observed through one`s behavior and habits in using the Internet. This study predicts the internet addiction disorder (IAD) and mental health disorder status of UI students by using machine learning based on Support vector Machine (SVM) algorithm. This study used behaviour of internet usage for the input. Samples used in this study were taken from Universitas Indonesia`s students with Science and Technology background. The data collection period was set before and after the exam period. Data collected in this study included history of website accessed by students and questionnaires based on Internet addiction test (IAT) and General Health Questionnaire (GHQ-12). Student`s website history would be used as feature data set that represent user internet usage behavior, while the IAT and GHQ-12 questionnaires results were used as the label. The preprocessing stage was carried out using Synthetic Minority Over-Sampling Technique (SMOTE) method to overcome the imbalance of data distribution in class used. Then, student`s website history would be analyzed using machine learning based on SVM algorithm to predict IAT and mental health status. This study also compared other algorithms such as Decision Tree and k-Nearest Neighbor (kNN). The optimization of machine learning model was conducted using grid search method to obtain the best
parameters. The validation of the model would be carried out using the cross-validation obtained from grid search method. Based on the results obtained, it shows that the highest accuracy for predicting internet addiction was obtained from SVM algorithm with 88% accuracy for the second dataset. Comparison with other models showed that Decision Tree obtained the highest accuracy value of 96% for predicting internet addiction. For the prediction of mental health disorder, SVM algorithm obtained the highest accuracy than Decision Tree or kNN. The SVM algorithm can predict with accuracy of 71% with combined dataset. When comparing the accuracy result with the accuracy of Decision Tree and kNN, the highest accuracy value of 72% was achieved by kNN method. The optimal value of accuracy is obtained when the grid search method is performed. The results of this study indicate that the grid search method has succeeded in improving the performance of SVM, Decision Tree, and kNN due to parameter value changes."

Depok: Fakultas Ilmu Komputer Universitas Indonesia, 2020

TA-pdf

UI - Tugas Akhir Universitas Indonesia Library

Hasil Pencarian :: Simpan CSV :: Kembali

Hasil Pencarian