Hasil Pencarian

Ditemukan 68847 dokumen yang sesuai dengan query

Vadie Akbar Kalamata

Rancang bangun portable public address system dengan tingkat speech intelligibility tinggi = Design and fabrication of a portable public address system with a high speech intelligibility level

"Public Address System merupakan sebuah sistem yang dapat menguatkan suara secara elektronik maupun akustik yang digunakan untuk menyampaikan informasi ke khalayak ramai. Perkembangan teknologi menjadikan sistem tersebut semakin kecil dan portable. Salah satu contoh portable public address system yang paling umum digunakan adalah megafon. Namun dalam perkembangannya, megafon hanya difokuskan untuk membuat sistem semakin lantang namun respons frekuensi dari sistem tidak disempurnakan. Hal ini menyebabkan speech intelligibility tingkat pemahaman berkurang karena banyak frekuensi yang tidak dapat direproduksi oleh megafon. Penelitian ini membahas tentang perancangan perangkat portable public address system yang memiliki tingkat speech intelligibility tinggi dengan harapan akan memiliki kejelasan suara yang lebih baik dari sistem yang sudah ada. Dari hasil pengujian ditemukan bahwa perangkat penelitian memiliki sound pressure level sebesar 107 dB; 4 dB lebih rendah dibandingkan dengan sistem megafon yang sudah ada. Hasil respons frekuensi perangkat lebih lebar dibandingkan dengan sistem megafon yang sudah ada, yakni dari rentang 200 Hz ndash; 16 KHz. Karakter suara yang dihasilkan perangkat memiliki sifat warm yang lebih nyaman di telinga.

Public Address System is a system designed to amplify sound electronically or acoustically which is used to address an information to the public. With the advancement of technology, the system grew more compact and portable. One of the most used and common portable public address system is the megaphone. Sadly, megaphones development solely focus on how to make the system louder, neglecting the improvement of the frequency response. Thus making the speech intelligibility of the system lessen due to the failure of the reproduction of the frequency response needed. This thesis discusses about the design of a portable public address system with high speech intelligibility level that have a clearer sound output compared to the existing system. From the test, results found that the device has a sound pressure level of 107 dB 4 dB less than the existing megaphone system. The device's frequency response is wider than the existing megaphone system that covers from 200 Hz ndash 16 KHz. The sound character that the device produces has a warmer feel that is more comfortable in the ear."

Depok: Universitas Indonesia, 2017

S68093

UI - Skripsi Membership Universitas Indonesia Library

Muhammad Harish Heruputra

Dampak Keterbukaan Ruang terhadap Speech Intelligibility pada Masjid = The Impact of Spaces' Openness on Speech Intelligibility in Mosques

"Masjid sebagai tempat ibadah umat muslim membutuhkan tingkat kejelasan suara yang tinggi sehingga kekhusyukan ibadah dapat tercapai. Namun, masjid saat ini kebanyakan memiliki kualitas akustik yang buruk akibat penggunaan material yang reflektif terhadap suara di ruangan tertutup. Skripsi ini bertujuan untuk memahami lebih dalam mengenai dampak keterbukaan ruang terhadap speech intelligibility pada masjid. Speech intelligibility adalah ukuran seberapa jelas suatu suara di dalam ruangan yang diukur dengan metode speech transmission index (STI). STI dipengaruhi oleh banyak faktor, di antaranya adalah bising dan dengung ruangan, yang menyebabkan keterbukaan ruang memiliki dampak besar terhadap STI. Skripsi dilakukan dengan menganalisis akustik ruang menggunakan simulasi pada software Fohhn Designer, dengan Masjid UI didesain terbuka dan tertutup sebagai objek untuk mengetahui persebaran STI, intensitas suara (SPL), dan waktu dengung (RT) kedua tipe masjid. Hasil studi menunjukkan bahwa masjid dengan desain terbuka memiliki nilai STI yang lebih tinggi, tetapi perbedaannya tidak begitu signifikan karena desain ruangan dan loudspeaker masjid sudah tergolong baik, ditambah barisan jamaah di kedua tipe masjid juga dapat meningkatkan STI.

Mosque as a worshiping place for muslims requires a high level of speech intelligibility so that the solemnity of worship can be achieved. However, today's mosques mostly have poor acoustic quality due to the use of sound-reflective materials in enclosed room. This thesis aims to understand more about the impact of spaces’ openness on speech intelligibility in mosques. Speech intelligibility is a measure of how clear a voice is in a room and measured by the speech transmission index (STI) method. STI is affected by many factors, two of them are background noise and reverberation of the room, which causes the openness of a space to have a big impact on STI. This thesis analyzes the rooms’ acoustics using Fohhn Designer software simulation, along Masjid UI with both exposed and enclosed design as the objects to determine the distribution of STI, sound intensity (SPL), and reverberation time (RT) of both mosques types. The results show that mosque with an open design has a higher STI, but the difference is not significant due to the good geometry and loudspeakers design the mosque already has, as well as the rows of worshipers that can also increase STI value in both mosques types."

Depok: Fakultas Teknik Universitas Indonesia, 2023

S-pdf

UI - Skripsi Membership Universitas Indonesia Library

The determination of acoustical absorbing materials in the " A-Marwah" room of "A-Akbar" mosque in Surabaya using objective parametera approximation

"The room that used to hold the speech activity needs a good acoustical quality.To determine the acoustical quality of a room the experimental reserach was conducted to find the relationship between the determination of acoustical absorbing materials and speech intelligibility in the "Al-Marwah" room of "Al-Akbar" mosque in Surabaya....."

IPTEKAB

Artikel Jurnal Universitas Indonesia Library

Mohamad Irfan Fanani

Rancang Bangun Sistem Pencarian Kata pada Terjemahan Al-Qur’an Berbasis Website dengan Node.Js dan Google Speech-To-Text API = Design and Develop a Word Search System in Website-Based Translation of The Quran with Node.Js And Google Speech-To-Text API

"Penelitian ini membahas tentang pengembangan sistem pencarian kata pada terjemahan Al-Qur’an berbasis website dengan menggunakan Node.JS, Google Speech-to-Text API, dan alquran.cloud API sebagai komponen utamanya. Masukan sistem berupa suara yang pada proses selanjutnya diubah menjadi teks oleh Google Speech-to-Text API lalu teks digunakan sebagai kata kunci untuk mencari terjemahan menggunakan alquran.cloud API. Keluaran sistem berupa tampilan pada halaman website yang berisikan tabel daftar nama surat dan nomor ayat yang mengandung kata kunci. Pembuatan website menggunakan HTML, CSS, dan fungsi JavaScript untuk menyatukan web API dalam satu website. Fungsi JavaScript yang dibuat untuk melakukan fetch data dan menampilkan tabel keluaran dieksekusi di client, sedangkan fungsi untuk masukan sinyal suara dan transkripsi dieksekusi di server. Hasil percobaan menghasilkan akurasi sebesar 86% pada hasil transkripsi Google Speech-to-Text API yang digunakan pada sistem. Akurasi pada penguji perempuan lebih tinggi 14% dibanding penguji laki-laki dengan akurasi penguji perempuan sebesar 92%. Diamati juga waktu dalam kecepatan proses transkripsi, kecepatan fetch data dari alquran.cloud API, dan kecepatan sistem dalam menampilkan keluaran.

This study discusses the development of a word search system for website-based translation of the Qur'an using Node.JS, Google Speech-to-Text API, and alquran.cloud API as the main components. The system input is in the form of voice which in the next process is converted into text by the Google Speech-to-Text API and then the text is used as keywords to search for translations using the alquran.cloud API. The system output is in the form of a display on a website page that contains a table listing letter names and verse numbers containing keywords. Website are created using HTML, CSS, and JavaScript functions to unify the web API in one website. JavaScript functions that are created to fetch data and display an output table are executed on the client, while functions for voice signal input and transcription are executed on the server. The experimental results yield an accuracy of 86% on the transcription results of the Google Speech-to-Text API used in the system. The accuracy of the female examiners was 14% higher than the male examiners with the female examiners' accuracy of 92%. Also observed is the speed of the transcription process, the speed of fetching data from the alquran.cloud API, and the speed of the system in displaying the output."

Depok: Fakultas Teknik Universitas Indonesia, 2022

S-Pdf

UI - Skripsi Membership Universitas Indonesia Library

Sandford, William Phillips

Effective business speech

New York: McGraw-Hill, 1960

808.5 SAN e

Buku Teks Universitas Indonesia Library

Arief Saferman

Rancang Bangun Sistem Automatic Speech Recognition untuk Bahasa Indonesia Berbasis Wav2Letter dengan Loss Function CTC dan ASG = Development of Automatic Speech Recognition System for Indonesian Language Based on Wav2Letter with Loss Function CTC and ASG

Selama masa pandemi COVID-19, teknologi Automatic Speech Recognition (ASR) menjadi salah satu fitur yang sering digunakan pada komputer untuk mencatat di kelas online secara realtime. Teknologi ini akan bekerja dimana setiap suara yang muncul akan langsung dikenali dan dicatat pada halaman terminal. Dalam penelitian ini, model ASR Wav2Letter akan digunakan menggunakan CNN (Convolution Neural Network) dengan loss function CTC (Connectionist Temporal Classification) dan ASG (Auto Segmentation Criterion). Selama proses pembuatannya, berbagai hyperparameter acoustic model dan language model dari model ASR Wav2Letter terkait dengan implementasi batch normalization¸ learning-rate, window type, window size, n-gram language model, dan konten language model diuji pengaruh variasinya terhadap performa model Wav2Letter. Dari pengujian tersebut, ditemukan bahwa model ASR Wav2Letter menunjukkan performa paling baik ketika acoustic model menggunakan metode ASG dengan learning-rate 9 × 10−5 , window size 0.1, window type Blackman, serta 6-gram language model. Berdasarkan hasil akurasi WER CTC unggul 1,2% dengan 40,36% berbanding 42,11% dibandingkan ASG, namun jika dilihat lamanya epoch dan ukuran file model, loss function ASG memiliki keunggulan hampir dua kalinya CTC, dimana ASG hanya membutuhkan setengah dari jumlah epoch yang dibutuhkan oleh CTC yakni 24 epoch berbanding dengan 12 epoch dan ukuran file model ASG setengah lebih kecil dibandingkan CTC yakni 855,2 MB berbanding dengan 427,8 MB. Pada pengujian terakhir, model ASR Wav2Letter dengan loss function ASG mendapatkan hasil terbaik dengan nilai WER 29,30%. Berdasarkan hasil tersebut, model ASR Wav2Letter dengan loss function ASG menunjukkan perfoma yang lebih baik dibandingkan dengan CTC.

During the COVID-19 pandemic, Automatic Speech Recognition technology (ASR) became one of features that most widely used in computer to note down online class in real-time. This technology works by writing down every word in terminal from voice that is recognized by the system. ASR Wav2Letter model will use CNN (Convolutional Neural Network) with loss function CTC (Connectionist Temporal Classification) and ASG (Auto Segmentation Criterion). While developing Wav2Letter, various hyperparameter from acoustic model and language model is implemented such as batch normalization, learning rate, window type, window size, n-gram language model, and the content of language model are examined against the performance of Wav2Letter model. Based on those examination, Wav2Letter shows best performance when it uses ASG loss function learning rate 9 × 10−5 , window size 0.1, window type Blackman, and 6-gram language model. With that configuration, WER of CTC outplay ASG around 1.2% with 40.36% compare to 42,11%, but another parameter shows ASG are way more superior than CTC with less time epoch training which are 24 epoch for CTC against 12 epoch for ASG and the size of memory model shows CTC has bigger size than ASG with 855.2 MB against 427.8 MB. In the last test, ASR Wav2Letter model with ASG loss function get the best WER value around 29.3%. Based on those results, ASR Wav2Letter Model shows its best performance with ASG loss function than CTC.

Depok: Fakultas Teknik Universitas Indonesia, 2022

S-Pdf

UI - Skripsi Membership Universitas Indonesia Library

Mohammad Salman Alfarisi

Rancang Bangun Sistem Automatic Speech Recognition untuk Bahasa Indonesia Berbasis Wav2Vec 2.0 = Development of Automatic Speech Recognition System for Indonesian Language Based on Wav2Vec 2.0

Salah satu permasalahan yang terdapat pada sistem Automatic Speech Recognition (ASR) yang sudah ada adalah kurangnya transparansi dalam penanganan data suara, yang tentunya membuat adanya keraguan terhadap privasi data tersebut. Di sisi lainnya, untuk mengembangkan sebuah sistem ASR yang memiliki akurasi memadai dan dapat bekerja secara luring membutuhkan jumlah data yang banyak, khususnya data suara yang sudah diiringi dengan transkripnya. Hal ini menjadi salah satu hambatan utama pengembangan sistem pengenalan suara, terutama pada yang memiliki sumber daya minim seperti Bahasa Indonesia. Oleh karena itu, dalam penelitian ini dilakukan perancangan sistem pengenalan suara otomatis berbasis model wav2vec 2.0, sebuah model kecerdasan buatan yang dapat mengenal sinyal suara dan mengubahnya menjadi teks dengan akurasi yang baik, meskipun hanya dilatih data dengan label yang berjumlah sedikit. Dari pengujian yang dilakukan dengan dataset Common Voice 8.0, model wav2vec 2.0 menghasilkan WER sebesar 25,96%, dua kali lebih baik dibandingkan dengan model Bidirectional LSTM biasa yang menghasilkan 50% namun membutuhkan jumlah data dengan label 5 kali lipat lebih banyak dalam proses pelatihan. Namun, model wav2vec membutuhkan sumber daya komputasi menggunakan 2 kali lebih banyak RAM dan 10 kali lebih banyak memori dibandingkan model LSTM

One of the main problems that have plagued ready-to-use Automatic Speech Recognition (ASR) Systems is that there is less transparency in handling the user’s voice data, that has raised concerns regarding the privacy of said data. On the other hand, developing an ASR system from scratch with good accuracy and can work offline requires a large amount of data, more specifically labeled voice data that has been transcribed. This becomes one of the main obstacles in speech recognition system development, especially in low-resourced languages where there is minimal data, such as Bahasa Indonesia. Based on that fact, this research conducts development of an automatic speech recognition system that is based on wav2vec 2.0, an Artificial Model that is known to recognize speech signals and convert it to text with great accuracy, even though it has only been trained with small amounts of labeled data. From the testing that was done using the Common Voice 8.0 dataset, the wav2vec 2.0 model produced a WER of 25,96%, which is twice as low as a traditional Bidirectional LSTM model that gave 50% WER, but required 5 times more labeled data in the training process. However, the wav2vec model requires more computational resource, which are 2 times more RAM and 10 times more storage than the LSTM model.

Depok: Fakultas Teknik Universitas Indonesia, 2022

S-Pdf

UI - Skripsi Membership Universitas Indonesia Library