Improving Accuracy and Efficiency of Speaker Identification Using K-means and MFCC Algorithms in Noisy Environments

AL-Salem Nasr; Ali Ukasha

doi:10.63318/

Improving Accuracy and Efficiency of Speaker Identification Using K-means and MFCC Algorithms in Noisy Environments

المؤلفون

AL-Salem Nasr Electrical and Electronic Engineering Department, Faculty of Engineering - Wadi Alshatti University https://orcid.org/0000-0002-1937-4614
Ali Ukasha Electrical and Electronic Engineering Department, Faculty of Engineering - Wadi Alshatti University https://orcid.org/0000-0002-1937-4614

DOI:

https://doi.org/10.63318/

الكلمات المفتاحية:

K-means Algorithm، Speaker identification، Artificial Intelligence، Signal to noise ratio (SNR)، Mel-frequency cepstral، Coefficients (MFCC)، Mean Squared Error (MSE)

الملخص

Speaker identification is a critical challenge in audio processing, with significant applications in security and authentication systems. Efforts focus on developing fast and efficient AI-based techniques to identify speakers using features such as pitch and frequency. A speaker recognition system consists of two main stages: feature extraction and matching. This research presents an innovative model aimed at enhancing the accuracy of speaker recognition using K-means and MFCC algorithms. The results demonstrate that the K-means algorithm reduced the error rate from 20% to 0.85%, while the MFCC features achieved an accuracy range between 80% and 99.15%. Additionally, recognition time was significantly improved, decreasing from 0.4092 seconds to 0.0438 seconds, thereby increasing the system's efficiency. Moreover, the system's performance in noisy environments was evaluated using the Signal-to-Noise Ratio (SNR), while the Mean Squared Error (MSE) metric was employed to ensure reliability and confidence in the recognition results. These findings highlight the effectiveness of the proposed algorithms and underscore the system's potential for applications in voice-controlled systems and personal assistants.

التنزيلات

تنزيل البيانات ليس متاحًا بعد.

التنزيلات

PDF (الإنجليزية)

منشور

2025-03-11

إصدار

Volume 3, Issue 1, January-June 2025

القسم

Articles

الرخصة

هذا العمل مرخص بموجب Creative Commons Attribution-NonCommercial 4.0 International License.

تستخدم هذه المجلة رخصة المشاع الإبداعي-غير تجاري نَسب المُصنَّف 4.0 دولي (CC BY-NC 4.0)، والتي تسمح بالاستخدام والمشاركة والتوزيع والاستنساخ بأي وسيط أو صيغة، طالما أنك تعطي الفضل المناسب للمؤلف (المؤلفين) الأصلي والمصدر، وتوفر رابطًا لرخصة المشاع الإبداعي، وتشير إلى ما إذا تم إجراء تغييرات. للاطلاع على نسخة من هذا الترخيص، قم بزيارة /https://creativecommons.org/licenses/by-nc/4.0

حقوق الطبع والنشر للمقالات

يحتفظ المؤلفون بحقوق الطبع والنشر لمقالاتهم المنشورة في هذه المجلة.

كيفية الاقتباس

Nasr, A.-S., & Ukasha, A. (2025). Improving Accuracy and Efficiency of Speaker Identification Using K-means and MFCC Algorithms in Noisy Environments. مجلة جامعة وادي الشاطئ للعلوم البحتة والتطبيقية, 3(1), 107-113. https://doi.org/10.63318/

تنزيل الاقتباسات

Improving Accuracy and Efficiency of Speaker Identification Using K-means and MFCC Algorithms in Noisy Environments

المؤلفون

DOI:

الكلمات المفتاحية:

الملخص

التنزيلات

التنزيلات

منشور

إصدار

القسم

الرخصة

كيفية الاقتباس

إنشاء طلب نشر

المعلومات

اللغة

الكلمات المفتاحية

المنشورات الأخيرة

مطور من قبل

الاستعراض

Quick Links

Quick Links

Quick Links