Development of Speech Command Control Based TinyML System for Post-Stroke Dysarthria Therapy Device

Bambang Riyanta, Henry Ardian Irianta, Berli Paripurna Kamiel

Abstract


Post-stroke dysarthria (PSD) is a widespread outcome of a stroke. To help in the objective evaluation of dysarthria, the development of pathological voice recognition and technology has a lot of attention. Soft robotics therapy devices have been received as an alternative rehabilitation and hand grasp assistance for improving activity daily living (ADL). Despite the significant progress in this field, most soft robotic therapy devices use a complex, bulky, lack of pathological voice recognition model, large computational power, and stationary controller. This study aims to develop a portable wirelessly multi-controller with a simulated dysarthric vowel speech in Bahasa Indonesia and non-dysarthric micro speech recognition, using tiny machine learning (TinyMl) system for hardware efficiency. The speech interface using INMP441, compute with a lightweight Deep Convolutional Neural network (DCNN) design and embedded into ESP-32. Feature model using Short Time Fourier Transform (STFT) and fed into CNN. This method has proven useful in micro-speech recognition with low computational power in both speech scenarios with a level of accuracy above 90%. Realtime inference performance on ESP-32 using hand prosthetics, with 3-level household noise intensity respectively 24db,42db, and 62db, and has respectively resulted from 95%, 85%, and 50% Accuracy. Wireless connectivity success rate with both controllers is around 0.2 - 0.5 ms.


Keywords


Post Stroke Dysarthria; Dysarthic Speech Recognition; SCR; ASR; Micro Speech; KWS; TinyML; Edge Controller Devices; STFT-CNN.

Full Text:

PDF

References


C. M. J. M. Dourado Jr, S. P. P. da Silva, R. V. M. da Nóbrega, A. C. da S. Barros, P. P. R. Filho, and V. H. C. de Albuquerque, “Deep learning IoT system for online stroke detection in skull computed tomography images,” Computer Networks, vol. 152, pp. 25–39, Apr. 2019, doi: 10.1016/j.comnet.2019.01.019.

S. Wang, H. Zhai, L. Wei, B. Shen, and J. Wang, “Socioeconomic status predicts the risk of stroke death: A systematic review and meta-analysis,” Preventive Medicine Reports, vol. 19, p. 101124, Sep. 2020, doi: 10.1016/j.pmedr.2020.101124.

F. Herpich and F. Rincon, “Management of Acute Ischemic Stroke,” Critical Care Medicine, vol. 48, no. 11, pp. 1654–1663, Oct. 2020, doi: 10.1097/ccm.0000000000004597.

R. Chiaramonte and M. Vecchio, “A Systematic Review of Measures of Dysarthria Severity in Stroke Patients,” PM&R, vol. 13, no. 3, pp. 314–324, Oct. 2020, doi: 10.1002/pmrj.12469.

K. Brown and K. Spencer, “Dysarthria following Stroke,” Seminars in Speech and Language, vol. 39, no. 1, pp. 015–024, Jan. 2018, doi: 10.1055/s-0037-1608852.

Z. Mou, Z. Chen, J. Yang, and L. Xu, “Acoustic properties of vowel production in Mandarin-speaking patients with post-stroke dysarthria,” Scientific Reports, vol. 8, no. 1, Sep. 2018, doi: 10.1038/s41598-018-32429-8.

M.-Y. Liaw et al., “Respiratory muscle training in stroke patients with respiratory muscle weakness, dysphagia, and dysarthria – a prospective randomized trial,” Medicine, vol. 99, no. 10, p. e19337, Mar. 2020, doi: 10.1097/md.0000000000019337.

R. Chiaramonte, P. Pavone, and M. Vecchio, “Speech rehabilitation in dysarthria after stroke: a systematic review of the studies,” European Journal of Physical and Rehabilitation Medicine, vol. 56, no. 5, Nov. 2020, doi: 10.23736/s1973-9087.20.06185-7.

R. Islam, M. Tarique, and E. Abdel-Raheem, “A Survey on Signal Processing Based Pathological Voice Detection Techniques,” IEEE Access, vol. 8, pp. 66749–66776, 2020, doi: 10.1109/access.2020.2985280.

S. Pinto et al., “Treatments for dysarthria in Parkinson's disease,” The Lancet Neurology, vol. 3, pp . 547-556, 204, doi: doi: 10.1016/S1474-4422(04)00854-3.

A. F. Rumbach, E. Finch, and G. Stevenson, “What are the usual assessment practices in adult non-progressive dysarthria rehabilitation? A survey of Australian dysarthria practice patterns,” Journal of Communication Disorders, vol. 79, pp. 46–57, May 2019, doi: 10.1016/j.jcomdis.2019.03.002.

P. Balzan, C. Tattersall, and R. Palmer, “Non-invasive brain stimulation for treating neurogenic dysarthria: A systematic review,” Annals of Physical and Rehabilitation Medicine, vol. 65, no. 5, p. 101580, Sep. 2022, doi: 10.1016/j.rehab.2021.101580.

M. Icht, “Improving speech characteristics of young adults with congenital dysarthria: An exploratory study comparing articulation training and the Beatalk method,” Journal of Communication Disorders, vol. 93, p. 106147, Sep. 2021, doi: 10.1016/j.jcomdis.2021.106147.

I. Díaz et al., “Development of a robotic device for post-stroke home tele-rehabilitation,” Advances in Mechanical Engineering, vol. 10, no. 1, Jan. 2018, doi: 10.1177/1687814017752302.

A. A. Khan, S. K. Ranjha, M. U. Akram, S. G. Khawaja, and A. Shaukat, “Neurotransmission cognitive theory: A novel approach for non-invasive brain stimulation using mechanical vibrations for the rehabilitation of neurological patients,” Medical Hypotheses, vol. 143, p. 110078, Oct. 2020, doi: 10.1016/j.mehy.2020.110078.

R. S. Calabrò et al., “Does hand robotic rehabilitation improve motor function by rebalancing interhemispheric connectivity after chronic stroke? Encouraging data from a randomised-clinical-trial,” Clinical Neurophysiology, vol. 130, no. 5, pp. 767–780, May 2019, doi: 10.1016/j.clinph.2019.02.013.

I. Boukhennoufa, X. Zhai, V. Utti, J. Jackson, and K. D. McDonald-Maier, “Wearable sensors and machine learning in post-stroke rehabilitation assessment: A systematic review,” Biomedical Signal Processing and Control, vol. 71, p. 103197, Jan. 2022, doi: 10.1016/j.bspc.2021.103197.

K. Nuckols et al., “Proof of Concept of Soft Robotic Glove for Hand Rehabilitation in Stroke Survivors,” Archives of Physical Medicine and Rehabilitation, vol. 100, no. 12, p. e195, Dec. 2019, doi: 10.1016/j.apmr.2019.10.099.

C. Proulx, D. Gagnon, and J. Higgins, “Perceived Usability and Acceptability of a Soft Robotic Glove for Rehabilitation of Adults With Hand Hemiparesis: A Mixed-Method Study Among Occupational Therapists in Stroke Rehabilitation,” Archives of Physical Medicine and Rehabilitation, vol. 101, no. 11, p. e101, Nov. 2020, doi: 10.1016/j.apmr.2020.09.308.

J. D. Setiawan, M. Ariyanto, S. Nugroho, M. Munadi, and R. Ismail, “A Soft Exoskeleton Glove Incorporating Motor-Tendon Actuator for Hand Movements Assistance,” International Review of Automatic Control (IREACO), vol. 13, no. 1, p. 1, Jan. 2020, doi: 10.15866/ireaco.v13i1.18274.

B. B. Kang, H. Choi, H. Lee, and K.-J. Cho, “Exo-Glove Poly II: A Polymer-Based Soft Wearable Robot for the Hand with a Tendon-Driven Actuation System,” Soft Robotics, vol. 6, no. 2, pp. 214–227, Apr. 2019, doi: 10.1089/soro.2018.0006.

C.-Y. Chu and R. M. Patterson, “Soft robotic devices for hand rehabilitation and assistance: a narrative review,” Journal of NeuroEngineering and Rehabilitation, vol. 15, no. 1, Feb. 2018, doi: 10.1186/s12984-018-0350-6.

P. Tran, S. Jeong, S. L. Wolf, and J. P. Desai, “Patient-Specific, Voice-Controlled, Robotic FLEXotendon Glove-II System for Spinal Cord Injury,” IEEE Robotics and Automation Letters, vol. 5, no. 2, pp. 898–905, Apr. 2020, doi: 10.1109/lra.2020.2965900.

A. Dwivedi, L. Gerez, W. Hasan, C.-H. Yang, and M. Liarokapis, “A Soft Exoglove Equipped With a Wearable Muscle-Machine Interface Based on Forcemyography and Electromyography,” IEEE Robotics and Automation Letters, vol. 4, no. 4, pp. 3240–3246, Oct. 2019, doi: 10.1109/lra.2019.2925302.

A. Foroutannia, M.-R. Akbarzadeh-T, and A. Akbarzadeh, “A deep learning strategy for EMG-based joint position prediction in hip exoskeleton assistive robots,” Biomedical Signal Processing and Control, vol. 75, p. 103557, May 2022, doi: 10.1016/j.bspc.2022.103557.

F. Wang, Y. Chen, Y. Wang, Z. Liu, Y. Tian, and D. Zhang, “A soft pneumatic glove with multiple rehabilitation postures and assisted grasping modes,” Sensors and Actuators A: Physical, vol. 347, p. 113978, Nov. 2022, doi: 10.1016/j.sna.2022.113978.

Q. Liu, J. Zuo, C. Zhu, and S. Q. Xie, “Design and control of soft rehabilitation robots actuated by pneumatic muscles: State of the art,” Future Generation Computer Systems, vol. 113, pp. 620–634, Dec. 2020, doi: 10.1016/j.future.2020.06.046.

M. V. M. Neves, L. Furlan, F. Fregni, L. R. Battistella, and M. Simis, “Robotic-Assisted Gait Training (RAGT) in Stroke Rehabilitation: A Pilot Study,” Archives of Rehabilitation Research and Clinical Translation, vol. 5, no. 1, p. 100255, Mar. 2023, doi: 10.1016/j.arrct.2023.100255.

P. Caliandro et al., “Exoskeleton-assisted gait in chronic stroke: An EMG and functional near-infrared spectroscopy study of muscle activation patterns and prefrontal cortex activity,” Clinical Neurophysiology, vol. 131, no. 8, pp. 1775–1781, Aug. 2020, doi: 10.1016/j.clinph.2020.04.158.

T. Triwiyanto, S. Luthfiyah, I. Putu Alit Pawana, A. Ali Ahmed, and A. Andrian, “Bilateral mode exoskeleton for hand rehabilitation with wireless control using 3D printing technology based on IMU sensor,” HardwareX, vol. 14, p. e00432, Jun. 2023, doi: 10.1016/j.ohx.2023.e00432.

D. Mulfari, G. Meoni, M. Marini, and L. Fanucci, “Towards a Deep Learning Based ASR System for Users with Dysarthria,” Computers Helping People with Special Needs, pp. 554–557, 2018, doi: 10.1007/978-3-319-94277-3_86.

Y.-Y. Lin et al., “A Speech Command Control-Based Recognition System for Dysarthric Patients Based on Deep Learning Technology,” Applied Sciences, vol. 11, no. 6, p. 2477, Mar. 2021, doi: 10.3390/app11062477.

M. S. Yakoub, S. Selouani, B.-F. Zaidi, and A. Bouchair, “Improving dysarthric speech recognition using empirical mode decomposition and convolutional neural network,” EURASIP Journal on Audio, Speech, and Music Processing, vol. 2020, no. 1, Jan. 2020, doi: 10.1186/s13636-019-0169-5.

B. Vachhani, C. Bhat, and S. K. Kopparapu, “Data Augmentation Using Healthy Speech for Dysarthric Speech Recognition,” Proc. Interspeech 2018, pp. 471-475, Sep. 2018, doi: 10.21437/interspeech.2018-1751.

S. R. Shahamiri, “Speech Vision: An End-to-End Deep Learning-Based Dysarthric Automatic Speech Recognition System,” IEEE Transactions on Neural Systems and Rehabilitation Engineering, vol. 29, pp. 852–861, 2021, doi: 10.1109/tnsre.2021.3076778.

A. A. Joshy and R. Rajan, “Automated Dysarthria Severity Classification: A Study on Acoustic Features and Deep Learning Techniques,” IEEE Transactions on Neural Systems and Rehabilitation Engineering, vol. 30, pp. 1147–1157, 2022, doi: 10.1109/tnsre.2022.3169814.

W. Ye, Z. Jiang, Q. Li, Y. Liu, and Z. Mou, “A hybrid model for pathological voice recognition of post-stroke dysarthria by using 1DCNN and double-LSTM networks,” Applied Acoustics, vol. 197, p. 108934, Aug. 2022, doi: 10.1016/j.apacoust.2022.108934.

B. A. D. la C. Sánchez, M. A. Montiel, and E. L. González, “EMG-controlled hand exoskeleton for assisted bilateral rehabilitation,” Biocybernetics and Biomedical Engineering, vol. 42, no. 2, pp. 596–614, Apr. 2022, doi: 10.1016/j.bbe.2022.04.001.

F. Putri, W. Caesarendra, E. D. Pamanasari, M. Ariyanto, and J. D. Setiawan, “Parkinson Disease Detection Based on Voice and EMG Pattern Classification Method for Indonesian Case Study,” Journal of Energy, Mechanical, Material and Manufacturing Engineering, vol. 3, no. 2, p. 87, Dec. 2018, doi: 10.22219/jemmme.v3i2.6977.

J. Hou, Y. Shi, M. Ostendorf, M.-Y. Hwang, and L. Xie, “Region Proposal Network Based Small-Footprint Keyword Spotting,” IEEE Signal Processing Letters, vol. 26, no. 10, pp. 1471–1475, Oct. 2019, doi: 10.1109/lsp.2019.2936282.

A. Ghandoura, F. Hjabo, and O. A. Dakkak, “Building and benchmarking an Arabic Speech Commands dataset for small-footprint keyword spotting,” Engineering Applications of Artificial Intelligence, vol. 102, p. 104267, Jun. 2021, doi: 10.1016/j.engappai.2021.104267.

R. Bhalley, “TensorFlow Basics,” Deep Learning with Swift for TensorFlow, pp. 143–169, 2021, doi: 10.1007/978-1-4842-6330-3_4.

V. J. Reddi et al., “Widening Access to Applied Machine Learning with TinyML,” Harvard Data Science Review, Jan. 2022, doi: 10.1162/99608f92.762d171a.

A. M. Rostami, A. Karimi, and M. A. Akhaee, “Keyword spotting in continuous speech using convolutional neural network,” Speech Communication, vol. 142, pp. 15–21, Jul. 2022, doi: 10.1016/j.specom.2022.06.001.

E. van der Westhuizen, H. Kamper, R. Menon, J. Quinn, and T. Niesler, “Feature learning for efficient ASR-free keyword spotting in low-resource languages,” Computer Speech & Language, vol. 71, p. 101275, Jan. 2022, doi: 10.1016/j.csl.2021.101275.

L. Liu, M. Yang, X. Gao, Q. Liu, Z. Yuan, and J. Zhou, “Keyword spotting techniques to improve the recognition accuracy of user-defined keywords,” Neural Networks, vol. 139, pp. 237–245, Jul. 2021, doi: 10.1016/j.neunet.2021.03.012.

S. Cai et al., “A Voice-Activated Switch for Persons with Motor and Speech Impairments: Isolated-Vowel Spotting Using Neural Networks,” Proc. Interspeech 2021, pp. 4823-4827, Aug. 2021, doi: 10.21437/interspeech.2021-330.

K. Dokic, D. Mandusic, and B. Radisic, “Analysis of ESP32 SoC for Feed-Forward Neural Network Applications,” Innovation in Information Systems and Technologies to Support Learning Research, pp. 165–175, Dec. 2019, doi: 10.1007/978-3-030-36778-7_18.

M. Z. H. Zim, “TinyML: analysis of sekar Xtensa LX6 microprocessor for neural network applications by ESP32 SoC,” Machine Learning, Jun. 2021, doi: arXiv:2106.10652.

R. S. Iborra and A. F. Skarmeta, “TinyML-Enabled Frugal Smart Objects: Challenges and Opportunities,” IEEE Circuits and Systems Magazine, vol. 20, no. 3, pp. 4–18, Aug. 2020, doi: 10.1109/mcas.2020.3005467.

S. Asutkar, C. Chalke, K. Shivgan, and S. Tallur, “TinyML-enabled edge implementation of transfer learning framework for domain generalization in machine fault diagnosis,” Expert Systems with Applications, vol. 213, p. 119016, Mar. 2023, doi: 10.1016/j.eswa.2022.119016.

M. M. Shibl, L. S. Ismail, and A. M. Massoud, “A machine learning-based battery management system for state-of-charge prediction and state-of-health estimation for unmanned aerial vehicles,” Journal of Energy Storage, vol. 66, p. 107380, Aug. 2023, doi: 10.1016/j.est.2023.107380.

P. P. Ray, “A review on TinyML: State-of-the-art and prospects,” Journal of King Saud University - Computer and Information Sciences, vol. 34, no. 4, pp. 1595–1623, Apr. 2022, doi: 10.1016/j.jksuci.2021.11.019.

H. Rahman et al., “IoT enabled mushroom farm automation with Machine Learning to classify toxic mushrooms in Bangladesh,” Journal of Agriculture and Food Research, vol. 7, p. 100267, Mar. 2022, doi: 10.1016/j.jafr.2021.100267.

D. M. Matilla, Á. L. Murciego, D. M. J. Bravo, A. S. Mendes, and V. R. Q. Leithardt, “Low-cost Edge Computing devices and novel user interfaces for monitoring pivot irrigation systems based on Internet of Things and LoRaWAN technologies,” Biosystems Engineering, vol. 223, pp. 14–29, Nov. 2022, doi: 10.1016/j.biosystemseng.2021.07.010.

Lu, Xugang, Sheng Li, and M. Fujimoto, “Automatic speech recognition,” Speech-to-speech translation, pp. 21-38, 2020.

M. Yankayiş, “Performance Evaluation of Feature Extraction and Modeling Methods for Speaker Recognition,” Annals of Reviews & Research, vol. 4, no. 3, Nov. 2018, doi: 10.19080/arr.2018.04.555639.

B. Ustubioglu, G. Tahaoglu, and G. Ulutas, “Detection of audio copy-move-forgery with novel feature matching on Mel spectrogram,” Expert Systems with Applications, vol. 213, p. 118963, Mar. 2023, doi: 10.1016/j.eswa.2022.118963.

G. Parisi, A. Coluccia, and A. Fascista, “On time-frequency correlation in spectrogram samples with application to target detection,” Signal Processing, vol. 200, p. 108648, Nov. 2022, doi: 10.1016/j.sigpro.2022.108648.

S. Jothimani and K. Premalatha, “MFF-SAug: Multi feature fusion with spectrogram augmentation of speech emotion recognition using convolution neural network,” Chaos, Solitons & Fractals, vol. 162, p. 112512, Sep. 2022, doi: 10.1016/j.chaos.2022.112512.

D. Kim and J. Lee, “Predictive evaluation of spectrogram-based vehicle sound quality via data augmentation and explainable artificial Intelligence: Image color adjustment with brightness and contrast,” Mechanical Systems and Signal Processing, vol. 179, p. 109363, Nov. 2022, doi: 10.1016/j.ymssp.2022.109363.

T. Alam and A. Khan, “Lightweight CNN for Robust Voice Activity Detection,” Lecture Notes in Computer Science, pp. 1–12, 2020, doi: 10.1007/978-3-030-60276-5_1.

Sutikno, K. Anam, and A. Saleh, “Voice Controlled Wheelchair for Disabled Patients based on CNN and LSTM,” 2020 4th International Conference on Informatics and Computational Sciences (ICICoS), pp. 1-5, Nov. 2020, doi: 10.1109/icicos51170.2020.9299007.

J. Kwon and D. Park, “Hardware/Software Co-Design for TinyML Voice-Recognition Application on Resource Frugal Edge Devices,” Applied Sciences, vol. 11, no. 22, p. 11073, Nov. 2021, doi: 10.3390/app112211073.

A. Suryarasmi, C. -C. Chang, R. Akhmalia, M. Marshallia, W. -J. Wang, and D. Liang, “FN-Net: A lightweight CNN-based architecture for fabric defect detection with adaptive threshold-based class determination,” Displays, vol. 73, p. 102241, Jul. 2022, doi: 10.1016/j.displa.2022.102241.

C. Chen, H. Seo, and Y. Zhao, “A novel pavement transverse cracks detection model using WT-CNN and STFT-CNN for smartphone data analysis,” International Journal of Pavement Engineering, vol. 23, no. 12, pp. 4372–4384, Jun. 2021, doi: 10.1080/10298436.2021.1945056.

S. Duan, H. Zheng, and J. Liu, “A Novel Classification Method for Flutter Signals Based on the CNN and STFT,” International Journal of Aerospace Engineering, vol. 2019, pp. 1–8, Apr. 2019, doi: 10.1155/2019/9375437.

J. Huang, B. Chen, B. Yao, and W. He, “ECG Arrhythmia Classification Using STFT-Based Spectrogram and Convolutional Neural Network,” IEEE Access, vol. 7, pp. 92871–92880, 2019, doi: 10.1109/access.2019.2928017.

S. M. Beeraka, A. Kumar, M. Sameer, S. Ghosh, and B. Gupta, “Accuracy Enhancement of Epileptic Seizure Detection: A Deep Learning Approach with Hardware Realization of STFT,” Circuits, Systems, and Signal Processing, vol. 41, no. 1, pp. 461–484, Jul. 2021, doi: 10.1007/s00034-021-01789-4.

A. Pandey and D. Wang, “A New Framework for CNN-Based Speech Enhancement in the Time Domain,” in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 27, no. 7, pp. 1179-1188, July 2019, doi: 10.1109/TASLP.2019.2913512.

Y. Lee, H. J. Park, I. H. Bae, and G. Kim, “Resonance Characteristics in Epiglottic Cyst: Formant Frequency, Vowel Space Area, Vowel Articulatory Index, and Formant Centralization Ratio,” Journal of Voice, Oct. 2021, doi: 10.1016/j.jvoice.2021.09.008.

P. Warden, “Speech commands: A dataset for limited-vocabulary speech recognition,” arXiv preprint arXiv:1804.03209, Apr. 2018, doi: https://doi.org/10.48550/arXiv.1804.03209.

M. M. Goodwin, “The STFT, Sinusoidal Models, and Speech Modification,” Springer Handbook of Speech Processing, pp. 229–258, 2008, doi: 10.1007/978-3-540-49127-9_12.

S. A. Alim and N. K. A. Rashid, Some Commonly Used Speech Feature Extraction Algorithms. London, UK: IntechOpen, 2018.

X. Wang, T. Ying, and W. Tian, “Spectrum Representation Based on STFT,” 2020 13th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI), pp. 435-438, 2020, doi: 10.1109/CISP-BMEI51763.2020.9263516.

J. Benesty, J. Chen, and E. A. P. Habets. Speech enhancement in the STFT domain. Springer Science & Business Media, 2011.

L. Rabiner and B. W. Juang. Fundamentals of speech recognition. Prentice-Hall, Inc., 1993.

N. Kehtarnavaz, “Frequency Domain Processing,” Digital Signal Processing System Design, pp. 175–196, 2008, doi: 10.1016/b978-0-12-374490-6.00007-6.

K. Bhangale and K. Mohanaprasad, “Speech Emotion Recognition Using Mel Frequency Log Spectrogram and Deep Convolutional Neural Network,” Lecture Notes in Electrical Engineering, pp. 241–250, Oct. 2021, doi 10.1007978-981-16-4625-6_24.0

M. Loughlin, Z. Xie, Y. Song, H. Phan, and R. Palaniappan, “Time–Frequency Feature Fusion for Noise Robust Audio Event Classification,” Circuits, Systems, and Signal Processing, vol. 39, no. 3, pp. 1672–1687, Jul. 2019, doi: 10.1007/s00034-019-01203-0.

M. T. Nguyen, W. W. Lin, and J. H. Huang, “Heart Sound Classification Using Deep Learning Techniques Based on Log-mel Spectrogram,” Circuits, Systems, and Signal Processing, vol. 42, no. 1, pp. 344–360, Aug. 2022, doi: 10.1007/s00034-022-02124-1.

H. Xu, J. Zhang, and L. Dai, “Differential Time-frequency Log-mel Spectrogram Features for Vision Transformer Based Infant Cry Recognition,” Proc. Interspeech 2022, pp. 1963-1967, Sep. 2022, doi: 10.21437/interspeech.2022-18.

D. Gao, X. Tang, M. Wan, G. Huang, and Y. Zhang, “EEG driving fatigue detection based on log-Mel spectrogram and convolutional recurrent neural networks,” Frontiers in Neuroscience, vol. 17, Mar. 2023, doi: 10.3389/fnins.2023.1136609.

M. M. Oo and L. L. Oo, “Fusion of Log-Mel Spectrogram and GLCM Feature in Acoustic Scene Classification,” Studies in Computational Intelligence, pp. 175–187, Jul. 2019, doi: 10.1007/978-3-030-24344-9_11.

H. Meng, T. Yan, F. Yuan, and H. Wei, “Speech Emotion Recognition From 3D Log-Mel Spectrograms With Deep Learning Network,” IEEE Access, vol. 7, pp. 125868–125881, 2019, doi: 10.1109/access.2019.2938007.

Z. Diao, J. Yan, Z. He, S. Zhao, and P. Guo, “Corn Seedling Recognition Algorithm Based on Hyperspectral Image and Lightweight-3d-Cnn,” SSRN Electronic Journal, 2022, doi: 10.2139/ssrn.4162664.

O. Attallah, “CerCan·Net: Cervical cancer classification model via multi-layer feature ensembles of lightweight CNNs and transfer learning,” Expert Systems with Applications, vol. 229, p. 120624, Nov. 2023, doi: 10.1016/j.eswa.2023.120624.

J. Yang, L. Zhang, X. Tang, and M. Han, “CodnNet: A lightweight CNN architecture for detection of COVID-19 infection,” Applied Soft Computing, vol. 130, p. 109656, Nov. 2022, doi: 10.1016/j.asoc.2022.109656.

H. I. Hussein, A. O. Mohammed, M. M. Hassan, and R. J. Mstafa, “Lightweight deep CNN-based models for early detection of COVID-19 patients from chest X-ray images,” Expert Systems with Applications, vol. 223, p. 119900, Aug. 2023, doi: 10.1016/j.eswa.2023.119900.

Y. Wang, S. Li, H. Zhang, and T. Liu, “A lightweight CNN-based model for early warning in sow oestrus sound monitoring,” Ecological Informatics, vol. 72, p. 101863, Dec. 2022, doi: 10.1016/j.ecoinf.2022.101863.

K. Sanjar, A. Rehman, A. Paul, and K. JeongHong, “Weight Dropout for Preventing Neural Networks from Overfitting,” 2020 8th International Conference on Orange Technology (ICOT), Dec. 2020, doi: 10.1109/icot51877.2020.9468799.

L. Li and M. Spratling, “Understanding and combating robust overfitting via input loss landscape analysis and regularization,” Pattern Recognition, vol. 136, p. 109229, Apr. 2023, doi: 10.1016/j.patcog.2022.109229.

O. İrsoy and E. Alpaydın, “Dropout regularization in hierarchical mixture of experts,” Neurocomputing, vol. 419, pp. 148–156, Jan. 2021, doi: 10.1016/j.neucom.2020.08.052.

S. H. Khan, M. Hayat, and F. Porikli, “Regularization of deep neural networks with spectral dropout,” Neural Networks, vol. 110, pp. 82–90, Feb. 2019, doi: 10.1016/j.neunet.2018.09.009.

Q. K. Pham, T. V. Vo, and P. T. Tran, “On the Implementation of a Low-Cost Mind-Voice-and-Gesture-Controlled Humanoid Robotic Arm Using Leap Motion and Neurosky Sensor,” Journal of Electrical Engineering & Technology, vol. 17, no. 1, pp. 665–683, Sep. 2021, doi: 10.1007/s42835-021-00903-5.

W. Batayneh, E. Abdulhay, and M. Alothman, “Comparing the efficiency of artificial neural networks in sEMG-based simultaneous and continuous estimation of hand kinematics,” Digital Communications and Networks, vol. 8, no. 2, pp. 162–173, Apr. 2022, doi: 10.1016/j.dcan.2021.08.002.

J. Ramirez, A. Rubiano, and P. Castiblanco, “Soft Driving Epicyclical Mechanism for Robotic Finger,” Actuators, vol. 8, no. 3, p. 58, Jul. 2019, doi: 10.3390/act8030058.

J. Park, I. Hwang, and W. Lee, “Wearable Robotic Glove Design Using Surface-Mounted Actuators,” Frontiers in Bioengineering and Biotechnology, vol. 8, Sep. 2020, doi: 10.3389/fbioe.2020.548947.

D. Kim et al., “Eyes are faster than hands: A soft wearable robot learns user intention from the egocentric view,” Science Robotics, vol. 4, no. 26, Jan. 2019, doi: 10.1126/scirobotics.aav2949.

J. Shor et al., “Personalizing ASR for Dysarthric and Accented Speech with Limited Data,” Proc. Interspeech 2019, pp. 784-788, Sep. 2019, doi: 10.21437/interspeech.2019-1427.

A. Jalali, R. Mallipeddi, and M. Lee, “Sensitive deep convolutional neural network for face recognition at large standoffs with small dataset,” Expert Systems with Applications, vol. 87, pp. 304-315, 2017, doi: 10.1016/j.eswa.2017.06.025.

E. Li, L. Wang, Q. Xie, R. Gao, Z. Su, and Y. Li, “A novel deep learning method for maize disease identification based on small sample-size and complex background datasets,” Ecological Informatics, vol. 75, p. 102011, Jul. 2023, doi: 10.1016/j.ecoinf.2023.102.




DOI: https://doi.org/10.18196/jrc.v4i4.15918

Refbacks

  • There are currently no refbacks.


Copyright (c) 2023 Bambang Riyanta, Henry Ardian Irianta, Berli Paripurna Kamiel

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

 


Journal of Robotics and Control (JRC)

P-ISSN: 2715-5056 || E-ISSN: 2715-5072
Organized by Peneliti Teknologi Teknik Indonesia
Published by Universitas Muhammadiyah Yogyakarta in collaboration with Peneliti Teknologi Teknik Indonesia, Indonesia and the Department of Electrical Engineering
Website: http://journal.umy.ac.id/index.php/jrc
Email: jrcofumy@gmail.com


Kuliah Teknik Elektro Terbaik