Repository logo
 

Medication adherence classification for non-communicable disease patients through machine learning approaches = Ukuhambisana nendlela yokuthatha imithi ngokwezigaba zeziguli ezinezifo ezingathelelani ngokuthobela kwazo imithetho yemithi kusetshenziswa izindlela zokufunda zomshini.

dc.contributor.advisorEzugwu, Absalom El-Shamir.
dc.contributor.advisorFonou-Dombeu, Jean Vincent.
dc.contributor.authorKanyongo, Wellington.
dc.date.accessioned2025-01-15T18:37:43Z
dc.date.available2025-01-15T18:37:43Z
dc.date.created2024
dc.date.issued2024
dc.descriptionDoctoral Degree. University of KwaZulu-Natal, Pietermaritzburg.
dc.description.abstractMedication non-adherence is a significant public health issue that results in poor treatment outcomes, unnecessary hospitalisations, increased healthcare expenses, and increased risks of morbidity, disability, and mortality. According to the World Health Organisation, only 50% of people with chronic diseases adhere to treatment recommendations, despite receiving guidance on the need for medication adherence (MA). Notwithstanding considerable endeavours to tackle this concern, traditional methods have demonstrated low efficacy in predicting and intervening in non-adherent behaviours. In recent years, the advent of Fourth Industrial Revolution (4IR) technologies, such as artificial intelligence (AI) and machine learning (ML) has brought about promising solutions to MA problems. This study developed seven ML models (SVM, KNN, DT, Naïve Bayes, DNN, LR, and RF) to classify MA in a dataset of diabetes and hypertension patients in Zimbabwe, a developing country. The thesis's full structure includes numerous studies chronologically arrayed as follows: (1) an expansive systematic literature review on ML approaches to MA analytics among NCD (NCD) patients; (2) feature selection and the importance of predictors of NCDs MA from ML research perspectives; (3) the generation of a novel stateof-the-art dataset entitled "Data Wrangling and Generation for ML Models in MA Analytics: A Practical Standpoint Using Patient-Level and Medical Claims Data"; (4) development and validation of seven ML models, and (5) development of twinned machine learning algorithms for MA analytics in patients with diabetes and hypertension. The relevant data for diabetics and hypertensives were filtered from the initial raw dataset based on the International Classification of Diseases 10th Revision Code (ICD-10). Since adherence metrics measure the percentage of patients covered by medication refill claims for the same medication in the same therapeutic class, the created dataset only included medication refill data and patient-level data for 2022, from January 1 to December 31, 2022. Non-adherence was defined as medication refills that accounted for less than 75% of the projected 12-month claims, whereas adherence required at least 75% of the refills. This study acknowledges the absence of a universally accepted compendium of data wrangling (DW) steps while developing the novel dataset. However, it accepts certain building blocks and functionalities with significant overlaps as typical DW elements. In light of this, the study employs a pragmatic approach to dataset generation, encompassing nine essential DW building blocks and tasks that are employed iteratively and incrementally, comparable to an agile approach to dataset generation. An ML model was then developed using the generated dataset to evaluate the novel dataset's soundness, and its 81% accuracy demonstrated a generally good level of performance. During the ML model development process, the study used an RF feature importance mechanism to identify the important variables needed for building MA prediction models. The study also produced Shapley Additive Explanations (SHAP) beeswarm plots to aid in the interpretation of the model outputs and evaluate the contribution of the features to the prediction process. These factors included the annual quantity of medical supplies, the annual claim amount, the patient's age, their membership in a wellness programme, the medical aid cover, their contribution to the cover, comorbidity, diagnosis, the type of hospital cover, the occurrence of complications, gender, and the type of medical aid scheme. The total number of medical supplies dispensed annually was revealed as the most important predictor of MA. The ML classifiers had a classification accuracy of 84.7% to 87.6%, with AUC values ranging from 0.857 to 0.934. The most robust classifier was the RF, an ensemble learning technique with 87.6% accuracy, an AUC of 0.9351, and superior precision, recall, and F1-score. The hybridization of classical SVM with tree-based ML algorithms (RF and DT) demonstrated improved results, with the newly developed TWINNED-SVM-RF and TWINNED-SVMDT classifiers outperforming the standard SVM. Among these, the TWINNED-SVM-RF model showed superior performance. Additionally, the application of LIME (Local Interpretable Model-Agnostic Explanations) enhanced the explainability of the models, providing valuable insights into their decision-making processes. The ensemble learner (RF) and the TWINNED-SVM-RF show promise as prognostic tools for improving MA and determining patient adherence levels. These findings contribute to reducing discrepancies in medication refills and adherence rates among NCD patients. The ML model serves as a foundation for the creation of intelligent MA and intervention apps to promote patient MA among chronic disease patients. The outcomes, techniques, and insights proffered in novel dataset generation inform future researchers, data scientists, and analysts on how to conduct data mining, dataset enrichment, and DW. The dataset generated can spark additional research in areas such as evaluating MA and performing ML tasks such as classifying, predicting, and clustering MA behaviours. Iqoqa. Ukungathobeli imithetho yemithi kuyinkinga enkulu yempilo yomphakathi eholela emiphumeleni yokwelashwa engemihle, ukulaliswa esibhedlela okungenasidingo, ukwanda kwezindleko zokunakekelwa kwempilo, kanye nezingozi zokugula, ukukhubazeka, nokufa. Ngokwabakwa-World Health Organisation, bangama-50% kuphela abantu abanezifo ezingamahlalakhona abathobela okunconyiwe kokwelashwa, naphezu kokuyalwa mayelana nesidingo sokudla imithi (i-MA). Naphezu kwemizamo emikhulu yokubhekana nale nkinga, izindlela zendabuko ziye zabonisa ukusebenza ngezinga eliphansi ekuqageleni nasekungeneleni ekuziphatheni okungahambisani nalokhu. Eminyakeni yamuva nje, ukufika kobuchwepheshe boGuquko Lwesine Lwezimboni (i-4IR), njengobuhlakani bokweziwa (i-AI) nokufunda ngomshini (i-ML) kuze nezixazululo ezithembisayo ezinkingeni ze-MA. Lolu cwaningo lwakha amamodeli ayisi-7 e-ML (i-SVM, i-KNN, i-DT, i-Naïve Bayes, i-DNN, i-LR, ne-RF) ukuze luhlukanise i-MA emaqoqweni emininingo esifo sikashukela neziguli ezinomfutho wegazi ophakeme eZimbabwe, izwe elisathuthuka. Isakhiwo esigcwele sombhalo wocwaningo sihlanganisa izifundo eziningi ngokulandelana kwezikhathi ezihlelwe kanje: (1) ukubukezwa okubanzi kwezincwadi ezihlelekile mayelana nezindlela ze-ML zokuhlaziya i-MA ezigulini ze-NCD (NCD); (2) ukukhethwa kwesici nokubaluleka kwababikezeli be-NCDs, i-MA emibonweni yocwaningo lwe-ML; (3) isizukulwane seqoqo lwemininingo elisha elisezingeni eliphakeme elinesihloko esithi "Data Wrangling and Generation for ML Models in MA Analytics: A Practical Standpoint" Ukusebenzisa Imininingo Yezinga Lesiguli Neyemibono Yezokwelapha; (4) ukusungulwa nokuqinisekiswa kwamamodeli ayisikhombisa e-ML, kanye (5) nokusungulwa kwemithetho ezolandelwa ekufundeni komshini we-MA wokuhlaziya iziguli ezinesifo sikashukela nomfutho wegazi ophakeme. Imininingo efanele yabanesifo sikashukela nomfutho wegazi ophezulu yahlungwa eqoqweni lemininingo yokuqala eyayingakenziwa lutho esekelwe ku-International Classification of Diseases 10th Revision Code (ICD-10). Njengoba izilinganiso zokubambelela zibala ngephesenti leziguli ezibuyayo zizolanda imithi efanayo ekilasini elifanayo lokwelashwa, iqoqo lemininingo elakhiwa lufaka kuphela imininingo yokugcwaliswa kwemithi nemininingo esezingeni lesiguli kwango-2022, kusukela ngoMasingana kuya kuZibandlela. Ukunganamatheli ohlelweni lwemithi kwakubonakala ngokugcwaliswa kwemithi okwakungaphansi kwama-75% kunalokho okwakulindelekile, bese ukunamathela kona kwakusho ukubonakala kokulandwa kwemithi okungaphezu kwama-75%. Lolu cwaningo luvuma ukungabi khona kwemibhalo eyamukelekile ezingeni lomhlaba wonke okuqoqela ndawonye imininingo (i-DW) kube kusungulwa iqoqo lemibuzo elisha. Nokho, lwamukela izinto ezithile zokwakha nezinto ezisebenzayo ezinokugqagqana okuphawuleka njengezinto ezivamile ze-DW. Ngenxa yalokhu, ucwaningo lusebenzisa indlela eqondile yokukhiqizwa kwemininingo, ehlanganisa izinto eziyisishiyagalolunye ezibalulekile zokwakha ze-DW nemisebenzi esetshenziswa ngokuphindaphindiwe nangokwengeziwe, efana nendlela esheshayo yokukhiqizwa kwemininingo. Imodeli ye-ML yabe isisungulwa kusetshenziswa iqoqo lemininingo elikhiqizwayo ukuze kuhlolwe umsindo weqoqo lwemininingo elisha, futhi ukunemba kwayo okungama-81% kubonise izinga elihle lokusebenza. Ngenkathi kuthuthukiswa imodeli ye-ML, ucwaningo lusebenzise indlela yokubaluleka kwesici se-RF ukuze kutholakale izinto eziguquguqukayo ezibalulekile ezidingekayo ekwakheni amamodeli okubikezela e-MA. Ucwaningo luphinde lwakhiqiza i-Shapley Additive Definanations (i-SHAP) beeswarm plots ukuze kusize ekuchazeni imiphumela yemodeli futhi kuhlolwe negalelo lezici enqubweni yokuqagela. Lezi zici kufakwa kuzo inani lonyaka lezinto zokwelapha, inani lokusetshenzisiwe lonyaka, iminyaka yesiguli, ukuba yilungu kohlelo lwempilo enhle, ikhava yosizo lwezempilo, igalelo labo kukhava, izifo ezihlezi emzimbeni ngesikhathi esisodwa, ukuhlonzwa kwesifo, uhlobo lwekhava yesibhedlela, ukuba khona kwezinkinga, ubulili, nohlobo lohlelo losizo lwezempilo. Ingqikithi yenani lezinto zokwelapha ezikhishwa minyaka yonke kwembulwe njengokuyikona okubikezela ngokubaluleke kakhulu kwe-MA. Izigaba ze-ML zazinokunemba kokuhlukaniswa okungama-84.7% kuya kuma-87.6%, okunamanani e-AUC asukela kuma-0.857 kuya kuma-0.934. Isigaba esinamandla kunazo zonke kwakuyi-RF, indlela yokufunda ehlanganisiwe enokunemba okungama-87.6%, i-AUC engama-0.9351, nokunemba okuphezulu, ukukhumbula, kanye ne-F1-score. Ukuhlanganiswa kwe-SVM yakudala enemithetho ye-ML asekelwe ngomumo wesihlahla (i-RF ne-DT) kwaveza imiphumela ethuthukile, ngezigaba ezintsha zemithetho ye-TWINNED-SVM-RF ne-TWINNED-SVM-DT ezidlula i-SVM ejwayelekile. Phakathi kwalokhu, imodeli ye-TWINNED-SVM-RF yabonisa ukusebenza kwezinga eliphezulu. Ngaphezu kwalokho, ukusebenzisa I-LIME (Local Interpretable Model-Agnostic Definanations) kuthuthukise ukuchazwa kwamamodeli, kwanikeza ulwazi olubalulekile enqubweni yokuthathwa kwezinqumo. Iqoqo lokufunda (i-RF) kanye ne-TWINNED-SVMRF kubonisa isithembiso njengamathuluzi okubikezela okuthuthukisa i-MA nokunquma amazinga okuthobela imithetho yemithi kweziguli. Lokhu okutholakele kunomthelela ekunciphiseni ukungafani ekugcwaliseni kabusha imithi kanye namanani okunamathela emithini kweziguli ze-NCD. Imodeli ye-ML isebenza njengesisekelo sokusungulwa kwe-MA ehlakaniphile nezinhlelo zama-ephu ukugqugquzela i-MA yesiguli phakathi kweziguli ezinezifo ezingamahlalakhona. Imiphumela, amasu, nemibono eboniswe ekukhiqizweni kwemininingo emisha yazisa abacwaningi besikhathi esizayo, ososayensi bemininingo, nabahlaziyi ngendlela yokuqoqa imininingo, ukucebisa iqoqo lemininingo, kanye ne-DW. Iqoqo lemininingo eqoqiwe lingavusa ucwaningo olwengeziwe ezindaweni ezinjengokuhlola i-MA nokwenza imisebenzi ye-ML enjengokuhlukanisa ngokwezigaba, ukuqagela, nokubeka ngononina ngokuhambisanayo nokuziphatha ngokwe-MA.
dc.identifier.urihttps://hdl.handle.net/10413/23561
dc.language.isoen
dc.subject.otherMedication adherence.
dc.subject.otherMachine learning.
dc.subject.otherNon-communicable diseases.
dc.subject.otherDiabetes.
dc.subject.otherHypertension.
dc.titleMedication adherence classification for non-communicable disease patients through machine learning approaches = Ukuhambisana nendlela yokuthatha imithi ngokwezigaba zeziguli ezinezifo ezingathelelani ngokuthobela kwazo imithetho yemithi kusetshenziswa izindlela zokufunda zomshini.
dc.typeThesis
local.sdgSDG3 SDG9 SDG12

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Kanyongo_Wellington_2024.pdf
Size:
35.46 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.64 KB
Format:
Item-specific license agreed upon to submission
Description: