Monday, September 16, 2024

Data Analysis for Accuracy of Provisional Diagnosis in Healthcare

 

Title Picture Reference - AI Generated Image at website - https://designer.microsoft.com/image-creator

Summary of the Blogpost – This blogpost is an attempt to illustrate an idea for doing performance assessment of healthcare facilities based on accuracy of provisional diagnosis for a given disease. To do this, the blogpost relies on the HIMS (Hospital Information Management System) data captured across the facilities (which are taken into consideration for comparison of performance). It is suggested that data for symptoms, provisional diagnosis and final diagnosis for facilities for a given time period can be subjected to Confusion Matrix for calculation of respective Performance of Facilities (represented through Sensitivity and Specificity for Provisional Diagnosis for the disease). It is further suggested that relative performance of facilities for accuracy of Provisional Diagnosis can be plotted on a S – S (Sensitivity and Specificity) Cartesian for classification of facilities. However, the analysis presents just a possible idea based on imaginary data as real data is not available to test the presented idea. The blogpost is written in the form of questions and answers.

- - - - - - - - - - - - 


What is Health Management?

In brief, health management provides guidance and leadership to administer health at the individual, organizational and systemic level. Health management embraces a holistic vision of health, in which health is impacted by behavioral, social, and environmental determinants. (Further details are available under APPENDIX – C of this blogpost)

 

What is the role of Information Technology in Health Management?

HMS (Hospital Management System) or HIMS (Hospital Information Management System) provides the underpinnings for decision-making and has four key functions: data generation, compilation, analysis and synthesis, and communication and use. (Further details are available under APPENDIX – C of this blogpost)

 

Why should data be analyzed for Provisional and Final Diagnosis?

Provisional diagnosis (PD) is the first considered diagnosis which initiates the first phase of management, whereas confirmatory or final diagnosis (FD) is the chronological organization and critical evaluation of information obtained from the history, physical examination, and investigations. (NCBI).

A technical series published by World Health Organization (WHO) in 2016 states that the most important task performed by primary care providers is diagnosis (World Health Organization, 2016).

Research shows the existence of gaps between provisional and final diagnosis in both private and government institutions. Both knowledge and skill gaps were evident in caregivers and gap in documentation was existent in medical records. (Chatterjee S, Ray K, Das AK. Gap analysis between provisional diagnosis and final diagnosis in government and private teaching hospitals: A record-linked comparative study. J Family Med Prim Care. 2016 Jul-Sep;5(3):637-640. doi: 10.4103/2249-4863.197318. PMID: 28217597; PMCID: PMC5290774.). Matching of provisional diagnosis with discharge diagnosis (final diagnosis) with greater accuracy and lesser number of investigations can lead to greater patient satisfaction along with lesser burden on health resources of the state. Accurate provisional diagnosis in the Emergency Department (ED) is considered to be more important as it has a significant impact on safety (Adnan & Baharuddin "A Study on the Diagnostic Discrepancy between Admission and Discharge in Hospital Universiti Sains Malaysia" January 2021 Malaysian Journal of Medicine and Health Sciences 17(1):105-110 17(1):105-110).

Historically, diagnosis relied more on symptoms. With the development of systematic methods of clinical examination more reliance was placed on signs. However major breakthroughs in diagnosis were achieved when medical technology provided a wide array of accurate and reliable laboratory and radiological investigations. (Chattopadhyay, Amitabha, et al. "Gap analysis between provisional diagnosis on admission and final diagnosis during discharge–A comparative study." J Dent Med Sci 8 (2013): 28-31.). A brief on Symptoms and Signs can be accessed under APPENDIX - A for ready reference.

Analysis of HIMS/HMS generated (non-repudiable) data (mined from large sample of anonymized patient records) for (disease wise) matching of provisional and final diagnosis done at facility level may be helpful in many respects, like: -

1.      Improving Differential Diagnosis - Differential diagnosis is defined as the process of differentiating between the probability of one disease versus that of other diseases with similar symptoms that could possibly account for illness in a patient. (Refer - Deepti Lamba, William H. Hsu, Majed Alsadhan, Chapter 1 - Predictive analytics and machine learning for medical informatics)

2.      Observing Changing Signs and Symptoms of Diseases – Manifestations of diseases  may keep on changing with changing times (Bennett Lorber, Changing patterns of infectious diseases, The American Journal of Medicine, Volume 84, Issue 3, Part 2, 1988, ISSN 0002-9343). Such changes may happen due to numerous factors like (say) changing lifestyles of a population or (say) mutating pathogens. The results of such changes could be observed as variations in symptoms and in signs associated with diseases.

3.      Bringing Consistency in Diagnosis - Detailed analysis of mismatch in provisional and final diagnosis across a large set of facilities may (in some cases) help in bringing consistency in diagnosis of some of the diseases. In general, conducting such an analysis periodically could even lead to improvement in quality of diagnosis in a given set of facilities.

4.      Developing a better understanding on the way Patients express Symptoms of a Disease to Care-Givers – Usually, patients come to a facility from extremely diverse backgrounds (say in terms of languages spoken, education levels, religious and social beliefs) and therefore may have different ways to articulate medical complains. These complaints become the basis for recording of the symptoms. Quality of (interactive) communication between a patient and clinical caregiver (or say the doctor) may be critical for improving accuracy of symptoms narrated. These symptoms (and observed signs) in-turn become the basis for provisional diagnosis by the doctor (or an authorized clinical professional).

A possible cause of poor matching of provisional diagnosis with the final diagnosis could be poor quality of interactive communication between patient and doctor. This aspect may get attention and most likely get improved with frequent assessment of gap in provisional and final diagnosis.

 

Can you suggest a way to do an analysis for Provisional and Final Diagnosis?

Let us try to develop an idea to do an analysis of provisional and final diagnosis from data captured through HIMS. In this analysis, we will try to apply Confusion Matrix to assess healthcare facilities for correct provisional diagnosis of a given disease. This assessment will be done for data collated during a fixed period of time across a given set of facilities with an assumption that all other related parameters remain the same for these facilities.

It is assumed that the following data is available for a given facility, which has implemented HIMS -  a) symptoms reported by patients for a disease at triaging, b) the provisional diagnosis (likely-positive (likely to have a disease) or likely-negative (not likely to have a disease)) and c) the final diagnosis (positive or negative) for the disease. In this analysis we will analyze capability of the facility to be correct in doing provisional diagnosis for the disease. This capability of the facility will be expressed in terms of two parameters Sensitivity and Specificity (described later in this blogpost). Once capability of a facility is determined, the same will be repeated for a given set of facilities. Thus, relative capabilities of facilities chosen for analysis can be determined (in terms of respective Sensitivity and Specificity values).

A Confusion Matrix is commonly used in assessment of accuracy and reliability of outputs of Machine Learning models. Essentially, a confusion matrix is a matrix that summarizes the performance of a machine learning model on a set of test data. It is a means of displaying the number of accurate and inaccurate instances based on the model’s predictions. It is often used to measure the performance of classification models, which aim to predict a categorical label for each input instance. (geeksforgeeks.org). In table – 1, a confusion matrix is prepared by matching data of Provisional Diagnosis with corresponding data for the Final Diagnosis.

The matrix displays the number of instances produced by the model on the test data. These instances in reference to table – 1 (representing our idea of analysis) are as under (geeksforgeeks.org): -

1.      True Positive (TP): The model correctly predicted a positive outcome (the actual outcome was positive). This is represented as ‘a’ in Table – 1 in reference to this analysis.

2.      True Negative (TN): The model correctly predicted a negative outcome (the actual outcome was negative). This is represented as ‘d’ in Table – 1 in reference to this analysis.

3.      False Positive (FP): The model incorrectly predicted a positive outcome (the actual outcome was negative). Also known as a Type I error. This is represented as ‘b’ in Table – 1 in reference to this analysis.

4. False Negative (FN): The model incorrectly predicted a negative outcome (the actual outcome was positive). Also known as a Type II error. This is represented as ‘c’ in Table – 1 in reference to this analysis.


Table – 1: Confusion Matrix for Provisional Diagnosis and Final Diagnosis in Healthcare


We will apply the confusion matrix (as above in table-1) to the sample of data available for a critical disease from a given set of facilities (let us take 15 imaginary facilities and therefore 15 corresponding Matrix with corresponding values of TP, FP, FN and TN).  In context of India, the critical disease could be any of the diseases with high prevalence (Communicable or Non-Communicable). However, as an illustrative example, we will take Tuberculosis (because, India accounts for about 25% of global TB burden, with an estimated TB incidence of 2.77 million in 2022). In application of confusion matrix, the sample size of data is recommended to be statistically significant (say typically around 500 cases from every facility for applying to the matrix), the period of capture of data is to be considered reasonably recent and quality of data be considered good in terms of completeness, cleanliness, accuracy and reliability.

In relation to our analysis, the provisional diagnosis will be assessed on Sensitivity and Specificity. In reference to the Matrix shown in the table – 1, the formula and explanation for these two parameters are given as under (a,b,c and d used in the following formula are taken from the Table – 1): -

1.      Sensitivity = [a/(a+c)]×100

2.      Specificity = [d/(b+d)]×100

Sensitivity - The sensitivity of a test is also called the true positive rate (TPR) and is the proportion of samples that are genuinely positive that give a positive result using the test in question. For example, a test that correctly identifies all positive samples in a panel is very sensitive. Another test that only detects 60 % of the positive samples in the panel would be deemed to have lower sensitivity as it is missing positives and giving higher a false negative rate (FNR). Also referred to as type II errors, false negatives are the failure to reject a false null hypothesis (the null hypothesis being that the sample is negative).

Specificity - The specificity of a test, also referred to as the true negative rate (TNR), is the proportion of samples that are genuinely negative that give a negative result using the test in question. For example, a test that identifies all healthy people as being negative for a particular illness is very specific. Another test that incorrectly identifies 30 % of healthy people as having the condition would be deemed to be less specific, having a higher false positive rate (FPR). Also referred to as type I errors, false positives are the rejection of a true null hypothesis (the null hypothesis being that the sample is negative).

A possible process for carrying out collection of data for the analysis is illustrated as under: -


Figure – 1: Suggestive Process for Data Capture


A patient with symptoms of a disease is triaged and sent to a doctor (in a of the given facility being considered for this analysis). Such patients (with due consent) will be considered as subjects for this analysis. A subject can be Provisionally Diagnosed as “Likely Positive” or “Likely Negative” for Tuberculosis (TB) by the attending Doctor. Based on provisional diagnosis the doctor will start suitable medication and at the same time recommend confirmatory tests (for possible evidence(s) of TB). Once the results of tests are available, the doctor will write the Final Diagnosis as a “Positive” or a “Negative” case of TB. The data for Symptoms, for Provisional Diagnosis and for Final Diagnosis will be used to be fed to analytical framework used (Confusion Matrix).

In a real scenario, for every facility chosen for analysis (like we have chosen 15 facilities for our imaginary case), one Confusion Matrix will be prepared based on data captured at that facility (say clean dataset in the range of 500 cases). Sensitivity and Specificity will be calculated for every Confusion Matrix. These parameters (Sensitivity and Specificity) will represent performance of diagnosis (on correctness of the provisional diagnosis) for the given disease across the chosen facilities.

Since we do not have real data, let us try to work-out on the above analytical idea through some imaginary random data. So, with an imaginary data for Sensitivity and Specificity of 15 Imaginary Facilities (assumed to have been derived from huge data collected during 01-JAN-2023 to 31-DEC-2023 in respective facilities – typically valid dataset of around 500 for every facility), an S-S (Sensitivity and Specificity) Cartesian Plot has been prepared. This is a deviation form ROC curve (between values of ‘Sensitivity’ and ‘1-Specificity’) plotted by data analysts to test their models. The deviation is taken considering that we are not testing different provisional diagnosis AI model, but we are exploring an indicator for performance of facilities on Provisional and Final diagnosis. The imaginary data used is given under APPENDIX – B. The S-S Plot is as under: -


Figure – 2: Suggestive Sensitivity and Specificity Plot (S – S Plot) for a selected set of Facilities based on Provisional Diagnosis and Final Diagnosis data collected over a given period of time


Suggestive interpretations based on the above S – S curve is given in the following table. In case, it turns out that the S – S curve is found relevant on application of this idea for analysis (analytical framework) on real data, the suggestive interpretations may be used by policy makers and administrators for improving provisioning of healthcare.




Table – 2: Suggestive Interpretation of the S – S Plot with respect to the performance of Facilities on accuracy of Provisional Diagnosis


The assumptions made for the above S – S plot include: -

1.      Reliability of Confirmatory Test is assured

2.      Clarity in communication between Patient and Service Providers is observed across all facilities

3.      Availability of required Clinical and Non-Clinical Infrastructure is good in all facilities

4.      Patient Load per Doctor do not vary significantly across facilities

5.      Qualification of Clinical and Non-Clinical Staff do not vary significantly across facilities

6.      Patient Load with multiple diseases (which may include or exclude the disease in question) is largely uniform across the facilities

7.      Asymptomatic patients are rare (statistically negligible) and will remain excluded from this analysis

8. Prevalence of the disease does not vary in high magnitude across the facilities chosen and does not impact diagnostic accuracy in any significant way across the facilities


Should the suggested Analytical Framework (if found relevant on real data) be used by Healthcare Administrators or Policy Makers?

Measurement of performance in healthcare is a critical issue and needs adequate care in selection of indicators for administrative purposes. Complexities in health administration include: -

a.      Freedom, Flexibility, Discretion and Authority of Healthcare Service Providers

b.      Mutual trust between Patient and Treating Doctor

c.      Right to Information, Freedom of Choice and Right to Privacy of Patients

d.      Community and Societal Values and Government Regulations

e.      Tradition and Beliefs of stakeholders

f.        Diversity in adherence to divergent, contradictory or competing School of Thoughts in Medical Fraternity

g.      Resource optimization challenges with prevailing Patient Load, Services Rendered, Funding, Locality and Infrastructure availability at Facilities

h.      Cost of Medical Expense to the Patient

i.        Temptation of a facility to improve on measured indicators (at times, which may unknowingly turn out to introduce biases in treatment)

I believe that the matter of healthcare administration be left to the administrative experts. Ideas suggested by data analysts may be scrutinized in the larger context of healthcare administration. Only those ideas, which could stand scrutiny of healthcare experts be put to use for administering healthcare facilities (and that too after incorporating suggested modifications).

In this case, the idea expressed in this blogpost is at this stage, just a possibility. The idea is not even put to test through real data (which I do not have access to). The idea is not even debated and improved with domain experts. Thus, at the best this blogpost can be considered as an idea open to debate and corrections to get refined for a possible consideration to be tested with real data by real data analysts or researchers.


/***********************************/


APPENDIX – A: Brief on Symptoms and Signs

Symptoms and Signs are briefed as under: -

§  Symptoms - Something that a person feels or experiences that may indicate that they have a disease or condition. Symptoms can only be reported by the person experiencing them. They cannot be observed by a health care provider or other person and do not show up on medical tests. Some examples of symptoms are pain, nausea, fatigue, and anxiety. (NCI)

§  Signs - In medicine, a sign is something found during a physical exam or as a result of a laboratory or imaging test that shows that a person may have a condition or disease. Signs can be observed by a health care provider or other person. Some examples of signs are fever, swelling, skin rash, high blood pressure, and high blood glucose. (NCI)

 

APPENDIX – B: Imaginary Data to Plot the Illustrative S - S CURVE


APPENDIX – C: Information Technology in Health Management

 

What is Health Management?

Health management is complex, and its meaning can vary depending on local contexts. EHMA (European Health Management Association) with our members have developed a definition that captures the needs of health systems all across Europe. In brief, health management provides guidance and leadership to administer health at the individual, organisational and systemic level. Health management embraces a holistic vision of health, in which health is impacted by behavioral, social, and environmental determinants. Health management includes and goes beyond healthcare management, which comprises community, primary, secondary, and tertiary care provision. It also happens outside of care settings and builds synergy with other related policy and societal areas in line with the ‘One health’ concept. Health management encompasses the entire health ecosystem in which health managers collaborate with patients, informal and formal caregivers, patient organisations, legislators, educators, policy makers and regulators, public health experts, researchers, health insurance experts, and pharmaceutical industries. Together, they aim to create a clear health vision and alignment strategy, as well as lay down the organisational, societal and technological conditions to achieve optimal health outcomes for individual patients and the entire community. Health managers are jointly responsible for establishing effective and holistic governance structures, built on a co-design and co-production model. (EHMA)

 

What is the role of Information Technology in Health Management?

The evolving paradigm shift resulting from IT, social and technological changes has created a need for developing an innovative knowledge-based healthcare system, which can effectively meet global healthcare system demands and also cater to future trends.  The Hospital Information Management System (HIMS) is developed with this sole aim in mind, which helps in processing and management of hospital information not only inside the boundary, but also beyond the hospital boundary, e.g., telemedicine or e-healthcare. (Wadhwa S, Saxena A, Wadhwa B. Hospital information management system: an evolutionary knowledge management perspective. Int J Electron Healthc. 2007;3(2):232-60. doi: 10.1504/IJEH.2007.013103. PMID: 18048272.)

HMS (Hospital Management System) or HIMS (Hospital Information Management System) provides the underpinnings for decision-making and has four key functions: data generation, compilation, analysis and synthesis, and communication and use. HIMS collects data from the health sector and other relevant sectors, analyses the data and ensures their overall quality, relevance and timeliness, and converts data into information for health-related decision-making. (Health Metrics Network Framework and Standards for Country Health Information Systems, World Health Organization, January 2008)