Introduction
Diagnosing a urinary tract infection (UTI) in nursing home residents is difficult. This is due to a variety of factors, including the lack of appropriate diagnostic tests for UTIs to differentiate them from asymptomatic bacteriuria (ASB), which is highly prevalent in this population. Reference Nicolle, Gupta and Bradley1 Expression of specific urinary signs and symptoms due to cognitive impairments is often difficult for residents, Reference Nicolle, Strausbaugh and Garibaldi2 and from the side of caregivers, assessment of these symptoms is complex. Reference D’Agata, Loeb and Mitchell3 Nonspecific symptoms, such as confusion, are highly prevalent and historically attributed to UTIs. Reference D’Agata, Loeb and Mitchell3,Reference Arinzon, Shabat, Peisakh and Berner4
The role of nonspecific symptoms is unclear in nursing home residents suffering from UTI. The latest international guidelines and criteria (such as the Loeb criteria), although created for surveillance purposes, do not consider nonspecific symptoms in the diagnosis of UTI in older adults. Reference Hertogh and Haaijman5–Reference Loeb, Bentley and Bradley11 The recommendations are based on inconsistent results on the role of nonspecific symptoms and UTI, Reference van Buul, Vreeken and Bradley9,Reference Sundvall, Ulleryd and Gunnarsson12–Reference Boscia, Kobasa, Abrutyn, Levison, Kaplan and Kaye19 including 3 meta-analysis providing conflicting evidence.
To our knowledge, the diagnostic value of the guideline-based criteria for UTI in older adults upon implementation has not been described. Several prediction models for UTI diagnosis in other populations, for example, children and women, or settings such as emergency departments have been developed, Reference Shaikh, Hoberman and Hum20–Reference Gágyor, Haasenritter, Bleidorn, McIsaac, Schmiemann, Hummers-Pradier and Himmel23 as well as a model to predict adverse outcomes in geriatric patients with UTI. Reference Ginde, Rhee and Katz24 Caterino and colleagues Reference Caterino, Kline and Leininger18 developed a model to predict whether an altered mental status and malaise/lethargy affect the posttest probability of infection in older adults admitted to the emergency department. The probability of infections overall did not increase, but the probability of a UTI increased moderately when using their model.
The objective of this study was to assess the clinical relevance of the implemented guideline-based diagnostic criteria for UTI and the added value of nonspecific symptoms to support clinical decision-making.
Methods
Design
This was a preplanned secondary analysis of a cross-sectional study (PROGRESS) with the overall aim to improve UTI diagnosis in nursing homes. A full description of the study design and procedures has been described elsewhere. Reference Kuil, Hidad and Fischer25
Setting and participants
The data were collected between November 2017 and August 2019 in 13 Dutch nursing homes. The intention was to resume the study after an initial interim analysis-related break for the primary study objective. At the beginning of 2020, we decided to end the study permanently due to the closing of the nursing homes in the coronavirus disease 2019 (COVID-19) pandemic. Consenting older adults (≥65 years old) residing at a psychogeriatric, somatic, or rehabilitation ward with a presumed UTI based on clinical assessment of the attending clinician were eligible for enrollment. Exclusion criteria were (1) presumed respiratory tract infection (RTI) or other infection requiring antibiotics, (2) previous enrollment, or (3) urine collected >24 hours after initiation of antibiotic therapy. In Dutch nursing homes, elderly care physicians (physicians who specialize in long-term care for frail elderly people for 3 years) are responsible for the medical care of residents. Specially trained nurses can perform delegated tasks such as clinical assessment and diagnostic decisions.
Reference test for UTI
As there is no reference test for UTI diagnosis in this population, a stringent composite definition for “true UTI” was used. To fulfill this definition, one should meet all of the following criteria: (1) a positive urine leucocyte esterase test, (2) uropathogens in a bacterial culture at ≥104 colony-forming units/mL; (3) a maximum of 2 uropathogens; and (4) improvement of symptoms during follow-up while adequate antibiotic treatment was received (ie, proven susceptibility of isolated uropathogens to the administered antibiotic). The type of symptoms was not part of the “true UTI” definition. Treating physicians decided whether or not to treat based on their own clinical decision-making, without having access to the results of study procedures. As such, the clinical practice in the nursing home was not altered by the study activities. All primary and secondary uropathogens in the European Consensus Guideline were considered as uropathogens. Reference Aspevall, Hallander, Gant and Kouri26 Other bacterial species were considered as non-uropathogens. Whether the initial symptoms improved was reported at day 5 and 10 during follow-up. The improvement of symptoms at day 10 was used to determine the presence of a “true UTI.” When data at day 10 was missing or reported to be unknown, symptom improvement at day 5 was used. When the improvement was reported as unknown at day 5, this was considered to be no improvement.
Data collection
The presence of specific urinary symptoms (dysuria, frequency, urgency, new or worsened urine incontinence, urethral purulence, lower abdominal pain, costovertebral angle tenderness, and hematuria), systemic signs and symptoms (fever, chills, delirium), nonspecific symptoms (confusion, malaise, and loss of appetite), and an indwelling urinary catheter were reported on a clinical report form (CRF) by the attending clinician at the time of suspicion of UTI (day 0). When the presence of symptoms was unknown, this was considered to be an absence of symptoms. The procedures for urine collection, urine dipstick tests, and bacterial cultures were described previously. Reference Kuil, Hidad and Fischer25 Participants and attending clinicians were not informed about urine dipstick and bacterial culture results. Symptom improvement was assessed and registered on the CRF at day 5 and 10 of follow-up.
Outcome and analysis
We used a decision curve analysis as proposed by Vickers and Elkin. Reference Vickers and Elkin27 The Net Benefit is the outcome measure of a decision curve analysis to quantify the performance of different prediction models. The Net Benefit denotes the difference between the true positive and false positive classifications of the outcome (UTI) based on the prediction model and is calculated by the following formula: (TP – (w * FP))/N, where TP = true positives, w = weight (threshold probability), FP = false positives, and N = total number of individuals. The threshold probability in this formula is defined as the minimum probability of disease above which the physician considers a UTI to be present and decides to treat. The treating physician chooses a reasonable threshold probability that reflects clinical considerations on the risk of under- and overtreatment. Reference Vickers, van Calster and Steyerberg28 Personal preferences and attitudes of the prescriber may also play a role. Reference Ebell, Locatelli, Mueller, Senn and Morgan29 Diagnostic and treatment thresholds of 19.1% and 42.3%, respectively, were described previously in suspected UTI in primary care. Reference Harris, Pineles and Baghdadiv30 The Net Benefit of the prediction models is compared with “treating all” or “treating none” of the individuals. An elderly care physician might only need a low clinical suspicion for UTI to treat a patient with the idea that antibiotics might improve the presenting symptoms and prevent the worsening toward pyelonephritis. On the other hand, antibiotics may lead to side effects such as Clostridioides difficile infection, drug-drug interactions, or antimicrobial resistance development. These considerations might result in elderly care physicians to require a higher clinical suspicion of UTI before starting treatment. Decision curve analysis incorporates these different considerations in thresholds probabilities for treatment. It translates the minimal pretest probability for the outcome (UTI) at which the physician is willing to start treatment into the clinical threshold with different weights for false positives (risk for adverse events and antimicrobial resistance) and false negatives (risk for persisting or worsening of symptoms) of the prediction model. When a threshold probability of 10% is chosen, the harm of delaying antimicrobial treatment is considered 9 times higher than the harm of unnecessarily treating with antibiotics for that particular patient (10% reflects a ratio 1:9).
We compared 2 different models. The first is the guideline-based model, in which UTI is assumed to be present if there is at least 1 specific urinary symptom, together with a positive leukocyte or nitrite dipstick results. Reference Hertogh and Haaijman5,Reference van Buul, Vreeken and Bradley9 This model reflects the prevailing guideline in the Netherlands. The second is the extended model, in which a UTI is assumed if the criteria of the guideline model are met, together with at least 1 nonspecific symptom.
The Net Benefit of the 2 prediction models was visualized as a function of the threshold probability in a decision curve. We calculated the Net Benefit for a range of thresholds (0%–40%), as thresholds are known to vary between type of practitioners and number of years in practice. Reference Harris, Pineles and Baghdadiv30 A model is clinically useful when the Net Benefit is higher than the Net Benefit of the models that reflect to treat all or treat none of the patients.
The assessment of predictors was blinded to the outcome measure, as we classified for the outcome (true UTI) after data collection, because symptom improvement was part of our stringent definition. No prestudy sample size calculations were performed. All enrollments were used for analysis; there were no possibilities to extend the study due to COVID-19. No repeated enrollments were used for analysis to ensure independence of the observations. For the reference true UTI definition, complete case analysis was used, and enrollments with missing test results were excluded.
Calculations and decision curves were performed in RStudio (dca function) version 1.4.1103 (Integrated Development for R. RStudio, Inc., Boston, MA).
The reporting of the study is in accordance with the TRIPOD guideline (transparent reporting of a multivariate prediction model for individual prognosis or diagnosis). Reference Collins, Reitsma, Altman and Moons31
Results
Participants
The study enrolled 313 presumed UTI cases. After exclusions, there were 196 eligible presumed UTI cases of which 180 had all mandatory items documented for the stringent “true UTI” definition that represented the study population (Figure 1).
Characteristics of participants
Most participants were female (78.9%), and the median age was 87 years (range 66–107). Participants enrolled stayed at psychogeriatric, rehabilitation, and somatic wards (72.8%, 13.9%, and 13.3%, respectively) which is in line with the composition of nursing home wards in the Netherlands. The frequency of reported specific and nonspecific symptoms are listed in Supplementary Table S1–S2.
Frequency of “true UTIs”
Of the 180 presumed UTI cases, the urine leukocyte esterase and/or nitrite test was positive in 151 cases (83.9%; Table 1). In 71 of 151 cases, bacterial cultures were positive according to our definition (47.0%), and 59 cases were treated with adequate antibiotics according to the susceptibility test results (83.1%, 59/71). The most common uropathogens in this study were Escherichia coli (62%), Proteus spp. (8.9%), and Aerococcus spp. (8.3%). In 43 cases, the initial symptoms improved during antibiotic therapy, fulfilling the definition of “true UTI” (23.9%). Most UTIs were treated with nitrofurantoin (64.0%) and amoxicillin-clavulanic acid (19.3%).
Note. UTI, urinary tract infection.
a Positive urine leucocyte esterase test: ≥1+ leukocytes detected by Combur2 dipstick analysis.
b Positive bacterial culture defined as the presence of 1 or 2 uropathogens at ≥104 CFU/mL.
c Most bacterial cultures (23) were negative among the dipstick-negative urine specimens (79.3%).
d Proven susceptibility of isolated uropathogens to the administered antibiotic, in case of positive urine culture.
e The symptoms improved in 9 of the 12 residents treated with an antibiotic not covering the identified uropathogen.
f Improvement of the initially presented symptoms during follow-up; unknown symptom improvement equals the absence of improvement; in case of missing data on symptom improvement at day 10, symptom improvement at day 5 was used.
Primary outcome: the Net Benefit for UTI prediction
The guideline-based model showed a higher Net Benefit compared with “treating all” or “treating none,” indicating clinical usefulness (Figure 2). However, this model lacked clinical usefulness for the threshold probabilities below 21% and above 28%.
The extended model showed a higher Net Benefit and a wider corresponding threshold probability range (10%–28%) compared with the guideline-based model. The extended model was clinically useful for physicians who consider missing a UTI to be about 2.5 to 9 times worse than unnecessarily treating with antibiotics. Physicians who consider lower (<10%) or higher (>28%) thresholds for treatment (very worried about UTI or very worried about adverse events, respectively) should avoid using the extended model and rather treat all individuals or no individuals with a suspected UTI, respectively. The Net Benefit of the extended model ranged from 0.07 (at a risk threshold of 28%) to 0.157 (at a risk threshold of 10%). The unit of the Net Benefit is the net true positives. Reference Van Calster, Wynants and Verbeek32 This equates to the treatment of 7–15.7 patients with a true UTI per 100 patients suspected for having a UTI by the extended model, with no patients unnecessarily treated with antibiotics.
When comparing the extended model and the guideline-based model, the additionally detected true UTIs ranged from 0.6 to 10.1 per 100 patients with a suspected UTI (Figure 3).
To illustrate the translation of the guideline-based model compared with the extended model into clinical usefulness, we use an example. Let us assume that an elderly care physician considers missing a UTI to be 4 times worse than unnecessarily treating a particular patient with antibiotics. This 4:1 ratio is equal to a preferred threshold probability of 20%. In this case, the Net Benefits of the guideline-based model and the extended model are 0.0486 and 0.0819, respectively (Table 2). This means that the difference in Net Benefit between the guideline-based and the extended model is 0.0333, which translates into 3.33 additionally detected UTIs per 100 patients for whom antibiotic treatment is appropriate when the extended model is used (Table 2). The extended model improved the prediction of a UTI, compared with the guideline-based model, but the threshold probability range was narrow.
Net Benefit = (TP – (w * FP))/N, where TP = true positives, w = weight (threshold probability), FP = false positives, and N = total number of patients. Guideline-based model: specific urinary symptoms and positive urine dipstick result; extended model: guideline-based model or ≥1 nonspecific symptoms with positive dipstick results.
Discussion
The clinical usefulness of diagnosing UTI based on the current guideline algorithms is limited. Our extended model which adds nonspecific symptoms to the guideline algorithm improved the prediction of UTI; however, even this extended model is clinically useful for only a small range of physicians’ preferred threshold probabilities. Outside these thresholds, treating all or no patients with a presumed UTI is preferred.
The results of this study indicate that there is only a small window of opportunity to improve decision-making by using nonspecific symptoms. Whether this small threshold probability range is clinically useful depends on the particular physicians’ preferences, which implicitly incorporates the anticipated treatment effect (the more effective the treatment for health benefits, the lower the threshold) and the estimated risks of harm by under- and overtreatment for the specific patient (individualized medicine). Threshold probability ranges are by definition subjective and vary between physicians. Reference Van Calster, Wynants and Verbeek32 The number of false positives receiving unnecessary antibiotics is high using both models (Supplementary Tables 4 and 5). The harms of unnecessary antimicrobial use in terms of adverse events might be acceptable at the individual level, but the harm of antimicrobial resistance development has consequences at local or global population levels, and whether resistance development affects individual treatment decisions is unsure. Reference Nicholson, Tennant and White33,Reference Zhuo, Labbate and Norris34 Although based on our results no direct recommendations to incorporate nonspecific symptoms to treatment algorithms can be made, the observed limited usefulness of the guideline-based model based on specific symptoms and positive dipstick results is noteworthy. This finding is supported by previous meta-analysis illustrating a limited diagnostic value of specific urinary symptoms in older adults. Reference Gbinigie, Ordóñez-Mena, Fanshawe, Plüddemann and Heneghan35 Furthermore, a decrease in diagnostic value of specific urinary symptoms with increasing age has been reported by others. Reference Holm, Siersma and Cordoba36 The limiting factor of these 2 studies is that they use bacterial cultures as a reference test for UTI which is less informative in a setting with a high prevalence of asymptomatic bacteriuria. We tried to minimize this limitation by adding treatment response in our UTI definition. When using a very stringent UTI definition, one may expect a high number of true positives and a low number of false negatives. In our study, only 24 of the 43 “true UTIs” were detected using the guideline-based model, compared with 39 UTIs when using the extended model (Supplementary Tables 4 and 5). This illustrates that using solely specific urinary symptoms and positive dipstick results may contribute to the underdiagnosis of UTIs in nursing home residents. The guideline model to predict UTI thus seems barely clinically useful, and because appropriate diagnostic tests to differentiate UTI from ASB are currently lacking, physicians need their clinical expertise to make a judgment call with regard to treatment.
Decision curve analyses have not previously been described for the evaluation of diagnostic strategies for UTI in this population. Decision analysis is a research tool and is not designed to be used for individual residents. It is a method to try to determine the value of a prediction model in clinical practice. We used the guideline-based model without assessing its discriminatory power, as the model is already fully implemented in Dutch nursing homes. The same holds true for the extended model as it adds just a single criterion to the implemented guideline-based model.
It is known that assessment of clinical symptoms varies among physicians, Reference Hughes, Ellard and Campbell37 that nonspecific symptoms can cause diagnostic uncertainty, and that a diagnostic test to support decision-making is preferred. Reference Kuil, Schneeberger, van Leth, de Jong and Harting38 Our findings that show the limited usefulness of symptomatology further underline the need for appropriate diagnostic tests to distinguish patients with a UTI from ASB.
The added value of this study to the existing literature is the use of an analysis approach to assess the clinical usefulness of a model by weighing true positives and false positives, rather than traditional measures for discrimination, as treatment decisions are likely to vary among physicians. Using traditional measures like receiver operating curves with false positives and false negatives equally weighted results in more weight for positive outcomes when a positive outcome is less common (as in our study). Reference Steyerberg, Vickers and Cook39 This study illustrates the potential usefulness of the decision curve methodology as an alternative solution in further studies to define the best approach to the diagnosis of UTI in nursing home residents.
A limitation of our study was the very few events of UTI. Second, we used a stringent definition for “true UTIs” consisting of 4 items, including symptom resolution during adequate antibiotic treatment, to reduce misclassification of the outcome. This stringent definition may have underestimated the true number of UTIs because treatment response was part of the definition and not every patient was treated. On the other hand, the number of true UTIs may have been overestimated when symptoms unrelated to UTI resolved spontaneously. We deliberately included the therapy effect because it leads to the least overestimation of the UTIs. By leaving the treatment decision entirely to the physician, the results can be generalized to a large extent. Without a reference test available for UTI, we consider using this stringent definition as a reasonable solution. For future studies, a research reference standard could be used for which efforts are already being made to develop one. 40 Third, the presence of symptoms used in the prediction models was based on the clinical assessment of attending clinicians rather than using standardized assessment scales, Reference Wei, Fearing, Sternberg and Inouye41 but we assumed this reflected the clinical practice. Finally, the criterion for inclusion was a presumed UTI based on clinical assessment, which may have varied between physicians or with time due to the publication of the latest UTI guideline in October 2018. This may have caused clinical heterogeneity, affecting the pretest probability of having a UTI.
Conclusions and implications
The current guideline-based diagnostic algorithm seems to have a limited clinical usefulness in the prediction of UTI requiring antibiotic treatment. Using nonspecific symptoms might improve the prediction of UTI to a limited extent. Our findings imply that symptomatology offers little guidance for clinicians, that current guidelines focusing on specific urinary symptoms may need to be nuanced, and that there is a need for a reliable diagnostic tool to guide appropriate antibiotic treatment for UTIs in the geriatric population.
Supplementary material
To view supplementary material for this article, please visit https://doi.org/10.1017/ash.2024.345.
Data availability statement
The data sets are available on a dedicated project page: doi: 10.17605/OSF.IO/GDR8F.
Acknowledgments
The authors thank all residents for participating in this study, and all physicians and nurses of the participating nursing home wards who completed the clinical report forms and collected the urine samples.
Author contribution
Study concept and design: FvL, MdJ, CS, and SK. Acquisition of data: SK. Analysis and interpretation of data: SK and FvL. Drafting of the manuscript: SK. Critical review of the manuscript for important intellectual content: FvL, CS, and MdJ. Revision of manuscript: SK and FvL. Secured funding: MdJ, CS, and FvL.
Financial support
This work was supported by the Netherlands Organization for Health Research and Development, ZonMw (grant number 541001003). The funder had no role in study design, data collection, and interpretation or in submitting the work for publication.
Competing interests
All authors report no conflicts of interest relevant to this article.
Ethical standard
The PROGRESS study protocol was approved by the ethics committee of the VU University Medical Center (ref. no. 2017.350) and the National Central Committee on Research involving Human Subjects (CCMO ref. no. NL62067.029.17).
Written informed consent prior to enrollment was obtained from participants or their legal representatives when incapacitated. Residents’ capacity was assessed by the treating clinician. The study was conducted in accordance with Good Clinical Practice guidelines and the principles of the Declaration of Helsinki.
Consent for publication
Not applicable.