Year of Publication
OBJECTIVES: To evaluate the performance of an engineered machine learning algorithm to identify kidney stones and measure stone characteristics without the need for human input.
METHODS: We performed a cross-sectional study of 94 children and adults who had kidney stones identified on non-contrast CT. A previously developed deep learning algorithm was trained to segment renal anatomy and kidney stones and to measure stone features. The performance and speed of the algorithm to measure renal anatomy and kidney stone features were compared to the current gold standard of human measurement performed by 3 independent reviewers.
RESULTS: The algorithm was 100% sensitive and 100% specific in detecting individual kidney stones. The mean stone volume segmented by the algorithm was smaller than that of human reviewers and had moderate overlap (Dice score: 0.66). There was substantial variation between human reviewers in total segmented stone volume (Jaccard score: 0.17) and volume of the single largest stone (Jaccard score: 0.33). Stone segmentations performed by the machine learning algorithm more precisely approximated stone borders than those performed by human reviewers on qualitative assessment.
CONCLUSION: An engineered machine learning algorithm can identify and characterize stones more accurately and reliably than humans, which has the potential to improve the precision and efficiency of assessing kidney stone burden.