To log in, click the teal "Login" button in the upper right-hand corner of this page. If you are logged in but still do not have access, please check your 2025 Annual Meeting registration.
To log in, click the teal "Login" button in the upper right-hand corner of this page. If you are logged in but still do not have access, please check your 2025 Annual Meeting registration.
Moderator
Yousuf Khalifa, MD
Panelists
Prasanna V Ramesh, MS, DNB
Tam Ch Hoang, MD
Viewing Papers
Expand a paper title to the right to view the paper abstract and authors, and view the full session above.
Presenting Author
Prasanna V Ramesh, MS, DNB
Purpose
An AI protractor tailored specifically for the intricate analysis of the iridocorneal angle was innovated by us for Anterion (high-resolution swept-source OCT) images.
Methods
This technology swiftly and precisely annotates angles with human-in-the-loop machine learning, meticulously measuring their degrees, and autonomously categorising them into wide open, narrow, and closed angles. Trained on a meticulously curated dataset comprising 1500 images, partitioned into 1370 for training and 130 for testing, this AI protractor exhibited a progressive increase in accuracy over time.
Results
In three distinct tests, the AI demonstrated exceptional sensitivity and specificity, showcasing its robust performance; Test 1 - Sensitivity 82.61% & Specificity 92.59%; Test 2 - Sensitivity 88.24% & Specificity 94.44%; and Test 3 - Sensitivity 95.24% & Specificity 95.83%.
Conclusion
This AI model also overcomes the black box dilemma thereby enhancing diagnostic precision with constant training via the feedback mechanism.
Presenting Author
Aisling Higham, FRCOphth, MBBS, MSc
Co-Authors
Ernest Lim MBBS, BSc, Nick De Pennington MBA, BM BCh, FRCS, Andrew Lee MD, Saif Aldeen AlRyalat MD, Sanjana Jaiswal MBBS
Purpose
Large-language models (LLMs) for healthcare lack scalable evaluation methods. We test the hypothesis that evaluation of LLM generated responses to patient queries can be automated using readability metrics and automated qualitative evaluations, ultimately supporting development of more usability and empathetic AI-driven clinical conversations.
Methods
A comparative study was conducted using 78 anonymized clinician-curated questions posed to Dora, a clinical agent used in post-cataract surgery follow-ups in the UK. Responses from 2 Texas-based ophthalmologists were compared with responses from three different LLMs using retrieval-augmented generation (RAG). Metrics compared response length, word complexity and readability scores in both groups. Chain-of-thought prompting was employed for LLMs to rate answers on a 1-5 likert scale for the following domains: patient concern management, empathy, and information provision. Two separate ophthalmologists blind-ranked responses from the LLM versus clinicians for each domain.
Results
Automated metrics revealed variations in response length and readability between LLM-generated and clinician-generated responses. LLM responses had fewer words overall (39%), and with shorter sentences (52%) than the clinicians’. They were also more readable than clinician responses, with an average Flesh Kincaid reading grade required of 5.6 versus 10.2 respectively. The LLMs’ qualitative assessment demonstrated comparable abilities with a trend preference for clinicians for ‘managing concerns’, and ‘information provision’, and ‘empathy’ for LLMs. Agreement between clinician ranking and the automated likert evaluation was 74%, comparable to inter-clinician ranking agreement of 75%.
Conclusion
We surface interesting differences in usability patterns between clinician and AI-generated responses to real-world, cataract patient post-op queries. This study supports the feasibility of using automated metrics for evaluating model responses, guiding the design and development of empathetic and usable AI systems.
Presenting Author
Mohammad Soleimani, MD
Purpose
Microbial keratitis (MK) is a major cause of corneal blindness, with outcomes dependent on timely, accurate diagnosis. In low and middle-income countries, limited access to expertise and equipment hampers diagnostics. This study investigates using deep learning (DL) to diagnose and differentiate MK subtypes from smartphone-captured images.
Methods
The dataset comprised 889 cases of bacterial keratitis (BK), fungal keratitis (FK), and acan-thamoeba keratitis (AK) collected from 2020 to 2023. A convolutional neural network-based model was developed and trained for classification.
Results
The study demonstrates the model's overall classification accuracy of 83.8%, with specific accu-racies for AK, BK, and FK at 81.2%, 82.3%, and 86.6%, respectively, with an AUC of 0.92 for the ROC curves.
Conclusion
The model exhibits practicality, especially with the ease of image acquisition using smartphones, making it applicable in diverse settings.
Presenting Author
Paula E Maturana, MD
Co-Authors
Daniela M López, MSc,
Claudia L Goya, MSc,
Daniela B. Cabrerizo, MD,
Remigio Lopez-Solís, PhD,
Leonidas Traipe, MD, MSc
Purpose
To describe ultrasound findings in meibomian glands (MG) and lacrimal glands(LG) of patients with DED due to MGD (meibomian gland dysfunction), SS (Sjögren’s syndrome) and others AID (autoimmune diseases).
Methods
Descriptive study of 60 patients with DED diagnosis according to DEWS II criteria (19 patients with SS, 19 with AID and 22 with MGD) who underwent Doppler ultrasound of MG and LG in both eyes (24 and 70 MHz transducer). The results were analyzed by a radiologist sub-specialized in ultrasound and morphometric parameters were registered for both glands, such as: intra and inter-gland ultrasound density, vascularization characteristics and structural changes (diameter, thickness, and density) in acini and ducts.
Results
Sixty patients were enrolled, with a mean age of 49.3 ± 12.70 (range: 16 - 77), and 88.67% were women. Regarding MG, intra- and inter-acinus echogenicity alterations were observed in all cases (primarily hypoechogenicity). Structural alterations were present in 86% of patients, and hypervascularization was noted in 95% (with a higher systolic peak in patients with AID, averaging 10.48 cm/s). For LG, echogenicity, structural alterations, and hypervascularization were observed in 88%, 10%, and 75% of cases, respectively, especially in SS patients.
Conclusion
Ultrasound evaluation offers valuable insights into the structural changes of LG and MG in DED. Alterations in gland echogenicity and hypervascularization were the most frequently findings. This technique can reveal adaptive changes and inflammatory effects that are not detectable by other methods and lead to the development of new research.