Translating musculoskeletal radiology reports using ChatGPT-4: comment

Daungsupawong, Hinpetch; Wiwanitkit, Viroj

doi:10.1007/s00256-024-04651-1

Translating musculoskeletal radiology reports using ChatGPT-4: comment

Letter to the Editor
Published: 04 April 2024

(2024)
Cite this article

Download PDF

Skeletal Radiology Aims and scope Submit manuscript

Translating musculoskeletal radiology reports using ChatGPT-4: comment

Download PDF

Hinpetch Daungsupawong¹ &
Viroj Wiwanitkit²

312 Accesses
Explore all metrics

The Original Article was published on 25 January 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Dear Editor,

We would like to discuss “Translating musculoskeletal radiology reports into patient-friendly summaries using ChatGPT-4” [1]. The layperson summaries of 60 radiology reports generated by ChatGPT-4 were evaluated as part of the study. The primary goal was to evaluate the completeness and accuracy of these summaries, which had to be written at a reading level appropriate for the eighth and ninth grades and be succinct and well-organized. In order to do this, the summaries were given a scale of 1 to 3 for accuracy and thoroughness, to be rated by three separate readers.

The study’s conclusions showed that each of the 60 summaries complied with the word count and readability standards. However, the three readers’ evaluations of accuracy differed from one another. Readers two and three rated accuracy significantly higher, at 2.71 and 2.77, respectively, than reader one, who gave it an average rating of 2.58. The readers’ evaluations of completeness also varied; reader one gave it a score of 2.87, reader two gave it a score of 2.73, and reader three gave it a score of 2.87. The two primary readers’ low agreement on accuracy and completeness is shown by their respective kappa values of 0.29 and 0.33. It was unexpected that the addition of a third reader had no discernible effect on the inter-reader agreement.

A few of the study’s shortcomings should be noted. First off, there appears to have been irregularities in the three readers' assessment processes based on the differences in accuracy and completeness scores. Furthermore, the poor inter-reader agreement suggests that opinions of the caliber of the layperson summaries are not in agreement. Finally, no information about possible factors that might have affected the ratings and inter-reader agreement is provided by the study.

Given these constraints, a number of questions are posed for additional research. First and foremost, in order to guarantee more consistency among readers, standardization in the review process is important. Furthermore, more research is needed to determine the variables influencing the differences in accuracy and completeness evaluations across various readers. Additionally, the study would benefit from focusing on certain prompts or features of the radiological reports that might have influenced the readers’ evaluations. It is also worthwhile to think about putting extra rules or regulations into place to raise the standard of layperson summaries. Finally, broadening the scope of the research to encompass a more varied and substantial array of radiology reports would augment the applicability of the results. By establishing norms or guidelines and looking into the ethical concerns surrounding the use of AI-generated content, it is also possible to guarantee a responsible and transparent use of this technology [2].

References

Kuckelman IJ, Wetley K, Yi PH, Ross AB. Translating musculoskeletal radiology reports into patient-friendly summaries using ChatGPT-4. Skeletal Radiol. 2024. https://doi.org/10.1007/s00256-024-04599-2. Online ahead of print.
Kleebayoon A, Wiwanitkit V. ChatGPT, critical thing and ethical practice. Clin Chem Lab Med. 2023;61(11):e221.
Article CAS PubMed Google Scholar

Download references

Author information

Authors and Affiliations

Phonhong, Vientiane, Lao People’s Democratic Republic
Hinpetch Daungsupawong
Medical College, Saveetha Institute of Medical and Technical Sciences Saveetha University, Chennai, India
Viroj Wiwanitkit

Authors

Hinpetch Daungsupawong
View author publications
You can also search for this author in PubMed Google Scholar
Viroj Wiwanitkit
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hinpetch Daungsupawong.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Daungsupawong, H., Wiwanitkit, V. Translating musculoskeletal radiology reports using ChatGPT-4: comment. Skeletal Radiol (2024). https://doi.org/10.1007/s00256-024-04651-1

Download citation

Received: 29 February 2024
Revised: 29 February 2024
Accepted: 10 March 2024
Published: 04 April 2024
DOI: https://doi.org/10.1007/s00256-024-04651-1

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Translating musculoskeletal radiology reports using ChatGPT-4: comment

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation