Elsevier

Clinical Radiology

Volume 60, Issue 11, November 2005, Pages 1205-1212
Clinical Radiology

Voice recognition for radiology reporting: Is it good enough?

https://doi.org/10.1016/j.crad.2005.07.002Get rights and content

AIM

To compare the efficiency and accuracy of radiology reports generated by voice recognition (VR) against the traditional tape dictation–transcription (DT) method.

MATERIALS AND METHODS

Two hundred and twenty previously reported computed radiography (CR) and cross-sectional imaging (CSI) examinations were separately entered into the Radiology Information System (RIS) using both VR and DT. The times taken and errors found in the reports were compared using univariate analyses based upon the sign-test, and a general linear model constructed to examine the mean differences between the two methods.

RESULTS

There were significant reductions (p<0.001) in the mean difference in the reporting times using VR compared with DT for the two reporting methods assessed (CR, +67.4; CSI, +122.1 s). There was a significant increase in the mean difference in the actual radiologist times using VR compared with DT in the CSI reports; −14.3 s, p=0.037 (more experienced user); −13.7 s, p=0.014 (less experienced user). There were significantly more total and major errors when using VR compared with DT for CR reports (−0.25 and −0.26, respectively), and in total errors for CSI (−0.75, p<0.001), but no difference in major errors (−0.16, p=0.168). Although there were significantly more errors with VR in the less experienced group of users (mean difference in total errors −0.90, and major errors −0.40, p<0.001), there was no significant difference in the more experienced (p=0.419 and p=0.814, respectively).

CONCLUSIONS

VR is a viable reporting method for experienced users, with a quicker overall report production time (despite an increase in the radiologists' time) and a tendency to more errors for inexperienced users.

Introduction

Over the previous decade, there has been an increase in both the number and complexity of radiology reports. Historically, reports have been produced by several methods: Direct dictation to a secretary, handwritten onto the back of the request form for subsequent typing, and dictation on a tape handset with subsequent transcription (tape dictation–transcription, DT). These methods are time-consuming for the radiologist, expensive in terms of transcriptionist costs, and may lead to inevitable delays in the availability of the reports. Newer methods are being tried and there have been major advances in speech recognition systems. Digital dictation with subsequent speech recognition and corrections done by clerical staff (Philips SP6000) has been reported as a good alternative for report generation when compared with word-processed reports performed by the radiologist.1 Automated speech recognition (ASR MediSpeak) has been compared with conventional DT in CT report generation. The errors made were comparable, and the report turnaround time was reduced, although the time taken by the radiologist increased.2

Guidelines issued by the Royal College of Radiologists (RCR) suggest that all reports should ideally be proofread and authorized immediately at the time of transcription and whilst reviewing the image being reported,3 in order to minimize the clinical risk from errors. This is not possible with the current working practices of many radiology departments where DT methods are in use. Few would have time to be able to review the images whilst proofreading. The guidelines state that departments need to seek ways of developing immediate transcription methods to minimize this source of risk.

The Norfolk and Norwich University Hospital is a large teaching hospital performing over 250,000 radiology examinations annually. DT is the most frequent method of report transcription, but with this we have experienced delays of up to 7 working days from dictation of the report by the radiologist and its subsequent transcription and availability on the hospital information systems. This was due to clerical shortages and constant interruptions by clinicians chasing reports that were still “on tape”. The situation has changed markedly since the introduction of voice recognition (VR) reporting 3 years ago. An initial comparison of VR and DT reporting here suggested VR had both a time and error advantage, although limited by study size.4 We performed a study to compare reporting times and error rates for the two techniques in our department, to assess the efficiency and accuracy of both methods.

Section snippets

Methods and materials

The Norfolk and Norwich University Hospital used a hospital-wide PACS (GE Pathspeed) and McKesson Total Care Radiology Information System (McKesson TC-rad v8.9 RIS). Well-developed software was used (GHG VoicePro for Radiology) to provide an interface between RIS and the VR speech engine (Dragon NaturallySpeaking Professional v.5). GHG customized the VR reporting system to enable RIS to function using both command mode of VR (to navigate around the system), as well as text mode (to talk the

Results

Of the 220 reports used in the study 160 (73%) were for CR, the remaining for CSI. For 99 reports (45%) VR was carried out first. The average word length was 53 words.

Discussion

The present study demonstrates a total reporting time advantage of VR over DT for long reports, and this time advantage remains when adjusted for length of report, irrespective of the reporting method or experience of VR use. The DT method is complex, involving several stages and individuals in the pathway. The VR method incorporates the dictation and transcription into one stage. The report turnaround time (time from the end of the examination to the time the report is available on RIS) is

References (7)

  • H. Arndt et al.

    Anwendung der digitalen Spracherkennung in der radiologischen routine (use of digital speech recognition in diagnostic radiology)

    Fortschr Rontgenstr

    (1999)
  • U.K.M. Teichgraber et al.

    Digitale Spracherkennung bei der erfassung computertomograthischer befundtexte (automated speech recognition for report generation in computed tomography)

    Fortschr Rontgenstr

    (1999)
  • Risk management in clinical radiology

    (2002)
There are more references available in the full text version of this article.

Cited by (46)

  • Benefits of Integrated RIS/PACS/Reporting Due to Automatic Population of Templated Reports

    2019, Current Problems in Diagnostic Radiology
    Citation Excerpt :

    Regarding time savings, report automation allows the radiologist to spend more time per examination reviewing the images and synthesizing findings into a clinically relevant impression, rather than on dictating automatable noninterpretive but required fields. The use of templates, along with experience in VR, has been shown to increase the efficiency of dictation.7,8 No studies, however, have demonstrated the effects of a fully automated template system by which all noninterpretive fields are automatically populated.

  • Laterality errors in radiology reports generated with and without voice recognition software: Frequency and clinical significance

    2013, Journal of the American College of Radiology
    Citation Excerpt :

    Previous studies have evaluated the use of voice recognition software in radiology [7,10-18], but the majority focused on cost-effectiveness and reporting time. Of those analyzing accuracy, several found increases in the frequency of errors without the use of voice recognition [10,11,14]. However, these studies evaluated for errors in syntax, grammar, spelling, and so on, and did not specifically address laterality errors, which are more likely to be clinically relevant.

  • The radiology report - Are we getting the message across?

    2011, Clinical Radiology
    Citation Excerpt :

    LaFortune et al. argued that a court could easily find the radiologist responsible for harm done to a patient as a result of a careless unchecked report and adds that the use of phrases such as “dictated but not checked” and other phrases to limit responsibility also reduce the reliability of a report.17 Although voice recognition is a viable method of reporting for experienced users, with a quicker overall report time, this comes at the price of an increase in the radiologists’ time and a tendency to more errors for inexperienced users.43 Other errors (such as grammatical or spelling errors) can also make the radiologist look rather careless, as demonstrated in Table 8, which are all excerpts from verified radiology reports.

View all citing articles on Scopus
View full text