2.3 Clinically Relevant Scoring Mechanism (RFQI)
The paper proposes a novel clinically relevant scoring mechanism RFQI. This metric considers the exact synonym which can be directly fetched from the dictionary of radiological findings created previously (section 3.1). It also factors in the relative region (e.g., left, right, upper, mid, lower). The weighted sum of these metrics are then taken to report the overall accuracy.
The overall formula for RFQI for a given class c can be written as:
RFQIc =∑Ni=1(wcsyn ∗T Pci,syn +wcloc ∗T Pci,loc) ∑Ni=1(wcsyn ∗T Pci,syn +wcloc ∗T Pci,loc)
where N represents the number of positive training examples of a given class, wcsyn and wcloc represent the weights for synonym and localization matching terms, and T Pci,syn and T Pci,loc represent the binary value (1 for matching and 0 for non-matching). The scale of this metric is kept as 0-1 meaning that the sum of these weights for a given class is 1 i.e.
wcsyn +wcloc = 1
Note that in case there is no mention of the region in the ground truth/report, we consider it as a positive match for region for obvious reasons.