CPC G16H 15/00 (2018.01) [G06F 40/20 (2020.01); G06F 40/279 (2020.01); G06V 30/30 (2022.01); G06V 30/416 (2022.01); G16H 10/60 (2018.01); G16H 30/20 (2018.01)] | 20 Claims |
15. A system comprising:
one or more processors; and
a non-transitory computer-readable medium storing a plurality of instructions executable by the one or more processors to perform a method comprising:
receiving an image file containing a pathology report;
performing an image recognition operation on the image file to extract input text strings;
detecting, using a natural language processing (NLP) model, entities from the input text strings, each entity including a label and a value;
extracting, using the NLP model, values of the entities from the input text strings;
converting, based on a mapping table that maps entities and values to pre-determined terminologies, the values of at least some of the entities to corresponding pre-determined terminologies; and
generating a post-processed pathology report including the entities detected from the input text strings and the corresponding pre-determined terminologies,
wherein the input text strings are first input text strings; and
wherein parameters of the image recognition operation are determined based on an accuracy of recognizing entities from second input text strings by the NLP model, the second input text strings being generated by the image recognition operation using the parameters.
|