Show simple item record

dc.contributor.authorSerhan, Gener
dc.contributor.authorParker, Dattilo
dc.contributor.authorDhruv, Gajaria
dc.contributor.authorAlexander, Fusco
dc.contributor.authorAli, Akoglu
dc.date.accessioned2023-11-28T02:34:23Z
dc.date.available2023-11-28T02:34:23Z
dc.date.issued2023-09-20
dc.identifier.citationSerhan, G., Parker, D., Dhruv, G., Alexander, F., & Ali, A. (2023). Gpu-based and streaming-enabled implementation of pre-processing flow towards enhancing optical character recognition accuracy and efficiency. Cluster Computing, 1-13.en_US
dc.identifier.issn1386-7857
dc.identifier.doi10.1007/s10586-023-04137-0
dc.identifier.urihttp://hdl.handle.net/10150/670151
dc.description.abstractResearch has demonstrated that digital images can be pre-processed through operations such as scaling, rotation, and blurring to enhance the accuracy of optical character recognition (OCR) by emphasizing important features within the image. Our study employed the open-source Tesseract OCR and found that accuracy can be improved through pre-processing techniques including thresholding, rotation, rescaling, erosion, dilation, and noise removal, based on a dataset of 560 phone screen images. However, our CPU-based implementation of this process resulted in an average latency of 48.32 ms per image, which can hinder the processing of millions of images using OCR. To address this challenge, we parallelized the pre-processing flow on the Nvidia P100 GPU and executed it through a streaming approach, which reduced the latency to 0.825 ms and achieved a speedup factor of 58.6x compared to the serial execution. This implementation enables the use of a GPU-based OCR engine to handle multiple sources of data streams with large-scale workloads.en_US
dc.description.sponsorshipNational Science Foundationen_US
dc.language.isoenen_US
dc.publisherSpringer Science and Business Media LLCen_US
dc.rights© 2023, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.en_US
dc.rights.urihttps://rightsstatements.org/vocab/InC/1.0/en_US
dc.subjectCUDAen_US
dc.subjectGPUen_US
dc.subjectmage processingen_US
dc.subjectLeptonicaen_US
dc.subjectOptical Character Recognition (OCR)en_US
dc.subjectTesseracten_US
dc.titleGpu-based and streaming-enabled implementation of pre-processing flow towards enhancing optical character recognition accuracy and efficiencyen_US
dc.typeArticleen_US
dc.identifier.eissn1573-7543
dc.contributor.departmentDepartment of Bioethics and Medical Humanism, College of Medicine-Phoenix, University of Arizonaen_US
dc.identifier.journalCluster Computingen_US
dc.description.note12 month embargo; first published 20 September 2023en_US
dc.description.collectioninformationThis item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at repository@u.library.arizona.edu.en_US
dc.eprint.versionFinal accepted manuscripten_US
dc.identifier.pii4137
dc.source.journaltitleCluster Computing
dc.source.volume26
dc.source.issue6
dc.source.beginpage3407
dc.source.endpage3419


Files in this item

Thumbnail
Name:
Gpu-based and streaming-enabled ...
Size:
3.390Mb
Format:
PDF
Description:
Final Accepted Manuscript

This item appears in the following Collection(s)

Show simple item record