Blind Aid: State of the art for Scene Text Detector and Text to Speech

This paper the main focus is on the people who are blind and who cannot see. This prototype leads the blind people to recognize the text before them. The entire paper process of this blind aid. First of all, the blind person will be given with a camera attached to his spectacles. Whenever he wants t...

Full description

Saved in:
Bibliographic Details
Published in:2022 International Conference on Advanced Computing Technologies and Applications (ICACTA) pp. 1 - 5
Main Authors: Kotagiri, Srividya, Venkataramana, Attada, Kiran, Gogula
Format: Conference Proceeding
Language:English
Published: IEEE 04-03-2022
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This paper the main focus is on the people who are blind and who cannot see. This prototype leads the blind people to recognize the text before them. The entire paper process of this blind aid. First of all, the blind person will be given with a camera attached to his spectacles. Whenever he wants to read something, he will take a snap of that particular location. Now the text in the image will be detected using an algorithm called EAST (Efficient and Accurate Scene Text Detector) which is an example of FCN with PVANet. In this detection there will be a use of max pooling while feature extraction in images. After detecting the text from image, this project uses Tesseract based OCR Engine to recognize the text in the image. After recognizing the text from the image, the text will be converted to some speech output to the blind person using python package called pytts version 3. The speech converted text will be given as an output to blind person with the aid of speaker. Finally here comes the concept of Modified EAST where the already built in model is extended to increase the accuracy of the prototype or model.
DOI:10.1109/ICACTA54488.2022.9753094