Audio Narration of a Scene for Visually Disabled using Smart Goggle

Authors

  • Pratyush Pratap Singh Student, Department of Information Science and Engineering, Dayananda Sagar Academy of Technology and Management, Bengaluru, India
  • Sharath S. Hegde Student, Department of Information Science and Engineering, Dayananda Sagar Academy of Technology and Management, Bengaluru, India
  • R. Varun Student, Department of Information Science and Engineering, Dayananda Sagar Academy of Technology and Management, Bengaluru, India
  • Vivek Hegde Student, Department of Information Science and Engineering, Dayananda Sagar Academy of Technology and Management, Bengaluru, India
  • K. A. Sumithra Devi Professor, Department of Information Science and Engineering, Dayananda Sagar Academy of Technology and Management, Bengaluru, India

Keywords:

Raspberry Pi, Tesseract OCR engine, Raspberry Pi camera board, OpenCV, Natural Language Processing, Natural Language Generation, Text to Speech (TTS) engine, Optical Character Recognition (OCR), Object detection

Abstract

This work supports visually disabled people to get an idea of what is in the captured image. By using different kinds of multimedia information processing techniques, the proposed device will first acquire image attributes via Pi Camera, then perform an image to text conversion using Tesseract library and OpenCV library. Previously proposed approaches used computer vision technology to determine labels or exploit already available descriptions of the training images to transfer or compose a completely new description for the image to be tested. Now we propose an approach that will use image annotations to generate image descriptions and shows that with the accurate object and attribute detection, human-like descriptions for images can be generated. We use TTS (Text to Speech) for text to speech transformation and Python programming language.

Downloads

Download data is not yet available.

Downloads

Published

18-04-2022

Issue

Section

Articles

How to Cite

[1]
P. P. Singh, S. S. Hegde, R. Varun, V. Hegde, and K. A. S. Devi, “Audio Narration of a Scene for Visually Disabled using Smart Goggle”, IJRESM, vol. 5, no. 4, pp. 73–75, Apr. 2022, Accessed: Apr. 20, 2024. [Online]. Available: https://journal.ijresm.com/index.php/ijresm/article/view/1943