Image Recognition and Voice Translation for Visually Impaired

Authors

  • Sandeep Pasupuleti Department of Computer Science and Engineering, Hindustan Institute of Technology and Science, Chennai, India
  • Lahari Dadi Department of Computer Science and Engineering, Hindustan Institute of Technology and Science, Chennai, India
  • Manikumar Gadi Department of Computer Science and Engineering, Hindustan Institute of Technology and Science, Chennai, India
  • R. Krishnaveni Department of Computer Science and Engineering, Hindustan Institute of Technology and Science, Chennai, India

Keywords:

image recognition, voice translation, Flickr_8k, VGG, LSTM

Abstract

Image recognition and voice translation for the visually impaired is principally centered on aiding the visually impaired individuals which can guide them through changing the image into voice. Image captioning is that generating the caption of the given image. The arrival of image recognition and voice translation system can facilitate visually impaired individuals to visualize the globe. Image recognition needs to understand the numerous things, their properties, and their connections during an image. The “Flickr_8k” dataset is employed for this project with having 8000 pictures. During this project, we have a tendency to principally specialize in the eye mechanism, that extracts the options through VGG16 and converts the options into language through LSTM. During this project, we've projected a changed LSTM which can enhance the addition of data that have a tendency to get from a picture as further input to each unit of the block, which enhances the higher results on the recognition and translation. The sentence that's generated from the image is evaluated exploitation blue cheese Score. Moreover, the benefits and therefore the short comings of the prevailing ways and projected technique area unit mentioned and therefore the analysis criteria has been non commissioned. Hence, we have a tendency to area unit developing this model that generates a caption for the given image and that we show our model advances the state of art on tasks that need the joint process of Image and sentence description and finally converting it into voice.

Downloads

Download data is not yet available.

Downloads

Published

18-05-2021

Issue

Section

Articles

How to Cite

[1]
S. Pasupuleti, L. Dadi, M. Gadi, and R. Krishnaveni, “Image Recognition and Voice Translation for Visually Impaired”, IJRESM, vol. 4, no. 5, pp. 18–23, May 2021, Accessed: Jan. 28, 2025. [Online]. Available: https://journal.ijresm.com/index.php/ijresm/article/view/713