VISIONSPEAK : virtual interactive object detection and voice narration
Keywords:
Real-time object recognition, Voice instruction, EducationalAbstract
VisionSpeak is an innovative and smart application that uses camera technology to recognize and identify the objects in real-time and gives instructions to the users through the voice instruction and describes the object. It is designed for the children , which help them to learn about the different objects in there surroundings which help them to educate about their surrounding and learn about its characteristics. This makes the learning more interactive and fun, which eventually helps children to improve there grasping power as they are learning through there surroundings by cutting off the traditional way of learnings. Beyond the education, visionspeak can also be used as an assistive tool or technology for visually impaired individuals. It helps to navigate their surrounding more easily, giving them more independence and ease in their daily lives as they move around their environment. By using camera it helps to identify the objects or navigate them and voice instruction will help them to give the description or the instructions. With this ability that supports both learning and the accessibility, Visionspeak has the potential to make the real difference in the people’s lives as it enables the independence and promotes the self-reliance ability. This helps to bridge the gap between the visually impaired community and the sighted world while also offering the fun, interactive and educational experience for the children to gain independence, access information, and enhance the learning experiences.
Downloads
References
Ayoub Benali Amjoud; Mustapha Amrouch “Object Detection Using Deep Learning, CNNs and Vision Transformers: A Review”, Vol. 11, Page(s): 35479 – 35516, DOI: 10.1109/ACCESS.2023.3266093, April 2023
V. Madhusudhana Reddy; T. Vaishnavi; K. Pavan Kumar “Speech-to-Text and Text-to-Speech Recognition Using Deep Learning”, July 2023, DOI: 10.1109/ICECAA58104.2023.10212222
Tavva Bindamrutha; Appaneni Likhitha; Sankalamaddi Aashritha Reddy; Guntakandla Vikranth Reddy; S. Shanmuga Priya “A Real-time Object Detection System for the Visually Impaired”, May 2023, DOI: 10.1109/ViTECoN58111.2023.10157345
Meghana Pulipalupula; Srija Patlola; Mahesh Nayaki; Manoj Yadlapati; Jayshree Das; B. R. Sanjeeva Reddy “Object Detection using You Only Look Once (YOLO) Algorithm in Convolution Neural Network (CNN)”, April 2023, DOI: 10.1109/I2CT57861.2023.101262130
Prasanta Das; Angshuman Chakraborty; Ravi Sankar; Om Krishan Singh; Hena Ray; Alokesh Ghosh “Deep Learning-Based Object Detection Algorithms on Image and Video”, June 2023, DOI: 10.1109/CONIT59222.2023.10205601
Yashal Railkar; Aditi Nasikkar; Sakshi Pawar; Pranjal Patil; Rohini Pise “Object Detection and Recognition System Using Deep Learning Method”, May 2023, DOI: 10.1109/I2CT57861.2023.10126316
Suo Li; Jinchi You; Xin Zhang “Overview and Analysis of Speech Recognition”, August 2022, DOI: 10.1109/AEECA55500.2022.9919050
Gurram Sunitha; Banda Prathima; Chintalacheri Charan Yadav; Adluru Sudeepthi; Chadipirala Lakshmi Charitha; V. Sreeramulu BerinathSpeech “Speech Recognition based Assistant Bee: Vision to the Impaired”, September 2022, DOI: 10.1109/ICIRCA54612.2022.9985022
Downloads
Published
Issue
Section
License
Copyright (c) 2025 International Journal for Research Publication and Seminar

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.