DD SPEAK-TO (DUMB & DEAF SPEAKING SYSTEM)

Deaf-Dumb people used Sign Language for communication. It is essential to get dumb and deaf people involved and connected to each other. Dumb and Deaf Speaking System will be very helpful to the mute people for conveying their thoughts to others and understand others in the form of gestures. First,

2025-06-28 16:31:04 - Adil Khan

Project Title

DD SPEAK-TO (DUMB & DEAF SPEAKING SYSTEM)

Project Area of Specialization Internet of ThingsProject Summary

Deaf-Dumb people used Sign Language for communication. It is essential to get dumb and deaf people involved and connected to each other. Dumb and Deaf Speaking System will be very helpful to the mute people for conveying their thoughts to others and understand others in the form of gestures. First, Dumb and Deaf people gestures are captured by kinetic camera and recognize their gestures by Gesture Recognition method and give output in the form of text or sound. After receiving the message, normal person will answer in the form of text and sound which is also converted into gestures by using 3D model which is displaying on the screen. This system also includes a text to speech conversion block that interprets the matched gestures

Project Objectives Project Implementation Method

 Methodology: 
5.1  Gesture to Text Conversion:  
First, gestures are captured by using a kinetic camera. A kinetic camera is used for capturing the gestures of mute people in the form of video. Then this video is converted into different frames. Each frame has an image and apply image processing on it. Image processing include remove background, repositioning and rescaling and pointing edges. After image processing gesture are recognized by Gestures Recognition method which involves different algorithms.

DD SPEAK-TO  (DUMB & DEAF SPEAKING SYSTEM) _1582918092.png

 Gesture Recognition Algorithms: 
a) 3D Model & Skeletal Model: 

 This is an algorithm that is used to create a 3D or three-dimensional model which detects an object and its position within a certain radius of the camera to identify the gesture portrayed. Remember that gesture can only be translated into text if a gesture is found in database.  
The skeletal model is a spatial model that has an algorithm which uses a simpler set of parameters such as joint angles and segment lengths. It can be analyzed by using their position, orientation, and relationship with other joints and segments. 
Now it searches for matching messages for the set of values. Once it is found in database memory, this message is retrieved and convert into phrases. Then gestures are converted into the text messages and also produce sound with the help of using speaker. Scikit-learn will be used to understand and analyze the meaning of gesture and result the output into text. 
b) Text to Speech Conversion: 
Text to speech conversion is often called TTS. Festival (the Festival Speech Synthesis System) is a framework for TTS. It builds speech synthesis systems with various example modules and offers a full text to speech system. We will use it through the Python wrapper Pyfestival. In order to keep our focus on gesture recognition and to reduce development time, we will use this existing library. In our project, Pyfestival will be used to convert text into speech. The text input for this tool will be obtained from the gesture interpretation. 

5.2  Speech or Text into Gestures:

This voice or text is again converted into gestures by creating 3D model which indicates the motion of mute people displayed on the screen

a) Speech to Text Conversion (STT): 

 This system focusses on real-time recognition of spoken language. Pypi is a Python library for speech recognition that supports several speech recognition engines and APIs including CMU Sphinx

b) Text to Pakistan Sign Language (PSL): 

 The text value retrieved from spoken English will be matched to the corresponding PSL model. Then the PSL posturer gesture will be passed to the next stage to be simulated on the computer screen.

DD SPEAK-TO  (DUMB & DEAF SPEAKING SYSTEM) _1582918093.png

Now we have a fully functional smart speaking system to help mute people communicate with normal people. All the conversation is capture by making a video using an Android app and then later it will be broadcast for conveying message to other dumb people in the world. 
 

Benefits of the Project

 Outcomes/ anticipated results  

Technical Details of Final Deliverable

Tools And Techniques:

Final Deliverable of the Project HW/SW integrated systemType of Industry Education , Medical , Media , Telecommunication Technologies Internet of Things (IoT), Blockchain, OthersSustainable Development Goals Good Health and Well-Being for People, Industry, Innovation and Infrastructure, Reduced InequalityRequired Resources
Item Name Type No. of Units Per Unit Cost (in Rs) Total (in Rs)
Total in (Rs) 35800
Kinetic Camera Equipment12500025000
Book For Dumb And Deaf Miscellaneous 110001000
15.6 Equipment120002000
Printing Cost Miscellaneous 110001000
XBOX 360 LT3 with Wired Controller and GTA V Equipment168006800

More Posts