Language Learning Application for Children with Special Needs

Our project is a Language Learning Application catered towards children suffering with Autism Spectrum Disorder (ASD). ASD renders these children semi-verbal to non-verbal. The focus of our application is to help these children to communicate effectively in everyday situations, and also help them le

2025-06-28 16:33:57 - Adil Khan

Project Title

Language Learning Application for Children with Special Needs

Project Area of Specialization Artificial IntelligenceProject Summary

Our project is a Language Learning Application catered towards children suffering with Autism Spectrum Disorder (ASD). ASD renders these children semi-verbal to non-verbal. The focus of our application is to help these children to communicate effectively in everyday situations, and also help them learn basic to complex language skills. In developed countries, Augmentative and Alternative Communication (AAC) devices are used to cater to semi to non-verbal individuals. Unfortunately in developing countries like Pakistan, such softwares are rarely used due to high cost and language barriers.

Our application will be the first of a kind low cost AAC device which will incorporate the use of our in-house built Artificial Intelligence Language Models to cater to the child's needs. Furthermore, our monthly reports will provide value for the therapists, who can use of reports to monitor whether the child is progressing in his/her language skills or not.

We plan to distribute our application to the public, so that anyone whose child is suffering from such intellectual disability can use it. We also aim to incorporate our application into schools which cater to these special children, as this will allow for early adoption of language skills and hence enable effective learning from an early age.

Project Objectives

Our application aims to create a social impact on the lives of children suffering from Autism Spectrum Disorder (ASD), dyslexia, cerebral palsy, and other intellectual disabilities, which weakens their spoken comprehension skills. The solution to dealing with these children is through Augmentative and Alternative Communication (AAC) devices. Our objectives are:

Project Implementation Method

Our project consists of a number of different modules, and we’ll describe the implementation method for each below:

Benefits of the Project

Currently in Pakistan, there are roughly around 1.4 million children who might be suffering from some intellectual or neural disorder, resulting into potential disability. According to conservative estimates, more than 500,000 children are non-verbal in Pakistan. Most of these are lifelong disabilities without a cure. But with timely help, these children can adjust well into our functioning society.

Our Language Learning Application will help these semi-verbal to non-verbal children better communicate in everyday situations. They will be able to express their thoughts and ideas effectively. Not being able to say what you want leads to anxiety and frustration in the child. The child will also not be socially cast out, and will be able to blend in with other people. Our application gives the children an opportunity for empowerment and social inclusion through formal education.

Better communication leads to lower hit-and-miss scenarios, where the parent or caretaker has to guess as to what the child is trying to say.

Furthermore not being able to communicate with your teacher means you can’t ask questions and clear your doubts. This leads to ineffective learning in early school years, which are essential foundations of further education. Our application can integrate easy within schools and allow the teacher and child to communicate with ease and lead to better learning.

The reports that we generate contain important information about the child’s interaction with our application, and the reports and quiz that we generate provide great value to the child’s therapist, who can properly monitor the progress and growth of the child’s language skills.

Technical Details of Final Deliverable

Our Machine Learning models implied a one-to-many architecture in text sequences. Before our work, such architecture was mostly prominent in images, more specifically in “caption generation” tasks. Textual data incorporated a seq2seq, in which the input given is a sequence, and the output is also a sequence. We wanted an architecture which could give us as output a sequence, but as input it would only take in one word, and not a whole sequence.

So our first approach was using N gram models to cater to this issue. And then we tried Hidden Markov Models (HMM’s). But we soon realized that both of these methods suffered predominantly from the same issue, which was “variable context capture” issue.

We decided finally to use Recurrent Neural Networks (RNN’s) to cater to our problem, and use a variant of RNN’s called Long Short-Term Memory (LSTM) networks, which are the state-of-the-art when it comes to generating sequences with variable context.

Our novelty came in how we decided to basically tune our dataset. Since we wanted to generate sequences from one word, we had to make sure that our model was trained on that specific word. But at the same time, we also had to make sure that our model generated semantically and syntactically correct sequences, and in order to do this we also had to capture the positional information of the word. To cater to the position information, we decided to split our dataset and train it on verbs. Verbs are actions and these are mostly used in everyday scenarios to describe what you want to do, what you need, your emotions, etc. So we decided that verbs would be the perfect place to start. So we divided each sequence in the training dataset into two parts; 1) first part containing the sequence up till the verb, 2) second part containing the remainder of the sequence. Then we inverted the first half of the sequence to train our Backward Language Model. The second part of the sequence was sent to be trained on our Forward Language Model.

Then after sentence generation, if we had another constraint to incorporate apart from verb, we would use cosine similarity to between word embeddings to calculate the closest word in the generated sequence to replace the other constraint with.

Final Deliverable of the Project Software SystemType of Industry Education , IT , Health Technologies Artificial Intelligence(AI), Cloud InfrastructureSustainable Development Goals Good Health and Well-Being for People, Quality EducationRequired Resources
Item Name Type No. of Units Per Unit Cost (in Rs) Total (in Rs)
Total in (Rs) 70375
HP 23.8 Equipment12100021000
ADATA DDR4 RAM Equipment3650019500
WACOM Intuos Art Tablet Equipment12210022100
Visit To Psychologists, Therapists and Autistic Children. Miscellaneous 55002500
Design Expo Photo-paper Printouts Miscellaneous 20601200
Design Expo Flow Diagram Prints Miscellaneous 2015300
Final Defense Binding Charges Miscellaneous 25001000
Final Defense CD-R Charges Miscellaneous 3100300
Final Defense Colour Printing Miscellaneous 2325575
Final Defense Normal Printing Miscellaneous 805400
Project Poster Print in 2x3 Feet Miscellaneous 115001500

More Posts