Extended Urdu Natural Language Toolkit (Ex unlt)
In this research, we have developed three novel computational linguistics analysis tools for Urdu language spell checker, hybrid POS tagger and text classifier. We have designed and developed these three tools completely. We have shown each and every phase of the development process. These three too
2025-06-28 16:32:29 - Adil Khan
Extended Urdu Natural Language Toolkit (Ex unlt)
Project Area of Specialization Artificial IntelligenceProject SummaryIn this research, we have developed three novel computational linguistics analysis tools for Urdu language spell checker, hybrid POS tagger and text classifier. We have designed and developed these three tools completely. We have shown each and every phase of the development process. These three tools were designed using rule-based, dictionary lookup, stochastic methods and hybrid methods. We have also developed benchmark corpus for these tools and described the corpus generation process. As corpus generation was one of the main phase. We have also explained the methodology of these tools individually. The methodology include system diagram, user-interaction diagram, flowchart and the interface for our application. Then we have explained the techniques, approaches or models we are using. Finally, we have concluded our results. For improving our results, we have defined some rules. Our finalized results were very convincing. We have also concluded evaluation measures for our three text processing tools.
Project Objectives- Explore the problem of identifying spell checking, text classification, and POS tagging.
- Explore state-of-the-art techniques for spell checking, text classification, and POS tagging.
- Develop supporting resources required for these resources.
- Design and develop these tools for Urdu language.
- Evaluate these basis preprocessing tools.
Text processing tools formulated around rule-based, dictionary lookup, and stochastic methods.
Benefits of the ProjectThese tools provides the facility to Urdu Scholars for Urdu language analysis.
Technical Details of Final DeliverableWe delivers 2 FYP reports and 1 final document.
Final Deliverable of the Project Software SystemType of Industry IT Technologies Artificial Intelligence(AI)Sustainable Development Goals Quality EducationRequired Resources| Item Name | Type | No. of Units | Per Unit Cost (in Rs) | Total (in Rs) |
|---|---|---|---|---|
| Total in (Rs) | 0 | |||
| n/a | Equipment | 0 | 0 | 0 |