Urdu DBPedia

DBpedia is a community effort to  extracts structured information from Wikipedia, interlinks it with other databases,maps it with DBpedia Ontology  and makes the data available on web. Lately DBpedia has started mapping various cross-language editions of Wikipedia.This project aims to crea

2025-06-28 16:36:31 - Adil Khan

Project Title

Urdu DBPedia

Project Area of Specialization Artificial IntelligenceProject Summary

DBpedia is a community effort to  extracts structured information from Wikipedia, interlinks it with other databases,maps it with DBpedia Ontology  and makes the data available on web. Lately DBpedia has started mapping various cross-language editions of Wikipedia.This project aims to create mappings of Urdu wikipedia with DBpedia Ontology as there exist neither any Urdu DPedia Ontology created nor mapped with English DBpedia properties and classes. Thus collaboration between DBpedia and Urdu contributors is required and we aspire to provide recipies for Urdu DBpedia creation.

Project Objectives

Creation of Urdu DBPedia  Ontology and mappings with English DBpedia properties and classes.

Project Implementation Method

This project requires infobox extraction of Urdu Wikipedia. After Crwaling infobox data, the attributes and values will be mapped to classes in English DBpedia with the help of extractors and  link translation functions. Infobox crawling will be performed by  Data Bricks Amazon Cloud servers for better and relaible performance, data extraction and data storage.

Benefits of the Project

Urdu chapter of DBpedia is still not created and it will be a contribution in DBPedia international community.

Technical Details of Final Deliverable

It require acccess of DBpedia community for mapping urdu wikipedia attriutes to DBpedia properties and for creating new mappings for Urdu Language and Web Crwaling using Cloud server to extract Urdu wikipedia infoboxes.

Final Deliverable of the Project Software SystemType of Industry IT Technologies Artificial Intelligence(AI), Cloud Infrastructure, OthersSustainable Development Goals Quality Education, Industry, Innovation and InfrastructureRequired Resources
Item Name Type No. of Units Per Unit Cost (in Rs) Total (in Rs)
Total in (Rs) 51400
Coursera Java Training Miscellaneous 150005000
Coursera Web Crawling training: Using Python to access Web Data Miscellaneous 150005000
Printing Equipment72001400
Data Bricks Amazon Cloud Server Equipment14000040000

More Posts