Adil Khan 11 months ago
AdiKhanOfficial #FYP Ideas

Urdu DB Pedia

Following the success of the web of documents (World Wide Web), there has been a big enthusiasm in creating a Web of Data by publishing data in a manner that can be easily  understood by software programs.         In 2014, there were more than 1,000 publicly availab

Project Title

Urdu DB Pedia

Project Area of Specialization

Artificial Intelligence

Project Summary

Following the success of the web of documents (World Wide Web), there has been a big enthusiasm in creating a Web of Data by publishing data in a manner that can be easily  understood by software programs.
        In 2014, there were more than 1,000 publicly available    datasets containing more than 900,000 documents. Among these datasets, DBpedia stands out as the central hub of Linked Open Data (LOD) because it provides a vast amount of information and most other datasets in the LOD cloud link to DBpedia.
    
This is the Urdu DBpedia Project  to extract structured knowledge from WikiPedia and English DBpedia to make
it freely available on the Web using Semantic Web and Linked Data technologies. 
The project will extract knowledge from  English language edition of Wikipedia and then will compare with urdu DBpedia to find out what are the missing attributes.

The Urdu DBpedia project comprises three main areas:

1) Structured Data Extractors & Transformers:

which extract entities, entity relationship types, and entity relationships from Wikipedia documents.

2)Deployment of Linked Open Data:

that makes entity relationships available to the Web in Linked Open Data form.

Project Objectives

The main objective is to extract structure Knowledge and then find what is missing in the attributes and synchronizing of Knowledge that is extracted from English and Urdu WikiPedia.And to provide  Wikipedia-knowledge in a form compatible with tools covering business intelligence & analytics, entity extraction, natural language processing, reasoning & inference, machine learning services,and artificial intelligence in general.

Project Implementation Method

This phase involves:
Training of Web Crawling 
And then Extraction of WikiPedia Infoboxes by Crawling over
Cloud Server
Generation Of Classes 
Urdu DBPedia Ontology
Mapping with DBPedia Classes and Properties
And then experimentation and testing of the Project.

Benefits of the Project

As we know that a significant percentage of the information
stored in the English DBpedia is not available in the Urdu DBpedia. This fact places the English DBpedia as a valuable and exclusive source of  information.

 Also we have to remark that the Urdu DBpedia does not contain all the information stored in the other DBPedias like English DBpedia, but only a minimum subset.

By extracting knowledge from the English DBpedaia and WikiPedia and making it available on the web in the structured form will benefit all the Urdu DBpedia readers whether they belong to  the field of Education,
Medicine,Agriculture,Industry or Infrastructure.

Technical Details of Final Deliverable

Project  involves web crawling of English and Urdu Wikipedia, After comparing them the missing attributes will be identified. It  involves Android Java, PHP and the implementaion of apatche nutch.                                                                          The  project will  provide semantics to the data (mapping) by relating data from Wikipedia articles to elements of the Urdu DBpedia ontology by using wiki-based tools. The extraction process will read a Wikipedia page which containsan infobox and will extract its attribute-value pairs.

Final Deliverable of the Project

Software System

Type of Industry

IT

Technologies

Artificial Intelligence(AI), Cloud Infrastructure, Others

Sustainable Development Goals

Quality Education, Industry, Innovation and Infrastructure

Required Resources

Item Name Type No. of Units Per Unit Cost (in Rs) Total (in Rs)
Java Training Miscellaneous 150005000
Web Crawling Training:Using Python to access Web Data Miscellaneous 150005000
Printing Equipment72001400
Data Bricks Amazon Cloud Server Equipment14000040000
Total in (Rs) 51400
If you need this project, please contact me on contact@adikhanofficial.com
0
112
Intelligent police management system

The police management system is an application that allows all the paper work that are don...

1675638330.png
Adil Khan
11 months ago
Aquatic Trash Bin

Our world is 70-71% water. Water is very essential for life on earth, yet water pollution...

1675638330.png
Adil Khan
11 months ago
eHEALTH Medicine recommendation system

In real-world scenarios disease diagnosis and effective treatment is a real challenge. Cur...

1675638330.png
Adil Khan
11 months ago
Smart Street Light System

Project Summary: Statistics show that almost three-fourth of the demands of energy in the...

1675638330.png
Adil Khan
11 months ago
smart irrigation system using wireless sensors network

Pakistan is an agricultural country and agriculture plays major role in the gross domestic...

1675638330.png
Adil Khan
11 months ago