Adil Khan 9 months ago

AdiKhanOfficial #FYP Ideas

Urdu sentiment Analysis

Project Title

Project Area of Specialization

Artificial Intelligence

Project Summary

Sentiment analysis (aspect-based opinion mining) is the task of automatically generating a polarity of an opinionated sentence or document from a set of opinions. Commercial and non-commercial organization’s use online feedback systems to receive reviews from customers about their product(s), which causes availability of large amount of opinionated text on the web. As web has become a multi lingual resource of information where same text is available in more than one languages. User’s preferred to give feedback in their native language, due to which same product reviews are available in various languages of the world. Manual inspection of this rich text is quite difficult, requires huge work force along with handsome amount of time. There are many software applications available in the market to automatically review opinions and generate useful statistical results for users. One of the major limitations of such applications is that they are developed for rich languages like English, German, Spanish, Arbic etc. Lack of availability of resources is major barrier in the development and evaluation of such type of software systems for less popular lanauges of the world. This research study focuses on the development of comparative product reviews corpus for Urdu and Roman Urdu languages. Furthermore, corpus will be studied as supervised classification problem to explore how much the corpus will be useful for the development and evaluation of automatic product reviews systems for Urdu and Roman Urdu languages

Project Objectives

250 million people of the world population speaks Urdu language and many of the online application like social web sites like Twitter and Facebook support Urdu or people use Roman Urdu to express their views. So our project is to build an application sentimental analysis of the point of view of people that is written in Urdu or Roman Urdu about different products. The motivation is not only to make a corpus of comparison but also to promote Urdu language internationally.

Goals and Objectives :

• Development of comparative review based corpus for Urdu and Roman Urdu language.

• Explore different available sentimental analysis techniques for developed corpus.

• Corpus will also be studied as supervised classification problem

Project Implementation Method

There are three approaches we can use it according our choice:

1.Machine learning:

Bayesian Networks

Naive Bayes Classification

Maximum Entropy

Neural Networks Support Vector Machine

1.1 Features

Term presence and frequency

Part of speech

information Negations Opinion words and phrases

2.Lexicon based

Dictionary based approach

Novel Machine Learning Approach

Corpus based approach

Ensemble Approaches

2.1 Features

Manual construction,

Corpus-based

Dictionary- based

3.Hybrid

Machine learning Lexicon based

3.1 Features

Sentiment lexicon constructed using public resources for initial sentiment detection Sentiment words as features in machine learning method.

Benefits of the Project

There are multilingual resources available online for sentiment analysis that has bulky volume of data in different language. Most of the resources are developed for the rich language like English, German, French and Italian. People like to express their feelings in their native language, that also furnishes the need of corpora to be develop in Urdu Language.There are almost 7105 languages spoken in the world and Urdu ranks 19th among them. Urdu is the most spoken language in the South Asia. Due to large Diasporaof IndoPak Subcontinent citizens it is spreading into the West. Pakistan has Urdu as national language.In teaching from public schools for junior to mid-level administration and in the print and electronic media also Urdu is used as a way of communication. Urdu doesnt have its roots only on Pakistan but it is also spoken in the Afghanistan, Bangladesh, Indiaand Nepal. Also it has become the culturallanguage among the South Asian Muslims and spoken outside the Indo-Pak subcontinent, mainly in the Middle East, Europe, Canada and the United States.

Technical Details of Final Deliverable

Example for test

Steps for training a classi?er for sentiment analysis.

1.Firstly, data have to be prepared in order to obtain a data set namely

2.the training set – by means of preprocessing and feature selection methods.

3.Then, such a data set is involved in the learning

step, which uses ML algorithms and yields a trained classi?er. Finally, the classi?er has to be tested on a di?erent data set – namely,

4.the test set

Final Deliverable of the Project

Software System

Type of Industry

IT , Manufacturing , Others , Telecommunication

Technologies

Artificial Intelligence(AI)

Sustainable Development Goals

Decent Work and Economic Growth, Industry, Innovation and Infrastructure, Sustainable Cities and Communities, Responsible Consumption and Production, Partnerships to achieve the Goal

Required Resources

Item Name	Type	No. of Units	Per Unit Cost (in Rs)	Total (in Rs)
Overall hardware & Software	Equipment	8	5000	40000
stationary & etc	Miscellaneous	10	500	5000
			Total in (Rs)	45000

If you need this project, please contact me on contact@adikhanofficial.com

Comments 0

Prediction Based Electricity Consumption Alerts

Prediction based electricity consumption alerts is based on ATmega family controllers, Nod...

Adil Khan

9 months ago

DIALYZER REPROCESSING UNIT FOR PATIENT WITH ESRD

Hemodialysis is a treatment to filter wastes and water from the blood, as your kidneys did...

Adil Khan

9 months ago

Job Intel

This project focuses on the issues that are faced by many organizations in their job selec...

Adil Khan

9 months ago

Multispectral Monitoring of Crops using GIS

The project's idea is about providing farmers in agriculture with simple and real-time ass...

Adil Khan

9 months ago

Physical Activity Recognition and Tracking of Diabetic Patients using...

We propose to recognize and track the physical activities performed by a diabetic patient....

Adil Khan

9 months ago