With the advent of technology and its continuous growth, social media platforms such as Twitter, Facebook and Instagram can be utilised by extremist outfits to propagate messages and ideologies far and wide in a rapid and efficient manner. The most preferred medium of communication for exchanging id
Detection of Radicalization through tweets using Machine Learning
With the advent of technology and its continuous growth, social media platforms such as Twitter, Facebook and Instagram can be utilised by extremist outfits to propagate messages and ideologies far and wide in a rapid and efficient manner. The most preferred medium of communication for exchanging ideas and sharing opinions is Twitter - thereby enabling vulnerable individuals to fall prey to the radicalization process - by making the process of reaching out to extremists relatively easier. Users expressing positive ideas about extremism, propagating hate-speech towards entities or expressing allegiance to an extremist school of thought are identified to be at risk. In this project, we will address the need to detect and classify radicalisation risks by employing machine learning models and performing analysis on written text in tweets using traditional Natural Language Processing based approaches.Social Media serves as a powerful tool for extremists to disseminate their propaganda, recruit assets and conduct psychological warfare. Social Media enables religious extremists to exploit the vulnerability factors linked to socio-economic and demographic conditions of individuals thereby making them suitable targets for their radicalization. To curb the rise of radicalism-led militancy using online tools, we use sentiment analysis techniques for the detection and classification of tweets exhibiting extremist tendencies
1. Identification of tweets embracing Islamic Fundamentalism.
2. Measuring the performance of Machine Learning models such as Support Vector Machine, Random Forest, and Deep Neural Networks in identifying radicalized tweets.
3. Implementing State of of the art Models such as T5 Model, BERT and GPT3 for identifying radicalized tweets.
4. Detecting the sentiments of users through tweets using Python Libraries such as NLTK/CoreNLP.
Our aim is to propose and develop an effective radicalism detection solution. After acquiring labelled dataset from Kaggle , the data has been cleaned through different pre-processing techniques: removing hyperlink, tags and RTs, null and short tweets removal, removing punctuation, stop-words removal, word lemmatization and Feature Extraction. To classify tweets displaying extremist tendencies, different Machine Learning Algorithms such as Support Vector Machine, Deep NN, Recurrent NN ,BERT T5 and Random Forest will be used. The algorithm with the best accuracy is implemented. For the identification of emotions, opinions, sentiments showing allegiance to extremist causes in tweets, Python Libraries based on natural language processing such as NLTK/CoreNLP have been used for sentiment analysis to be categorized into positive, negative or neutral sentiments
• Identification of tweets containing radical content related to religious cyber-extremism.
• To measure the performance of Machine Learning models such as Naïve Bayes, Support Vector Machine, Random Forest, and Deep Neural Networks in identifying radicalized tweets.
• Performing sentiment analysis on tweets; identifying users with radicalized mindset and mining their emotions through their tweets.
• To mine emotions through tweets using Python’s Libraries for natural language processing
Our product will be a piece of software that can be implemented in other social platforms. The Machine Learning algorithms such XGBoost Random Forest and Support Vector Machine can be trained using CPU but Deep Learning Models such as Vanilla RNN ,Transformers and Transfer learning of BERT and T5 may take months to get trained on CPU , so for that purpose RTX Graphical Processing Unit is required for the fast training of models as RTX GPU contains Tensor cores which aligns with Tensor Flow Deep Learning Library of Google and make training fast on GPUs.
| Item Name | Type | No. of Units | Per Unit Cost (in Rs) | Total (in Rs) |
|---|---|---|---|---|
| RTX 2060 | Equipment | 1 | 66000 | 66000 |
| Documentations and printing | Miscellaneous | 1 | 10000 | 10000 |
| Total in (Rs) | 76000 |
This project aims to recognize human activities from series of images considering differen...
Many doctors face difficulties when they need to consult or discuss the case of their pati...
The purpose of this project is to understand mechanical behavior of composite solid propel...
In this project we propose idea to resolve the security and safety issues of markets on re...
Deforestation is one of the major issue behind all the global warming and increasing heat...