Crime detection in Pakistan starts with the local police when they are contacted by people in need. The major drawback is that local law enforcement is not equipped with the latest equipment to be able to handle crimes even on a small scale. Problems are being dealt with using primitive methods that
Crime Detection in Local Languages
Crime detection in Pakistan starts with the local police when they are contacted by people in need. The major drawback is that local law enforcement is not equipped with the latest equipment to be able to handle crimes even on a small scale. Problems are being dealt with using primitive methods that have been used for decades.
Crime detection using audio data in local languages could help drastically reduce the crime rate at a local level. Not only will it enable the police to be able to track down criminals, but it can also aid in preventing crime. Predicting future events based on the information gathered can play a vital role in destabilizing crime rings at the neighborhood level.
To implement a system that can detect crime in audio data, we will need a speech recognition system. Speech recognition systems require the design and development of speech corpus, language models, and grammar specifications related to the language for which the system is to be developed. Corpus development includes the collection, careful annotation, cleaning, and verification of speech data. These resources are limited for the Urdu language hence speech recognition for the Urdu language is still at a very basic level. We have an unlimited supply of unfiltered data that can be used to train even the most complex systems. We aim to target this area and design a model that can benefit society.
This project offers the ability to distinguish between harmless conversations and meaningful intelligence so organizations such as the FIA and PTA can greatly benefit from it.
The front end of our project will basically be an interface that allows the user to upload audio files and will receive a rating that will determine whether the sentence spoken is in a good or bad context.
On the backend, we will be applying machine learning algorithms that will be able to perform sentiment analysis and determine the context behind the use of criminal words.
In runtime, the audio files will be converted into text using the suitable API, after which the data will be passed through the algorithm and we will determine the meaning.
| Item Name | Type | No. of Units | Per Unit Cost (in Rs) | Total (in Rs) |
|---|---|---|---|---|
| NVIDIA GeForce GTX 1060 6 GB | Equipment | 1 | 35000 | 35000 |
| Boya By-M1 Professional Collar Microphone | Equipment | 2 | 2000 | 4000 |
| Research and Implementation | Miscellaneous | 1 | 5000 | 5000 |
| Urdu and Punjabi Raw Speech Corpus | Equipment | 1 | 20000 | 20000 |
| U Shape 2 in 1 Audio Splitter Jack 3.5 mm to dual female | Equipment | 2 | 300 | 600 |
| Total in (Rs) | 64600 |
The interest to find an alternative mode of vehicles has seen a considerable growth in the...
From the past few years Semantic Analyzers are doing only semantic analysis of Engli...