Text mining approach applied to reveal PiRNA and disease Associations

Topical investigation of piRNAs role in disease association has been elucidated in many studies published in recent years revealing piRNAs exclusive role as biomarker in various diseases, and functional implications regulating key genes, disruption of regulation of these genes can lead to disease pr

2025-06-28 16:36:18 - Adil Khan

Project Title

Text mining approach applied to reveal PiRNA and disease Associations

Project Area of Specialization Biomedical EngineeringProject Summary

Topical investigation of piRNAs role in disease association has been elucidated in many studies published in recent years revealing piRNAs exclusive role as biomarker in various diseases, and functional implications regulating key genes, disruption of regulation of these genes can lead to disease prognosis. There are several databases are available for piRNAs cluster, such as pirnabank, piRNAQuest, piRbase, provides sequence and location information in various organism, but there is no database related piRNA disease association. So we will apply text-based data mining approach to dig out all the published data related to PIWI-interacting RNAs (piRNAs) associated with diseases. Further, we will validate the data using machine learning approach, final validation will be done manually. we have already published PiRDisease v1.0: A manually curated database for piRNA disease relationship. Now, We will update it to PiRDisease v2.0: Text-based data mining approach to reveal piRNA disease association.  

Project Objectives

we want to apply text-based data mining approaches to filter out piRNA disease-associated data in the literature, find association and build updates version of PiRDisease v1.0 to PiRDisease v2.0. 

Project Implementation Method

we will use Python scripts to mine the piRNA disease-related data, develop an algorithm to find out associations. Finally, we will construct PiRDiease v2.0 using my SQL and connect it to the web interface.

Benefits of the Project

It will be useful for scientific community and novel researchers and students to search already established piRNA associations in disease. So they can design their studies accordingly. 

Technical Details of Final Deliverable

DATA MINING AND DATABASE DEVELOPMENT 

Final Deliverable of the Project HW/SW integrated systemCore Industry EducationOther IndustriesCore Technology Big DataOther TechnologiesSustainable Development Goals Quality EducationRequired Resources
Item Name Type No. of Units Per Unit Cost (in Rs) Total (in Rs)
Total in (Rs) 70000
WebHosting and Domain Equipment23500070000

More Posts