Text mining approach applied to reveal PiRNA and disease Associations
Topical investigation of piRNAs role in disease association has been elucidated in many studies published in recent years revealing piRNAs exclusive role as biomarker in various diseases, and functional implications regulating key genes, disruption of regulation of these genes can lead to disease pr
2025-06-28 16:36:18 - Adil Khan
Text mining approach applied to reveal PiRNA and disease Associations
Project Area of Specialization Biomedical EngineeringProject SummaryTopical investigation of piRNAs role in disease association has been elucidated in many studies published in recent years revealing piRNAs exclusive role as biomarker in various diseases, and functional implications regulating key genes, disruption of regulation of these genes can lead to disease prognosis. There are several databases are available for piRNAs cluster, such as pirnabank, piRNAQuest, piRbase, provides sequence and location information in various organism, but there is no database related piRNA disease association. So we will apply text-based data mining approach to dig out all the published data related to PIWI-interacting RNAs (piRNAs) associated with diseases. Further, we will validate the data using machine learning approach, final validation will be done manually. we have already published PiRDisease v1.0: A manually curated database for piRNA disease relationship. Now, We will update it to PiRDisease v2.0: Text-based data mining approach to reveal piRNA disease association.
Project Objectiveswe want to apply text-based data mining approaches to filter out piRNA disease-associated data in the literature, find association and build updates version of PiRDisease v1.0 to PiRDisease v2.0.
Project Implementation Methodwe will use Python scripts to mine the piRNA disease-related data, develop an algorithm to find out associations. Finally, we will construct PiRDiease v2.0 using my SQL and connect it to the web interface.
Benefits of the ProjectIt will be useful for scientific community and novel researchers and students to search already established piRNA associations in disease. So they can design their studies accordingly.
Technical Details of Final DeliverableDATA MINING AND DATABASE DEVELOPMENT
Final Deliverable of the Project HW/SW integrated systemCore Industry EducationOther IndustriesCore Technology Big DataOther TechnologiesSustainable Development Goals Quality EducationRequired Resources| Item Name | Type | No. of Units | Per Unit Cost (in Rs) | Total (in Rs) |
|---|---|---|---|---|
| Total in (Rs) | 70000 | |||
| WebHosting and Domain | Equipment | 2 | 35000 | 70000 |