Adil Khan 9 months ago
AdiKhanOfficial #FYP Ideas

Multiple Voice Separation with Speaker Diarization

Speech Separation is a special scenario of source separation problem and a challenging task. We will present a method to separate a mixed audio sequence, in which multiple speakers speak simultaneously. The classification model will estimate the number of speakers and will train different models for

Project Title

Multiple Voice Separation with Speaker Diarization

Project Area of Specialization

Artificial Intelligence

Project Summary

Speech Separation is a special scenario of source separation problem and a challenging task. We will present a method to separate a mixed audio sequence, in which multiple speakers speak simultaneously. The classification model will estimate the number of speakers and will train different models for each speaker. We will evaluate our model under both clean and noisy data. The expected results will be that our model will separate multiple voices and then additionally it will transcribe the speech into text and display the transcribed results with speaker diarization on our website.

Project Objectives

The project is aimed at developing a software that will help individuals and businesses separate and identify voices from any audio. It is also aimed at converting the audio of speakers in form of text. We are expecting our blind source separation model to predict separate multiple voices with at least 50% accuracy and we will try to increase its accuracy as well.

Project Implementation Method

Major application components include

  • Dataset Collection
  • Data Pre processing
  • Training
  • Validation
  • Prediction
  • Web Application

Benefits of the Project

In terms of business, our goals are:

  • To improve the quality of voice assistance.
  • To use the software in IoT devices.
  • To be used in live streaming devices to translate the audio to text.
  • To help identify the speaker of the audio.

Technical Details of Final Deliverable

This project aims to build a model that will separate multiple voices/signals from a mixed-signal and transcribe the text with speaker diarization.

Final Deliverable of the Project

Software System

Core Industry

IT

Other Industries

Education

Core Technology

Artificial Intelligence(AI)

Other Technologies

Sustainable Development Goals

Industry, Innovation and Infrastructure

Required Resources

Item Name Type No. of Units Per Unit Cost (in Rs) Total (in Rs)
Gtx1650 Equipment17000070000
Total in (Rs) 70000
If you need this project, please contact me on contact@adikhanofficial.com
SWAP AND SELL

SWAP AND SELL is a mobile application where we offer community a platform on which they ca...

1675638330.png
Adil Khan
9 months ago
Departmental Management System

This project Departmental Management system (online system) proposed to replace the curren...

1675638330.png
Adil Khan
9 months ago
Kinetic energy harvesting using triboelectric nanogenerator

Concern has grown in recent years about increasing energy consumption and its impact on th...

1675638330.png
Adil Khan
9 months ago
AI Based Sign Language Interpreter For Disabled Persons

Disabled persons often face communication problems while interacting with general public....

1675638330.png
Adil Khan
9 months ago
Tripals

?Tripals? is an android mobile application which is used to provide a platform to a user t...

1675638330.png
Adil Khan
9 months ago