We are a product development company working on a voice processing application. In this application we want to perform speaker recognition from the audio files user uploads to our backend. Here user is allowed to upload voice recording from our iOS / Android mobile app to the backend application. After receiving the upload, the backend application should compare the voice against existing voice samples of the user and identify whether users voice is there in the newly uploaded audio file. Here is the flow:
1. User create profile for the first time by entering email / mobile, password etc
2. User upload his 4 sample voice files at the time of signup. Each samples files could be 10-15 secs length. These samples voice can be used for comparison when user uploads his voice recordings
3. When user finish signup, application take user to home screen
4. User record a new speech from the application and upload it to server
5. Server should receive the file and validate it
6. After validation is done, it should verify whether user's voice is there in this audio file by comparing this against sample audio files that user uploaded at the time of signup
7. If user's voice is identified in the audio file, we should update it in the database that user's voice is found in it. Then upload the audio file in AWS S3 and send response back to mobile app
This is the process of this functionality. All the registered users in our application should be able to upload their audio file and do this speaker recognition as mentioned above. When multiple users are uploading the audio file at the same time to server, this speaker recognition module should perform comparison without any hiccup. The speaker recognition feature should provide at least 80% accuracy while comparing the voice.
We also tried to use Speaker Recognition API provided by Azure cloud. But the accuracy it provided is really bad. Thats the reason we decided to build this feature by our own.
Our backend application is build with Python Flask framework. The database we used in our application is Postgres and MongoDB. If you can build this speaker recognition module in Python with support of C++ is good with us, as Python have powerful packages to build any kind of mathematical stuff. If you wish to use some other programming language feel free to do so. But we should get accuracy not less than 80%.
Please feel free to ping me for any questions. I can try to clarify your queries.
12 freelancers are bidding on average ₹60055 for this job
Hello! I want to work for your project. I'm very interested in your project. I'd like you to call me on chat. Please give me your project detail. If you give me the task, I'm very glad with you. I'll give you the best Daha Fazla
Hello. I am an expert in developing android and iPhone and voice processing. I have read your request and I have enough experience to complete your project. I have ever built some recording apps and I want to know i Daha Fazla
Hi, I am professional in speaker recognition. I can read the your proposal. I can do this project perfectly. Thank you.
I ll use libraries like friture for evaluation of sound and comparison in between. I ll try few libraries and choose the best one according to their accuracies.
BlueCoded provides smartphone & web application development services to clients anywhere in the world With over 500 apps in the App store and Google Play. Please check some of our works below: 2 Sample Web devel Daha Fazla
Hi, I am rakesh representing the company named Ranks Digital Media [url removed, login to view] are a team of 40+ creative people who cater the market of web & software design and development since past 6 years. ///////////////////// Daha Fazla
Before Awarding Project to Any Freelancer Please Discuss With Us -=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=- We have gone through the details that you have provided, and we are wil Daha Fazla
I have 2+ years history of hands on experience as an android developer make me an excellent match for the unique demand of this opportunity.I am competent enough to be able to handle multiple projects simultaneously ow Daha Fazla
I assume you need only the voice recognition [url removed, login to view] confirm. Also please provide what kind of interface is required to integrate this module with the system. I am 14 years experienced software engineer having h Daha Fazla
We are a team of experts with more than 8 years of rich cloud experience in AWS [url removed, login to view], we have worked extensively on Machine Learning. We have developed recommendation/statistical engines, web scrapers/crawlers Daha Fazla