
Closed
Posted
We need a developer to build a real-time voice transformation application that changes our voice during live phone calls (both inbound and outbound) using the ElevenLabs Speech-to-Speech API. What We Need: The app should capture microphone input, stream it to the ElevenLabs voice changer API, and route the transformed audio back through a virtual audio device (like VB-Cable or VoiceMeeter) so it can be used as a microphone input for any calling app — Zoom, Google Voice, Teams, RingCentral, or any softphone. Key Requirements: Real-time voice transformation with minimal latency (target sub-500ms) Works with any phone/calling application via virtual audio device Supports both inbound and outbound calls Simple desktop GUI with voice selection, on/off toggle, and audio level meters Ability to browse and switch between ElevenLabs voices Background noise removal support Hotkey to toggle voice changer on/off during a call Fallback to natural voice if API connection drops Required Skills: Real-time audio processing and streaming (PyAudio, PortAudio, or similar) API integration experience (REST/streaming) Virtual audio device setup (VB-Cable, VoiceMeeter, BlackHole) Desktop app development (Python, Electron, C#, or similar) Experience with ElevenLabs API is a strong plus
Project ID: 40207403
105 proposals
Remote project
Active 10 days ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
105 freelancers are bidding on average $24 USD/hour for this job

Hello, I understand you need a real-time voice changer for live calls that uses ElevenLabs Speech-to-Speech API, streams mic input, and feeds transformed audio to a virtual audio device for use in any calling app, with sub-500ms latency, inbound/outbound support, a simple GUI, voice browsing, noise removal, hotkey toggle, and a fallback to natural voice if the API drops. I will implement a PyAudio-based pipeline for low-latency streaming, robust API integration, and a cross-platform GUI (PyQt or Electron). The app will integrate VB-Cable/VoiceMeeter, provide voice selection, on/off toggle, and audio level meters, plus a hotkey for quick toggling during a call. I will structure the code for easy deployment, include clear docs, and deliver an MVP within two weeks, followed by enhancements. Next: confirm UI tech preference and target voice list. What is your target latency and which virtual audio driver (VB-Cable or VoiceMeeter) do you prefer for the output? Best regards,
$25 USD in 24 days
8.2
8.2

Hello, To hit that sub-500ms latency target while maintaining high-fidelity voice transformation, we need to focus on a buffered stream architecture using Python and ElevenLabs' WebSocket API rather than standard REST requests. I will build a lightweight desktop wrapper using PyAudio and CustomTkinter that acts as the bridge between your hardware and the cloud. Here is how I’ll handle the technical requirements: => Use a chunk-based WebSocket streaming pipeline with small audio frames for near-instant speech-to-speech response. => Route transformed audio directly to VB-Cable, so Zoom, Teams, and RingCentral detect it as a normal microphone. => Add a watchdog fallback to instantly switch to raw mic audio if API latency spikes or the connection drops. => Apply noise suppression (RNNoise / WebRTC VAD) to keep input clean and improve synthesis quality. Why This Works: Having integrated ElevenLabs for several real-time projects, I know the biggest bottleneck is the "First Byte Latency." I optimize this by using PCM 16kHz or 24kHz mono streams to reduce bandwidth overhead without sacrificing the characteristic "ElevenLabs quality" your callers expect. Question:- Would you prefer the first version as a portable Windows .exe, or should we plan for Windows + macOS from the start? Best, Niral
$15 USD in 40 days
7.9
7.9

Hello, As an experienced team who excels in developing real-time audio applications, API integrations, and desktop app development utilizing virtual audio devices, we are the perfect match for your real-time voice transformation project. Our proficiency with key skills such as Python programming, REST/streaming, and knowledge of virtual audio devices like VB-Cable and VoiceMeeter positions us well to meet your unique requirements. Additionally, our expertise in Asterisk PBX is an added advantage that can seamlessly integrate your application with various phone systems. At Live Experts®, we've successfully executed numerous projects involving audio streaming and manipulation with minimal latency. We understand that reliability is crucial in voice transformation, so we're committed to providing fail-safe mechanisms such as the automatic fallback to natural voice in case of API drops. Our workability with ElevenLabs API and ability to browse and switch between voices efficiently ensures an optimal user experience. Most importantly, we strive for client satisfaction at every stage of project development. Apart from ensuring all your specific needs are met, we also provide strong post-deployment support to ensure a smooth running of your application. Choose us today, and together let's transform your project idea into a reality! Thanks!
$50 USD in 796 days
7.3
7.3

Hi I have strong experience in real-time audio processing, low-latency streaming, and virtual audio device routing, which aligns well with building a live voice-transformation app using the ElevenLabs Speech-to-Speech API. The main technical challenge is achieving sub-500ms latency while capturing mic input, streaming it to the API, and returning processed audio through a virtual device, and I address this with optimized audio buffers, async pipelines, and stable fallback logic. I can implement a clean desktop GUI with voice selection, level meters, hotkeys, and on/off toggles, ensuring it works seamlessly with VB-Cable, VoiceMeeter, or BlackHole. Noise removal and graceful failover to natural voice will be integrated directly into the stream flow. The solution will be compatible with Zoom, Teams, Google Voice, RingCentral, and any softphone that reads from virtual microphones. My background includes API-driven audio tools, ElevenLabs integration, and cross-platform desktop apps built in Python and C#. Thanks, Hercules
$50 USD in 40 days
6.7
6.7

✅ Lovable AI Expert | AI Development | Game Development ✅ Hi, Thank you for considering this opportunity! I bring extensive experience in implementing custom solutions powered by LLMs, conversational AI, and intelligent automation. Recently I have been working on Lovable AI for developing a gaming platform using it, complete with chat-based agent logic, expressive front-ends, and backend integrations. In other project, implemented a fully automated AI agent system for intelligent meeting creation using ElevenLabs Conversational AI and Gemini (via a custom agent brain). The flow integrates voice interaction, natural language processing, location precision, and frontend. Due to NDAs, links aren’t public—but once you open the chat, I’ll share live demos and walkthroughs. Whether you're building an internal assistant, a public-facing voice agent, or an integrated AI productivity tool, I can help bring your vision to life with robust, scalable architecture and a human-like user experience. I would love to connect and explore how we can contribute to your AI initiative. (Note: Budget is flexible — we can finalize it after reviewing the complete scope.) Thanks & Regards, Kajal
$20 USD in 40 days
6.7
6.7

Dear , We carefully studied the description of your project and we can confirm that we understand your needs and are also interested in your project. Our team has the necessary resources to start your project as soon as possible and complete it in a very short time. We are 25 years in this business and our technical specialists have strong experience in Python, Audio Services, Asterisk PBX, Voice Talent, VoIP, Node.js, Twilio, Audio Engineering, SIP, ElevenLabs and other technologies relevant to your project. Please, review our profile https://www.freelancer.com/u/tangramua where you can find detailed information about our company, our portfolio, and the client's recent reviews. Please contact us via Freelancer Chat to discuss your project in details. Best regards, Sales department Tangram Canada Inc.
$25 USD in 5 days
7.6
7.6

With a robust background in audio processing and streaming, API integration, virtual audio device configuration, desktop app development, and an exceptional command over Python, I can guarantee the swift and seamless construction of your real-time voice transformation application utilising ElevenLabs Speech-to-Speech API. Not only that, but I can also proficiently handle any potential challenges that may arise during the process. Having worked on projects varying from CRM systems to mobile apps and custom websites, I'm versatile enough to develop a coherent and efficient application design for your specific needs. Moreover, my experience utilizing virtual audio devices like VB-Cable and VoiceMeeter will ensure smooth compatibility with diverse phone/calling applications - assuring you of more flexibility. My expertise in Python will be instrumental for creating the simple yet effective GUI you desire, featuring voice selection options, on/off toggles, audio level meters and more. My proven track record in delivering successful projects attests to my reliability and proficiency in translating complex requirements into elegant solutions. Given my skills in Asterisk PBX, Node.js, SIP and VoIP along with my determination to continuously improve myself as a developer, I believe I'm the perfect fit for your project! Let's not miss this opportunity to realize your vision - together! With Regards!
$15 USD in 40 days
6.7
6.7

Hi there, I am a top freelancer based in California with extensive experience in real-time audio processing and API integration, and I am excited about the opportunity to develop a voice transformation application for your live phone calls. I thoroughly understand your requirements, including the need for robust real-time processing with minimal latency, and I am confident in my ability to deliver an application that seamlessly integrates with various calling platforms through a virtual audio device. My background includes building applications that utilize audio APIs, and I have successfully implemented similar functionalities using technologies like PyAudio and VoiceMeeter. I can create a user-friendly desktop GUI that allows for easy voice selection, audio level monitoring, and hotkey support, ensuring a smooth user experience even during calls. I am eager to discuss your project further and outline a timeline for its completion. Please feel free to message me at your earliest convenience to get started! Could you provide more details on the specific use cases or scenarios you envision for this application?
$30 USD in 36 days
6.1
6.1

As an experienced and multi-talented electrical engineer, my passion for creating intelligent, connected systems perfectly aligns with the complex endeavor of your real-time voice transformation project. With profound knowledge in real-time audio streaming and processing, I can swiftly navigate Python, Electron or C# to create a simple yet powerful desktop GUI which will contain all the essential features you've mentioned - voice selection, on/off toggle, audio level meters, background noise removal support and a hotkey for toggling the voice changer during calls. My expertise does not stop there: API integration is a crucial aspect of this task you are presenting, and my extensive experience in REST and streaming APIs makes me an excellent fit for this role. If there are any drop issues stemming from the ElevenLabs API connection, I will flawlessly design it to fall back to the natural voice input impeccably.
$25 USD in 40 days
6.0
6.0

Hi there, I’m offering a 25% discount for this project. With expertise in real-time audio processing and API integration, I will develop a solution to change your voice during live phone calls using the ElevenLabs API. I specialize in creating responsive, low-latency systems that maintain voice quality while applying real-time effects, ensuring a seamless experience for the caller. The process includes reviewing your requirements, integrating ElevenLabs API for live voice modulation, and implementing a robust, reliable system for phone calls. I will handle API communication, real-time audio processing, and any necessary error handling to ensure smooth operation across devices. You’ll receive a fully functional real-time voice changer for live calls, optimized for performance and reliability, along with documentation of the setup and usage. My goal is to deliver an efficient, polished solution that works seamlessly without requiring your technical involvement. Best regards, Sohail
$15 USD in 1 day
5.9
5.9

I have extensive experience in Python, Audio Services, Asterisk PBX, Voice Talent, and VoIP, making me a perfect fit for the Real-Time Voice Changer for Live Phone Calls Using ElevenLabs API project. I am confident in my ability to meet the key requirements, including real-time voice transformation with minimal latency, API integration, virtual audio device setup, and desktop app development. The budget can be adjusted once we discuss the full project scope, and I am committed to delivering quality work within your budget. Let's discuss the job details and get started right away. Please go through my profile to see my 15 years of experience and commitment to client satisfaction. Looking forward to hearing from you.
$18 USD in 3 days
5.4
5.4

⭐Hello, I’m ready to assist you right away!⭐ I believe I’d be a great fit for your project since I have extensive experience in real-time audio processing and streaming, API integration, and virtual audio device setup. I have successfully developed applications with similar requirements, ensuring minimal latency and seamless integration with various calling apps. This project aligns perfectly with my technical skills and expertise, allowing me to deliver high-quality results efficiently. I have a proven track record in developing desktop GUI applications with innovative features, meeting client expectations and delivering exceptional user experiences. Leveraging my expertise in real-time voice transformation and API integration, I am confident in providing a robust solution that fulfills all your project requirements. If you have any questions, would like to discuss the project in more detail, or would like to know how I can help, we can schedule a meeting. Thank you. Maxim
$20 USD in 25 days
5.4
5.4

Hi there, I’m Ahmed from Eastvale, California — a Senior Full-Stack Engineer with over 15 years of experience building high-quality web and mobile applications. After reviewing your job posting, I’m confident that my background and skill set make me an excellent fit for your project — Real-Time Voice Changer for Live Phone Calls Using ElevenLabs API . I’ve successfully completed similar projects in the past, so you can expect reliable communication, clean and scalable code, and results delivered on time. I’m ready to get started right away and would love the opportunity to bring your vision to life. Looking forward to working with you. Best regards, Ahmed Hassan
$40 USD in 40 days
4.9
4.9

As an adaptable and skilled freelance developer, I possess all the necessitated proficiencies for your real-time voice changing app. My practical understanding of audio services allows me to seamlessly deal with real-time audio processing and streaming. I have a commendable knowledge of APIs and have efficiently integrated them in previous projects. As I've also worked with virtual audio devices like VB-Cable and VoiceMeeter, setting them up for your project is second nature to me. My development expertise extends into Python, which can be an effective tool in creating your desired streamlined desktop GUI and enhancing your overall app performance. Furthermore, my competence in Node.js aligns well with the need to establish a stable connection with ElevenLabs API, which is not only a skill I possess, but one that I exercise proficiently. With respect to my work style, I emphasize on maintaining open communication throughout the project. This ensures not only timely addressing of concerns but also regular updates on progress, thus ensuring your comfort at every step of the process. Let's join hands on this project, and take a step towards building something remarkable together!
$20 USD in 40 days
4.9
4.9

Hi, the real challenge here is getting latency low enough that the voice changer feels natural during live calls. I’d approach it as a real-time audio pipeline: mic → low-latency capture → streaming to ElevenLabs Speech-to-Speech → immediate playback through a virtual audio device, with smart buffering to stay under the ~500ms target. I’ve worked with real-time audio, virtual devices like VB-Cable/VoiceMeeter, and streaming APIs, so the routing works cleanly with Zoom, Teams, RingCentral, or any softphone without per-app hacks. On top of that, I’d ship a lightweight desktop UI with voice browsing, meters, hotkeys, noise suppression, and a graceful fallback to the natural mic if the API drops. If you’re curious, I can explain how I’d structure the audio thread, buffering strategy, and failover so this stays stable even in long calls. Best
$28 USD in 40 days
5.1
5.1

✋ Hi there. I can build a real-time voice changer for live calls using the ElevenLabs API, with minimal latency and support for any softphone or calling app via a virtual audio device. ✔️ I have solid experience in real-time audio processing, API streaming, and desktop app development with Python and Electron. In a previous project, I built an audio transformation tool that captured microphone input, processed it through a voice API, and routed it via virtual audio devices with sub-500ms latency, including GUI controls for voice selection and on/off toggles. ✔️ For your project, I will implement real-time inbound and outbound call processing, integrate ElevenLabs voices, add background noise removal, and include hotkeys for toggling. The app will fallback to natural voice if the API connection drops, and provide audio level meters and a simple desktop interface for managing voices. ✔️ I will deliver tested, documented code with setup instructions for virtual audio devices and ElevenLabs integration, ensuring smooth performance across all calling applications. Let’s chat to finalize your requirements and start development. Best regards, Mykhaylo
$20 USD in 40 days
5.0
5.0

Hi, I’m Karthik, a developer with 10+ years of experience in real-time applications, audio processing, and API-driven systems. I’ve worked on low-latency streaming and voice/audio pipelines, so a live voice changer using ElevenLabs is right in my wheelhouse. Why I’m a strong fit • Hands-on with real-time audio capture/streaming (PyAudio/PortAudio/WebRTC concepts) • Experience integrating AI/voice APIs and handling streaming responses • Familiar with virtual audio routing (VB-Cable/VoiceMeeter style setups) • Built desktop tools in Python and Electron with clean, simple UIs • Performance-focused to keep latency low and audio stable How I’ll approach your app – Capture mic input → stream to ElevenLabs S2S → return transformed audio in real time – Route output via virtual audio device for universal softphone support – Optimize buffering for sub-500 ms latency – Simple GUI: voice picker, toggle, meters, hotkeys – Noise reduction and connection monitoring – Auto-fallback to natural voice if API drops I prioritize stability during live calls and smooth UX for non-technical users. Happy to discuss platform (Windows/macOS), timeline, and share relevant work. Ready to start. Best regards, Karthik
$30 USD in 40 days
5.0
5.0

Hello RePrimeGroup, I am Vishal Maharaj, with 20 years of expertise in Python and Node.js. I have carefully reviewed the requirements for the real-time voice changer project using ElevenLabs API. To achieve this, I propose to develop a desktop application that captures microphone input, streams it to the ElevenLabs API for voice transformation, and routes the modified audio through a virtual audio device for use in any calling application. The solution will include real-time audio processing, API integration, virtual audio device setup, a user-friendly GUI, voice selection options, background noise removal, and hotkey functionalities. I am confident in delivering a seamless and efficient voice transformation system for both inbound and outbound calls. Please initiate a chat to discuss further details. Cheers, Vishal Maharaj
$20 USD in 40 days
5.3
5.3

Hello, I will implement a low-latency desktop voice changer that captures microphone input, streams small audio chunks to the ElevenLabs Speech-to-Speech API, and routes transformed audio to a virtual audio device (VB‑Cable / VoiceMeeter / BlackHole) so any calling app can use it. Approach: build an audio pipeline in Python (PortAudio/sounddevice) with async streaming to ElevenLabs, RNNoise/WebRTC for background-noise suppression, loopback capture for inbound calls, and a lightweight Electron GUI for voice selection, on/off toggle, meters and hotkey control. I’ll prioritize sub-500ms latency via 64–128ms frames, non-blocking buffers, and immediate fallback to natural mic if the API drops. To start I’ll deliver a working mic→ElevenLabs→VB‑Cable prototype, then add inbound routing, GUI polish and hotkeys. What I need from you to begin: ElevenLabs API key, target OS (Windows/macOS/linux), preferred virtual audio device, and a sample calling app to test. Which OS should I target first (Windows/macOS/Linux), and do you already have an ElevenLabs API key plus a preferred virtual audio device for testing? Best regards,
$30 USD in 16 days
4.3
4.3

Hello, "Secure trading begins with trust, and trust is built on transparency." I can build your real‑time voice transformation application using the ElevenLabs Speech‑to‑Speech API, ensuring sub‑500ms latency and seamless integration with virtual audio devices like VB‑Cable or VoiceMeeter. The app will capture microphone input, stream it to the API, and route the transformed audio back so it can be used in Zoom, Teams, Google Voice, RingCentral, or any softphone. It will feature a simple desktop GUI with voice selection, on/off toggle, audio level meters, hotkey support, and background noise removal. Fallback to natural voice will be included if the API connection drops, and the system will support both inbound and outbound calls. I have experience with real‑time audio processing, API integration, virtual audio device setup, and desktop app development in Python, Electron, and C#.
$20 USD in 40 days
4.2
4.2

East Hartford, United States
Member since Dec 19, 2025
$15-25 USD / hour
$100-300 USD
$15-25 USD / hour
$15-25 USD / hour
$15-25 USD / hour
$10-30 USD
₹750-1250 INR / hour
₹1250-2500 INR / hour
$750-1500 AUD
$30-250 USD
$250-750 AUD
₹750-1250 INR / hour
$30-250 USD
₹12500-37500 INR
₹600-1500 INR
€8-30 EUR
$250-750 USD
$15-25 USD / hour
$15-25 USD / hour
$2-8 USD / hour
$750-1500 AUD
$10-30 USD
$30-250 USD
$10-30 USD
$10-30 USD