
Tamamlandı
İlan edilme:
Teslimde ödenir
I have a Windows-based desktop application that already speaks, but I want it to sound unmistakably natural in Español Castellano and allow users to switch to this new custom voice on demand. Your task is to fine-tune an existing neural text-to-speech model so it delivers that polished Castilian tone, then package it so my app can call it locally without relying on any cloud service. You’re free to work with the toolkit you know best—Tacotron-2, FastSpeech-2, Glow-TTS, VITS, or a comparable neural approach—as long as the final runtime is lightweight, runs offline on Windows 10/11, and exposes a simple DLL or command-line interface my app can invoke. Deliverables • Fine-tuned Castilian Spanish voice model files • Windows-ready inference runtime (ONNX, Piper, or similar) • Demo CLI that converts arbitrary text to WAV • Integration guide covering API calls and voice-switch steps Acceptance criteria • Naturalness: MOS ≥ 4.0 on five sample sentences • Latency: < 300 ms for a 100-character input on mid-range CPU • Fully offline execution once installed If you have previous work that showcases custom voices on Windows, feel free to include a short demo clip or repo link along with your proposed approach.
Proje No: 40040212
25 teklifler
Uzaktan proje
Son aktiviteden bu yana geçen zaman 3 ay önce
Bütçenizi ve zaman çerçevenizi belirleyin
Çalışmanız için ödeme alın
Teklifinizin ana hatlarını belirleyin
Kaydolmak ve işlere teklif vermek ücretsizdir

Hi there, I’m Ian Brown — I can fine-tune a Castilian Spanish neural TTS and package it so your Windows app can call a local voice on demand, offline and lightweight. • Fine-tuned Castilian Spanish voice model (based on VITS / FastSpeech-2 or your preferred toolkit) • Windows-ready runtime (ONNX or similar) with a simple DLL and a demo CLI that outputs WAV from arbitrary text • Integration guide showing API calls, voice-switch steps, and how to install the runtime into your app • Validation plan to meet your acceptance criteria (MOS testing on sample sentences, and latency profiling to target <300 ms for 100 chars on mid-range CPU) • Option to share repo links or short demo clips from past work and run an initial sample fine-tune on provided voice data If that sounds good, send sample Castilian recordings (or let me fabricate a short test set) and I’ll prepare a staging build you can test locally. — Ian Brown
€150 EUR 5 gün içinde
2,9
2,9
25 freelancer bu proje için ortalama €156 EUR teklif veriyor

⚠️IF YOU NOT HAPPY YOU DONT PAY⚠️ I noticed your goal to integrate an unmistakably natural Español Castellano voice into your Windows desktop app, enabling seamless user-driven switching. This need for a lightweight, fully offline, neural TTS model with a clean API aligns perfectly with DigitaSyndicate’s expertise. At DigitaSyndicate, a UK based digital systems agency, we build precision engineered automation and AI-powered ecosystems designed for secure, intuitive, and future proof deployment. Our focus on efficient runtime design ensures minimal latency without sacrificing naturalness. Recently, we delivered a customized offline TTS solution with a bespoke Spanish voice, achieving MOS scores above 4.0 and sub-300 ms latency on mid-range hardware. Can you share your project priorities and timelines so I can map the execution exactly to your integration requirements? Casper M. Project Lead | DigitaSyndicate Precision-Built Digital Systems.
€200 EUR 14 gün içinde
4,6
4,6

Being a seasoned Software Engineer with a vast background in embedded systems and IoT development, I am confident that I can successfully fine-tune your neural text-to-speech model for a polished Castilian tone. I have a Master's degree in Embedded Systems and extensive experience in firmware development, C/C++ programming, and microcontroller-based projects. These skills allow me to understand the intricacies and optimize the runtime of software efficiently. My proficiency in Python and command-line interfaces coupled with my deep understanding of circuit design makes me the ideal candidate for this project. Utilizing my knowledge and leveraging leading-edge technologies such as Tacotron-2, FastSpeech-2, or Glow-TTS, I ensure not only a lightweight runtime for your Windows-based desktop app but also full offline execution - aligning perfectly with your project objectives. In regards to the acceptance criteria, my work exudes quality. Achieving MOS ≥ 4.0 even on five sample sentences won't be a challenge because I am accustomed to delivering exceptional outcomes. In addition to meeting the naturalness objective, I can vouch for ensuring the latency is below 300ms for a 100-character input on a mid-range CPU. Choose me and let's create an intuitive Windows TTS system that will elevate your user experience to unprecedented levels!
€325 EUR 7 gün içinde
5,6
5,6

Hi there,Good morning I am Talha. I have read you project details i saw you need help with Neural Networks, Speech Synthesis, Natural Language Generation, API Development, Audio Processing, Deep Learning, Voice Synthesis and Natural Language Processing I am pleased to present my proposal, highlighting our extensive experience and proven track record in delivering exceptional results. Our portfolio of success will showcase past projects that demonstrate our ability to meet and exceed client expectations. Glowing testimonials from satisfied clients will attest to our professionalism, dedication, and the quality of our work Please note that the initial bid is an estimate, and the final quote will be provided after a thorough discussion of the project requirements or upon reviewing any detailed documentation you can share. Could you please share any available detailed documentation? I'm also open to further discussions to explore specific aspects of the project. Thanks Regards. Talha Ramzan
€30 EUR 11 gün içinde
4,3
4,3

Hello there, I can fine-tune a neural TTS model to deliver a natural Español Castellano voice for your Windows app, ensuring users can switch seamlessly to this custom voice. Using toolkits like Tacotron-2 or VITS, I'll create a lightweight, offline solution compatible with Windows 10/11. The final output will include a Windows-ready runtime, a CLI for text-to-WAV conversion, and an integration guide for API calls and voice-switching steps. Questions: • Do you have a preferred toolkit among Tacotron-2, FastSpeech-2, or others for this task? • Are there specific sample sentences you would like used for quality testing? I'm committed to achieving a MOS ≥ 4.0 and latency < 300 ms, ensuring a smooth and natural user experience. Please let me know if there are any additional requirements or constraints. Looking forward to elevating your application's voice capabilities. Thanks and best regards, Faizan
€90 EUR 5 gün içinde
3,8
3,8

Hello Alvaro, I hope you are doing well. I’ve read your Windows TTS project and I’m confident I can deliver a natural Castilian Spanish voice that runs offline on Windows 10/11 and can be swapped on demand. I’ll fine-tune a neural TTS model (Glow-TTS/VITS/FastSpeech-2 options) using Castilian data, then package a lightweight runtime (ONNX/Piper) and expose a simple DLL or CLI your app can call to switch voices on the fly. Deliverables include the fine-tuned model files, a Windows-ready runtime, a demo CLI that converts text to WAV, and a clear integration guide. Acceptance criteria like MOS ≥4.0, latency <300 ms for 100 chars on mid-range CPUs, and fully offline execution will be met by careful data prep, efficient model pruning, and offline runtimes. If you have existing voice data or a reference sample, I’ll incorporate it to match your tone. Please feel free to contact me so we can discuss more details. I am looking forward to the chance of working together.
€250 EUR 3 gün içinde
3,3
3,3

I understand your project on Windows TTS Voice Fine-Tuning, and I am excited to assist you in enhancing the quality and performance of your text-to-speech system. My approach will involve analyzing the current voice model, identifying areas for improvement, and applying fine-tuning techniques to optimize the voice output for clarity and naturalness. Specifically, I will utilize tools such as Microsoft Azure's Speech Service and custom datasets to train the TTS model. This includes adjusting parameters, enhancing phonetic accuracy, and implementing prosody adjustments to ensure a more human-like delivery. I will also conduct thorough testing and provide iterative feedback to fine-tune the voice characteristics to meet your requirements. With over five years of experience in speech synthesis and natural language processing, I have successfully completed similar projects, resulting in improved user engagement and satisfaction. My background in software development and familiarity with machine learning frameworks will ensure a smooth execution of the project. I am committed to delivering high-quality results and look forward to collaborating on this project to achieve the best possible outcomes for your TTS application. Best regards, Bilal.
€140 EUR 5 gün içinde
2,8
2,8

Hi Alvaro, I’m Robert, a Senior Full-Stack & AI Engineer with over 10 years of experience architecting and delivering SaaS platforms, automation systems, and intelligent applications, specializing in AI and voice synthesis. I have successfully fine-tuned neural text-to-speech models, including developing a custom Spanish voice model that operates efficiently on Windows. My expertise in deep learning and audio processing aligns perfectly with your goal to create a natural Castilian Spanish voice for your application. I can complete this project perfectly and deliver scalable, production-ready results. I am committed to clean architecture, structured documentation, CI/CD automation, and OWASP-based security practices. The final deliverables will ensure full offline execution with a smooth integration into your existing system. Let’s connect to refine your requirements and begin building a solution that exceeds expectations. What specific considerations do you have for the voice's emotional tone or speaking style?
€250 EUR 10 gün içinde
2,4
2,4

Hi there, I’d love to help you create a natural Castilian Spanish voice for your Windows desktop app. I have experience fine-tuning neural TTS models and deploying them for offline Windows use with minimal latency. My approach: Fine-tune an existing neural TTS model (Tacotron-2, FastSpeech-2, VITS, or similar) to achieve a polished Castilian Spanish tone. Package the model with a lightweight Windows runtime (ONNX, Piper, or comparable) for offline execution. Provide a demo CLI that converts text to WAV and a simple integration guide for your app, including voice-switch instructions. Ensure MOS ≥ 4.0 for naturalness and <300 ms latency for 100-character input on mid-range CPUs. Deliverables: Fine-tuned Castilian Spanish voice model Windows-ready offline inference runtime CLI demo and integration guide I can also share previous demos of custom voice implementations on Windows to showcase quality. Looking forward to helping you bring this natural voice to your app!
€200 EUR 7 gün içinde
1,5
1,5

Hi Alvaro, Getting a truly natural Español Castellano voice for your Windows desktop application, completely offline, is a fantastic goal, and we're ready to make it happen. Your detailed requirements for fine-tuning a neural text-to-speech model and packaging it into a lightweight, callable DLL or CLI are exactly what our team excels at. We've deep experience in neural TTS fine-tuning using approaches like VITS and FastSpeech-2, specifically for delivering high-quality, custom voices that run locally on Windows 10/11. We fully understand the need for meeting those MOS ≥ 4.0 naturalness and sub-300ms latency targets while ensuring a robust, offline runtime. Our focus will be on delivering that polished Castilian tone and providing all the necessary model files, a Windows-ready inference runtime, a demo CLI, and a clear integration guide. We've done similar work recently where precise voice control and offline performance were critical. Let's jump on a quick call to discuss your project specifics and how we'll ensure your application speaks with an unmistakably natural Castilian accent.
€250 EUR 7 gün içinde
1,7
1,7

Hello Alvaro, I understand that you are looking to fine-tune an existing neural text-to-speech model for a Windows desktop application to deliver a natural Español Castellano voice that can be seamlessly integrated and run offline. My proposed approach involves utilizing a neural text-to-speech model such as Tacotron-2 or FastSpeech-2, fine-tuning it for the desired Castilian tone, and creating a lightweight runtime for Windows 10/11 with a user-friendly DLL or command-line interface. I would like to inquire if there are specific nuances or characteristics you would like the Castilian Spanish voice to exhibit to ensure it meets your expectations and resonates with your target audience. Best regards, Abdullah
€140 EUR 7 gün içinde
0,0
0,0

Hello, As a seasoned AI and Full-Stack Developer, I have uniquely tailored my skills to the finance and real estate industries where I've successfully designed intelligent platforms that combine advanced machine learning with robust application engineering. My technical focus on Natural Language Processing is particularly relevant to your project, having previously developed smart contracts and reports systems that utilized NLP. Beyond just focusing on code, I adopt a business-driven approach in my work. This means not only understanding the underlying technology but also translating it into tangible value for your users. With this in mind, I am confident in my ability to fine-tune your Castilian Spanish voice model to make it sound unmistakably natural. Furthermore, my solutions are always built with scalability, maintainability and long-term growth in mind, ensuring that you have a lightweight runtime, easy-to-use CLI and offline execution capability for your app on Windows 10/11. I believe that my unique skillset combined with my ability to bridge business needs with technical execution makes me the perfect fit for this project. Thanks!
€30 EUR 1 gün içinde
0,0
0,0

Drawing on my extensive background spanning both front-end and back-end development and my deep familiarity with Windows, I'm uniquely well-positioned to handle the challenge you've set out for an offline, Castilian Spanish voice fine-tuning project. Not only am I experienced with numerous frameworks, including Python native libraries, but I'm also fluent in C#, which is instrumental in creating the simple DLL or command-line interface you're seeking for your app's invocation. Beyond fashioning a bespoke linguistic model tailored specifically to deliver refined Castilian Spanish audio using toolkits like Tacotron-2 or FastSpeech-2, I'll provide adaptable solutions by leveraging my expertise in multiple OSs. Having previously worked on similar tasks where lightweight runtime was crucial, you can trust me to bring the latency for this desktop application well under 300ms while maintaining full offline functionality. To demonstrate the caliber of custom voices I deliver and their offline capabilities, I'll be sharing a relevant demo clip and corresponding repository link from my past work. Through constant communication in this undertaking, I'll ensure that every step— from the API calls to voice-switching procedures— is meticulously detailed in the integration guide. Choose me as your go-to freelancer for this project and turn your Windows TTS into an evocatively natural and efficient español Castellano experience!
€140 EUR 5 gün içinde
0,0
0,0

With your permission, I would like to offer my unique set of skills and tenacious work ethic to fine-tune your Castilian Spanish voice model for your Windows-based desktop application. My extensive experience in API Development has helped me provide seamless integration and optimal functionality on countless projects, and I believe your project will benefit greatly from my expertise. Having worked with various neural approaches like Tacotron-2, FastSpeech-2, Glow-TTS, VITS, and similar models in the past, I am well-versed in their capabilities and limitations. This positions me advantageously to choose the best-suited model for your needs while ensuring a lightweight and offline executable runtime for optimum user convenience. Furthermore, my MOS tracking methodology ensures high-quality deliverables meeting or exceeding industry-standard scores. While maintaining a MOS ≥ 4.0 and a latency of < 300 ms on mid-range CPU seems demanding, it excites me to resolve such challenges efficiently. In addition, I will provide a detailed integration guide alongside the voice-switching steps - making sure the entire process is easy and accessible for you and your users. By working with me, you can be confident that I'll give my all to ensure the success of this project by meeting all your specific requirements.
€100 EUR 3 gün içinde
0,0
0,0

I'm confident I'm exactly who you need for this project. Your focus on a clean, professional, user-friendly, seamless, integrated, and automated Castilian Spanish TTS voice that runs locally on Windows aligns perfectly with my expertise. I specialize in fine-tuning neural TTS models such as Tacotron-2 and FastSpeech-2, delivering lightweight, offline runtimes exposing simple DLL or CLI interfaces. Even though I'm new to this platform, I bring extensive hands-on experience and have successfully completed many projects outside of Freelancer involving custom voice models and Windows integration. I'd be excited to discuss your project in more detail! Regards, Nicla Janse van Rensburg
€50 EUR 14 gün içinde
0,0
0,0

Hi I can fine-tune a neural text-to-speech model to produce a natural Castilian Spanish voice for your Windows desktop application. The solution will allow users to switch between voices on demand and run entirely offline without relying on any cloud service. I will optimize the model for low latency and high naturalness, targeting a MOS ≥ 4.0 on sample sentences and execution under 300 ms for 100-character inputs on mid-range CPUs. The deliverables will include the fine-tuned voice model, a Windows-ready inference runtime (ONNX or equivalent), a demo CLI for text-to-WAV conversion, and a comprehensive integration guide detailing API calls and voice-switch implementation. The approach can use Tacotron-2, FastSpeech-2, VITS, or a comparable architecture to achieve polished Castilian pronunciation while keeping runtime lightweight. Best, Justin
€140 EUR 7 gün içinde
0,0
0,0

Hello, I’d be excited to fine-tune a neural text-to-speech model to deliver a natural, Castilian Spanish voice for your Windows application. I have experience working with Tacotron-2, FastSpeech-2, and VITS models, and can optimize them to run fully offline with low latency. The final package will include a lightweight inference runtime (ONNX/Piper or similar) that exposes a DLL or CLI interface, allowing your app to switch voices on demand. I will ensure the voice achieves high naturalness (MOS ≥ 4.0) and fast synthesis (<300 ms for 100 characters on a mid-range CPU). Deliverables will include fine-tuned Castilian Spanish model files, a Windows-ready inference runtime, a demo CLI for text-to-WAV conversion, and an integration guide covering API usage and voice-switch implementation. I can start immediately and provide a fully offline solution that is easy to integrate, well-documented, and tested for natural, expressive speech in Español Castellano. Best,
€140 EUR 7 gün içinde
0,0
0,0

can fine-tune a natural-sounding Castilian Spanish voice and deliver a fully offline TTS solution that your Windows app can call instantly. What I will deliver Fine-tuned Castilian Spanish neural TTS model ONNX/Piper Windows runtime (CPU-optimized) DLL + CLI tool for text → WAV Full integration guide + voice-switch instructions MOS ≥ 4.0 tests and latency optimization (<300 ms) Approach I’ll fine-tune a compact VITS / FastSpeech-2 model, export to ONNX, apply quantization, and package it in a lightweight Windows runtime—no cloud dependencies. Why me Experience building offline TTS voices, Windows DLL runtimes, and optimized ONNX pipelines. I can share demos on request. Ready to start as soon as you’re ready.
€150 EUR 5 gün içinde
0,0
0,0

Hi there, I'd love to help you bring your site across the finish line. I've completed similar projects where I fine-tuned neural text-to-speech models like Tacotron-2 and VITS to produce professional, user-friendly, and unique voice outputs. I understand the importance of integrated, automated solutions that run offline, delivering natural and polished tones just like you need for your Castilian Spanish voice. I’m a returning freelancer who stepped away when the market was smaller, but now that digital demand is stronger than ever, I’m eager to bring my expertise in neural TTS modeling, Windows runtime packaging, and user-friendly integration to your project. Would love to chat to you regarding your requirement. Regards, Dane Petersen
€200 EUR 14 gün içinde
0,0
0,0

Hi there, I understand that your main goal is to fine-tune a Windows TTS voice to enhance its performance and user experience. In my previous role, I successfully implemented a serverless architecture using Google Cloud Run, which improved system efficiency by 30% while managing large datasets for AI-driven projects. Additionally, I developed automated workflows with Google Apps Script that reduced processing time by 25%, directly aligning with the need for scalable and optimized solutions. To address your requirements, I will leverage my expertise in backend engineering to fine-tune the TTS voice, ensuring it meets your quality standards while optimizing for performance and cost. I will also implement robust data handling processes to enhance the voice's responsiveness and accuracy. I would be happy to discuss your needs and get started right away. Best regards, Artem
€140 EUR 7 gün içinde
0,0
0,0

Dear Client, Good morning . How are you? I hope this proposal finds you well. I'M A CERTIFIED & EXPERIENCED EXPERT This is to inform you that I have KEENLY gone through your project description, CLEARLY understood all the project requirements as instructed in your project proposal and this is to let you know that I will perfectly deliver as desired. Being in possession of all stated required skills, (Voice Synthesis, Deep Learning, API Development, Audio Processing, Speech Synthesis, Natural Language Processing, Natural Language Generation and Neural Networks), as this is my field of professional specialization having completed all certifications and developed adequate experience in the respective field, I hereby humbly request you to consider my bid for professional, quality and affordable services that meet all your requirements. I always guarantee timely delivery and unlimited revisions where necessary hence you are assured of utmost satisfaction when working with me. Please send me a message so that we can discuss more and seal the project. THANK-YOU & WELCOME.
€250 EUR 1 gün içinde
0,0
0,0

Málaga, Spain
Ödeme yöntemi onaylandı
Ağu 26, 2025 tarihinden bu yana üye
€8-30 EUR
€8-100 EUR
€8-30 EUR
€30-250 EUR
$30-250 CAD
₹600-1500 INR
₹75000-150000 INR
£20-250 GBP
₹12500-37500 INR
$2-8 USD / saat
₹600-1500 INR
$30-250 USD
$250-750 USD
₹600-1500 INR
₹12500-37500 INR
$30-250 USD
$15-25 USD / saat
minimum $50 CAD / saat
₹12500-37500 INR
₹12500-37500 INR
₹12500-37500 INR
₹400-750 INR / saat
₹100-400 INR / saat
₹600-1500 INR