Opportunity for MLOPS Engineer: Serve a wav2vec Speech Recognition Model through Triton Server

Job Description:

We are looking for a talented MLOPS engineer to work on a challenging speech recognition project. The project has a tight deadline of 5 days. The tasks involved are:

Based on the wav2vec2 model available in the repository lgris/wav2vec2-large-xlsr-open-brazilian-portuguese-v2, convert it to ONNX and TensorRT

Evaluate the WER of the model in TensorRT compared to the original model in Hugginface

Create a Dockerfile with the Triton server configured with an endpoint to consume the model in TensorRT

Create a Dockerfile with a Python server using [login to view URL] to send audio to the Triton server for inference

Create a Dockerfile with a JavaScript client sending audio from the microphone to the Python server, from Python to the Triton server through GRPC, and back to the browser with the transcription

Create a Docker Compose file with the three services communicating with each other and ready for testing

Compare the inference times of the PyTorch model served directly from Python, the TensorRT model served directly from Python, and the model served through the TensorRT server

Evaluate the latency of the communication between the Python server and the TensorRT server

The goal is to perform audio inference captured from the user's microphone in browser through [login to view URL] communication with the Python server and then from this to the Triton server to be able to receive multiple concurrent requests from different users

Attention should be paid in the Python server to have a session for each user, so that the streaming audio can be returned to the user who sent the audio.

If you have the skills and experience to tackle this project, we would love to hear from you. Please apply with your portfolio and relevant experience. Time is of the essence, so apply as soon as possible.

Beceriler: Python, Yazılım Mimarisi, JavaScript, NLP, DevOps

Müşteri Hakkında:
( 0 değerlendirme ) Petrópolis, Brazil

Proje NO: #35917970

Bu iş için 13 freelancer ortalamada $681 teklif veriyor


Hello Good evening , I just finished reading the job description . I see you are looking for someone experienced in developing products using NLP, Python, DevOps, Software Architecture and JavaScript. This is something Daha Fazla

$750 USD in 18 gün içinde
(117 Değerlendirme)

Nice to talk you felipeniren, After reading in detail the requirements of your project and concluding that they match my areas of knowledge and skills, I would like to introduce myself. My name is Anthony Muñoz and I Daha Fazla

$624 USD in 7 gün içinde
(6 Değerlendirme)

Hello, I read your project details and really interested in your mentioned job. I have 5+ years’ experience doing similar jobs related to these skills NLP, Python, DevOps, Software Architecture and JavaScript. I think Daha Fazla

$750 USD in 6 gün içinde
(22 Değerlendirme)

Hello. As a Professional NLP Engineer, I have strong knowledge and rich experience with Python, Pytorch, Tensorflow, NLP, ChatBot, OpenAI ChatGPT, Fine-tuning the OpenAI API model, ASR(Automatic Speech Recognition usin Daha Fazla

$500 USD in 7 gün içinde
(0 Değerlendirme)