
In Progress
Posted
Paid on delivery
We need a freelancer to: 1. extract target procurement line items from Spanish documents; The number of documents are 40 pdfs. 2. clean and normalize descriptions, quantities, units, brands, models, and technical attributes; 3. identify comparable supplier products using the approved source environment; 4. extract prices, currencies, package sizes, and source information; 5. convert prices into standardized unit prices; 6. flag uncertain matches and cases where no reliable match is available; 7. submit a reproducible data file and workflow documentation. Deliverables are: • [login to view URL]: one row per procurement item and candidate supplier match; • [login to view URL]: a machine-readable version of the same data; • [login to view URL], [login to view URL], or equivalent reproducible workflow, when automation is used; • [login to view URL]: tools, assumptions, source-use rules, match criteria, and reproduction steps. Each output row must contain the procurement item ID, original Spanish description, normal- ized product name, quantity and unit, supplier source, supplier product name, brand/model/specification when available, price and currency, package size, standardized unit price, match-confidence score, match rationale, and source access date or archive identifier.
Project ID: 40435289
16 proposals
Remote project
Active 5 days ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs

Hello, I can accurately process all 40 Spanish PDFs and deliver structured, reproducible outputs including Excel, JSON, workflow files, and documentation. The work will include procurement item extraction, data normalization, supplier-product matching, price standardization, confidence scoring, and QA validation. I focus on accuracy, traceability, and clean, audit-ready outputs. Available to start immediately. Best Regards Akash Kumar Dubey
$15 USD in 5 days
0.0
0.0

As an AI/ML engineer with profound expertise in data processing, I am confident in my ability to effectively and accurately tackle the tasks described in your project. Extracting data from 40 Spanish PDFs, cleaning and normalizing them, and subsequently identifying comparable supplier products are all within my sphere of expertise. My experience extends to linguistic processing using tools like Tesseract, EasyOCR, and PaddleOCR, which will prove invaluable for normalizing the Spanish descriptions. What distingushes me from other freelancers is my zeal for delivery. I won't just submit an output file; instead I will provide you with a comprehensive [login to view URL] file detailing every necessary aspect of the project including tools used, assumptions made, and steps to reproduce the work Not only do I bring aboard years of experience in NLP and Computer vision fields, and possess state-of-the-art techniques unexplored by many such as YOLOv8 for shoplifting detection on live CCTV feeds, but I also prioritize reliability in production. My work on VisionAssist Android app that assists visually impaired users by announcing nearby objects with distance demonstrates my commitment to not only developing cutting-edge technology but deploying them for real-life applications. Choosing me means choosing efficiency, accuracy, and reliability--traits crucial for a project like yours.
$30 USD in 7 days
1.4
1.4
16 freelancers are bidding on average $25 USD for this job

Hello, I have thoroughly reviewed the project requirements for Spanish Procurement Data Extraction and Supplier Price Matching. I understand the need to extract, clean, normalize, identify comparable products, extract prices, and submit reproducible data files. Let's chat and discuss it further. To handle your project, I will start with extracting procurement line items from the 40 Spanish PDF documents. Then, I will clean and normalize the data, identify comparable supplier products, extract prices and relevant information, and convert prices into standardized unit prices. I will also flag uncertain matches and provide a reproducible workflow documentation. The deliverables will include output files in both Excel and JSON formats, along with a reproducible workflow document and README file detailing tools, assumptions, and match criteria. Before signing-off my bid, I would like to ask a question, i.e., "What is the preferred format for the source information and archive identifiers?" Best Regards, Aneesa.
$10 USD in 7 days
6.9
6.9

Hello there! I will extract data from spanish 40 pdf right now with 100% accuracy. I am offering a free sample before awarding the project. Can you please open a chat box to discuss the project in detail? I am 24/7 available for discussing the project. Sir, please check my profile link for your satisfaction and of my client reviews: https://www.freelancer.com/u/bktk Regards: Muhammad Bilal KTK
$15 USD in 1 day
6.2
6.2

I’ll build a reproducible Python workflow to extract and normalize procurement items from the 40 Spanish PDFs, match supplier products with confidence scoring, standardize pricing, and deliver clean XLSX/JSON outputs with full documentation.
$50 USD in 1 day
5.2
5.2

Hi, I can help with extracting procurement items from the Spanish PDFs, cleaning and normalizing the data, identifying comparable supplier products, and preparing the final Excel/JSON outputs with proper documentation. I’m detail-oriented and comfortable handling research, data organization, and accuracy checks for this type of project. Regards, Himanshu Bisht
$30 USD in 4 days
2.4
2.4

I understand the importance of precise data extraction and normalization for your Spanish procurement project. With a strong background in data processing and analysis, I can efficiently extract and clean procurement line items from the 40 PDFs you have. My approach will ensure that each data point—descriptions, quantities, brands, and prices—are accurately captured and standardized. I will leverage automation tools to streamline the extraction and normalization processes while maintaining high data quality. Additionally, I will identify comparable supplier products and flag any uncertain matches to ensure transparency in the results. You will receive comprehensive deliverables, including the output files in both Excel and JSON formats, along with detailed workflow documentation to facilitate reproducibility. Communication will be a priority throughout the project to address any questions or adjustments needed. I am committed to delivering the final outputs within 14 days to meet your timeline.
$20 USD in 14 days
0.6
0.6

Hello. I'm Spanish native speaker and specialist in Excel and VBA. I can do this job. I put 7 days because I can't work sabbaths nor sundays. Hope to hear from you soon. Have a nice day.
$10 USD in 7 days
0.0
0.0

Hi there, I can help with extract target procurement line items from spanish documents. Here's how I'll approach it: 1) Understand requirements and confirm scope 2) Implement with clean code + tests 3) Deploy with documentation Timeline: 3 day(s) | Bid: $22 I'll do a small sample/demo first — if you like it, we proceed. No risk for you. Ready to start now. Message me and let's get going.
$22 USD in 3 days
0.0
0.0

I am expert in data field. I have 24 years of experience in dealing with structured and unstructured all kind of data from any source to any destination.
$30 USD in 3 days
0.0
0.0

Pully, Switzerland
Payment method verified
Member since Dec 9, 2025
$10-50 USD
$10-50 USD
$10-50 USD
$10-50 USD
$10-50 USD
₹100-400 INR / hour
$250-750 USD
₹750-1250 INR / hour
$30-250 USD
$10-85 USD
$8-15 USD / hour
£250-750 GBP
€8-30 EUR
£20-30 GBP
₹400-750 INR / hour
$30-250 USD
$250-750 CAD
₹1500-12500 INR
£20-250 GBP
₹100-400 INR / hour
₹1500-12500 INR
₹1500-12500 INR
₹600-1500 INR
₹1250-2500 INR / hour
$15-25 USD / hour