Filtre

Son aramalarım
Şuna göre filtrele:
Bütçe
ile
ile
ile
Tür
Beceri
Diller
    İş Durumu
    895 pyspark iş bulundu, ücretlendirmeleri EUR

    The Data Engineer contractor role will be a project based role focused on migrating data pipelines from legacy infrastructure and frameworks such as Scalding to more modern infrastructure we support such as Spark Scala. This role will be responsible for: Analyzing existing data pipelines to understand their architecture, dependenci...Requirements The ideal candidate is a Data Engineer with considerable experience in migrations and Big Data frameworks. Must-Haves Scala programming language expertise Spark framework expertise Experience working with BigQuery Familiarity scheduling jobs in Airflow Fluency with Google Cloud Platform, in particular GCS and Dataproc Python programming language fluency Scalding framework fluency Pyspark framework fluency Dataflow(Apache Beam) framewor...

    €284 (Avg Bid)
    €284 Ortalama Teklif
    8 teklifler

    Hi, Please apply only individual. Agency can apply but budget should not be more than mentioned. Role : GCP Engineer (OTP) Exp : 7 + yrs SHIFT: IST Cloud Storage Buckets, BigQuery (SQL, Data Transformations and movement) Airflow (python, DAGs), DBT IAM Policies PyCharm Databricks (pySpark), Azure DevOps Clear and confident communication

    €1337 (Avg Bid)
    €1337 Ortalama Teklif
    8 teklifler

    I'm in need of a Machine Learning Engineer who can migrate our existing notebooks from RStudio and PySpark to AWS Sagemaker. Your task will be to: - Understand two models I have running locally. One is a Rstudio logistic regression model, and the other is a pySpark XGboost also running on local. - Migrate These two models to AWS SAGEMAKER. Data will be on S3 -Prepare models to run on sagemaker totally, so that we can do training and testing 100% on sagemaker.-Models are already running on a local computer, but I need to move them to Sagemaker 100%. Data is on S3 already. -You need to configure and prepare Sagemaker from end to end, and teach me how you did it, since I need to replicate it in another system. -I will give you the data and access to AWS Ideal Skills and...

    €375 (Avg Bid)
    Gizlilik Anlaşması
    €375 Ortalama Teklif
    3 teklifler

    The Data Engineer contractor role will be a project based role focused on migrating data pipelines from legacy infrastructure and frameworks such as Scalding to more modern infrastructure we support such as Spark Scala. This role will be responsible for: Analyzing existing data pipelines to understand their architecture, dependenci...Requirements The ideal candidate is a Data Engineer with considerable experience in migrations and Big Data frameworks. Must-Haves Scala programming language expertise Spark framework expertise Experience working with BigQuery Familiarity scheduling jobs in Airflow Fluency with Google Cloud Platform, in particular GCS and Dataproc Python programming language fluency Scalding framework fluency Pyspark framework fluency Dataflow(Apache Beam) framewor...

    €286 (Avg Bid)
    €286 Ortalama Teklif
    10 teklifler

    I am looking for a dedicated specialist well-versed in using Databricks and PySpark for data processing tasks, with a primary focus on data transformation. With the provision of JSON format files, you will perform following tasks: - Carry out complex data transformations - Implement unique algorithms to ensure efficient data processing - Test results against required benchmarks Ideal Skills: - Proficient in Databricks and PySpark. - Must possess a solid background in data transformation. - Experience handling large JSON datasets. The end goal is to achieve seamless data transformation leveraging the power of Databricks and PySpark, enhancing our ability to make informed business decisions. Please provide your completed projects, and the strategies you've used ...

    €44 / hr (Avg Bid)
    €44 / hr Ortalama Teklif
    29 teklifler

    ...functions to handle data quality and validation. -Should have good understanding on S3,Cloud Formation, Cloud Watch, Service Catalog and IAM Roles -Perform data validation and ensure data accuracy and completeness by creating automated tests and implementing data validation processes. -Should have good knowledge about Tableau, with creating Tableau Published Datasets and managing access. -Write PySpark scripts to process data and perform transformations.(Good to have) -Run Spark jobs on AWS EMR cluster using Airflow DAGs.(Good to have)...

    €1396 (Avg Bid)
    €1396 Ortalama Teklif
    22 teklifler

    ...Stay current with new technology options and vendor products, evaluating which ones would be a good fit for the company Troubleshoot the system and solve problems across all platform and application domains Oversee pre-production acceptance testing to ensure the high quality of a company’s services and products Skill Sets: Strong development experience in AWS Step Functions, Glue, Python, S3, Pyspark Good understanding of data warehousing, Large-scale data management issues, and concepts. Good experience in Data Analytics & Reporting and Modernization project Expertise in at least one high-level programming language such as Java, Python Skills for developing, deploying & debugging cloud applications Skills in AWS API, CLI and SDKs for writing applications Knowledge...

    €674 (Avg Bid)
    €674 Ortalama Teklif
    27 teklifler

    I am in need of a proficient PySpark coder to aid in debugging errors present within my current code. The main focus of this project is optimization and troubleshooting. Unfortunately, I can't specify the type of errors– I need a professional to help identify and rectify them. If you are an experienced PySpark coder with a keen eye for bug identification and problem solving, I'd appreciate your expertise.

    €6 - €19
    Mühürlü
    €6 - €19
    10 teklifler

    I am in need of a proficient PySpark coder to aid in debugging errors present within my current code. The main focus of this project is optimization and troubleshooting. Unfortunately, I can't specify the type of errors– I need a professional to help identify and rectify them. If you are an experienced PySpark coder with a keen eye for bug identification and problem solving, I'd appreciate your expertise.

    €6 - €18
    Mühürlü
    €6 - €18
    6 teklifler

    I'm searching for a PySpark expert who can provide assistance on optimizing and debugging current PySpark scripts. I am specifically focused on PySpark, so expertise in this area is crucial for the successful completion of this project. Key Responsibilities: - Optimizing PySpark scripts to improve efficiency - Debugging current PySpark scripts to resolve existing issues Ideal Candidate: - Proficient with PySpark - Experience in big data management, data ingestion, processing, analysis, visualization, and reporting - Strong problem-solving skills to identify and resolve issues effectively - Knowledgeable in performance tuning within PySpark.

    €94 (Avg Bid)
    €94 Ortalama Teklif
    67 teklifler

    I'm looking for a skilled freelancer to create a Spark script that transfers data from a Hive metastore to an S3 bucket. The goal of this project is to enable backup and recovery. Skills and Experience: - Proficiency in Spark and Hive - Extensive experience with S3 buckets - Understanding of data backup strategies Project Details: - The script needs to read the schema and perform metadata transfer for selected schema to s3 bucket. - Only bid if you have work experience with spark, hive, s3 - 4 schemas needs to be migrated - I have already got access to s3 configured - I have local instance of netapp s3 available and bucket created. - Server is Ubuntu

    €92 (Avg Bid)
    €92 Ortalama Teklif
    10 teklifler

    I am looking for an experienced data analyst who is well-versed in PySpark to clean up a medium-sized dataset in a CSV file format. The file contains between 10k-100k rows, and your primary role will be to: - Remove duplicate data entries - Deduplicate the dataset - Handle missing values - Aggregate the resultant data Your proficiency in using PySpark to automate these processes efficiently will be critical to the success of this project. Therefore, prior experience in handling and cleaning similar large datasets would be beneficial. Please note, this project requires precision, meticulousness, and a good understanding of data aggregation principles.

    €24 (Avg Bid)
    €24 Ortalama Teklif
    9 teklifler

    This vital task entails cleaning and sorting two CSV files of approximately 100,000 rows and second one of about 1.5million rows using pyspark (Python) in Jupyter Notebook(s). The project consists of several key tasks: Read in both datasets and then: - Standardizing data to ensure consistency - Removal of duplicate entries - Filtering columns we need - Handling and filling missing values - Aggregating data on certain groupings as output Important requirement: I also need unit tests to be written for the code at the end. Ideal Skills: Candidates applying for this project should be adept with Pyspark in Python and have experience in data cleaning and manipulation. Experience with working on datasets of similar size would also be preferable. Attention to detail in ensuring ...

    €170 (Avg Bid)
    €170 Ortalama Teklif
    57 teklifler

    I'm seeking an experienced Data Engineer with proficiency in SQL and PySpark. Key Responsibilities: - Develop and optimize our ETL processes. - Enhance our data pipeline for smoother operations. The ideal candidate should deliver efficient extraction, transformation, and loading of data, which is critical to our project's success. Skills and Experience: - Proficient in SQL and PySpark - Proven experience in ETL process development - Previous experience in data pipeline optimization Your expertise will significantly improve our data management systems, and your ability to deliver effectively and promptly will be highly appreciated.

    €86 (Avg Bid)
    €86 Ortalama Teklif
    17 teklifler

    - Conversion of the entire Python code into PySpark. Skills and experience required: - Proficient knowledge in Python.

    €24 (Avg Bid)
    €24 Ortalama Teklif
    26 teklifler

    ...competent in either PySpark or RDD, using Python to create versatile code fitting for several scenarios. Your main task will be to write code to compare rows using Python in line with the clear set of rules I provide. These rules are detailed in an attached Word document and are based on comparisons encompassing specific columns, presence or absence of particular data, and multiple criteria comparisons. The expected output is a reversal logic for claim_opened_timestamp_utc. I need output that are in right side. I need either in pyspark or in rdd to compare rows. spark - spark-3.3.0-bin-hadoop3 py4j-0.10.9.5 I am using I need your support till I execute it in my office computer I need it in 3 days. Ideal Skills and Experience: - Proficiency in Python - Experience with...

    €146 (Avg Bid)
    Acil
    €146 Ortalama Teklif
    17 teklifler

    I'm beginer user of Azure Databricks and Pyspark. I'm looking to boost my skills to the next level and need an expert to guide me through advanced techniques. Ideal freelancers should have vast experience and profound knowledge in data manipulation using Pyspark, Azure Databricks, data pipeline construction, and data analysis and visualization. If you've previously tutored or mentored in these areas, it'll be a plus.

    €11 / hr (Avg Bid)
    €11 / hr Ortalama Teklif
    4 teklifler

    I need complete 2 small projects done. The data needs to be pulled from API using python. The pulled data needs to be unnested, then transformed to answer some insights with medallion architecture. Here, you need to showcase SCD-type 2 ingestions, incremental joins,...to be pulled from API using python. The pulled data needs to be unnested, then transformed to answer some insights with medallion architecture. Here, you need to showcase SCD-type 2 ingestions, incremental joins, managing PII information, aggregation. Final deliverable needed for 1st project (databricks): Data model designed and architecture overview Notebooks of transformations in Python and PySpark/Spark Scala Final deliverable needed for 2nd project (dbt): Data model designed and architecture overview dbt sql and ...

    €259 (Avg Bid)
    €259 Ortalama Teklif
    16 teklifler

    Looking for someone with good skills in Airflow, Pyspark and SQL.

    €236 (Avg Bid)
    €236 Ortalama Teklif
    13 teklifler

    I am looking for a skilled professional in Python, with a comprehensive understanding of PySpark, Databricks, and GCP. A primary focus of the project is to build a data pipeline and apply time series forecasting techniques for revenue projection, using historical sales data. Key tasks will include: - Constructing a robust data pipeline using Python, PySpark, and Databricks. - Applying time series forecasting to produce revenue predictions. - Using Mean Squared Error (MSE) to measure model accuracy. The ideal candidate for this project would have: - Proven experience with Python, PySpark, Databricks, and GCP. - Expertise in time series forecasting models. - Practical understanding and use of Mean Squared Error (MSE) for model accuracy. - Experience with large scale ...

    €10 / hr (Avg Bid)
    €10 / hr Ortalama Teklif
    14 teklifler

    I am looking to develop a sophisticated and efficient data pipeline for revenue forecasting. This pipeline will be implemented using Python, pyspark, databrics, and gcp Big Data. Here is what you need to know about this task: - Data Source: The data originates from Google Cloud Platform's Big Data service. As such, the freelancer should have solid experience and understanding of working with Big Data services on GCP. - Data Update Frequency: The frequency of data updates will be confirmed during the project, but suffice to say frequency could be high. Prior experience with real-time or near-real-time data processing will be highly beneficial. - Performance Metrics: The key performance metric I'm focusing on is data processing speed. The freelancer should have a strong kn...

    €17 / hr (Avg Bid)
    €17 / hr Ortalama Teklif
    13 teklifler

    I'm in need of a specialist, ideally with experience in data science, Python, PySpark, and Databricks, to undertake a project encompassing data pipeline creation, time series forecasting and revenue forecasting. #### Goal: * Be able to extract data from GCP BigData efficiently. * Develop a data pipeline to automate this process. * Implement time series forecasting techniques on the extracted data. * Use the time series forecasting models for accurate revenue forecasting. #### Deadline: * The project needs to be completed ASAP, hence a freelancer with a good turnaround time is preferred. #### Key Skill Sets: * Data Science * Python, PySpark, Databricks * BigData on GCP * Time series forecasting * Revenue forecasting * Data Extraction and Automation Qualification in...

    €17 / hr (Avg Bid)
    €17 / hr Ortalama Teklif
    15 teklifler

    I am looking for a developer to create an AWS Glue and Pyspark script that will strengthen the data management of my project. The task involves moving more than 100GB of text data from a MySQL RDS table to my S3 storage account, on a weekly basis. Additionally, the procured data needs to be written on parquet files, for easy referencing. The developer will also need to send scripts to deploy the AWS Glue pipelines on Terraform, fitting all parameters. Skilled expertise in AWS Glue, PySpark, Terraform, MySQL and experience in handling large data is required. There is no compromise on the quality and completion timeline. Effective performance on this project will open doors to more work opportunities on my various projects.

    €38 (Avg Bid)
    €38 Ortalama Teklif
    15 teklifler

    I am seeking a skilled professional proficient in managing big data tasks with Hadoop, Hive, and PySpark. The primary aim of this project involves processing and analyzing structured data. Key Tasks: - Implementing Hadoop, Hive, and PySpark for my project to analyze large volumes of structured data. - Use Hive and PySpark for sophisticated data analysis and processing techniques. Ideal Skills: - Proficiency in Hadoop ecosystem - Experience with Hive and PySpark - Strong background in working with structured data - Expertise in big data processing and data analysis - Excellent problem-solving and communication skills Deliverables: - Converting raw data into useful information using Hive and Visualizing the results of queries into the graphical representation...

    €16 / hr (Avg Bid)
    €16 / hr Ortalama Teklif
    15 teklifler

    ...currently searching for an experienced AWS Glue expert, proficient in PYsPARK with data frames and Kafka development. The ideal candidate will have: • Expertise in data frame manipulation. • Experience with Kafka integration. • Strong PYsPARK development skills. The purpose of this project is data integration, and we will be primarily processing data from structured databases. The selected freelancer should be able to work with these databases seamlessly, ensuring efficient and effective data integration using AWS Glue. The required work would involve converting structured databases to fit into a data pipeline, setting up data processing, and integrating APIs using Kafka. This project requires a strong background in AWS Glue, PYSPARK, data frame ...

    €221 (Avg Bid)
    €221 Ortalama Teklif
    24 teklifler

    I'm seeking assistance to develop a Python-based solution utilizing PySpark for efficient data processing using the Chord Protocol. This project demands an intermediate level of expertise in Apache Spark or PySpark, combining distributed computing knowledge with specific focus on Python programming. Key Requirements: - Proficiency in Python programming and PySpark framework. - Solid understanding of the Chord Protocol and its application in data processing. - Capable of implementing robust data processing solutions in a distributed environment. Ideal Skills and Experience: - Intermediate to advanced knowledge in Apache Spark or PySpark. - Experience in implementing distributed file sharing or data processing systems. - Familiarity with network communicati...

    €512 (Avg Bid)
    €512 Ortalama Teklif
    38 teklifler

    ...Professional with strong expertise in Pyspark for a multi-faceted project. Your responsibilities will extend to but not limited to: - Data analysis: You'll be working with diverse datasets including customer data, sales data and sensor data. Your role will involve deciphering this data, identifying key patterns and drawing out impactful insights. - Data processing: A major part of this role will be processing the mentioned datasets, and preparing them effectively for analysis. - Performance optimization: The ultimate aim is to enhance our customer targeting, boost sales revenue and identify patterns in sensor data. Utilizing your skills to optimize performance in these sectors will be highly appreciated. The ideal candidate will be skilled in Hadoop and Pyspark wi...

    €435 (Avg Bid)
    €435 Ortalama Teklif
    25 teklifler

    Build a glue etl using pyspark to transfer data from mysql to postgres. facing challenges in column mappings between the 2 sources, the target database has datatypes enums and text arrays. should solve the erros in column mappings Should have prior experience ingesting data into postgres enum datatype

    €21 / hr (Avg Bid)
    €21 / hr Ortalama Teklif
    54 teklifler

    I am in need of an experienced data engineer with specific expertise in PySpark. This project involves the integration and migration of data from structured databases currently housed in AWS. Here's a rundown of your key responsibilities: - Data integration from various existing structured databases - Migration of the combined data to a single, more efficacious database Ideal Candidate: - Proven experience in data migration and integration projects - Expertise in PySpark is indispensable - Proficiency in manipulating AWS databases - A solid understanding of structured databases and various data formats is mandatory This project is more than just technical skills- I'm looking for someone who can understand the bigger picture and contribute to the overarching str...

    €620 (Avg Bid)
    €620 Ortalama Teklif
    13 teklifler

    I'm looking for a professional with a strong understanding of PySpark to help transform a dataframe into JSON following a specific schema. This project's main task is data transformation to aid in data interchange. The project requires: - Expertise in PySpark - Proficiency in data transformation techniques - Specific experience in data aggregation For the transformation, I require the application of an aggregation method. In this case, we will be sorting the data. It's crucial that you are skilled in various aggregation methods, especially sorting. Your knowledge in handling critical PySpark operations is crucial for this job's success. Experience in similar projects will be highly regarded.

    €23 (Avg Bid)
    €23 Ortalama Teklif
    18 teklifler

    Looking for an expert Azure Data Engineer to assist with multiple tasks. Your responsibilities will include: - Implementing and managing Azure Data Lake and Data Ingestion. - Developing visual reports...platforms to achieve three main objectives: - Perform sophisticated data analysis and visualization. - Enable advanced data integration and transformation. - Build custom applications to meet specific needs. Candidates should have an advanced understanding of Azure Data Lake, Power BI, and Powerapps, bringing a minimum of 6 years experience as Databricks. Proficiency in Python, SQL, PostGre SQL, and Pyspark is also required. Knowledge of GitHub and the CI/CD Process will be beneficial for this role. If you have the skills and expertise needed for this project, I'd love to...

    €32 / hr (Avg Bid)
    €32 / hr Ortalama Teklif
    28 teklifler

    ...need to be pushed swiftly to Elasticsearch using Pyspark. Your expertise will help push all data columns from this file into Elasticsearch, establishing a more actionable access to a significant amount of data. Given the project's urgency, I'm expecting a rapid, reliable transition. While the structure for the documents remains undecided due to the project's intricacies, I'm open to suggestions that will make this process more efficient and effective. Anyone with experience in Pyspark, Elasticsearch, and vast data manipulation will have a substantial edge on this project, as these skills are highly necessary for success. A strong understanding of different data structures is also a plus. • Leading Skills Required: Proficiency in Pyspark ...

    €9 / hr (Avg Bid)
    €9 / hr Ortalama Teklif
    3 teklifler

    ...Title: Pyspark Data Engineering Training Overview: I am a beginner/intermediate in Pyspark and I am looking for a training program that focuses on data processing. I prefer one on one and written guides as the format for the training. Skills and Experience Required: - Strong expertise in Pyspark and data engineering - Excellent knowledge of data processing techniques - Experience in creating and optimizing data pipelines - Familiarity with data manipulation and transformation using Pyspark - Ability to explain complex concepts in a clear and concise manner through written guides - Understanding of best practices for data processing in Pyspark Training Topics: The training should primarily focus on data processing. The following topics should be cov...

    €22 / hr (Avg Bid)
    €22 / hr Ortalama Teklif
    72 teklifler

    ...training is expected to be spread across multiple days. The trainer must have the capability to provide an understanding of the major concepts and components of Apache Spark, with a focus on how to use Databricks and the Pyspark API to manipulate and visualize data. As the training progresses, the instructor should be able to explain how to develop applications using Pyspark and articulate different approaches that a data scientist would use to evaluate and test their models. The instructor should also be able to educate the users on how to deploy and maintain Pyspark applications and how to provide feedback and questions in order to improve their performance. We expect the trainer to be readily available to answer any questions and guide the users along the w...

    €95 (Avg Bid)
    €95 Ortalama Teklif
    75 teklifler

    I am seeking an expert in the field to provide remote training in the use of Databricks and Python with PySpark. This is important for developing data processing applications with a high degree of efficiency. The training should cover areas such as data wrangling, machine learning, and Spark streaming. In order to be successful, attendees must be well-versed in Databricks, Python and PySpark, as these skills will be essential for completing the course. The course should provide a good understanding of the concepts and practical application of these tools. This training will give attendees the skills they need to analyse and manipulate large datasets, develop effective data processing pipelines, design powerful machine learning models and build reliable applications that use...

    €101 (Avg Bid)
    €101 Ortalama Teklif
    74 teklifler

    ...S3, and RDS; Azure services; and Pyspark data processing and transformations. Essential Skills: - Proficient in AWS, specifically on EC2, S3, RDS with strong understanding of data storage and retrieval. - Expert in Azure services such as Azure SQL Database and Blob Storage. - Highly experienced in writing efficient data transformations using Pyspark. Ideal Experience: - Minimum 7 years in the field with solid experience in technical interviews and coaching. Your task will be to provide actionable insights, best practices, and expert advice to nail my upcoming technical interview. Having been on the other side of the interview table would be an added advantage. - Proven track record of performing successful data processing and transformations using Pyspark. - Prev...

    €14 / hr (Avg Bid)
    €14 / hr Ortalama Teklif
    8 teklifler

    Experienced Python + SQL +AWS +AZURE data engineer (7+ years) for evening IST timings. For guiding in interview preparation specially for data engineering. Tasks: Should have good knowledge of pyspark, sql, pandas Should have written multiple ETL pipeline in aws and azure. Note: The freelancer must be available during evening ist timings.

    €9 / hr (Avg Bid)
    €9 / hr Ortalama Teklif
    12 teklifler

    ...structured data such as SQL databases. Skills and experience required: - Expertise in AWS migration, specifically from another cloud provider - Strong knowledge and experience with structured data, particularly SQL databases - Familiarity with AWS Glue and Athena for data processing and analysis - Ability to work with a combination of different AWS services for optimal performance and efficiency Pyspark ,sql,python Cdk Typescript Aws glue ,Emr and andes Currently Migrating from teradata to aws. Responsibilities: - Migrate data from another cloud provider to AWS, ensuring a smooth transition and minimal downtime - Design and develop applications that utilize AWS Glue and Athena for data processing and analysis - Optimize data storage and retrieval using AWS S3 and R...

    €8 / hr (Avg Bid)
    €8 / hr Ortalama Teklif
    14 teklifler

    ...am looking for a skilled and experienced developer to work on a personal project involving the use of CNN by pyspark for analyzing brain and lung cancer. Skills and Experience: - Proficient in using pyspark and CNN - Intermediate understanding of convolutional neural networks - Familiarity with analyzing medical data - Experience in working with cancer-related datasets - Strong problem-solving skills and attention to detail The project requires the use of specific datasets, which I already have. However, any additional assistance in acquiring relevant datasets would be appreciated. The ideal candidate should have a good understanding of CNN and be able to apply it using pyspark. Experience in analyzing medical data and working with cancer-related datasets would ...

    €34 (Avg Bid)
    €34 Ortalama Teklif
    10 teklifler

    I am looking for a skilled professional who can help me with a project titled "synapse pyspark delta lake merge scd type2 without primary key". The ideal candidate should have experience and expertise in the following areas: Desired Outcome: - The desired outcome of the merge process is to update existing records and insert new records. Data Quality: - The level of data quality required for the outcome is high integrity, with no duplicates and full accuracy. Handling Historical Data: - There is a specific requirement to keep track of historical changes to the data. Skills and Experience: - Proficiency in Synapse, Pyspark, Delta Lake - Experience with SCD Type 2 implementation - Strong understanding of data integrity and accuracy - Ability to handle historical da...

    €311 (Avg Bid)
    €311 Ortalama Teklif
    2 teklifler

    ...Senior Data Engineer who possesses extensive experience and proficiency in a range of key technologies and tools. The ideal candidate should have a strong background in Python, demonstrating skillful use of this programming language in data engineering contexts. Proficiency in Apache Spark is essential, as we rely heavily on this powerful analytics engine for big data processing. Experience with PySpark, the Python API for Spark, is also crucial. In addition to these core skills, we require expertise in AWS cloud services, particularly AWS Glue and Amazon Kinesis. Experience with AWS Glue will be vital for ETL operations and data integration tasks, while familiarity with Amazon Kinesis is important for real-time data processing applications. Furthermore, the candidate should hav...

    €10 / hr (Avg Bid)
    €10 / hr Ortalama Teklif
    11 teklifler

    I am looking for an Airflow, GCP, and Python expert to assist me with my project. Candidate should have a good knowledge of DAG, GIT, pandas, agile, pyspark and Airflow.

    €275 (Avg Bid)
    €275 Ortalama Teklif
    19 teklifler

    I am looking for a freelancer who can assist me with a Pyspark AWS ML project. The main goal of the project is data processing and transformation. I already have all the data needed for the project. The preferred timeline for this project is flexible. Skills and Experience: - Strong experience with Pyspark and AWS ML - Proficient in data processing and transformation techniques - Familiarity with machine learning model development - Ability to work within a flexible timeline

    €14 / hr (Avg Bid)
    €14 / hr Ortalama Teklif
    28 teklifler

    Years of experience: 7+ Location: Remote - India Contract Tenure - 03-06 Months Notice Period - Immediate -15/20 Days Timings : 12pm - 9pm IST M - F AWS Data Engineer Requirements • Collaborate with business an...functions to handle data quality and validation. • Should have good understanding on S3,Cloud Formation, Cloud Watch, Service Catalog and IAM Roles • Perform data validation and ensure data accuracy and completeness by creating automated tests and implementing data validation processes. • Should have good knowledge about Tableau, with creating Tableau Published Datasets and managing access. • Write PySpark scripts to process data and perform transformations.(Good to have) • Run Spark jobs on AWS EMR cluster using Airflow DAGs.(Good to have) &...

    €3092 (Avg Bid)
    €3092 Ortalama Teklif
    16 teklifler

    Looking for someone who has a good knowledge of Pyspark, Airflow DAGs, GitHub, Pandas and Agile Framework. Overall candidate should be well aware of the data ingestion approach. Knowledge of Google cloud platform is a Bonus

    €281 (Avg Bid)
    €281 Ortalama Teklif
    25 teklifler

    I am looking for a skilled AWS Cloud + PySpark developer to create a Glue Streaming WordCount program. The program should be able to perform word count analysis on streaming data. I have the pyspark streaming code ready which works in my Jupyter notebook. So need help in integrating Kinesis/MSK --> Glue --> RDS/S3

    €8 (Avg Bid)
    €8 Ortalama Teklif
    2 teklifler

    ...Specific Letters (Using Spark) 5. Top Selling Countries (Using Spark) 6. Item Costs (Using Spark) 7. Sales Yearwise (Using PySpark) 8. Orders per Item (Using PySpark) 9. Country with Highest Sales (Using PySpark) 10. Customer Segmentation: Use clustering algorithms to identify different customer segments. 11. Time Series Forecasting: Predict future sales using ARIMA or LSTM. 12. Anomaly Detection: Identify any anomalies or outliers that could indicate fraudulent activity. 13. Association Rule Mining: Find associations between different products in the data (Using Spark). 14. Price Elasticity: Understand how the demand for a product changes with a change in its price (Using PySpark). 15. Correlation Between Priority and Profit: Analyze if 'Order Priority&...

    €76 (Avg Bid)
    €76 Ortalama Teklif
    10 teklifler

    ...Specific Letters (Using Spark) 5. Top Selling Countries (Using Spark) 6. Item Costs (Using Spark) 7. Sales Yearwise (Using PySpark) 8. Orders per Item (Using PySpark) 9. Country with Highest Sales (Using PySpark) 10. Customer Segmentation: Use clustering algorithms to identify different customer segments. 11. Time Series Forecasting: Predict future sales using ARIMA or LSTM. 12. Anomaly Detection: Identify any anomalies or outliers that could indicate fraudulent activity. 13. Association Rule Mining: Find associations between different products in the data (Using Spark). 14. Price Elasticity: Understand how the demand for a product changes with a change in its price (Using PySpark). 15. Correlation Between Priority and Profit: Analyze if 'Order Priority&...

    €67 (Avg Bid)
    €67 Ortalama Teklif
    4 teklifler

    Its a simple dataset and I have already analysed it using pandas. I want to analyse it using Pyspark and Koalas API.

    €165 (Avg Bid)
    €165 Ortalama Teklif
    6 teklifler

    Project Description: I am looking for a PySpark trainer who has advanced experience and expertise in data processing. The ideal candidate should be able to provide a scheduled training course. Skills and Experience: - Advanced level of experience with PySpark - Strong knowledge and expertise in tools like DataBricks, Pycharm, transformation & Actions. - Ability to provide a scheduled training course

    €288 (Avg Bid)
    €288 Ortalama Teklif
    3 teklifler