Time-series clustering of jobs performance behaviors using (spark/ Hadoop )

I am working on a huge number of timeseries of jobs performance behaviors

the goal of my work is to cluster a large number of jobs performance behaviors saved as timeseries and identifying the optimal number of clustering using K-means and Hierarchical and Dynamic Time Warping and compare between the three techniques using Pyspark

I need someone who has good experience in Pyspark especially the dataframe API to do the following tasks:

1- data preprocessing

2- dimensionality reduction

3- apply clustering algorithm (K-means, Hierarchical and Dynamic Time Warping)

4- identify the optimal number of clustering

5- compare between the three techniques

you need to be familiar with Hadoop ecosystem ,spark, Yarn

Beceriler: Machine Learning (ML), Spark, Hadoop, Python, Bulut Bilişim

Daha fazlasını gör: software time series analysis hurst exponent, time series analysis project, online part time tech support jobs, part time link building jobs, gnuplot bid time series, time series web service, reliable part time home based jobs, time series data library, time series var matlab, hurst exponent time series programming, count number jobs available user using mysql php, full time christian writer jobs, performance testing using vstte, part time call center jobs manila, mulipe time series getafreelancer, options trading correlation arbitrage time series, derive time series using matlab, time series using neural networks stock matlab, time series data clustering matlab

İşveren Hakkında:
( 0 değerlendirme ) cairo, Egypt

Proje NO: #22356884

Bu iş için 10 freelancer ortalamada $173 teklif veriyor


Hi I am a very experienced statistician, data scientist and academic writer. I have completed several PhD level thesis projects involving advanced statistical analysis of data. I have worked with data from several comp Daha Fazla

$100 USD in 7 gün içinde
(26 Değerlendirme)

do kindly reach out to me over chat and we can get started on the task. Also, if there are additional information you can share with me in a zip file

$166 USD in 5 gün içinde
(33 Değerlendirme)

Hi I meet your requirement and have a good experience with Hadoop and [login to view URL] seens to be a huge projet. Contact me and give me more details. After that we will fix the final price.

$1111 USD in 2 gün içinde
(4 Değerlendirme)

Hi, I am expert in time-series clustering, we can discuss over chat, if you have really large number of observations, to my understanding all the three technique you mentioned here will be irrelevant. Myself Ph.D. in Daha Fazla

$50 USD in 3 gün içinde
(2 Değerlendirme)

Hello!I I am very interested in your post project. I am really looking for this kind of project for a long time in freelancer since i have rich experience on it. I think this project is very suitable for me and i am su Daha Fazla

1 gün içinde %bids___i_sum_sub_32%%project_currencyDetails_sign_sub_33% USD
(2 Değerlendirme)
$25 USD in 2 gün içinde
(0 Değerlendirme)

Hi I have 3 years experience in Big Data eco system. spark hdfs yarn flume kafka HBase . Please reach to discuss further.. Thanks

$30 USD in 5 gün içinde
(0 Değerlendirme)

Hi, this is Clark from Shanghai, China. I have more than two years of spark and scala application development experience. I have worked in paypal, google crop as a full time employee and graduated from chinese top3 uni Daha Fazla

$166 USD in 5 gün içinde
(0 Değerlendirme)

Dear Sir I am capable of developing time series models Thanks in advance

1 gün içinde %bids___i_sum_sub_32%%project_currencyDetails_sign_sub_33% USD
(0 Değerlendirme)

Am a spark expert. I have delivered several talks on the topic and am an active speaker in the spark community. Link to one of my talks is below: [login to view URL] Having wo Daha Fazla

$50 USD in 7 gün içinde
(0 Değerlendirme)