Hi , I have a part of code to do matrix multiplication by using Cuda & Openmp . I want run three copies of matrix multiplication (same inputs) at parallel on three kernel . Then comparing the results for three kernels .
Three versions of multiplication arrays are as follows:
One of kernel do with shared memory, other without shared memory and last one do like this equation:
A * B "transpose of B" = C.
- Review the previous code
- Run the three copies in parallel "use Openmp". Run each kernels on a separate threads then compare three results and show me the results in Commands windows .After verification of the preliminary results of the program
Create an error in one of the kernels like this:
Change the addition process to subtract:
Cvalue + = ([url removed, login to view] [row * width + e]) * ([url removed, login to view] [e * width + col]);
To ensure that the program can detect the error , change the addition sign to subtract __>
Cvalue - = ([url removed, login to view] [row * width + e]) * ([url removed, login to view] [e * width + col]);
And determine where the error occurred in which kernels one or two or three .
Create multithreading from correct kernel and run it on the kernel where the error located and compared the results with previous results for the same kernel .So that we can determine the type of error .
I'm computer engineering TA with 10+ years of experience. I'm experienced with GPU programming over CUDA - Linear algebra over C/C++ / Java/Python Relevant Skills and Experience I'm experienced with data structure Daha Fazla
7 freelancers are bidding on average $285 for this job
hello I have strong background and experience with CUDA C development, you can see many completed projects in my profile. I can solve this with clear and effective code. You can ask me anything ofc Relevant Skills and Daha Fazla
Coming from a computational physics background I have lots of numerical analysis and programming experience. I have done such work before when I was working on a research team building a computing cluster using Nvidia Daha Fazla
We are a Team of Technical Consultants and Data Engineers having healthy experience into Big Data technologies,IOT/Cloud/AWS and Python/AI+Machine Learning. The Dynamic force has qualified engineers having expertise in Daha Fazla
I have experience developing CUDA software during my PhD with multithreading (pthreads) in the host machine to paralellise IO and GPGPU calculations. This helped in the development of a bioinformatics application. I Daha Fazla