Annulleret

Matrix Multiplication using hybrid of OpenMp and CUDA

Hi , I have a part of code to do matrix multiplication by using Cuda & Openmp . I want run three copies of matrix multiplication (same inputs) at parallel on three kernel . Then comparing the results for three kernels .

Three versions of multiplication arrays are as follows:

One of kernel do with shared memory, other without shared memory and last one do like this equation:

A * B "transpose of B" = C.

- Review the previous code

- Run the three copies in parallel "use Openmp". Run each kernels on a separate threads then compare three results and show me the results in Commands windows .After verification of the preliminary results of the program

Create an error in one of the kernels like this:

Change the addition process to subtract:

Cvalue + = ([url removed, login to view] [row * width + e]) * ([url removed, login to view] [e * width + col]);

To ensure that the program can detect the error , change the addition sign to subtract __>

Cvalue - = ([url removed, login to view] [row * width + e]) * ([url removed, login to view] [e * width + col]);

And determine where the error occurred in which kernels one or two or three .

Create multithreading from correct kernel and run it on the kernel where the error located and compared the results with previous results for the same kernel .So that we can determine the type of error .

Færdigheder: C programmering, CUDA, GPGPU

Se mere: implement multithreaded matrix multiplication using pthreads, implement matrix multiplication using pthreads, matrix matrix multiplication using openmp mpi hybrid, matrix multiplication using mpi library, language program implement multithreaded matrix multiplication using pthreads, multithreaded matrix multiplication using pthreads, matrix multiplication using openmp, matrix matrix multiplication using openmp, matrix multiplication using pthread, openmp matrix multiplication code, matrix power using cuda, code matrix multiplication using pthreads, matrix multiplication using multithreading, matrix multiplication using pthreads, matrix multiplication using mips

Om arbejdsgiveren:
( 0 bedømmelser ) Saudi Arabia

Projekt-ID: #14840821

Tildelt til:

profvipabutaleb

I'm computer engineering TA with 10+ years of experience. I'm experienced with GPU programming over CUDA - Linear algebra over C/C++ / Java/Python Relevant Skills and Experience I'm experienced with data structure Mere

$250 USD in 10 dage
(13 bedømmelser)
3.5

7 freelancere byder i gennemsnit $285 for dette job

ThanassisKalv

hello I have strong background and experience with CUDA C development, you can see many completed projects in my profile. I can solve this with clear and effective code. You can ask me anything ofc Relevant Skills and Mere

$250 USD in 4 dage
(38 bedømmelser)
4.9
$220 USD in 4 dage
(16 bedømmelser)
3.6
eliasdphiri217

Coming from a computational physics background I have lots of numerical analysis and programming experience. I have done such work before when I was working on a research team building a computing cluster using Nvidia Mere

$333 USD in 7 dage
(0 bedømmelser)
0.0
MetaoriginLab

We are a Team of Technical Consultants and Data Engineers having healthy experience into Big Data technologies,IOT/Cloud/AWS and Python/AI+Machine Learning. The Dynamic force has qualified engineers having expertise in Mere

$319 USD in 5 dage
(0 bedømmelser)
0.0
$305 USD in 10 dage
(0 bedømmelser)
0.0
josator

I have experience developing CUDA software during my PhD with multithreading (pthreads) in the host machine to paralellise IO and GPGPU calculations. This helped in the development of a bioinformatics application. I Mere

$315 USD in 10 dage
(0 bedømmelser)
0.0