HYPERDRIVE

A fast, parallel linear solver

In our quest for the implementation and analysis of a high performance, fast Conjugate Gradient solver, so far we have achieved the following goals:

Studied the Conjugate-Gradient algorithm and experimented with the results of running the code in the available library.
Wrote our implementation of the algorithm in C++ and MATLAB.
Tested the algorithm against our custom matrix that was carefully created to be a symmetric, positive definite matrix which is a requirement for the CG algorithm to converge to acceptable results.
The algorithm was tested against varying sizes of the custom matrix, starting with a 50x50, all the way upto a matrix of size of 14000x14000.
The execution times for each of the matrices was noted and plotted to study how the algorithm behaves with increasing sizes of the matrices and determine the breakeven point, after which it would be necessary to parallelize or kernalize the algorithm to obtain results by convergence faster.
Once we achieved the correct implementation and execution times against our custom matrix, we tested the algorithms against available test matrices of the order of 1089 and profiled the code to obtain the regions with the most scope for parallelization.
After implementing the OpenMP primitives, the new execution time for the 1089x1089 test matrix was noted.

Sr. No.	Execution Time(s)
1	189.83

Thus, we observed a speedup of 6.6x of the OpenMP implementation over the serial code.