Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Slightly different AVX512F, a bit better 2.06E6 (remove -mprefer-vect…
…or-width=512) ./check.exe -p 16384 32 1 *********************************************************************** NumBlocksPerGrid = 16384 NumThreadsPerBlock = 32 NumIterations = 1 ----------------------------------------------------------------------- FP precision = DOUBLE (nan=0) Complex type = STD::COMPLEX RanNumb memory layout = AOSOA[8] Momenta memory layout = AOSOA[8] Internal loops fptype_sv = VECTOR[8] (AVX512F) Random number generation = CURAND (C++ code) ----------------------------------------------------------------------- NumberOfEntries = 1 TotalTime[Rnd+Rmb+ME] (123)= ( 3.753826e-01 ) sec TotalTime[Rambo+ME] (23)= ( 3.477140e-01 ) sec TotalTime[RndNumGen] (1)= ( 2.766860e-02 ) sec TotalTime[Rambo] (2)= ( 9.360017e-02 ) sec TotalTime[MatrixElems] (3)= ( 2.541139e-01 ) sec MeanTimeInMatrixElems = ( 2.541139e-01 ) sec [Min,Max]TimeInMatrixElems = [ 2.541139e-01 , 2.541139e-01 ] sec ----------------------------------------------------------------------- TotalEventsComputed = 524288 EvtsPerSec[Rnd+Rmb+ME](123)= ( 1.396676e+06 ) sec^-1 EvtsPerSec[Rmb+ME] (23)= ( 1.507814e+06 ) sec^-1 EvtsPerSec[MatrixElems] (3)= ( 2.063201e+06 ) sec^-1 ***********************************************************************
- Loading branch information