Skip to content

Commit

Permalink
Small optimization for BoxReciprocalSumsGPU
Browse files Browse the repository at this point in the history
  • Loading branch information
LSchwiebert committed Sep 7, 2024
1 parent c396711 commit c8efcd0
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion src/GPU/CalculateEwaldCUDAKernel.cu
Original file line number Diff line number Diff line change
Expand Up @@ -140,7 +140,7 @@ __global__ void BoxReciprocalSumsGPU(double *gpu_x, double *gpu_y,
double *gpu_sumRnew, double *gpu_sumInew) {
int image = blockIdx.x;
double sumR = 0.0, sumI = 0.0;
#pragma unroll 16
#pragma unroll 8
for (int particleID = threadIdx.x; particleID < atomNumber; particleID += THREADS_PER_BLOCK_SM) {
double dot = DotProductGPU(gpu_kx[image], gpu_ky[image],
gpu_kz[image], gpu_x[particleID],
Expand Down

0 comments on commit c8efcd0

Please sign in to comment.