-
I'm trying to run some queries on big data. I've taken a portion of our data (only 43GB) and test some query with 15 fields in two scenarios: 24 CPU cores with 200 files, up to 400MB per file I'm using the following command to run the GPU code: Changing maxPartitionBytes or concurrentGpuTasks or any other parameter, doesn't seem to have any effect. Any idea would be highly appereciated. |
Beta Was this translation helpful? Give feedback.
Replies: 7 comments
-
With the GPU being mostly idle, I'm wondering about two possibilities:
To answer the first question, you could run with the config If the query is dominated by filesystem access, then running the query with less than half of the CPU cores (10 vs. 24) could significantly slowdown the GPU run. Fetching the raw data (as opposed to decoding the data) is still processed by the CPU, so this could be a significant contributor of the slowdown in comparison. To help answer this question, you could try running with more CPU cores for your GPU-configured setup and see how it impacts the query. Separately, you could use the Spark SQL web UI to examine the graphical query plan and see if the |
Beta Was this translation helpful? Give feedback.
-
To help on 1st possibility mentioned by @jlowe , we have a workload qualification doc here: After setting |
Beta Was this translation helpful? Give feedback.
-
Wow, that a ton of information. Thank you both! Thanks! |
Beta Was this translation helpful? Give feedback.
-
@viadea Thanks for the input, very helpful :) I've added the following as per the comments in the explain output As far as I can say, these are the remaning issues preventing the query to run entirely on the GPU:
` Is there anything further I can try to make it run on the GPU? thanks |
Beta Was this translation helpful? Give feedback.
-
The RAPIDS Accelerator does not currently support hash partitioning on |
Beta Was this translation helpful? Give feedback.
-
@jlowe I've updated #4900 with all the missing ops I've encountered so far. |
Beta Was this translation helpful? Give feedback.
-
Are there further questions for this issue, or is it covered by the other issues? |
Beta Was this translation helpful? Give feedback.
With the GPU being mostly idle, I'm wondering about two possibilities:
To answer the first question, you could run with the config
spark.rapids.sql.explain
set totrue
, and then you should see log messages for any portions of queries that are not on the GPU (and why they're not on the GPU). Depending on how many rows are being processed by nodes not on the GPU, it could contribute substantially to the slowdown you're seeing. Also if there are portions of the query not running on the G…