From 3b25a4af1c86b5da45046a54a6b6d773e091e344 Mon Sep 17 00:00:00 2001 From: jameswillis Date: Mon, 6 Jan 2025 19:32:23 -0800 Subject: [PATCH] add note about setting checkpoint dir for DBSCAN --- docs/tutorial/sql.md | 4 ++++ 1 file changed, 4 insertions(+) diff --git a/docs/tutorial/sql.md b/docs/tutorial/sql.md index da2a0f52bb..cd81ce5d5f 100644 --- a/docs/tutorial/sql.md +++ b/docs/tutorial/sql.md @@ -858,6 +858,10 @@ The algorithm is available as a Scala and Python function called on a spatial da The first parameter is the dataframe, the next two are the epsilon and min_points parameters of the DBSCAN algorithm. +!!!Note + The sparkContext's checkpoint directory must be set to use DBSCAN. Sedona's DBSCAN implementation uses Graphframes + which requires a checkpoint directory to be set. This can be done by calling `sparkContext.setCheckpointDir("path/to/checkpoint")`. + === "Scala" ```scala