Support GeoJSON for geo_point #85120

craigtaverner · 2022-03-18T17:02:03Z

Support GeoJSON for points when the mapper specifies geo_point.

iverase · 2022-03-21T07:08:25Z

server/src/main/java/org/elasticsearch/common/geo/GeoUtils.java

+                            subParser.nextToken();
+                            if (subParser.currentToken() == Token.START_ARRAY) {
+                                coordinates = new ArrayList<>();
+                                subParser.nextToken();


I wonder if we should stricter here. We expect a two / three dimensional double array. WE should only accept that else throw an error?

We do that on line 527. I noticed that the code here from before does a little validation during the parsing and then more validation on the results further down. I think we could do even more validation, like enforcing that you do not mix and match multiple types of data.

I've added a lot more validation now

iverase · 2022-03-21T07:15:06Z

server/src/main/java/org/elasticsearch/common/geo/GeoUtils.java

@@ -492,6 +519,14 @@ public static GeoPoint parseGeoPoint(XContentParser parser, GeoPoint point, fina
                } else {
                    return point.parseGeoHash(geohash, effectivePoint);
                }
+            } else if (coordinates != null) {
+                if (geojsonType == null || geojsonType.toLowerCase(Locale.ROOT).equals("point") == false) {


We need to check that we have no other elements in GeoJson (lat, lon or geohash)

Yes, I've generalized the validation now to cover all types.

craigtaverner · 2022-03-21T09:20:09Z

test/framework/src/main/java/org/elasticsearch/search/geo/GeoPointShapeQueryTestCase.java

+        }
+    }
+
+    public void testQueryPointFromMultiPoint() throws Exception {


This test is a duplicate of the test testQueryPointFromMultiPointFormats but with the number of supported formats reduced to those supported by the GeoJSON parser (so removing point as double[] and lat,lon string). This is because the current implementation added point geojson to the point parser, instead of adding point parsing to the geojson parser. The question is whether we should do it the other way round. The consequence of adding point support to the GeoJSON parser is that geo_shape mappers will start understanding the alternative versions of point (double[] and lat,lon string). Is that a good or a bad consequence?

I've generalized the test, keeping only test data separate (so geo_point tests all four formats, while geo_shape tests only WKT and GeoJSON).

craigtaverner · 2022-03-22T10:40:29Z

server/src/main/java/org/elasticsearch/common/geo/GeoUtils.java

-        NumberFormatException numberFormatException = null;
+        String geojsonType = null;
+        ArrayList<Double> coordinates = null;
+        class NumberFormatExceptionHandler {


This class exists only to keep the old behaviour that NumberFormatException is only handled at the end of parsing. However, there is no obvious reason why we would need that. It seems to me perfectly sufficient to throw the exception at the point that NumberFormatException arises, right? We have a number of other exceptions being thrown immediately during parsing, so why should this particular exception be treated differently. If we allow it to be thrown during parsing, this inner class can get replaced by a simple utility method.

On theory I have is that it could be that when throwing exceptions during parsing, it is impossible to continue parsing (because Objects are not closed, or fully parsed), so the entire import (or entire query) is failed. This can be fixed by catching exceptions, finishing parsing the inner object and then re-throw the exception (which is what we see happening with NumberFormatException) and then the outer code can decide whether to catch and continue, or re-throw. I remember seeing some comment somewhere about this case. So perhaps NumberFormatException is considered a case we want to just log and continue, while other exceptions are considered serious enough to break the entire import (or query)?

OK, after discussions with @iverase we decided to remove this. The work done in #40447 should cover the requirements to finish parsing objects, so we don't need to capture exceptions anymore.

elasticmachine · 2022-03-22T10:52:03Z

Pinging @elastic/es-analytics-geo (Team:Analytics)

elasticsearchmachine · 2022-03-22T10:53:31Z

Hi @craigtaverner, I've created a changelog YAML for you.

iverase

Just left a small nit, otherwise LGTM

server/src/main/java/org/elasticsearch/common/geo/GeoUtils.java

craigtaverner · 2022-03-22T14:52:19Z

@elasticmachine run elasticsearch-ci/part-2

craigtaverner · 2022-04-25T17:12:24Z

This work was enhanced further in #85442

craigtaverner added the :Analytics/Geo Indexing, search aggregations of geo points and shapes label Mar 18, 2022

craigtaverner requested a review from iverase March 18, 2022 17:02

elasticsearchmachine added the v8.2.0 label Mar 18, 2022

craigtaverner force-pushed the geojson_for_geo_point branch from b21e5ce to 43f446a Compare March 18, 2022 17:23

Support GeoJSON for geo_point

bbc61bd

craigtaverner force-pushed the geojson_for_geo_point branch from 43f446a to bbc61bd Compare March 18, 2022 17:25

iverase reviewed Mar 21, 2022

View reviewed changes

craigtaverner commented Mar 21, 2022

View reviewed changes

craigtaverner added 5 commits March 21, 2022 17:20

Better error handling for GeoJSON parsing

a4cb6de

Fixed failing tests

318850e

Generalized test with only test data specialized

ee6f1b0

More point parsing tests

f254ccd

Remove outdated TODOs

f654224

craigtaverner commented Mar 22, 2022

View reviewed changes

craigtaverner marked this pull request as ready for review March 22, 2022 10:52

elasticmachine added the Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) label Mar 22, 2022

craigtaverner added the >enhancement label Mar 22, 2022

craigtaverner added 2 commits March 22, 2022 11:53

Update docs/changelog/85120.yaml

425f34b

Simplify exception handling for parsing doubles

e6e9573

iverase approved these changes Mar 22, 2022

View reviewed changes

server/src/main/java/org/elasticsearch/common/geo/GeoUtils.java Show resolved Hide resolved

craigtaverner mentioned this pull request Mar 23, 2022

Add GeoJSON Point format support for geo_point field value formats #47815

Closed

craigtaverner merged commit 042b964 into elastic:master Mar 23, 2022

This was referenced Apr 21, 2022

Added documentation on GeoJSON format for points and geo-points #86066

Merged

Support 'GeoJSON' in CartesianPoint for 'point' #85442

Merged

craigtaverner mentioned this pull request May 17, 2022

geo_distance does not support GeoJSON for points #86834

Open

craigtaverner deleted the geojson_for_geo_point branch October 20, 2023 10:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support GeoJSON for geo_point #85120

Support GeoJSON for geo_point #85120

craigtaverner commented Mar 18, 2022 •

edited

Loading

iverase Mar 21, 2022

craigtaverner Mar 21, 2022

craigtaverner Mar 22, 2022

iverase Mar 21, 2022

craigtaverner Mar 21, 2022

craigtaverner Mar 22, 2022

craigtaverner Mar 21, 2022

craigtaverner Mar 22, 2022

craigtaverner Mar 22, 2022

craigtaverner Mar 22, 2022

craigtaverner Mar 22, 2022

elasticmachine commented Mar 22, 2022

elasticsearchmachine commented Mar 22, 2022

iverase left a comment

craigtaverner commented Mar 22, 2022

craigtaverner commented Apr 25, 2022

Support GeoJSON for geo_point #85120

Support GeoJSON for geo_point #85120

Conversation

craigtaverner commented Mar 18, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

elasticmachine commented Mar 22, 2022

elasticsearchmachine commented Mar 22, 2022

iverase left a comment

Choose a reason for hiding this comment

craigtaverner commented Mar 22, 2022

craigtaverner commented Apr 25, 2022

craigtaverner commented Mar 18, 2022 •

edited

Loading