Mapping a GeoPoint field
Elasticsearch natively supports the use of geolocation types – special types that allow you to localize your document in geographic coordinates (latitude and longitude) around the world.
Two main types are used in the geographic world: the point and the shape. In this recipe, we'll look at GeoPoint – the base element of geolocation.
Getting ready
You will need an up-and-running Elasticsearch installation, as we described in the Downloading and installing Elasticsearch recipe of Chapter 1, Getting Started.
To execute the commands in this recipe, you can use any HTTP client, such as curl (https://curl.haxx.se/), Postman (https://www.getpostman.com/), or similar. I suggest using the Kibana console, which provides code completion and better character escaping for Elasticsearch.
How to do it…
The type of the field must be set to geo_point
to define a GeoPoint.
We can extend the order example by adding a new field that stores the location of a customer. This will result in the following output:
PUT test/_mapping { "properties": { "id": {"type": "keyword",}, "date": {"type": "date"}, "customer_id": {"type": "keyword"}, "customer_ip": {"type": "ip"}, "customer_location": {"type": "geo_point"}, "sent": {"type": "boolean"} } }
How it works…
When Elasticsearch indexes a document with a GeoPoint field (lat_lon
), it processes the latitude and longitude coordinates and creates special accessory field data to provide faster query capabilities on these coordinates. This is because a special data structure is created to internally manage latitude and longitude.
Depending on the properties, given the latitude and longitude, it's possible to compute the geohash
value (for details, I suggest reading https://www.pubnub.com/learn/glossary/what-is-geohashing/). The indexing process also optimizes these values for special computation, such as distance, ranges, and shape match.
GeoPoint has special parameters that allow you to store additional geographic data:
lat_lon
(the default isfalse
): This allows you to store the latitude and longitude as the.lat
and.lon
fields. Storing these values improves the performance of many memory algorithms that are used in distance and shape calculus.
It makes sense to set lat_lon
to true
so that you store them if there is a single point value for a field. This speeds up searches and reduces memory usage during computation.
geohash
(the default isfalse
): This allows you to store the computedgeohash
value.geohash_precision
(the default is12
): This defines the precision to be used ingeohash
calculus.
For example, given a geo point value, [45.61752, 9.08363]
, it can be stored using one of the following syntaxes:
customer_location
= [45.61752, 9.08363]
customer_location.lat
=45.61752
customer_location.lon
=9.08363
customer_location.geohash
=u0n7w8qmrfj
There's more...
GeoPoint is a special type and can accept several formats as input:
lat
andlon
as properties, as shown here:{ "customer_location": { "lat": 45.61752, "lon": 9.08363 },
lan
andlon
as strings, as follows:"customer_location": "45.61752,9.08363",
geohash
as a string, as shown here:"customer_location": "u0n7w8qmrfj",
- As a
GeoJSON
array (note that here,lat
andlon
are reversed), as shown in the following code snippet:"customer_location": [9.08363, 45.61752]