Remove index pattern mapping cache #6498

rashidkpc · 2016-03-10T17:29:03Z

Currently we cache a normalized view of the Elasticsearch mapping because large mappings are expensive to parse. This causes lots of other problems, for example when a user adds a field we don't see it unless they manually refresh the mapping cache, which is a non-obvious task. Basically any time the mapping changes, it becomes painful for the user: #2236, #6362

And its not just mapping changes, we still have issues with parsing the large mappings in the first place, the more indices the user has, the longer it takes. Thats why we do stuff like restrict the parsing to 5 indices by default. #1540, #2928

The crux of the issue goes back to the pre-beta1 days when Kibana 4 didn't have a server to offload this stuff to, so we do it in the browser. There's 3 things that would help this:

Move index parsing to the server. The work on the ingest API is a good first step.
Don't cache the mappings forever. Retrieving these from Elasticsearch is cheap, we can do it regularly on the server. We could cache in memory for a short period, but we don't need to keep them forever
Get normalized mappings from Elasticsearch: Return an aggregated view of all mappings/properties of all types elasticsearch#15728

The first 2 we can do immediately, the last one would be an amazing optimization that would make everyone's life a lot easier and remove a lot of load and code from the Kibana backend.

Bargs · 2016-03-11T15:07:50Z

I'll just leave this here: #5575

There's some extra cruft in there, but that PR already has most of 1 and 2.

rfarley3 · 2016-04-12T19:15:16Z

+1

Evertras · 2016-06-29T01:27:18Z

Pre-baking Kibana instances becomes ugly with these caches. Being able to do it on the fly, even as an option for smaller instances, would be fantastic. +1

hannayurkevich · 2016-08-30T21:41:45Z

+1

droberts195 · 2016-10-10T13:00:40Z

+1

It would make life a lot easier for Prelert if Kibana just used mappings direct from Elasticsearch rather than having its own mappings.

sophiec20 · 2016-10-10T15:44:58Z

+1

tbuching · 2017-07-05T16:20:35Z

+1
Do you have any ideas, when something like Solution 2 could land in the final product?

JulienCarnec · 2018-05-29T08:38:30Z

+1
Refreshing mapping automatically or expose some API to refresh index patterns would be nice too.

fakenine · 2018-06-06T12:51:17Z

+1
Do you have an ETA for a solution like an automatical refresh or API endpoint ?

Hariharan-Gandhi · 2018-06-06T12:58:06Z

+1 API for refresh

tarraschk · 2018-06-06T13:01:12Z

+1

fwininger · 2018-06-06T13:06:39Z

Please, someone can give us some help ?

AustinBGibbons · 2018-06-07T17:29:31Z

+1 for API Refresh - also preserving the "popularity"

Or current approach is going to be to directly call

GET _plugin/kibana/api/index_patterns/_fields_for_wildcard?pattern=...

PUT _plugin/kibana/api/saved_objects/index-pattern/

in imitation of the network requests that we see in the refresh icon

sgarg7 · 2019-03-28T18:22:12Z

Upgrading our users from from Kibana 3 to Kibana 6 and Kibana hangs when it tries to load an index with ~22K fields, even after the mappings have been cached - #32153
That's a big difference between how the new Kibana handles large indexes. Moving forward on this would be great, thanks.

elasticmachine · 2019-04-29T16:48:57Z

Pinging @elastic/kibana-app-arch

sgarg7 · 2019-07-19T18:44:58Z

Are there any suggested workarounds for this error?

akshayurdh · 2019-10-17T08:49:09Z

+1
Need Index pattern refresh API.

AndrewMcQuerry · 2019-11-05T18:34:06Z

+1

alexios-y · 2019-11-27T11:50:04Z

+1. Just realized it's been 5 years since the first issue was raised.

fabrei · 2020-01-15T14:23:30Z

+1 for API Refresh - also preserving the "popularity"

Or current approach is going to be to directly call
GET _plugin/kibana/api/index_patterns/_fields_for_wildcard?pattern=...
PUT _plugin/kibana/api/saved_objects/index-pattern/
in imitation of the network requests that we see in the refresh icon

That is fine if it would work. I wrote this two lines of bash script to do exactly the same requests as the browser sends to kibana backend (replace with your specific one).

refresh_payload=$(curl -X GET 'localhost:5601/api/index_patterns/_fields_for_wildcard?pattern=packets*&meta_fields=_source&meta_fields=_id&meta_fields=_type&meta_fields=_index&meta_fields=_score' | jq '.fields[] | . + {count: 0} | . + {scripted: false}' | jq -s '. | tostring | {"attributes": {"title": "packets*", "timeFieldName": "timestamp", "fields": . }}')

curl -X PUT 'localhost:5601/api/saved_objects/index-pattern/<index-id>' -H 'kbn-xsrf: true' -H 'Content-Type: application/json' -d "$refresh_payload"

I added a new field to my index template and updated all existing indices as well. After the curl requests, I get an answer that the index pattern was updated. But the pattern was not updated in my kibana dashboard. I checked if the added field is in the response of the first curl; it is. Also it is not possible to filter by the added field or create a chart. So I think the code for refreshing a pattern makes another request which is not tracked by my developer tool..But after a day passed, the pattern was successfully refreshed and I had access to the added field.

If you take a look at the refresh button in kibana settings (with a developer tool), you see that the button calls the function refreshFields(). I took a look into the code and found that you need an IndexPattern-object. This object has this specific method. In my case it would be nice to call refreshFields() manually from my plugin which I wrote. Actually I am experimenting, how I can initiate an IndexPattern-object. But does anyone already have an idea?

mmguero · 2020-01-15T14:30:25Z

UPDATE: Thanks to @fabrei in idaholab/Malcolm#100, he suggested something to make the script I pasted more robust. I've updated the link and the code here to reflect that (using a _find to get the index ID based on the index pattern name vs. just assuming they're the same):

Here's a python script I wrote to refresh my index pattern fields in my project:

#!/usr/bin/env python
# -*- coding: utf-8 -*-

from __future__ import print_function

import argparse
import json
import requests
import os
import sys

GET_STATUS_API = 'api/status'
GET_INDEX_PATTERN_INFO_URI = 'api/saved_objects/_find'
GET_FIELDS_URI = 'api/index_patterns/_fields_for_wildcard'
PUT_INDEX_PATTERN_URI = 'api/saved_objects/index-pattern'

###################################################################################################
debug = False
PY3 = (sys.version_info.major >= 3)
scriptName = os.path.basename(__file__)
scriptPath = os.path.dirname(os.path.realpath(__file__))
origPath = os.getcwd()

###################################################################################################
if not PY3:
  if hasattr(__builtins__, 'raw_input'): input = raw_input

try:
  FileNotFoundError
except NameError:
  FileNotFoundError = IOError

###################################################################################################
# print to stderr
def eprint(*args, **kwargs):
  print(*args, file=sys.stderr, **kwargs)

###################################################################################################
# convenient boolean argument parsing
def str2bool(v):
  if v.lower() in ('yes', 'true', 't', 'y', '1'):
    return True
  elif v.lower() in ('no', 'false', 'f', 'n', '0'):
    return False
  else:
    raise argparse.ArgumentTypeError('Boolean value expected.')

###################################################################################################
# main
def main():
  global debug

  parser = argparse.ArgumentParser(description=scriptName, add_help=False, usage='{} <arguments>'.format(scriptName))
  parser.add_argument('-v', '--verbose', dest='debug', type=str2bool, nargs='?', const=True, default=False, help="Verbose output")
  parser.add_argument('-i', '--index', dest='index', metavar='<str>', type=str, default='sessions2-*', help='Index Pattern Name')
  parser.add_argument('-k', '--kibana', dest='url', metavar='<protocol://host:port>', type=str, default='http://localhost:5601/kibana', help='Kibana URL')
  parser.add_argument('-n', '--dry-run', dest='dryrun', type=str2bool, nargs='?', const=True, default=False, help="Dry run (no PUT)")
  try:
    parser.error = parser.exit
    args = parser.parse_args()
  except SystemExit:
    parser.print_help()
    exit(2)

  debug = args.debug
  if debug:
    eprint(os.path.join(scriptPath, scriptName))
    eprint("Arguments: {}".format(sys.argv[1:]))
    eprint("Arguments: {}".format(args))
  else:
    sys.tracebacklimit = 0

  # get version number so kibana doesn't think we're doing a XSRF when we do the PUT
  statusInfoResponse = requests.get('{}/{}'.format(args.url, GET_STATUS_API))
  statusInfoResponse.raise_for_status()
  statusInfo = statusInfoResponse.json()
  kibanaVersion = statusInfo['version']['number']
  if debug:
    eprint('Kibana version is {}'.format(kibanaVersion))

  # find the ID of the index name (probably will be the same as the name)
  getIndexInfoResponse = requests.get(
    '{}/{}'.format(args.url, GET_INDEX_PATTERN_INFO_URI),
    params={
      'type': 'index-pattern',
      'fields': 'id',
      'search': f'"{args.index}"'
    }
  )
  getIndexInfoResponse.raise_for_status()
  getIndexInfo = getIndexInfoResponse.json()
  indexId = getIndexInfo['saved_objects'][0]['id'] if (len(getIndexInfo['saved_objects']) > 0) else None
  if debug:
    eprint('Index ID for {} is {}'.format(args.index, indexId))

  if indexId is not None:

    # get the fields list
    getFieldsResponse = requests.get('{}/{}'.format(args.url, GET_FIELDS_URI),
                                     params={ 'pattern': args.index,
                                              'meta_fields': ["_source","_id","_type","_index","_score"] })
    getFieldsResponse.raise_for_status()
    getFieldsList = getFieldsResponse.json()['fields']
    if debug:
      eprint('{} would have {} fields'.format(args.index, len(getFieldsList)))

    # set the index pattern with our complete list of fields
    if not args.dryrun:
      putIndexInfo = {}
      putIndexInfo['attributes'] = {}
      putIndexInfo['attributes']['title'] = args.index
      putIndexInfo['attributes']['fields'] = json.dumps(getFieldsList)

      putResponse = requests.put('{}/{}/{}'.format(args.url, PUT_INDEX_PATTERN_URI, indexId),
                                 headers={ 'Content-Type': 'application/json',
                                           'kbn-xsrf': 'true',
                                           'kbn-version': kibanaVersion, },
                                 data=json.dumps(putIndexInfo))
      putResponse.raise_for_status()

    # if we got this far, it probably worked!
    if args.dryrun:
      print("success (dry run only, no write performed)")
    else:
      print("success")

  else:
    print("failure (could not find Index ID for {})".format(args.index))

if __name__ == '__main__':
  main()

fabrei · 2020-01-16T13:12:49Z

Thanks for sharing :) I don't know what I did wrong yesterday, but today my script works as well. I took a look into the management view for index patterns in my dashboard and the number of indexed fields had not been updated yesterday. Maybe the reload did not went totally right yesterday. As I know now which requests I have to do, I will integrate it in my plugin. But if someone knows how the already existing function refreshFields() can be used in a plugin and she/he shares this information, I would feel really happy about it =)

mbudge · 2020-02-18T11:34:55Z

+1
This would be very useful!

slmingol · 2020-03-06T20:25:14Z

A co-worker recently wrote a Golang app which does this for us:

https://github.com/Jmainguy/kibanaRefreshFields

ikawalec · 2020-07-03T19:55:07Z

+1

skmizuho · 2020-08-06T09:00:24Z

+1

debu99 · 2020-08-24T11:29:37Z

+1

Nilubkal · 2020-10-13T15:54:40Z

+1

mattkime · 2020-12-02T20:21:22Z

No longer needed as field list is no longer cached - #82223 - will be released in 7.11

rashidkpc added release_note:enhancement Feature:http labels Mar 10, 2016

spalger added the P1 label Mar 10, 2016

Bargs mentioned this issue Mar 11, 2016

Missing field mappings - Increase the default lookBack setting? #6362

Closed

Bargs self-assigned this Mar 24, 2016

Bargs mentioned this issue Mar 30, 2016

Config service server side #5317

Closed

Bargs mentioned this issue Apr 7, 2016

Include metaFields when creating an index pattern via the Ingest api #6823

Closed

rashidkpc mentioned this issue Apr 12, 2016

Refresh mappings automatically #2236

Closed

Bargs mentioned this issue Apr 19, 2016

Doc fields added by Filebeat aren't included in index pattern field list #6983

Closed

jimmyjones2 mentioned this issue May 4, 2016

Mapping limited to last 5 indices #2928

Closed

PhaedrusTheGreek mentioned this issue Sep 8, 2016

FLS restricted fields should not be visible #8192

Closed

Bargs mentioned this issue Nov 4, 2016

add top_hit metric #7302

Merged

3 tasks

Bargs mentioned this issue Dec 20, 2016

[Management] Field properties don't seem to match es API response #9466

Open

tbragin added the Team:Core Core services & architecture: plugins, logging, config, saved objects, http, ES client, i18n, etc label Feb 7, 2017

epixa removed the P1 label Apr 25, 2017

epixa added enhancement New value added to drive a business result and removed release_note:enhancement labels May 7, 2018

sgarg7 mentioned this issue Mar 28, 2019

10 $digest() iterations reached. Aborting! for an index with 22K fields #32153

Closed

lukeelmers added Feature:Data Views Data Views code and UI - index patterns before 8.0 :AppArch labels Apr 29, 2019

mattkime mentioned this issue May 3, 2019

[DISCUSS] Rethinking index patterns; improvements #35481

Closed

Bargs removed their assignment Sep 18, 2019

fabrei mentioned this issue Jan 16, 2020

"HTTPError: Not Found for url" while index pattern refresh for Kibana cisagov/Malcolm#100

Closed

lukeelmers mentioned this issue Jun 4, 2020

Add API call to refresh index fields #57699

Closed

mattkime mentioned this issue Jul 14, 2020

Plan to remove index pattern field mapping cache #71787

Closed

mattkime removed the Team:Visualizations Visualization editors, elastic-charts and infrastructure label Oct 2, 2020

mattkime closed this as completed Dec 2, 2020

teebu mentioned this issue Nov 5, 2021

Index Patterns Refresh field list automatically update for new fields opensearch-project/OpenSearch-Dashboards#913

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove index pattern mapping cache #6498

Remove index pattern mapping cache #6498

rashidkpc commented Mar 10, 2016

Bargs commented Mar 11, 2016

rfarley3 commented Apr 12, 2016

Evertras commented Jun 29, 2016

hannayurkevich commented Aug 30, 2016

droberts195 commented Oct 10, 2016

sophiec20 commented Oct 10, 2016

tbuching commented Jul 5, 2017

JulienCarnec commented May 29, 2018

fakenine commented Jun 6, 2018

Hariharan-Gandhi commented Jun 6, 2018 •

edited

Loading

tarraschk commented Jun 6, 2018

fwininger commented Jun 6, 2018

AustinBGibbons commented Jun 7, 2018

sgarg7 commented Mar 28, 2019

elasticmachine commented Apr 29, 2019

sgarg7 commented Jul 19, 2019

akshayurdh commented Oct 17, 2019 •

edited

Loading

AndrewMcQuerry commented Nov 5, 2019

alexios-y commented Nov 27, 2019

fabrei commented Jan 15, 2020

mmguero commented Jan 15, 2020 •

edited

Loading

fabrei commented Jan 16, 2020

mbudge commented Feb 18, 2020

slmingol commented Mar 6, 2020

ikawalec commented Jul 3, 2020

skmizuho commented Aug 6, 2020

debu99 commented Aug 24, 2020

Nilubkal commented Oct 13, 2020

mattkime commented Dec 2, 2020

Remove index pattern mapping cache #6498

Remove index pattern mapping cache #6498

Comments

rashidkpc commented Mar 10, 2016

Bargs commented Mar 11, 2016

rfarley3 commented Apr 12, 2016

Evertras commented Jun 29, 2016

hannayurkevich commented Aug 30, 2016

droberts195 commented Oct 10, 2016

sophiec20 commented Oct 10, 2016

tbuching commented Jul 5, 2017

JulienCarnec commented May 29, 2018

fakenine commented Jun 6, 2018

Hariharan-Gandhi commented Jun 6, 2018 • edited Loading

tarraschk commented Jun 6, 2018

fwininger commented Jun 6, 2018

AustinBGibbons commented Jun 7, 2018

sgarg7 commented Mar 28, 2019

elasticmachine commented Apr 29, 2019

sgarg7 commented Jul 19, 2019

akshayurdh commented Oct 17, 2019 • edited Loading

AndrewMcQuerry commented Nov 5, 2019

alexios-y commented Nov 27, 2019

fabrei commented Jan 15, 2020

mmguero commented Jan 15, 2020 • edited Loading

fabrei commented Jan 16, 2020

mbudge commented Feb 18, 2020

slmingol commented Mar 6, 2020

ikawalec commented Jul 3, 2020

skmizuho commented Aug 6, 2020

debu99 commented Aug 24, 2020

Nilubkal commented Oct 13, 2020

mattkime commented Dec 2, 2020

Hariharan-Gandhi commented Jun 6, 2018 •

edited

Loading

akshayurdh commented Oct 17, 2019 •

edited

Loading

mmguero commented Jan 15, 2020 •

edited

Loading