Special Characters in generated _id field

BrockHenry · October 3, 2024, 6:05am

We have LogStash sending GELF to OpenSearch. It was sending to Elastic previously, and we are trialling OpenSearch.

We are not specifying the _id field in the logstash-opensearch-output plugin, and so I beleive OpenSearch is generating the ID. I’m not sure though. Maybe it’s in the GELF data already?

The ID comes through with an URLencoded 1:1: prefix:

E.g.: id: 1%3A1%3Am7X0UJIBRBPGnq12qh3C

This is causing problems in AWS OpenSearch such as, “Failed to load the anchor document” when I try to View Surrounding Documents. Google tells me it is due to special characters, such as the percent marks.

%3A is an URL encoded colon “:”

Eugene7 · October 3, 2024, 10:48am

Hi @BrockHenry ,

Could you please share your pipelines.yml for Logstash ?

Eugene7 · October 3, 2024, 11:29am

Where did you find id: 1%3A1%3Am7X0UJIBRBPGnq12qh3C ? There are a few id parameters in the ingestion pipeline from GELF to OpenSearch.

GELF id paramater is recommended to define:

In the OpenSearch cluster, ID parameter is unique identifier for a new document. For the logstash-opensearch-output plugin, I think it’s auto generated value:

BrockHenry · October 3, 2024, 8:25pm

input {
  gelf {
    host => "172.20.20.81"
    use_tcp => true
    port_tcp => 12201
  }
}

output {
  opensearch {
    hosts => "https://xxxxxxxxxxxxxx.ap-southeast-2.aoss.amazonaws.com:443"
    ecs_compatibility => 'disabled'
    index => 'graylog'
    auth_type => {
      type => 'aws_iam'
      aws_access_key_id => 'xxxxxxxxxxxxxxxxxxx'
      aws_secret_access_key => 'xxxxxxxxxxxxxxxxxx'
      region => 'ap-southeast-2'
      service_name => 'aoss'
    }
    default_server_major_version => 2
    legacy_template => false
  }
}

BrockHenry · October 3, 2024, 8:33pm

_id: 1%3A1%3Am7X0UJIBRBPGnq12qh3C is in the opensearch document itself.

When I click on “View Single Document”, or “View Surrounding Documents”, the ID is in the URL, but that page fails to open correctly:

Cannot find document
No documents match that ID.

I thought it would be autogenerated by OpenSearch on ingestion, but why would it be generated with this invalid 1:1: prefix?

Thanks for your reply.

BrockHenry · October 4, 2024, 1:03am

Soo… long story short.

We changed the collection from time series to search, and it’s all working correctly.

Thanks for your help.

Topic		Replies	Views
Custom document IDs via Logstash output plugin OpenSearch	0	634	February 13, 2023
Logstash OpenSearch Plugin Doc Instructions result in failure: logstash pipeline doesn't like colons OpenDistro	7	1771	January 6, 2022
Opensearch output plugin logs full message with "immense term" General Feedback configure	3	1544	December 16, 2021
Logstash OpenSearch Output Security	3	443	October 21, 2021
Kibana - OPENID CONNECT using pingID Security	15	4316	May 14, 2020

Special Characters in generated _id field

Related topics