Aggregation on similarity

fisherijus · March 14, 2024, 2:45pm

Describe the issue:
I am feeding documents to Openseach that containt field ‘MessageText’ and i want to aggregate it and to show most common similar messages texts. I would consider that messages are similar if score is high enough.

for example:

text #1 → hello my name is Jhon
text #2 → Hello my name is Lisa
text #3 → Helo i want pizza.

and i would get:

Hello my name is Jhon → Count =2
Hello i want pizza → Count = 1

Topic		Replies	Views
Text categorization OpenSearch	1	265	March 20, 2024
Creating an Autocomplete Using OpenSearch.org Docs General Feedback troubleshoot	3	1895	February 28, 2022
Query documents based on aggregation results in single query OpenSearch	1	407	March 2, 2023
Term counts aggregation for complete index field OpenSearch discuss	0	557	January 5, 2023
How to index my documents to aggregate similar item listings together OpenSearch	1	291	May 3, 2023

Aggregation on similarity

Related topics