To convert RAG in OpenSearch to return results in a streaming manner, how do you do it?

Does the RAG feature in OpenSearch support streaming responses, similar to ChatGPT? Otherwise, the user has to wait too long for each query.

Each query takes several seconds.
It feels like the more relevant documents there are in the index, the slower the RAG queries become.

How to solve this problem

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.