Looking for Open-Source Models for Text-to-DSL Query Generation in OpenSearch

Arun · August 12, 2025, 4:06pm

I’m building a system where users enter a natural language query, and an LLM converts it into an OpenSearch DSL query. My workflow looks like this:

Provide the model with an OpenSearch index schema and 1–2 sample documents.
Pass the user’s natural language query.
Model generates the corresponding DSL.
Validate the DSL in OpenSearch.
Store valid query–DSL pairs for future fine-tuning.

I’m specifically looking for open-source models that:

Handle structured JSON output well (e.g., match, bool, filter, sort, aggregations).
Are easy to fine-tune with domain-specific DSL patterns.
Can run locally for testing but also scale in production.
Work well with schema + sample doc prompts.

If you’ve successfully fine-tuned any open-source model for OpenSearch DSL generation

pablo · October 22, 2025, 2:57pm

@Arun I think this could fit your needs.

jaspher · October 31, 2025, 5:32am

I’m building a similar setup where user prompts should be translated into OpenSearch DSL automatically.

Could anyone share which model(s) worked best for you — especially smaller or offline models

Topic		Replies	Views
Best AI/NLP model for converting natural language to OpenSearch queries Machine Learning	1	102	December 30, 2025
The DSL query is being generated in a different format while executing the agent OpenSearch	16	254	November 4, 2025
[Feedback] OpenSearch Assistant General Feedback discuss , feature-request	4	1698	January 1, 2024
How does opensearch fit into other models Machine Learning	2	251	July 30, 2024
[Feedback] Conversational Search and Retrieval Augmented Generation Using Search Pipeline - Experimental Release General Feedback discuss	12	1801	March 30, 2024

Looking for Open-Source Models for Text-to-DSL Query Generation in OpenSearch

Related topics