Connector for vertex ai models in conversational search

smawarsi · January 8, 2024, 10:54am

Hello All,

I am trying to create a connector Google Vertex AI models like chat-bison and text-bison. There are connector blueprints for OpenAI, Amazon BedRock etc. Can I invoke Google Vertex AI models using the connector? I tried to create one but the conversational search fails with the following error.

“Error from remote service: {\n "error": {\n "code": 400,\n "message": "1 instance(s) is allowed per prediction. Actual: 6",\n "status": "INVALID_ARGUMENT"\n }\n}\n”

The connector payload is given below:

{
    "name": "Vertex AI Chat Connector",
    "description": "The connector to Google Vertex AI",
    "version": 2,
    "protocol": "http",
    "parameters": {
        "endpoint": "<ENDPOINT>",
        "project": "<PROJECT>",
        "location" : "<LOCATION>",
        "model": "text-bison@002",
        "temperature": 0.2
    },
    "credential": {
        "VertexAI_Key": "<VERTEX_AI_KEY>"
    },
    "actions": [
        {
            "action_type": "predict",
            "method": "POST",
            "url": "https://${parameters.endpoint}/v1/projects/${parameters.project}/locations/${parameters.location}/publishers/google/models/${parameters.model}:predict",
            "headers": {
                "Authorization": "Bearer ${credential.VertexAI_Key}"
            },
            "request_body": "{\"instances\":${parameters.messages},\"parameters\":{\"temperature\":${parameters.temperature},\"maxOutputTokens\":256,\"topK\":40,\"topP\":0.95}}"
        }
    ]
}

Any help would be highly appreciated.

Thanks.

austinlee · January 8, 2024, 10:14pm

The error message says that you passed 6 messages (instances), but you can only pass 1 at a time. How are you testing your connector? Using the CLI (curl)?

smawarsi · January 9, 2024, 6:47am

I am using conversational search feature of OpenSearch. Followed the below steps:

Created a connector to Google Vertex AI PaLM 2 for text foundation model
Registered a new model in OpenSearch with the connector id
Deployed the model in OpenSearch
Created a new search pipeline with response processor retrieval_augmented_generation
Performed a conversational search by specifying the search pipeline created above with the below ext object

"ext": {
        "generative_qa_parameters": {
            "llm_model": "text-bison@002",
            "llm_question": "which modules were loaded?",
            "context_size": 1,
            "timeout": 30
        }
    }

I explicitly specified the context size = 1 so only one document is sent to the model. The conversational search API returns the error below.

“Error from remote service: {\n "error": {\n "code": 400,\n "message": "1 instance(s) is allowed per prediction. Actual: 6",\n "status": "INVALID_ARGUMENT"\n }\n}\n”

dhrubo · February 20, 2024, 9:58pm

Can you share your search payload in details?

system · April 20, 2024, 9:59pm

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Create a connector in order to Connecting to remote models OpenSearch	7	985	April 27, 2024
Create connector is failing with 502 and permission error for openai embedding OpenSearch configure	1	546	March 14, 2024
Conversational Search w/ Bedrock Claude Sonnet 3 OpenSearch discuss , troubleshoot , configure	0	62	November 25, 2024
Neural search text_embedding pipeline error Machine Learning	1	42	April 12, 2025
Unable to register a new model - error 'Invalid arguments in credential body' Machine Learning troubleshoot	2	95	January 12, 2025

Connector for vertex ai models in conversational search

Related topics