I have recently started using OpenSearch after I abandoned Elastic some years ago due to license changes. I am using both OpenSearch 3.0 and 2.17 and was using 2.17 (2 Node deployment) for the TORCH_FORMAT and encountered the issues using Python as it would download but not deploy. I have solved the issue for me by doing the following. But before I do that just some background on the tech stack used so you can see if your conditions approximate mine. I am running a host on hypervisor (in my case proxmox) enough RAM 20G, 8 Xeon cores, SSD disks but old tech. There is no GPU installed, OS Ubuntu 22.04LTS. My steps I took was as follows. The data frames API issue was my first problem and moved to “pandas==1.5.3”, Second I don’t have a GPU so installed the CPU version for torch “pip install torch==1.13.1 --index-url https://download.pytorch.org/whl/cpu” then the transformer version had to be changed as well “pip install transformers==4.37.2”. You will notice these are kinda old version of these packages and the reason for this is to try and match the version that were used for the version of OpenSearch you are using. Having said that I have not yet tested this for OpenSearch 3.0 and this will happen later for me once done with the training and when I want to use it in production. Hope this helps somebody with the same issues.
Related topics
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| Pretrained Model Download/Register Fails for TorchScript | 0 | 316 | July 29, 2024 | |
| Further Discussion from the issue written by @jdomkline | 1 | 83 | February 5, 2025 | |
| Error when deploying pretrained model: Unknown builtin op: aten::scaled_dot_product_attention | 3 | 286 | April 15, 2025 | |
| Unable to deploy the custom model in Opensearch | 4 | 149 | March 7, 2025 | |
| How to register local custom model? | 3 | 563 | November 5, 2024 |