Sure! @dhrubo I hope it will help others who are not familiar with the same issue.
Have you read my comment that detours Firewall issues by pre-downloading pytorch and tokenizers folders and copying them to the opensearch/data/ml_cache
path in a ML Node?
(and then defining a docker image for new pods or using docker copy for existing pods)
I think there might be better way instead of the above because it still needs proxy CDN or downloading files from public internet to your air-gapped environments. In my point of view, it’s quite complicated for developers who just want to follow a guide.
@dhrubo If you still think the above way is still valuable, I will create a PR as much detail as in ml-commons/docs/tutorials at main · opensearch-project/ml-commons · GitHub