Using an older version of Opensearch, 2.17 and creating a identical index, without knn-derived-source, as it didnt exist yet, gives me basically the same index size as using Opensearch 3.1 with knn-derived-source.
Furthering my suspicion that I should be seeing some difference
Im using the no-auth configuration and docker compose to run the clusters, but that could be a thing I can verify! Can you point me to how I can verify this?
You can execute a container using -it option so that the size of segments would be verified.
The vector graph data is ultimately stored within the Lucene segment files in the OpenSearch data directory, typically located under /usr/share/opensearch/data/nodes/0/indices/{indexId}/{shardId} path.