Alternative to fscrawler in opensearch

Sorry about the late answer but here goes:

  • building from the git version before 7.13.4 does not work (to me at least) since the docker image contains a bug that is resolved later.

I’ve looked over the link you’ve given towards the presentation for the tika fetchers, SMB/NFS would definetly be needed for it to fit our needs :slight_smile:

To “fix” fscrawler to work again with opensearch I would assume just adding the java rest client off opensearch as a different option in fscrawler would be “enough”, as right now the following modules are present:


elasticsearch-client-base
elasticsearch-client-v7
elasticsearch-client-v6

But then again, i’m not very familiar with fscrawler or the coding behind it so I might be wrong.
I’ll update this topic again when I see some progress on a front on this note!