I’m looking for a duplicate/similarity checker against a custom set of documents. This is possibly like a plagiarism checker, but with a custom reference (instead of everything that exists).

But I could not find a solution that can be selfhosted, and have some simple UI and capabilities like Turnitin. Any suggestions?

Thanks’