| ▲ | Ask HN: Better approach for plagiarism detection in self-hosted LMS? | |
| 1 points by pigon1002 8 hours ago | ||
I'm building an open-source LMS and added plagiarism detection using OpenSearch's more_like_this query plus character n-grams for similarity scoring. Basically when a student submits an answer, I search for similar answers from other students on the same question. Works decently but feels a bit hacky - just reusing the search engine I already had. Current setup:
Constraints:
- Self-hosted only, no external APIs
- Few thousand students
- Want simple operations, already running OpenSearch anywayQuestions: - Is this approach reasonable or am I missing something obvious? - What do other self-hosted systems use? Checked Moodle docs but their plagiarism plugins mostly call external services - Anyone tried lightweight ML models for this that don't need GPU? The search engine approach works but curious if there's a better way that fits our constraints. | ||