Almost every article on Scots Wikipedia is written by one American teenager, who does not speak Scots and is just writing English in an "accent".
If you have a multilingual language model, this fakery might be your _entire training data_ for Scots https://www.reddit.com/r/Scotland/comments/ig9jia/ive_discovered_that_almost_every_single_article/">https://www.reddit.com/r/Scotlan...
If you have a multilingual language model, this fakery might be your _entire training data_ for Scots https://www.reddit.com/r/Scotland/comments/ig9jia/ive_discovered_that_almost_every_single_article/">https://www.reddit.com/r/Scotlan...
note: I am not meaningfully Scottish and don& #39;t know Scots either, I& #39;m just passing this on, particularly because this should be a crisis of confidence in multilingual #NLProc
It& #39;s likely that this isn& #39;t the only Wikipedia edition that& #39;s faked by one person
It& #39;s likely that this isn& #39;t the only Wikipedia edition that& #39;s faked by one person
I believe that the cld2, cld3, and fastText language detectors all have Scots (sco) as one of the languages they claim to detect, and all of them are getting their belief about what Scots is from Wikipedia