The arXiv-vs-snarXiv challenge can be machine-learned quite easily!
Check: