I'm not talking about a summarizer, I'm talking about a classifier. It just needs to identify which parts of the page are advertising and which are not.
The point of such a tool is that it would read the web page in exactly the same way that a human would, so using trickery like pre-rendered images of text or funky unicode wouldn't really change anything. If a human can read it then so can the AI.
In my experience the vast majority of posts about Elon Musk are from people who hate him and are tired of hearing about him.