An external service that uses XML RPC to publish some posts on my site adds some unwanted CSS styles and HTML to the post content.
I would like to clean it up using a HTML cleaner that I’ve built based on DOMDocument and DOMXPath.
My only question is how can I access the post_content before it is actually saved to the databse so I can run my cleaner on it? Perhaps there is a filter/action that I can use?