suggestion: URL_REWRITE Type #6
Loading…
x
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Currently I try to make a good xpath extract for a local newspaper website, but their style has pretty many unnecessary stuff inside and no single div tag or something for the pure article text.
My suggestion for cases like this would be some url rewrite feature to fetch the print version instead of the normal article version.
A Simple regex rewrite for the url and it could fetch a very slime and clean version of the article.
This sounds good, I'll look into it when I find some time.
You could probably use ff_FeedCleaner for this though?