1
0

the guardian feed #41

Open
opened 2014-07-30 08:57:26 +01:00 by four2six · 1 comment
four2six commented 2014-07-30 08:57:26 +01:00 (Migrated from github.com)

i seem to be unable to pull the guardian's full article.
example:
feed: http://www.theguardian.com/world/rss
article: http://www.theguardian.com/world/2014/jul/30/wikileaks-australia-super-injunction-bribery-allegations

this is the xpath needed:
/html[@id='js-context']/body[@id='top']/div[@class='l-side-margins l-side-margins--layout-content']/article[@id='article']/div[@class='gs-container']/div[@class='content__main-column content__main-column--article']/div[@class='from-content-api js-article__body']

but even using this
"theguardian": {
"type": "xpath",
"xpath": "div[@class='gs-container']"
},
doesn't pull anything from their website.
Any idea what i'm doing wrong?

i seem to be unable to pull the guardian's full article. example: feed: http://www.theguardian.com/world/rss article: http://www.theguardian.com/world/2014/jul/30/wikileaks-australia-super-injunction-bribery-allegations this is the xpath needed: /html[@id='js-context']/body[@id='top']/div[@class='l-side-margins l-side-margins--layout-content']/article[@id='article']/div[@class='gs-container']/div[@class='content__main-column content__main-column--article']/div[@class='from-content-api js-article__body'] but even using this "theguardian": { "type": "xpath", "xpath": "div[@class='gs-container']" }, doesn't pull anything from their website. Any idea what i'm doing wrong?
aronchi commented 2014-10-27 13:57:46 +00:00 (Migrated from github.com)

I have a very similar problem with www.lifehacker.co.uk and with other sites.

 "www.lifehacker.co.uk": {
    "type": "xpath",
    "xpath": "div[@class='single-article__content']"
},

the feed link is http://feeds.feedburner.com/LifehackerUK-UKPostsOnly?format=xml

Inside the RSS links are in format:

http://www.lifehacker.co.uk/2014/10/25/make-simple-things-life

but it doesn't fetch the complete article content of the page.

I have a very similar problem with www.lifehacker.co.uk and with other sites. ``` "www.lifehacker.co.uk": { "type": "xpath", "xpath": "div[@class='single-article__content']" }, ``` the feed link is http://feeds.feedburner.com/LifehackerUK-UKPostsOnly?format=xml Inside the RSS links are in format: <link>http://www.lifehacker.co.uk/2014/10/25/make-simple-things-life</link> but it doesn't fetch the complete article content of the page.
This repo is archived. You cannot comment on issues.
No Label
1 Participants
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: mbirth/ttrss_plugin-af_feedmod#41
No description provided.