1
0

XPath explanation in README, some new mods

This commit is contained in:
Markus Birth 2013-04-11 15:56:46 +02:00
parent 58fc79ea37
commit 5a5728ef4d
6 changed files with 81 additions and 3 deletions

View File

@ -66,3 +66,54 @@ The **xpath** value is the actual Xpath-element to fetch from the linked page. O
If you get an error about "Invalid JSON!", you can use [JSONLint](http://jsonlint.com/) to locate the erroneous part.
XPath
-----
### Tools
To test your XPath expressions, you can use these Chrome extensions:
* [XPath Helper](https://chrome.google.com/webstore/detail/xpath-helper/hgimnogjllphhhkhlmebbmlgjoejdpjl)
* [xPath Viewer](https://chrome.google.com/webstore/detail/xpath-viewer/oemacabgcknpcikelclomjajcdpbilpf)
* [xpathOnClick](https://chrome.google.com/webstore/detail/xpathonclick/ikbfbhbdjpjnalaooidkdbgjknhghhbo)
### Examples
Some XPath expressions you could need (the `//` is automatically prepended and must be omitted in the FeedMod configuration):
##### HTML5 <article> tag
```html
<article>…article…</article>
```
```xslt
//article
```
##### DIV inside DIV
```html
<div id="content"><div class="box_content">…article…</div></div>`
```
```xslt
//div[@id='content']/div[@class='box_content']
```
##### Multiple classes
```html
<div class="post-body entry-content xh-highlight">…article…</div>
```
```xslt
//div[starts-with(@class ,'post-body')]
```
or
```xslt
//div[contains(@class, 'entry-content')]
```

9
mods/der-postillon.json Normal file
View File

@ -0,0 +1,9 @@
{
"name": "Der Postillon",
"feed": "http://www.der-postillon.com/feeds/posts/default",
"match": "blogspot/rkEL",
"config": {
"type": "xpath",
"xpath": "div[@itemprop='articleBody']"
}
}

9
mods/kojote-magazin.json Normal file
View File

@ -0,0 +1,9 @@
{
"name": "Der Kojote",
"feed": "http://www.kojote-magazin.de/feed/rss2",
"match": "kojote-magazin.de",
"config": {
"type": "xpath",
"xpath": "div[@class='post']"
}
}

View File

@ -4,6 +4,6 @@
"match": "n24.de",
"config": {
"type": "xpath",
"xpath": "div[@class='news']"
"xpath": "div[@class='c2a']"
}
}

View File

@ -0,0 +1,9 @@
{
"name": "Polizei Havelland",
"feed": "http://www.internetwache.brandenburg.de/sixcms/list.php?page=rss_hvl",
"match": "internetwache.brandenburg.de",
"config": {
"type": "xpath",
"xpath": "div[@id='content']/div[@class='box_content']"
}
}

View File

@ -2,11 +2,11 @@
$config = array(
'type' => 'xpath',
'xpath' => 'div[@class="bacontent"]',
'xpath' => 'div[@itemprop="articleBody"]',
);
$article = array(
'link' => 'http://www.berlin.de/polizei/presse-fahndung/archiv/383117/index.html',
'link' => 'http://www.der-postillon.com/2013/04/nordkoreas-armee-nach-wochenlangem.html',
'content' => 'This is the feed content',
'plugin_data' => '',
);