You are here

klack.de returns dummy titles

1 post / 0 new
lazlev
Offline
Joined: 5 years
Last seen: 4 months
klack.de returns dummy titles

klack.de seems to detect the scraper and returns dummy titles (Sie wurden gesperrt) after several successful grabs.

channels.xml:
<channel update="i" site="klack.de" site_id="22/cnn" xmltv_id="CNN">CNN</channel>

guide.xml:
<programme start="20190101140000 +0100" stop="20190101150000 +0100" channel="CNN">
<title lang="de">News Stream (with World Sport)</title>
<desc lang="de">Hosted by Kristie Lu Stout, News Stream is an hour-long news program broadcast from CNN's studio in Hong Kong(n)</desc>
<date>2019</date>
<category lang="de">Nachrichten</category>
<icon src="https://www.klack.de//templates/klack/images/default_epg/default.jpg" />
<country>USA</country>
</programme>
<programme start="20190101150000 +0100" stop="20190101160000 +0100" channel="CNN">
<title lang="de">Sie wurden gesperrt</title>
</programme>
<programme start="20190101160000 +0100" stop="20190101170000 +0100" channel="CNN">
<title lang="de">Sie wurden gesperrt</title>
</programme>

It's probably necessary to detect this kind of dummy title and slow down and retry. Second option would be to slow down from the beginning.

- wg++ v2.1
- Arch Linux
- Mono 5.16.0

Brought to you by Jan van Straaten

Program Development - Jan van Straaten ------- Web design - Francis De Paemeleere
Supported by: servercare.nl