You are here

superguidatv.it - index_date scrub problems

17 posts / 0 new
Last post
doglover
Offline
Joined: 11 years
Last seen: 3 years
superguidatv.it - index_date scrub problems

I made a SiteIni for https://www.superguidatv.it  (attached).

However the webpages have to be requested with oggi, domani, dopodomani.  (today, tomorrow)

On the webpage the exact date is mentioned. so an index_date scrub should be possible, in order to have a bit more security on the correct date being scrubbed.  However I am not able to get this working.  Can somebody help please.

 

Willy

 

PS:  This site is not particlarly interesting, but it list some channels, not listed anywhere else.

 

 

Attachments: 
doglover
Offline
Joined: 11 years
Last seen: 3 years

Does not work.

Try delete in the subpage the oggi.  So skipping today.

The the date scrubbed shouldbe 19-05-2018.  It is not.

[  Debug ] Element:  INDEX_DATE
[  Debug ] Modify
[  Debug ]      command & arguments : calculate(debug format=date,dd-MM-yyyy)
[  Debug ]      Element value:
[  Debug ] Domani, 19 maggio 2018
[  Debug ]      Element value after operation:
[  Debug ] 18-05-2018
[  Debug ] Debugging information SiteIni

 

Willy

doglover
Offline
Joined: 11 years
Last seen: 3 years

I cannot duplicate this result.

See the log

Willy

doglover
Offline
Joined: 11 years
Last seen: 3 years

I cannot duplicate this result.

See the log

Willy

doglover
Offline
Joined: 11 years
Last seen: 3 years

Oh shit.  Thanks for the spotting the error.

 

Willy

 

Marcusio26
Offline
Donator
Joined: 5 years
Last seen: 1 week

Hi guys, I need help. I am using the attached superguidatv.it__0.ini file but once the program has started it freezes everything showing me this message:

[ Info ] Group (0) :
[ Info ] update requested for - 288 - out of - 288 - channels for 3 day(s)
[ Debug ]
[ Info ] ( 1/288 ) SUPERGUIDATV.IT__0 -- chan. (xmltv_id=Spike) -- mode Force
[ ] Job finished at 06/08/2019 10:19:44 done in 1m 2s
[Critical] Unhandled Exception
[Critical]
Indice oltre i limiti della matrice.
[Critical]
in WGconsole.F.1(String[] 0)
in WGconsole.F.0(String[] 0)
[Critical] For detailed info, see log file C:\Users\marcy\AppData\Local\WebGrab+Plus\WebGrab++.log.txt
[Critical] Execution stopped.

can you help me?

thank you

mat8861
Offline
WG++ Team memberDonator
Joined: 9 years
Last seen: 3 hours

webgrab version?

Marcusio26
Offline
Donator
Joined: 5 years
Last seen: 1 week

WebGrab+Plus/w MDB & REX Postprocess -- version V2.1.9

Running on: Microsoft Windows NT 6.2.9200.0
Environment: 4.0.30319.42000

mat8861
Offline
WG++ Team memberDonator
Joined: 9 years
Last seen: 3 hours

this seems ok, please rename and test.

Attachments: 
Marcusio26
Offline
Donator
Joined: 5 years
Last seen: 1 week

Thanks a lot. but now I have this problem:

Group (0) :
update requested for - 288 - out of - 288 - channels for 3 day(s)
( 1/288 ) SUPERG -- chan. (xmltv_id=Paramount Network) -- mode Force
iiinnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
error downloading page: Richiesta annullata: Chiusura imprevista della connessione..
pausing 1 of 6 times for 10 seconds before re-try.
error downloading page: Richiesta annullata: Chiusura imprevista della connessione..
pausing 2 of 6 times for 20 seconds before re-try.
error downloading page: Richiesta annullata: Chiusura imprevista della connessione..
pausing 3 of 6 times for 30 seconds before re-try.
error downloading page: Richiesta annullata: Chiusura imprevista della connessione..
pausing 4 of 6 times for 40 seconds before re-try.
error downloading page: Richiesta annullata: Chiusura imprevista della connessione..
pausing 5 of 6 times for 50 seconds before re-try.
error downloading page: Richiesta annullata: Chiusura imprevista della connessione..
pausing 6 of 6 times for 60 seconds before re-try.
n
error downloading page: Richiesta annullata: Chiusura imprevista della connessione..
pausing 1 of 6 times for 10 seconds before re-try.
error downloading page: Richiesta annullata: Chiusura imprevista della connessione..
pausing 2 of 6 times for 20 seconds before re-try.
error downloading page: Richiesta annullata: Chiusura imprevista della connessione..
pausing 3 of 6 times for 30 seconds before re-try.
error downloading page: Richiesta annullata: Chiusura imprevista della connessione..
pausing 4 of 6 times for 40 seconds before re-try.

Marcusio26
Offline
Donator
Joined: 5 years
Last seen: 1 week

hello.I would like to correct my previous post. The error is relative only to the following channel:

Paramount Network

some advice?

Blackbear199
Offline
Blackbear199's picture
WG++ Team memberDonator
Joined: 9 years
Last seen: 6 min

my guess its a time out problem,try raising your timeout in your wg config.xml from 10(currently) to 30(what I use).

Group (0) :
update requested for - 1 - out of - 1 - channels for 3 day(s)
( 1/1 ) SUPERGUIDATV.IT -- chan. (xmltv_id=Paramount Channel) -- mode Force
iiinnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
1.33 sec/update

Marcusio26
Offline
Donator
Joined: 5 years
Last seen: 1 week

hello and thanks but also setting the timeout to 30 gives me the same problem

Marcusio26
Offline
Donator
Joined: 5 years
Last seen: 1 week
Goran wrote:

this happening because some shows have broken link to urlshow page as show Le inchieste di Padre Dowling
at Paramount Network channel
set retry to 1 or 2x

hello and thanks but I did not understand what parameters I have to change

Blackbear199
Offline
Blackbear199's picture
WG++ Team memberDonator
Joined: 9 years
Last seen: 6 min

that shouldn't cause the retry,i just did a 3 day grab again and no interuptions.
the link still returns a page,even though its the incorrect page with error 404,webgrab doesn't check for this(it don't know the difference).
if the page returns any data at all webgrab assumes it good.
that's why webgrab compares the index_title and details_title.
this should add the (?) to the title,easy way around it is to not scrub the details title and copy the index_title.

Blackbear199
Offline
Blackbear199's picture
WG++ Team memberDonator
Joined: 9 years
Last seen: 6 min

did u just try that now?

here's what I get..

Group (0) :
update requested for - 1 - out of - 1 - channels for 4 day(s)
( 1/1 ) SUPERGUIDATV.IT -- chan. (xmltv_id=Paramount Channel) -- mode Force
iiinnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
1.70 sec/update

if u check the xml the show is there but only index elements are scrubbed for the shows with bad details links,other have details elements also.

Attachments: 
Marcusio26
Offline
Donator
Joined: 5 years
Last seen: 1 week

thank you all for your help

Log in or register to post comments

Brought to you by Jan van Straaten

Program Development - Jan van Straaten ------- Web design - Francis De Paemeleere
Supported by: servercare.nl