You are here

site -- TV.COM not working

12 posts / 0 new
Last post
1NSdbZVbpZDX
Offline
Joined: 10 years
Last seen: 7 years
site -- TV.COM not working

TV.COM  gives me error in time grabbing

[  Info  ] channel (xmltv_id=HBO HDTV (East) (HBOHD)) site -- TV.COM -- mode incremental
[  Error ] time parsing error : String was not recognized as a valid DateTime.
[  Error ] stoptime time scrubbed : 1446741000
[  Error ] computer date/time format: 05/11/2015 05:24:01 p.m.
[  Error ] execution stopped

has to do spmething with :

index_start.scrub {regex||data-start="(\d{10})">||}

disable that option and it works (no programme start end tho')

Attachments: 
1NSdbZVbpZDX
Offline
Joined: 10 years
Last seen: 7 years

didn't realize ini needs last beta for work >= (* @MinSWversion: 1.1.1/55.27)

it's all ok now

 

my bad :(

markdark
Offline
Joined: 9 years
Last seen: 9 years

Same issue here. I do have the correct WebGrab.exe version running though

EDIT:

It all does work with version higher than the min software version. The latest beta does work: V1.56.8

Mariano
Offline
Joined: 9 years
Last seen: 8 years

Hi, I'm not able to grab EPG data from TV.com, I'm getting the following error message (I have the latest update installed):

[  Debug ] WebGrab+Plus/w MDB & REX Postprocess -- version 1.1.1/55.27 -- Jan van Straaten
[  Debug ] --------------------------------------------------------------------------------
[  Debug ] job started at 03/12/2015 06:47:08 a.m.
[  Debug ]
[  Debug ] reading config file: C:\ProgramData\ServerCare\WebGrab\WebGrab++.config.xml
[  Debug ] loading timezone data
[  Debug ] embedded timezones source: WGconsole.WG.Common.timezonesdata.txt
[  Debug ] found: C:\ProgramData\ServerCare\WebGrab\tv.com.ini -- Revision 0
[  Debug ] reading cookie file : C:\ProgramData\ServerCare\WebGrab\tv.com.cookies.txt
[  Debug ]
[  Debug ] running  on: Microsoft Windows NT 6.1.7601 Service Pack 1
[  Debug ] environment: 4.0.30319.1
[  Debug ] xmltv input file - C:\ProgramData\ServerCare\WebGrab\guide.xml - found
[  Debug ] 0 superfluous shows removed
[  Info  ]
[  Info  ] update requested for - 1 - out of - 1 - channels for 2 day(s)
[  Info  ] update mode - set per individual channel
[  Info  ]
[  Info  ]
[  Info  ]       i=index  .=same  c=change  g=gab  r=replace  n=new
[  Info  ]
[  Debug ]
[  Info  ] channel (xmltv_id=Spike TV) site -- TV.COM -- mode incremental
[  Error ] time parsing error : String was not recognized as a valid DateTime.
[  Error ] stoptime time scrubbed : 1449118800
[  Error ] computer date/time format: 12/03/2015 06:47:09 a.m.
[  Error ] execution stopped

Please let me know if there is anything I may be doing wrong here.

Thank you.

Mariano
Offline
Joined: 9 years
Last seen: 8 years

It does work now with no issues, except that all the movie and TV shows titles are in between slashes i.e "/Back\/to\/the\/\Future/\"

Do you know why this could be happening ? It would be nice to avoid having these brackes in the middle of the movie or TV show title.

Thanks in advance.

Mariano
Offline
Joined: 9 years
Last seen: 8 years

I don't know how to work with regular expressions but I'm more than sure that this issue has to do with either one of the following lines from the ini file:

index_start.scrub {regex||data-start="(\d{10})">||}
index_title.scrub {regex||<div class="title">(.+?)</div>||}
index_description.scrub {regex||<div class="desc">(.*?)</div>||}
*
index_temp_1.scrub {regex||data-tmsid="rvp:(\d+?)"||} * id
index_urlshow.modify {set('index_temp_1' not "")|http://www.tv.com/listings/event/?EventTmsId=rvp%3A'index_temp_1'}
http://www.tv.com/listings/event/?EventTmsId=rvp%3A1952005171

index_urlshow.headers {customheader=Accept-Encoding=gzip,deflate}* to speedup the downloading of the detail pages

title.scrub {regex||<h1>(?:<a href=.+?>)?(.+?)(?:</a>)?</h1>||}
subtitle.scrub {regex||<h2>(?:<a href=.+?>)?(.+?)(?:</a>)?</h2>||}
description.scrub {regex||<div class=\\"description\\">(.*?)</div>||}
description.modify {remove|\}
showicon.scrub {regex||data-image=\\"(http://.+?\.jpg)\\"||}
category.modify {substring(type=regex)|'title' "<label>(.+?):\s?</label>"}
title.modify {remove(type=regex)|"(<label>.+?</label>)"}

 

 

Attachments: 
1NSdbZVbpZDX
Offline
Joined: 10 years
Last seen: 7 years

works well here:

there's a difference between my ini and yours

yours: index_start.scrub {regex||data-start="(\d{10})">||}

mine: index_start.scrub {regex||data-start="(\d{10})"||}

Mariano
Offline
Joined: 9 years
Last seen: 8 years

Thanks for your comment. It did improve with a few channels, but I'm still seeing the following characters before the actual name of the movie, this only happens in movie channels such as HBO, Showtime, Starz, etc.

I've executed WG++ many times, but the outcome has always been the same: "<a href=\"/movies/transformers/\">Transformers" as the title.

I would appreciate if anyone could share their thoughts on how to fix this issue.

Thanks.

1NSdbZVbpZDX
Offline
Joined: 10 years
Last seen: 7 years

dirty fix made:

the problem is that title states 2 different arguments, one for movies another for regular shows, so regex comes in handy but im not good at that so regex Xperts do your best to fix it

in the meanwhile you can use this ini, theres a lot of (?) in titles (differs), readed somewhere in this forum a fix for that  (like: title.modify preffer blah blah...)

Attachments: 
francis
Offline
francis's picture
WG++ Team memberDonator
Joined: 12 years
Last seen: 2 weeks
Is the support helpful?
support us

I've adjusted the siteini. Should be fine now.

1NSdbZVbpZDX
Offline
Joined: 10 years
Last seen: 7 years

mighty Francis did it

here's another regex solution (just for study)

Attachments: 
Mariano
Offline
Joined: 9 years
Last seen: 8 years

Thank you both. I will try this today and let you know the outcome.

Log in or register to post comments

Brought to you by Jan van Straaten

Program Development - Jan van Straaten ------- Web design - Francis De Paemeleere
Supported by: servercare.nl