You are here

Update tv-programme.com

7 posts / 0 new
Last post
2P2M
Offline
Donator
Joined: 5 years
Last seen: 1 week
Update tv-programme.com

Hi,

Please add grab for :

- date (of production)

- categories

- <star-rating>

- <previously-shown> or <premiere>

Many thanks !!

mat8861
Offline
WG++ Team memberDonator
Joined: 10 years
Last seen: 5 hours

New version in siteini.pack. Not sure for previously shown and premiere, i see a lot of "inedit" and "direct" that would not be correct, for example even meteo program has "inedit"

Can you tell me an example of previously-shown and premiere? Thanks

Attachments: 
2P2M
Offline
Donator
Joined: 5 years
Last seen: 1 week

Hi,

For me, it's "Direct" wich refer to live sport event... but I can't test it because ip ban !

I try to modify delay : site {retry=<retry time-out="20" channel-delay="20" index-delay="20">5</retry>} but same issue.

I look for a good french site to grab apart of "diverto.com", "programme-tv.net" and "programme-tv.nouvelobs.com".

Can you add same grab items for "tv.free.fr" ? (I can open new subject if you want).

Thanks for your effort.

Best regards

mat8861
Offline
WG++ Team memberDonator
Joined: 10 years
Last seen: 5 hours

Unfortunately cannot be added

LicNotRequired
Offline
Donator
Joined: 3 months
Last seen: 2 hours

the only that will work correctly is the api version of the telerama ini.
you need repeat(previousshown) or premiere flag with ,Inédit and Direct info.
if premiere flag isnt available,then you need to set premiere like this..

this site has the repeat,Inédit and Direct info so we dont need to guess for repeats,just need to set the premiere tag based on all the data..
repeat  => not a premiere
not a repeat and is live and new => premiere

example 1,news show,its a repeat,Inédit and Direct
<programme start="20260512200000 +0200" stop="20260512204500 +0200" channel="TF1">
<title lang="fr">JT 20h</title>
<desc lang="fr">Le journal télévisé présente les faits marquants et les événements essentiels de l'actualité nationale et internationale, tout en mettant en valeur l'actualité régionale.(Inédit)(Direct).</desc>
<credits>
<presenter>Gilles Bouleau</presenter>
</credits>
<category lang="fr">Journal</category>
<icon src="https://webgrabplus.com/focus.telerama.fr/720x640/2025/01/07/1aa30573e233449f83b97c9bfc1785ce.jpg" /
<country lang="fr">France</country>
<previously-shown />
</programme>

example 2,sports program,not a repeat but its Inédit and Direct.
set the preimiere flag.
<programme start="20260512203000 +0200" stop="20260512223000 +0200" channel="beIN SPORTS 1">
<title lang="fr">Football : Ligue 2 BKT</title>
<desc lang="fr">A l'issue de la dernière journée du championnat de Ligue 2, les 3e, 4e et 5e du classement se disputent le droit d'affronter le 16e de la saison de Ligue 1 et d'espérer accéder à l'élite du football français la saison prochaine. Ce premier match de barrage envoie le 5e du classement sur la pelouse du 4e.(Inédit)(Direct).</desc>
<category lang="fr">Football</category>
<icon src="https://webgrabplus.com/focus.telerama.fr/720x640/2026/05/07/c4f7275dd81045e69489b4260f323ff6.jpg" />
<country lang="fr">France</country>
<premiere lang="fr" />
</programme>

your getting ip ban on tv-programme.com because your retry settings are incorrect.
show detail pages(show-delay) are the main reason for getting banned,they are the most requested url's.
you have no delay set for those,try this
<retry index-delay="2" channel-delay="1" show-delay="2" time-out="30">4</retry>
 

2P2M
Offline
Donator
Joined: 5 years
Last seen: 1 week
mat8861 wrote:

Unfortunately cannot be added

Ok but please can you extract the description on program page instead of index page (tab Résumé in html) ?

Thanks

LicNotRequired
Offline
Donator
Joined: 3 months
Last seen: 2 hours

that would be a bad idea,least from where i see it.
we already know they use a rate limit(ip ban).
i am not sure about the files in the pack but my personal files use the /casting url for the details page.
so if u want the details description u need to grab 2 additional pages,
the original which has the description and the /casting one.
see where i am going?
its will 99% lead to a ip ban as ur now grabbing 2 additional pages instead of one(/casting) per show..

 

Log in or register to post comments

Brought to you by Jan van Straaten

Program Development - Jan van Straaten ------- Web design - Francis De Paemeleere
Supported by: servercare.nl