Hi,
Please add grab for :
- date (of production)
- categories
- <star-rating>
- <previously-shown> or <premiere>
Many thanks !!
Brought to you by Jan van Straaten
Program Development - Jan van Straaten ------- Web design - Francis De Paemeleere
Supported by: servercare.nl
New version in siteini.pack. Not sure for previously shown and premiere, i see a lot of "inedit" and "direct" that would not be correct, for example even meteo program has "inedit"
Can you tell me an example of previously-shown and premiere? Thanks
Hi,
For me, it's "Direct" wich refer to live sport event... but I can't test it because ip ban !
I try to modify delay : site {retry=<retry time-out="20" channel-delay="20" index-delay="20">5</retry>} but same issue.
I look for a good french site to grab apart of "diverto.com", "programme-tv.net" and "programme-tv.nouvelobs.com".
Can you add same grab items for "tv.free.fr" ? (I can open new subject if you want).
Thanks for your effort.
Best regards
Unfortunately cannot be added
the only that will work correctly is the api version of the telerama ini.
you need repeat(previousshown) or premiere flag with ,Inédit and Direct info.
if premiere flag isnt available,then you need to set premiere like this..
this site has the repeat,Inédit and Direct info so we dont need to guess for repeats,just need to set the premiere tag based on all the data..
repeat => not a premiere
not a repeat and is live and new => premiere
example 1,news show,its a repeat,Inédit and Direct
<programme start="20260512200000 +0200" stop="20260512204500 +0200" channel="TF1">
<title lang="fr">JT 20h</title>
<desc lang="fr">Le journal télévisé présente les faits marquants et les événements essentiels de l'actualité nationale et internationale, tout en mettant en valeur l'actualité régionale.(Inédit)(Direct).</desc>
<credits>
<presenter>Gilles Bouleau</presenter>
</credits>
<category lang="fr">Journal</category>
<icon src="https://webgrabplus.com/focus.telerama.fr/720x640/2025/01/07/1aa30573e233449f83b97c9bfc1785ce.jpg" /
<country lang="fr">France</country>
<previously-shown />
</programme>
example 2,sports program,not a repeat but its Inédit and Direct.
set the preimiere flag.
<programme start="20260512203000 +0200" stop="20260512223000 +0200" channel="beIN SPORTS 1">
<title lang="fr">Football : Ligue 2 BKT</title>
<desc lang="fr">A l'issue de la dernière journée du championnat de Ligue 2, les 3e, 4e et 5e du classement se disputent le droit d'affronter le 16e de la saison de Ligue 1 et d'espérer accéder à l'élite du football français la saison prochaine. Ce premier match de barrage envoie le 5e du classement sur la pelouse du 4e.(Inédit)(Direct).</desc>
<category lang="fr">Football</category>
<icon src="https://webgrabplus.com/focus.telerama.fr/720x640/2026/05/07/c4f7275dd81045e69489b4260f323ff6.jpg" />
<country lang="fr">France</country>
<premiere lang="fr" />
</programme>
your getting ip ban on tv-programme.com because your retry settings are incorrect.
show detail pages(show-delay) are the main reason for getting banned,they are the most requested url's.
you have no delay set for those,try this
<retry index-delay="2" channel-delay="1" show-delay="2" time-out="30">4</retry>
Ok but please can you extract the description on program page instead of index page (tab Résumé in html) ?
Thanks
that would be a bad idea,least from where i see it.
we already know they use a rate limit(ip ban).
i am not sure about the files in the pack but my personal files use the /casting url for the details page.
so if u want the details description u need to grab 2 additional pages,
the original which has the description and the /casting one.
see where i am going?
its will 99% lead to a ip ban as ur now grabbing 2 additional pages instead of one(/casting) per show..