Please, can you add grab of http://www.tv-programme.com french site ?
I have tried to write an ini file but it's too difficult for me...
Thanks !
Please, can you add grab of http://www.tv-programme.com french site ?
I have tried to write an ini file but it's too difficult for me...
Thanks !
Brought to you by Jan van Straaten
Program Development - Jan van Straaten ------- Web design - Francis De Paemeleere
Supported by: servercare.nl
Added, will add more elements asap, please note siteini remarks.
Hi,
Many thanks but returns error even if I modify retry and delay...
Please see log file
Well this looks like a dns issue. Make sure your linux environment is using dns google or opendns (8.8.8.8 and 208.67.222.222)
Runs fine here. I made 2 versions one full ( a bit slower but with all details ) and regular one
Thanks for your work.
I change my DNS but same error (I can access website with Firefox without issues)
Just noticed
You need version 5.1
Anyway 404 is a network issue, this could also be due to dotnet, i just tested on my linux and asked to upgrade first to dotnet 8
All done but error persist.
Anyway, thanks for all !
Is this vps or local pc ?
Ok i am facing same issue, let me see why it works in windows and not in linux. Found out that date format is not translated to french basically https://tv-programme.com/tf1/lundi-2-juin-2025/ in linux is https://tv-programme.com/tf1/monday-2-june-2025/ so can't find the link and return 404
Will report the issue to the author as i am not sure is webgrabplus 5.x or ubuntu 24.04 or maybe dotnet
Well, date is optional for current day if you can implement it.
Others ini files are OK for me, I post 2 requests for update, thanks !
maybe dont do it the wrong way?
all you need is urldate.format {datestring|dddd-d-MMMM-yyyy}, u have dd(i assume it still worked).
and use |urldate| on the url_index line.
u also must have a magic windows pc because your global temp_1 doesnt convert the urldate correctly on winidows or linux.
calculate with date calculations will always result in the default culture info which is en-GB
besides you dont need all the replaces,ect your doing as there is no accented characters.
Edit:
I had checked the weekday names but not the month names.
Seems there are accented e and u in month names.
Easiest is to just modify the urlindex and replace these as accent characters are usually urlencoded.
If ur real paranoid substring the urldate part from the urlindex and replace the accented characters and then replace modified urldate in url index.
Or use the method u have but you would need to replace all 7 weekday names and all 12 month names with correct French values used by the site.
Only feb,Aug and Dec use accented characters.
All are far away from being used so pretty much have to wait and see what the site actually uses.
A good guess it what ur trying to do by replacing the accented characters.
another alternative..see screenshot.
Now it works in linux, download version dated 1 june 2025
@Blackbear199 no Idea why in linux culture info goes in language "en" it was working fine 4 years ago, i think there is a bug, not sure is linux or wg++ or most probably dotnet. Change of letters was applied anyway :
global_temp_1.modify{replace|é|e}
global_temp_1.modify{replace|û|u}
@Blackbear199, thanks the problem was in 'urldate_formatted'
once changed
global_temp_1.modify {calculate(format=date,dddd-dd-MMMM-yyyy)|'urldate'} into
global_temp_1.modify {set|'urldate_formatted'}
all works fine... forgot about new way
Many many thanks for all !!
I have downloaded the lastest files (.ini et .channels.xml) but I still have the error:
I'm a little bit lost because the latest downloaded .ini file does not contain the following lines which are mentioned in the previous posts above:
Please help me face to my missundunstood.
File is encrypted you don't see that. Please make sure you are using * @Revision 1 - [01/06/2025]
Here an extract of my tv-programme.com.ini file about the version:
* @Revision 1 - [01/06/2025] Mat8861 *fix elements
* @Revision 0 - [26/10/2021] Mat8861 *complete but ip ban after 6-7 chans, must use delay !!
I also attached the WebGrab++.log.txt file.
Can you please delete siteini and update or download siteini again? I tested and works fine both in windows and linux.
I have just deleted tv-programme.com.ini and downloaded again.
Same error is raised.
for info , same issue with full version
Please make sure your linux has google dns (8.8.8.8) and fr locales.
Fixed museum art please download rev.1
Still have the issue with the locale defined to fr_FR.UTF-8 and DNS resolver defined to 8.8.8.8.
Using my own DNS resolver, I can ping and resolve tv-programme.com.
Note: I use the docker image lscr.io/linuxserver/webgrabplus:latest.
Is could be due to the use of docker image ?
I have the same config , docker on dsm 6.2 image linuxserver/webgrabplus:latest. This night i try with fr_FR.UTF-8 ... but it seems that not solve the problem
please use the ini attached to check if date is correct, add in config:
<update>f</update>
<channel update="i" site="test_fr" site_id="tf1" xmltv_id="dummy">dummy</channel>
Then check webgrab log.txt it should return:
[ Info ] ( 1/1 ) TEST_FR -- chan. (xmltv_id=dummy) -- mode Force
[ Debug ] debugging information siteini; urlindex builder
[ Debug ] siteini entry :
[ Debug ] urldate format type: datestring, value: |dddd-dd-MMMM-yyyy
[ Debug ] https://tv-programme.com/|channel|/##date##/
[ Debug ] url_index created:
[ Debug ] https://tv-programme.com/tf1/mercredi-11-juin-2025/
Here the result using test_fr.ini.
Even if the locale of the Docker container is set to fr_FR.UTF-8, there is a english format for the date in the url: https://tv-programme.com/tf1/Wed-11-M06-2025/
As a result, still have the issue: [Warning ] error downloading page: Response status code does not indicate success: 404 (Not Found).
I specify , the TZ for my docker is Europe/Paris ...
maybe with regex , it possible to transform the date
Ok guy , i can confirm , problem with the date format
this is an issue with locale i guess, you can check with locale -a (attached my output) or reconfigure them with dpkg-reconfigure locales
You can follow this article https://poweradm.com/change-system-language-locale-linux/
Maybe an issue due to the Docker container configuration lscr.io/linuxserver/webgrabplus:latest.
From an attached shell of the Docker container, i installed the following packages:
and configured the french locale into the following file:
Nethertheless, something is wrong because the date command still returns an english format:
The webgrabplus issue still be present.
Maybe it must be necessary to rebuilt the Docker image including the installation of locale packages and configuration into the Dockerfile...
you should be able to do : dpkg-reconfigure locales
and mark fr
root@ubuntu:~# locale -v
LANG=en_GB.UTF-8
LANGUAGE=
LC_CTYPE="en_GB.UTF-8"
LC_NUMERIC="en_GB.UTF-8"
LC_TIME="en_GB.UTF-8"
LC_COLLATE="en_GB.UTF-8"
LC_MONETARY="en_GB.UTF-8"
LC_MESSAGES="en_GB.UTF-8"
LC_PAPER="en_GB.UTF-8"
LC_NAME="en_GB.UTF-8"
LC_ADDRESS="en_GB.UTF-8"
LC_TELEPHONE="en_GB.UTF-8"
LC_MEASUREMENT="en_GB.UTF-8"
LC_IDENTIFICATION="en_GB.UTF-8"
LC_ALL=
root@ubuntu:~#
this is what i get with locale -a, as i test different languages i set them all.
this is what i did typing : dpkg-reconfigure locales
Think he is using alpine Linux
See..
https://stackoverflow.com/questions/74516643/localization-not-working-on...
Either of u guys have any update?
I am not a dotnet guru but from what I searched icu libraries are used for date conversion,ect(globalization)
Locales are for user keyboard,menu language,ect.