I WANT TO SCRUB THE TITLE "CASANOVA" FROM THIS HTML CODE:
<td class="channel_program-table--program"><a href="https://......programchannel">Casanova</a> </td>
I TESTED THAT:
index_title.scrub {single|<a href="|">|</a>}
AND THE RESULT IS:
<title lang="en">Casanova (?)</title>
WHAT AM I DOING WRONG? :(
-------------------------------------------------------------------------------------------------------------
THE HTML FROM TITLE(title.srub) IS:
<img class="epg_close_up-logo" alt="channel" title="channel" src="/portal/image/journal/article?img_id=41707954&t=1524412293078"> </div> <h1>Casanova</h1> <div class="epg-closeup-info">
HOW CAN I SCRUB CORRECT THE TITLE FROM THIS PIECE?
THE HTML FROM TITLE(title.scrub) IS IN THE "TEST.ZIP" FILE
HOW CAN I SCRUB CORRECT THE TITLE FROM THIS PIECE?
DO YOU KNOW HOW? PLEASE TELL ME :(
THE RESULT IN .XML IS:
title lang="en">{{videoTitle}}
log.txt
[ Debug ] suspicious title in index page = Casanova
[ Debug ] differs from title in showdetails = Casanova (?)
????????? WTF??? :(
Maybe one of the 2 has a NON-BREAKABLE-SPACE (or nbsp for short) and this is causing a mismatch between title and index_title for WebGrab.