Feature #4873
eit Scrape title/summary title/summary
100%
Description
I have been trying to scrape the titles of show to remove the "New:" at the start of the title as when the file eventualy feed through to kodi the artwork cannot be found as New: is in the file name. Using $q$n.$x as the formatting string.
Using Over-the-air: EIT DVB Grabber with uk config.
I was hoping to remove it via the config file for the eit scraper.
However a combined title/summary text is made by joining the title, a space,
and the summary. The combined text is matched against the scrape_title regex
list. On a match, the EPG title is set to the match result.
I cannot see a way of knowing where the original title ended as broadcast.
If it was a combined title/summary text is made by joining the title, TWO spaces,
and the summary. I would then know whre the original title ended.
This would need a bit of rework to remove the extra space but at this point you have somenting to remove.
Or use some other caracter to join the title/summary before scraping ? " --- "
Will relate to https://tvheadend.org/issues/4801
Thanks
derek
History
Updated by Jim Hague almost 7 years ago
Derek Keith wrote:
I have been trying to scrape the titles of show to remove the "New:" at the start of the title as when the file eventualy feed through to kodi the artwork cannot be found as New: is in the file name. Using $q$n.$x as the formatting string.
Using Over-the-air: EIT DVB Grabber with uk config.
I was hoping to remove it via the config file for the eit scraper.
However a combined title/summary text is made by joining the title, a space,
and the summary. The combined text is matched against the scrape_title regex
list. On a match, the EPG title is set to the match result.I cannot see a way of knowing where the original title ended as broadcast.
If it was a combined title/summary text is made by joining the title, TWO spaces,
and the summary. I would then know whre the original title ended.
This would need a bit of rework to remove the extra space but at this point you have somenting to remove.
Or use some other caracter to join the title/summary before scraping ? " --- "
Ah. Good point. When I did this I didn't think of a use case for just getting the title and so needing to know where it ends. I'll try changing to '| ' maybe?
If you want to experiment, the single space is the space between the two %s in the following line:
snprintf(title_summary, sizeof(title_summary), "%s %s", se->str, lang_str_get(ev->summary, se->lang));
in /src/epggrab/module/eit.c.
Updated by Anonymous almost 7 years ago
- Status changed from New to Fixed
- % Done changed from 0 to 100
Applied in changeset commit:tvheadend|e6a01316020037e4d030cc55a10427e1d7f6000b.