Feature #4873: eit Scrape title/summary title/summary - Tvheadend

Actions

Copy link

Feature #4873

closed

eit Scrape title/summary title/summary

Added by Derek Keith over 7 years ago. Updated over 7 years ago.

Status:

Fixed

Priority:

Normal

Assignee:

Adam Sutton

Category:

EPG - Grabbers

Target version:

Start date:

2018-01-19

Due date:

% Done:

100%

Estimated time:

Description

I have been trying to scrape the titles of show to remove the "New:" at the start of the title as when the file eventualy feed through to kodi the artwork cannot be found as New: is in the file name. Using $q$n.$x as the formatting string.

Using Over-the-air: EIT DVB Grabber with uk config.

I was hoping to remove it via the config file for the eit scraper.
However a combined title/summary text is made by joining the title, a space,
and the summary. The combined text is matched against the scrape_title regex
list. On a match, the EPG title is set to the match result.

I cannot see a way of knowing where the original title ended as broadcast.

If it was a combined title/summary text is made by joining the title, TWO spaces,
and the summary. I would then know whre the original title ended.
This would need a bit of rework to remove the extra space but at this point you have somenting to remove.
Or use some other caracter to join the title/summary before scraping ? " --- "

Will relate to https://tvheadend.org/issues/4801

Thanks
derek

Actions

Copy link

Updated by Jim Hague over 7 years ago

Derek Keith wrote:

I have been trying to scrape the titles of show to remove the "New:" at the start of the title as when the file eventualy feed through to kodi the artwork cannot be found as New: is in the file name. Using $q$n.$x as the formatting string.

Using Over-the-air: EIT DVB Grabber with uk config.

I was hoping to remove it via the config file for the eit scraper.
However a combined title/summary text is made by joining the title, a space,
and the summary. The combined text is matched against the scrape_title regex
list. On a match, the EPG title is set to the match result.

I cannot see a way of knowing where the original title ended as broadcast.

If it was a combined title/summary text is made by joining the title, TWO spaces,
and the summary. I would then know whre the original title ended.
This would need a bit of rework to remove the extra space but at this point you have somenting to remove.
Or use some other caracter to join the title/summary before scraping ? " --- "

Ah. Good point. When I did this I didn't think of a use case for just getting the title and so needing to know where it ends. I'll try changing to '| ' maybe?

If you want to experiment, the single space is the space between the two %s in the following line:

      snprintf(title_summary, sizeof(title_summary), "%s %s",
               se->str, lang_str_get(ev->summary, se->lang));

in /src/epggrab/module/eit.c.

Actions

Copy link

Updated by Anonymous over 7 years ago

Status changed from New to Fixed
% Done changed from 0 to 100

Applied in changeset commit:tvheadend|e6a01316020037e4d030cc55a10427e1d7f6000b.

Actions

Copy link

Also available in: Atom PDF

Project

General

Profile

Tvheadend

Custom queries

Feature #4873

eit Scrape title/summary title/summary

Updated by Jim Hague over 7 years ago

Updated by Anonymous over 7 years ago