Project

General

Profile

Feature #4873

eit Scrape title/summary title/summary

Added by Derek Keith almost 7 years ago. Updated over 6 years ago.

Status:
Fixed
Priority:
Normal
Assignee:
Category:
EPG - Grabbers
Target version:
-
Start date:
2018-01-19
Due date:
% Done:

100%

Estimated time:

Description

I have been trying to scrape the titles of show to remove the "New:" at the start of the title as when the file eventualy feed through to kodi the artwork cannot be found as New: is in the file name. Using $q$n.$x as the formatting string.

Using Over-the-air: EIT DVB Grabber with uk config.

I was hoping to remove it via the config file for the eit scraper.
However a combined title/summary text is made by joining the title, a space,
and the summary. The combined text is matched against the scrape_title regex
list. On a match, the EPG title is set to the match result.

I cannot see a way of knowing where the original title ended as broadcast.

If it was a combined title/summary text is made by joining the title, TWO spaces,
and the summary. I would then know whre the original title ended.
This would need a bit of rework to remove the extra space but at this point you have somenting to remove.
Or use some other caracter to join the title/summary before scraping ? " --- "

Will relate to https://tvheadend.org/issues/4801

Thanks
derek

History

#1

Updated by Jim Hague almost 7 years ago

Derek Keith wrote:

I have been trying to scrape the titles of show to remove the "New:" at the start of the title as when the file eventualy feed through to kodi the artwork cannot be found as New: is in the file name. Using $q$n.$x as the formatting string.

Using Over-the-air: EIT DVB Grabber with uk config.

I was hoping to remove it via the config file for the eit scraper.
However a combined title/summary text is made by joining the title, a space,
and the summary. The combined text is matched against the scrape_title regex
list. On a match, the EPG title is set to the match result.

I cannot see a way of knowing where the original title ended as broadcast.

If it was a combined title/summary text is made by joining the title, TWO spaces,
and the summary. I would then know whre the original title ended.
This would need a bit of rework to remove the extra space but at this point you have somenting to remove.
Or use some other caracter to join the title/summary before scraping ? " --- "

Ah. Good point. When I did this I didn't think of a use case for just getting the title and so needing to know where it ends. I'll try changing to '| ' maybe?

If you want to experiment, the single space is the space between the two %s in the following line:

      snprintf(title_summary, sizeof(title_summary), "%s %s",
               se->str, lang_str_get(ev->summary, se->lang));

in /src/epggrab/module/eit.c.

#2

Updated by Anonymous over 6 years ago

  • Status changed from New to Fixed
  • % Done changed from 0 to 100

Applied in changeset commit:tvheadend|e6a01316020037e4d030cc55a10427e1d7f6000b.

Also available in: Atom PDF