Project

General

Profile

Stops responding after 2-12 hours

Added by Bob Odenkirk almost 5 years ago

Hello,
I've tried to find others with a similar problem and cannot figure this out. This has happened on two different computers, and I'm at a loss..

Running latest Archlinux, latest tvh compiled from the AUR. Input sources are 2 hdhomeruns, one an hdhr4-2us and the other an hdhr5-4us (hdhr signal level mod applied for the hdhr5, but it also does it without the patch), as well as 4 IPTV services.

I've messed with the default PKGBUILD configure options to try to tune it or alter the problem to isolate the cause, still not having any luck even finding where the problem is.

What happens is it hangs after a seemingly random amount of time. The log says nothing, CPU/RAM usage all seems normal, as though nothing is wrong. The admin interface will not load, and Kodi clients say the response isn't recognized.

Generally, a kill command doesn't do anything and requires -9... It's acting like a full-on zombie, but it doesn't look like one in the system monitor.

I honestly can't figure out where to start looking. Im not a total noob, but I've also never seen anything like this. It's one of the odder things I've tried to diagnose.

If anyone can even point me in a direction of where to look to try to figure out what is responsible, I'd be very grateful and can likely figure it out from there.. but this one has me very frustrated and very confused so far. But it feels OS- related, especially because both of these computers have been Archlinux. I'm guessing I'm missing something obvious, but I can't see the forest from the trees right now.. this has me so confused right now.


Replies (4)

RE: Stops responding after 2-12 hours - Added by Joe User almost 5 years ago

You can try running from the command line with the "--trace all" option and see if the last lines give you any hints.
(or use the Configuration->debugging tab to enable trace all and specify a log file.)

Other ways is to use gdb or valgrind.

RE: Stops responding after 2-12 hours - Added by Bob Odenkirk almost 5 years ago

This is exactly what I needed! I started seeing some issues with epoll after a bit, and some googling got me all squared away.

If anyone else is having any issues like this (and finds this thread while Googling), if you see issues with epoll or have odd issues with random freezing-but-not-crashing, check your open files limit by running
pidof tvheadend
Take the number and
cat /proc/[pid]/limits

Look for "max open files", if it's at or less than 1024, up it with
prlimit -n4096:4096 -p [pid]

It's per run, so if it works, you can let user hts open more files by modifying
/etc/security/limits.conf

Hope this helps.

RE: Stops responding after 2-12 hours - Added by Joe User almost 5 years ago

Glad you figured it out - and more glad you shared your findings AND solution - many people do not bother to report back...

RE: Stops responding after 2-12 hours - Added by Bob Odenkirk almost 5 years ago

Well, there is nothing I personally hate more than finding a dozen+ people posting an exact same exact problem in various places, and either no follow-up or evasive "I figured it out, nevermind" replies... so if/when I do get desperate enough to post something, and there ends up being a solution, I always try to make it a point to share how it got resolved.. That's what FOSS is supposed to be about, right? :)

    (1-4/4)