Search found 44 matches

by keyframe
Wed Jul 15, 2020 8:47 pm
Forum: Houdini
Topic: PDG Scheduler?
Replies: 1
Views: 380

PDG Scheduler?

Heya Timur,

Wondering if you are considering adding a PDG scheduler for Afanasy? Is that something that's on your radar?

Best,

Gene
by keyframe
Fri Jan 24, 2020 6:27 pm
Forum: General
Topic: tasks failing to stop
Replies: 15
Views: 4096

Re: tasks failing to stop

For posterity, It's the 'umask 022' portion of the command string. For some reason when I do that, the process spawned AFTER the ';' does not terminate as it should. What does seem to work is 'umask 022 && hou2rsp parms here' instead of 'umask 022; hou2rsp parms here' Phew. @Timur, thank you for hel...
by keyframe
Fri Jan 24, 2020 3:18 pm
Forum: General
Topic: tasks failing to stop
Replies: 15
Views: 4096

Re: tasks failing to stop

The plot thickens! I found a flame machine in the building that was still running centos7, and added it to the afanasy pool. I was surprised to see it behaving the same way (the child processes fails to terminate), which suggests that whatever changed was not related to the centos7->8 change, but be...
by keyframe
Thu Jan 23, 2020 10:55 pm
Forum: General
Topic: tasks failing to stop
Replies: 15
Views: 4096

Re: tasks failing to stop

Heya Timur, Investigating this further, here's what I turned up so far: PPID PID PGID 5015 19530 19530 19530 ? -1 SNs 0 0:00 su -m keyframe -c umask 022;redshiftCmdLine /dump/.farm/my_rs.1064.rs 19530 19531 19531 19531 ? -1 SNs 1000 0:00 bash -c umask 022;redshiftCmdLine /dump/.farm/my_rs.1064.rs 19...
by keyframe
Thu Jan 23, 2020 4:28 pm
Forum: General
Topic: machines fails to come back online after reboot
Replies: 4
Views: 1203

Re: machines fails to come back online after reboot

Thanks Timur,

Much appreciated. Do any files with the .json extension get loaded? or is it specific to config.json and farm.json?

G
by keyframe
Thu Jan 23, 2020 3:28 pm
Forum: General
Topic: machines fails to come back online after reboot
Replies: 4
Views: 1203

machines fails to come back online after reboot

... from the first world problems category ... I've noticed several of the newer machines reboot faster then they become deregistered from afserver. When they come back online and attempt to register, they get rejected because afserver hasn't realized that they've rebooted. There are several timing ...
by keyframe
Tue Jan 21, 2020 2:53 pm
Forum: General
Topic: tasks failing to stop
Replies: 15
Views: 4096

Re: tasks failing to stop

I will investigate this option further, but i'm not optimistic (unless the change occured between centos7 and centos8) since I've been doing this for a few years now, and the issue only surfaced recently... ... of course, I was warned about upgrade the OS, python2.7 -> python3, and Afanasy at the sa...
by keyframe
Mon Jan 20, 2020 9:10 pm
Forum: General
Topic: tasks failing to stop
Replies: 15
Views: 4096

Re: tasks failing to stop

The process tree looks like this, and I've noticed that su behaves a little differently between centos 7 and 8 regarding permissions -- perhaps there's more going on there that i'm unaware of. systemd(1)─┬─ModemManager(1097)─┬─{ModemManager}(1149) │ └─{ModemManager}(1165) ├─NetworkManager(1123)─┬─{N...
by keyframe
Mon Jan 20, 2020 6:43 pm
Forum: General
Topic: tasks failing to stop
Replies: 15
Views: 4096

Re: tasks failing to stop

Thanks for the insight. I'll debug further. I wonder whether this is related to me switching users as part of the task. I've added a su - <submitting_user_name> - c "<command to execute>" to the command in order to the resulting frames to be owned by the same person that submitted the render, rather...
by keyframe
Mon Jan 20, 2020 5:07 pm
Forum: General
Topic: tasks failing to stop
Replies: 15
Views: 4096

Re: tasks failing to stop

Here's something interesting.

The PID of the task running on tws12 is 12377, NOT 12369 like the afrender log seems to suggest.

G