Problem with Job

General discussions and questions.
Post Reply
ibaf72
Posts: 3
Joined: Tue Aug 04, 2020 5:01 pm

Problem with Job

Post by ibaf72 »

I use the version 2.2.1 on Centos 7
I created a Meshroom job with af.py and send it to the server
The job starts and after a while nothing happens anymore.

I got the following output by, Output, Log and ErrorHosts

Output:
ERROR: Can't get file:
/var/tmp/afanasy/jobs/0/2.CameraInit_1/tasks/0.0/output/1.txt
No such file or directory

Log:
Tue 01 Sep 20:48.48 : SESSION #1: Starting on "renderNode"
Tue 01 Sep 20:49.19 : Task update timeout.
Tue 01 Sep 20:49.50 : Task stop timeout.

ErrorHosts:
Task[0] error hosts:
renderNode: 2 at Tue 01 Sep 20:49.50

What did I wrong or what have I forgot

Sry for my english
User avatar
timurhai
Site Admin
Posts: 911
Joined: Sun Jan 15, 2017 8:40 pm
Location: Russia, Korolev
Contact:

Re: Problem with Job

Post by timurhai »

Hi!
It looks like a problem with afrender -> afserver connection.
Check render node log (from a gui), afserver and afrender processes (services) logs for details.

Task update timeout.
- afrender have not sent task process progress for some time (it can't send it), or it does know nothing about the task to start (task was not sent).

Output:
ERROR: Can't get file:
/var/tmp/afanasy/jobs/0/2.CameraInit_1/tasks/0.0/output/1.txt
No such file or directory

- after task finish, afrender have not send its process output to afserver

May be firewall?
Timur Hairulin
CGRU 3.3.1, Ubuntu 20.04, 22.04, MS Windows 10 (clients only).
ibaf72
Posts: 3
Joined: Tue Aug 04, 2020 5:01 pm

Re: Problem with Job

Post by ibaf72 »

thanks for the tip.
Firewall and SELinux is offline

I looked into the render node log over the GUI and on the node itself
and found out where the problem is

It produce a python error, "Bad Magic number" when it try to execute the command
But only when cgru should execute the command.

Traceback (most recent call last):
File "/opt/cgru/afanasy/python/services/generic.py", line 2, in <module>
from services import service
File "/opt/cgru/afanasy/python/services/service.py", line 11, in <module>
import cgruconfig
File "/opt/cgru/lib/python/cgruconfig.py", line 4, in <module>
import collections
ImportError: Bad magic number in /opt/Meshroom/lib/collections/__init__.pyc
Traceback (most recent call last):
File "/opt/cgru/afanasy/python/services/service.py", line 11, in <module>
import cgruconfig
File "/opt/cgru/lib/python/cgruconfig.py", line 6, in <module>
import json
ImportError: Bad magic number in /opt/Meshroom/lib/json/__init__.pyc

have i understand it correct, that cgru and Meshroom use the same *.pyc (json.pyc? and collections.pyc?) and
one programm compile it and the other on see that the magic number is not correct and throw the error?
User avatar
timurhai
Site Admin
Posts: 911
Joined: Sun Jan 15, 2017 8:40 pm
Location: Russia, Korolev
Contact:

Re: Problem with Job

Post by timurhai »

Yes.

CGRU uses system Python.
And when it tries to

Code: Select all

import collections
it gets it from Meshroom Python and the result is:

Code: Select all

ImportError: Bad magic number in /opt/Meshroom/lib/collections/__init__.pyc
So Afrender should not work in Meshroom environment.
How it happens? There definitely are other ways to use Meshroom.

Afanasy works everyday with Houdini, Maya, Nuke and so on, all such soft has internal Pythons like Meshroom.
And it works by afrender with no specific software environment setup.
Timur Hairulin
CGRU 3.3.1, Ubuntu 20.04, 22.04, MS Windows 10 (clients only).
ibaf72
Posts: 3
Joined: Tue Aug 04, 2020 5:01 pm

Re: Problem with Job

Post by ibaf72 »

ok thanks
than I have to look where I produced the error
maybe some of the environment variable for cgru and/or Meshroom are
not set correct for the non-interactive shell
Post Reply