AFrender quiting.

General discussions and questions.
Post Reply
seven11
Posts: 20
Joined: Tue Feb 21, 2017 8:03 pm

AFrender quiting.

Post by seven11 »

Hi Timur,
We upgraded to 2.2.1 and noticed that on our remote render machines which are in another office that AFrender will quit.
To give you an idea of the topology, We have a main office and a remote office connected via a tunnel through the internet. The tunnel can
and does go up and down any time due to outages. The AFrender in the remote office most of the time reconnects back to the AFserver in the main office.
But sometimes the AFrender just quits. Here's the output from the AFrender log:

A good reconnect looks like this:

AFERROR: msgsendtoaddress: connect failure for msgType 'TRenderUpdate':
10.0.0.26:51000: Operation now in progress

Thu 23 Feb 05:00.44: INFO Connection lost count = 1 of 3
AFERROR: msgsendtoaddress: connect failure for msgType 'TRenderUpdate':
10.0.0.26:51000: Operation now in progress

Thu 23 Feb 05:00.57: INFO Connection lost count = 2 of 3
AFERROR: msgsendtoaddress: connect failure for msgType 'TRenderUpdate':
10.0.0.26:51000: No route to host

Thu 23 Feb 05:01.04: INFO Connection lost count = 3 of 3
AFERROR: msgsendtoaddress: connect failure for msgType 'TRenderUpdate':
10.0.0.26:51000: No route to host

Thu 23 Feb 05:01.07: INFO Connection lost count = 4 of 3
Thu 23 Feb 05:01.07: WARNING Render connection lost, trying to reconnect...
Thu 23 Feb 05:21.37: INFO Reconnected to the server
Thu 23 Feb 05:21.37: INFO Render registered.

####################
A bad reconnect looks like this:
AFERROR: msgsendtoaddress: connect failure for msgType 'TRenderUpdate':
10.0.0.26:51000: Operation now in progress

Sat 25 Feb 23:17.58: INFO Connection lost count = 1 of 3
AFERROR: msgsendtoaddress: connect failure for msgType 'TRenderUpdate':
10.0.0.26:51000: Operation now in progress

Sat 25 Feb 23:18.11: INFO Connection lost count = 2 of 3
AFERROR: msgsendtoaddress: connect failure for msgType 'TRenderUpdate':
10.0.0.26:51000: Operation now in progress

Sat 25 Feb 23:18.24: INFO Connection lost count = 3 of 3
AFERROR: msgsendtoaddress: connect failure for msgType 'TRenderUpdate':
10.0.0.26:51000: No route to host

Sat 25 Feb 23:18.28: INFO Connection lost count = 4 of 3
Sat 25 Feb 23:18.28: WARNING Render connection lost, trying to reconnect...
Sat 25 Feb 23:18.37: INFO Reconnected to the server
Sat 25 Feb 23:18.37: ERROR Render with this hostname 'efile01' already registered.
Sat 25 Feb 23:18.37: INFO Exiting render.

Why is it saying "Render with this hostname 'efile01' already registered."?

Thanks,
Scott
User avatar
timurhai
Site Admin
Posts: 911
Joined: Sun Jan 15, 2017 8:40 pm
Location: Russia, Korolev
Contact:

Re: AFrender quiting.

Post by timurhai »

Hi.

May be afserver "thinks" that render is still online.
afrender client becomes offline after 60 seconds by default, and can be adjust in a config:
"af_render_zombietime":60
If render tries to connect (re-connect), and server has an online render with the same name, server sends a signal to render to exit.
Try to set "af_render_zombietime":10 seconds and af_render_connectretries":9 to the same value or a little less.
May it will be better in your situation.
Timur Hairulin
CGRU 3.3.1, Ubuntu 20.04, 22.04, MS Windows 10 (clients only).
User avatar
timurhai
Site Admin
Posts: 911
Joined: Sun Jan 15, 2017 8:40 pm
Location: Russia, Korolev
Contact:

Re: AFrender quiting.

Post by timurhai »

Sorry.
Try to set "af_render_zombietime":10 seconds and af_render_connectretries":11 to the same value or a little more.
This way server will treat render as offline before the render sends register message.
Timur Hairulin
CGRU 3.3.1, Ubuntu 20.04, 22.04, MS Windows 10 (clients only).
seven11
Posts: 20
Joined: Tue Feb 21, 2017 8:03 pm

Re: AFrender quiting.

Post by seven11 »

Just changed the two parameters. I'll see how they go.
Thanks Timur,
Scott
selsner
Posts: 47
Joined: Wed Jan 25, 2017 11:20 am

Re: AFrender quiting.

Post by selsner »

I am seeing the exact same problem right now with one of our remote offices.
CGRU 2.3.1 - CentOS 7.7

Sebastian Elsner - Pipeline Technical Director - RISE
www.risefx.com
User avatar
timurhai
Site Admin
Posts: 911
Joined: Sun Jan 15, 2017 8:40 pm
Location: Russia, Korolev
Contact:

Re: AFrender quiting.

Post by timurhai »

Have you tried:
"Try to set "af_render_zombietime":10 seconds and af_render_connectretries":11 to the same value or a little more"
- ?
Timur Hairulin
CGRU 3.3.1, Ubuntu 20.04, 22.04, MS Windows 10 (clients only).
selsner
Posts: 47
Joined: Wed Jan 25, 2017 11:20 am

Re: AFrender quiting.

Post by selsner »

Will try.
CGRU 2.3.1 - CentOS 7.7

Sebastian Elsner - Pipeline Technical Director - RISE
www.risefx.com
Post Reply