Hello Mr.Robin Shen,
Now we are facing an issue in prod with agent-master connection
We can not reproduce the issue but here are the symptons:
- Build request was hanged in CHECKING_BUILD_CONDITION status
- Can not cancel build request, other request was pending in queue
- Build server was unauthorized but build still not cancelled
- Can not open or remote server at that time.
- The server's log stop at "Active build agent {agentAddress} timed out..."
After restart server, the log continue generate:
Job still exists on job node...
Unable to find job (job class:...
Error processing build request...
I think the cause of problem is that the proxy cannot be created in getNodeService function when the agent server was lag
So we want to add timeout in that function
Please let me know you opinion.
Thank you.