Hi,
we keep getting warnings that the Log Insight cluster nodes get disconnected every once in a while. I checked the ESXi for issues with the VM or restarts, but the host has been running fine for weeks (last reboot was an update) and the VM has not restarted. I'm trying to find info in the Log Insight logs but I can't figure out where to start diagnosing this issue.
Does anyone have some hints? I would like to not open support requests every few days but I also can't find much info on this.
For the record, the issue is not permanent - the nodes reconnect on their own after a while, but every disconnect creates an internal support ticket.
Hello,
Please check the runtime.log file at /storage/var/loginsight, you will get more information there.
Also validate the NTP servers on the cluster nodes, follow KB article https://kb.vmware.com/s/article/2125130 for reference.
Regards,
Uday Mantri
Hello, thanks for your reply on this issue. I am also experiencing the issue with one of my worker nodes disconnected, but the NTP setting test is successful and I do not know how to check any of the logs in the storage path.
Any help would be appreciated.
Can you check/do the following:
- NTP and DNS configuration for all nodes
- Enough resources on ESXi hosts? Any resource contention?
- Backup using Snapshots running when nodes got disconnected?
- rebooted all the nodes?
Otherwise create a SR at GGS an upload support bundle.