Hi All,
I'm having troubles stabalizing time among ESX servers which all use the same time source. NTP is configured to use a network device as its time source (Juniper NetScreen) and this all works and syncs as aspected for a couple of days. After that time starts to drift slowly. I've tested several timesources, but every time the NTP client embedded in the ESX v5.1 build 799733 fails to update and poll its peer.
e.g. ntpd is set to sync with 11.22.33.44
using tcpdump-uw -c 5 -n -i vmk0 host 11.22.33.44 and port 123 shows good output
using watch "ntpq -p localhost" shows nice statistics like this
Every 2s: ntpq -p localhost 2013-07-23 12:24:04
remote refid st t when poll reach delay offset jitter
==============================================================================
*11.22.33.44 .LOCL. 1 u 121 128 377 6.265 -1008.1 378.536
But after a couple of days the when (last poll value) value keeps increasing without polling its peer
Every 2s: ntpq -p localhost 2013-07-23 12:11:50
remote refid st t when poll reach delay offset jitter
==============================================================================
11.22.33.44 .LOCL. 1 u 61d 1024 377 3.648 27994.3 925.134
Notice the when value which has increased up to 61 days since its last poll. It shows an offset of almost 30 seconds while for real the time is more than 15 minutes behind.
using ntpdc -c loopinfo shows a watchdog timer that is no longer increasing (it should), like the whole daemon is just freezing. The only way to correct this is restarting the NTP daemon.
This repeats itself every couple of days, give or take a few.
Any thoughts on this wierd behaviour? Anyone else experiencing the same issue?
Kind regards,
Arnold Veenema
Hi Arnold,
have exactly the same problem, so I like to make some movement in this thread.
The freezing NTP client occurs randomly on different ESX hosts.
I did not found any hints in NTP troubleshooting articles, so I would like to find a way to restart NTP service every day. How to do that ?
Kind regards
Rainer
Hi Rainer,
You could make a cron job to restart the ntpd daemon every day at 6.00AM e.g. or any other suitable time for you.
/bin/echo "0 6 * * * /etc/init.d/ntpd restart > /tmp/ntpdrestart.log 2>&1" >> /var/spool/cron/crontabs/root
After that restart the cron daemon to pickup your change
/bin/kill $(cat /var/run/crond.pid)
/bin/busybox crond
:smileyalert: Be aware that these changes will not survive a reboot. For a permanent cronjob in ESXi, you need to save the crontab file or recreate it after reboot. E.g. by means of editing the auto-backup.sh script to include your new crontab file to be saved. Links on how to do this could be found on several places like.... https://communities.vmware.com/thread/164134?start=150&tstart=0
Kind regards,
Arnold Veenema
Ah yes,
I believed, there is no cron on ESX because I found no cron process and no crontab command.
Thank you.
Hello
I have the same problem on our setup (2 x ESXi 5Update1).
It happens randomly maybe after days, maybe after hours.
We configured NTP acting as server as well (firewall settings)
After a while:
NTPd stops polling the configured peers.
NTPd stops writing loopstats
NTPd stops writing peerstats
no indication on syslog refering to NTP
Check vmware based article i hope it will help you to short out this issue.. http://kb.vmware.com/selfservice/microsites/search.do?language=en_US&cmd=displayKC&externalId=103583...