VMware Cloud Community
arnoldveenema
Contributor
Contributor

NTP client stops syncing time with its peer


Hi All,

I'm having troubles stabalizing time among ESX servers which all use the same time source. NTP is configured to use a network device as its time source (Juniper NetScreen) and this all works and syncs as aspected for a couple of days. After that time starts to drift slowly. I've tested several timesources, but every time the NTP client embedded in the ESX v5.1 build 799733 fails to update and poll its peer.

e.g. ntpd is set to sync with 11.22.33.44

using tcpdump-uw -c 5 -n -i vmk0 host 11.22.33.44 and port 123 shows good output

using watch "ntpq -p localhost" shows nice statistics like this

Every 2s: ntpq -p localhost                                 2013-07-23 12:24:04

     remote           refid      st t when poll reach   delay   offset  jitter
==============================================================================
*11.22.33.44     .LOCL.           1 u  121  128  377    6.265  -1008.1 378.536

But after a couple of days the when (last poll value) value keeps increasing without polling its peer

Every 2s: ntpq -p localhost                                 2013-07-23 12:11:50

     remote           refid      st t when poll reach   delay   offset  jitter
==============================================================================
11.22.33.44     .LOCL.           1 u  61d 1024  377    3.648  27994.3 925.134

Notice the when value which has increased up to 61 days since its last poll. It shows an offset of almost 30 seconds while for real the time is more than 15 minutes behind.

using ntpdc -c loopinfo shows a watchdog timer that is no longer increasing (it should), like the whole daemon is just freezing. The only way to correct this is restarting the NTP daemon.

This repeats itself every couple of days, give or take a few.

Any thoughts on this wierd behaviour? Anyone else experiencing the same issue?

Kind regards,

Arnold Veenema

5 Replies
sysman201110141
Contributor
Contributor

Hi Arnold,

have exactly the same problem, so I like to make some movement in this thread.

The freezing NTP client occurs randomly on different ESX hosts.

I did not found any hints in NTP troubleshooting articles, so I would like to find a way to restart NTP service every day. How to do that ?

Kind regards

Rainer

0 Kudos
arnoldveenema
Contributor
Contributor

Hi Rainer,

You could make a cron job to restart the ntpd daemon every day at 6.00AM e.g. or any other suitable time for you.

/bin/echo "0 6 * * * /etc/init.d/ntpd restart > /tmp/ntpdrestart.log 2>&1" >> /var/spool/cron/crontabs/root

After that restart the cron daemon to pickup your change

/bin/kill $(cat /var/run/crond.pid)

/bin/busybox crond

:smileyalert: Be aware that these changes will not survive a reboot. For a permanent cronjob in ESXi, you need to save the crontab file or recreate it after reboot. E.g. by means of editing the auto-backup.sh script to include your new crontab file to be saved. Links on how to do this could be found on several places like.... https://communities.vmware.com/thread/164134?start=150&tstart=0

Kind regards,

Arnold Veenema

sysman201110141
Contributor
Contributor

Ah yes,

I believed, there is no cron on ESX because I found no cron process and no crontab command.

Thank you.

0 Kudos
stephan_aichhol
Contributor
Contributor

Hello

I have the same problem on our setup (2 x ESXi 5Update1).

It happens randomly maybe after days, maybe after hours.

We configured NTP acting as server as well (firewall settings)

After a while:

NTPd stops polling the configured peers.

NTPd stops writing loopstats

NTPd stops writing peerstats

no indication on syslog refering to NTP

0 Kudos
admin
Immortal
Immortal

Check vmware based article i hope it will help you to short out this issue.. http://kb.vmware.com/selfservice/microsites/search.do?language=en_US&cmd=displayKC&externalId=103583...

0 Kudos