help troubleshooting an intermittent network problem
Posted: Mon Aug 10, 2015 4:17 am
Hi, I've got a problem with my Amahi box that I am having trouble identifying. The symptom is that every so often (say every few days, sometimes a bit longer) the internet stops working for my connected machines. I use the Amahi DNS and DHCP server, and I think that the problem is somewhere in that lot.
My current solution is simple and effective - I just reboot the server when I lose internet connection. But I'd rather work out what's going wrong and solve it. Can someone please point me to the appropriate logs to review so that I can get to the bottom of this problem?
Here's a log extract that might be relevant... /var/log/messages is filled with nmbd and smbd error messages. Typical extract:
I have not been able to get to the network troubleshooter (the link from wiki seems dead?) but my network currently passes all the other steps (am able to ping hda/router/8.8.8.8/yahoo.com and access http://hda in a browser window). Sorry this post is vague, please let me know what I should do to narrow down the problem! I should note that I can't remember making any recent changes to the machine to cause this problem.
My current solution is simple and effective - I just reboot the server when I lose internet connection. But I'd rather work out what's going wrong and solve it. Can someone please point me to the appropriate logs to review so that I can get to the bottom of this problem?
Here's a log extract that might be relevant... /var/log/messages is filled with nmbd and smbd error messages. Typical extract:
Code: Select all
Aug 10 13:07:07 localhost dnsmasq-dhcp[1558]: DHCPREQUEST(p5p1) 192.168.1.52 00:04:20:22:de:29
Aug 10 13:07:07 localhost dnsmasq-dhcp[1558]: DHCPACK(p5p1) 192.168.1.52 00:04:20:22:de:29 squeezeboxtouch
Aug 10 13:07:07 localhost dnsmasq-dhcp[1558]: DHCPREQUEST(p5p1) 192.168.1.52 00:04:20:22:de:29
Aug 10 13:07:07 localhost dnsmasq-dhcp[1558]: DHCPACK(p5p1) 192.168.1.52 00:04:20:22:de:29 squeezeboxtouch
Aug 10 13:07:45 localhost nmbd[997]: [2015/08/10 13:07:45.819182, 0] ../source3/nmbd/nmbd_browsesync.c:354(find_domain_master_name_query_fail)
Aug 10 13:07:45 localhost nmbd[997]: find_domain_master_name_query_fail:
Aug 10 13:07:45 localhost nmbd[997]: Unable to find the Domain Master Browser name WORKGROUP<1b> for the workgroup WORKGROUP.
Aug 10 13:07:45 localhost nmbd[997]: Unable to sync browse lists in this workgroup.
Aug 10 13:08:30 localhost kernel: [513606.695203] CIFS VFS: Server 127.0.0.1 has not responded in 120 seconds. Reconnecting...
Aug 10 13:08:30 localhost smbd[13704]: [2015/08/10 13:08:30.448965, 0] ../source3/smbd/process.c:2655(keepalive_fn)
Aug 10 13:08:30 localhost smbd[13704]: send_keepalive failed for client 0.0.0.0. Error Broken pipe - exiting