Forums

Resolved
0 votes
I’ve had to reset my server twice in the past week to restore internet access. But I can still ping google from server..
Saturday, July 13 2019, 02:20 PM
Share this post:
Responses (20)
  • Accepted Answer

    Friday, August 02 2019, 05:09 AM - #Permalink
    Resolved
    0 votes
    I haven’t had any issues since my last reboot 14 days ago. I’m starting to believe that accessing the admin page using chrome on my iPad caused my issues by coroupting my dhcp settings.

    Chrome will display “root” for my dhcp net mask! And who knows what else... it screws up... even without saving settings.
    The reply is currently minimized Show
  • Accepted Answer

    pgriffith
    pgriffith
    Offline
    Friday, July 19 2019, 08:44 AM - #Permalink
    Resolved
    0 votes
    Embarrassingly, it appears my issues stemmed from a dodgy Ethernet cable, I've just moved houses and the problem started occurring after I'd moved into my new place. Exact same cables that I'd used previously, plugged into the exact same ports, only 1m CAT 6's. I replaced both cables in and out of the Clear OS box when it next went down and it fixed the problem immediately, the network came straight back up without needing to reboot, so pretty much HAS to be cable related. Thanks to Nick for trying to figure out what was going on, sorry for wasting your time.
    The reply is currently minimized Show
  • Accepted Answer

    Friday, July 19 2019, 02:29 AM - #Permalink
    Resolved
    0 votes
    It seems to happen more then I notice as I randomly find my Logitech remote hubs off line, they don’t recover and require to be reset. I’ve also have noticed messages.from my SmartThings hub that it is now back online.
    The reply is currently minimized Show
  • Accepted Answer

    Thursday, July 18 2019, 08:39 PM - #Permalink
    Resolved
    0 votes
    Eric Anderson wrote:
    nick, can you reboot your system so you can be on the latest kernel please.

    I've been running the new kernel since July 9th, a week before it was released to the Community. ;)
    The reply is currently minimized Show
  • Accepted Answer

    Thursday, July 18 2019, 08:19 PM - #Permalink
    Resolved
    0 votes
    I agree. I tend to think it is more of a NAT problem. Have you found any way to cause it to occur. When it does occur, are only a few machine affected while others seem to work just fine?
    The reply is currently minimized Show
  • Accepted Answer

    Thursday, July 18 2019, 03:35 AM - #Permalink
    Resolved
    0 votes
    if i look at the network map, i can see some devices defaulted to their default ip address which really makes me think it is a dhcp or nat problem... this is strange as clear has be running really well for a long time and i have experienced very long uptimes.

    nick, can you reboot your system so you can be on the latest kernel please.
    The reply is currently minimized Show
  • Accepted Answer

    Thursday, July 18 2019, 03:31 AM - #Permalink
    Resolved
    0 votes
    i also have nodejs installed and running. about four weeks ago i wrote a service startup script to start my alarm integration server to smartthings (node), so naturally i was thinking that was the cause. but not so much anymore.

    # cat /var/clearos/configuration_backup/installed_apps.txt
    app-accounts
    app-accounts-core
    app-antiphishing
    app-antiphishing-core
    app-antivirus
    app-antivirus-core
    app-apcups
    app-apcups-core
    app-bandwidth-viewer
    app-bandwidth-viewer-core
    app-base
    app-base-core
    app-certificate-manager
    app-certificate-manager-core
    app-clearcenter
    app-clearcenter-core
    app-configuration-backup
    app-configuration-backup-core
    app-dashboard
    app-dashboard-core
    app-date
    app-date-core
    app-dhcp
    app-dhcp-core
    app-disk-usage
    app-disk-usage-core
    app-dns
    app-dns-core
    app-dynamic-dns
    app-dynamic-dns-core
    app-edition
    app-edition-core
    app-events
    app-events-core
    app-firewall
    app-firewall-core
    app-flexshare
    app-flexshare-core
    app-ftp
    app-ftp-core
    app-ftp-plugin-core
    app-graphical-console
    app-graphical-console-core
    app-groups
    app-groups-core
    app-incoming-firewall
    app-incoming-firewall-core
    app-intrusion-detection
    app-intrusion-detection-core
    app-intrusion-prevention
    app-intrusion-prevention-core
    app-language
    app-language-core
    app-ldap-core
    app-lets-encrypt
    app-lets-encrypt-core
    app-log-viewer
    app-log-viewer-core
    app-mail
    app-mail-core
    app-mail-notification-core
    app-marketplace
    app-marketplace-core
    app-miniupnpd
    app-miniupnpd-core
    app-mode-core
    app-network
    app-network-core
    app-network-map
    app-network-map-core
    app-ntp
    app-ntp-core
    app-openldap-core
    app-openldap-directory-core
    app-openvpn
    app-openvpn-core
    app-openvpn-plugin-core
    app-organization
    app-organization-core
    app-plex
    app-plex-core
    app-port-forwarding
    app-port-forwarding-core
    app-process-viewer
    app-process-viewer-core
    app-raid
    app-raid-core
    app-registration
    app-registration-core
    app-samba
    app-samba-common-core
    app-samba-core
    app-samba-extension-core
    app-simple-mode-core
    app-smtp-core
    app-software-updates
    app-software-updates-core
    app-ssh-server
    app-ssh-server-core
    app-storage-core
    app-support
    app-support-core
    app-suva-core
    app-tasks-core
    app-user-certificates
    app-user-certificates-core
    app-user-certificates-plugin-core
    app-user-profile
    app-user-profile-core
    app-users
    app-users-core
    The reply is currently minimized Show
  • Accepted Answer

    Wednesday, July 17 2019, 11:32 PM - #Permalink
    Resolved
    0 votes
    Sometimes I fake out my system so that it thinks there are multiple NICs so that I can use the MultiWAN. I do this if I know that I'm going to add a NIC later.

    Set up a bridge and have it be external but don't put any interfaces on it. You can also do that with a VLAN.
    The reply is currently minimized Show
  • Accepted Answer

    Tuesday, July 16 2019, 07:15 PM - #Permalink
    Resolved
    0 votes
    Ah yes and it shows no IP and only 2 NICs in lspci, because that is the underlying ppp0 NIC.
    The reply is currently minimized Show
  • Accepted Answer

    Tuesday, July 16 2019, 06:59 PM - #Permalink
    Resolved
    0 votes
    He probably means 'enp0s25'
    The reply is currently minimized Show
  • Accepted Answer

    Tuesday, July 16 2019, 04:06 PM - #Permalink
    Resolved
    0 votes
    There is nothing obvious there. From your output, you don't have an RTL8168 which can cause issues. Your DHCP server config is OK but you have narrowed the IP range right down and your router IP is in a not-so-standard place, but is not an issue.

    I've no idea what your np0s25 interface is. I've never seen one with this sort of name.

    I started relying to the post before Dave replied, but for Dave, if it is a firewall issue and connection tracking is stopping a firewall restart from having an immediate effect, you can flush all the firewall connection tracking with:
    yum install conntrack
    conntrack -F
    The reply is currently minimized Show
  • Accepted Answer

    Tuesday, July 16 2019, 03:33 PM - #Permalink
    Resolved
    0 votes
    Eric,

    I have a similar problem. Up until your post I was just attributing the failure to the experimental motherboard of the server but your description is similar to some behavior I've been seeing. The symptoms are that

    - the network appears to go offline
    - pings don't work from workstations to any external IP except the server's public IP
    - the server itself can ping everything and DNS is working
    - restarting the firewall doesn't seem to work immediately but after a few minutes it does come back online

    I do not experience the DHCP issue you or PGriffith report. The biggest problem I have with this bug is that I cannot reproduce it at will but have to wait for it and up until now it seemed to be a hardware issue. So thanks for your report.

    Can you send me a list of the modules of ClearOS that you have installed. A list is contained in the text file located in /var/clearos/configuration_backup/installed_apps.txt

    That can help us narrow down a potential interaction. I know that some machines are not affected by this issue so I don't know what the outlier element is. When I talked to Nick he indicated that conntrack can behave poorly. Next time I have this issue I will be snapshotting my iptables and nat tables to compare against a working set.
    The reply is currently minimized Show
  • Accepted Answer

    pgriffith
    pgriffith
    Offline
    Tuesday, July 16 2019, 02:26 PM - #Permalink
    Resolved
    0 votes
    Thanks Nick, here is everything you requested. Let me know if there is anything else you'd like to see.

    yum.log
    Jul 01 01:54:01 Updated: python-libs-2.7.5-80.el7_6.x86_64
    Jul 01 01:54:01 Updated: 1:app-configuration-backup-core-2.5.2-1.v7.noarch
    Jul 01 01:54:01 Updated: 1:app-configuration-backup-2.5.2-1.v7.noarch
    Jul 01 01:54:01 Updated: python-2.7.5-80.el7_6.x86_64
    Jul 01 01:54:02 Updated: 1:app-storage-core-2.6.15-1.v7.noarch
    Jul 01 01:54:02 Updated: snort-2.9.6.2-9.v7.x86_64
    Jul 07 11:57:12 Updated: glibc-common-2.17-260.el7_6.6.x86_64
    Jul 07 11:57:13 Updated: glibc-2.17-260.el7_6.6.x86_64
    Jul 07 11:57:13 Updated: systemd-libs-219-62.el7_6.7.x86_64
    Jul 07 11:57:15 Updated: systemd-219-62.el7_6.7.x86_64
    Jul 07 11:57:15 Updated: 7:device-mapper-1.02.149-10.el7_6.8.x86_64
    Jul 07 11:57:15 Updated: 7:device-mapper-libs-1.02.149-10.el7_6.8.x86_64
    Jul 07 11:57:15 Updated: 7:device-mapper-event-libs-1.02.149-10.el7_6.8.x86_64
    Jul 07 11:57:15 Updated: 7:device-mapper-event-1.02.149-10.el7_6.8.x86_64
    Jul 07 11:57:15 Updated: 7:lvm2-libs-2.02.180-10.el7_6.8.x86_64
    Jul 07 11:57:15 Updated: libteam-1.27-6.el7_6.1.x86_64
    Jul 07 11:57:17 Updated: 1:app-base-core-2.7.4-1.v7.noarch
    Jul 07 11:57:17 Updated: 1:app-base-2.7.4-1.v7.noarch
    Jul 07 11:57:17 Updated: teamd-1.27-6.el7_6.1.x86_64
    Jul 07 11:57:18 Updated: 7:lvm2-2.02.180-10.el7_6.8.x86_64
    Jul 07 11:57:18 Updated: systemd-sysv-219-62.el7_6.7.x86_64
    Jul 07 11:57:18 Updated: nscd-2.17-260.el7_6.6.x86_64
    Jul 07 11:57:29 Updated: 2:microcode_ctl-2.1-47.5.el7_6.x86_64
    Jul 07 11:57:29 Updated: systemd-python-219-62.el7_6.7.x86_64
    Jul 07 11:57:29 Updated: 2:vim-minimal-7.4.160-6.el7_6.x86_64
    Jul 07 11:57:41 Updated: gconsole-60.3.0-1.v7.2.x86_64
    Jul 10 18:22:44 Updated: 1:app-configuration-backup-core-2.5.3-1.v7.noarch
    Jul 10 18:22:44 Updated: 1:app-configuration-backup-2.5.3-1.v7.noarch
    Jul 10 18:22:44 Updated: 1:app-storage-core-2.6.16-1.v7.noarch


    network.conf
    # Network mode
    MODE="gateway"

    # Network interface roles
    EXTIF="ppp0"
    LANIF="ens2"
    DMZIF=""
    HOTIF=""

    # Domain and Internet Hostname
    DEFAULT_DOMAIN="griff.com"
    INTERNET_HOSTNAME="gateway.griff.com"

    # Extra LANS
    EXTRALANS=""

    # ISP Maximum Speeds
    PPP0_MAX_DOWNSTREAM=46310
    PPP0_MAX_UPSTREAM=17530
    ENP0S25_MAX_DOWNSTREAM=0
    ENP0S25_MAX_UPSTREAM=0


    dhcp.conf
    # This file is managed by the API.  Please add custom options in dnsmasq.conf.
    dhcp-option=ens2,1,255.255.255.0
    dhcp-option=ens2,28,10.0.0.255
    dhcp-option=ens2,3,10.0.0.139
    dhcp-option=ens2,6,10.0.0.139
    dhcp-range=ens2,10.0.0.100,10.0.0.130,24h


    ifconfig output
    np0s25: flags=4163<UP,BROADCAST,RUNNING,MULTICAST>  mtu 1500
    inet6 fd79:1f40:b55b:0:dad3:85ff:fe00:a6a8 prefixlen 64 scopeid 0x0<global>
    inet6 fe80::dad3:85ff:fe00:a6a8 prefixlen 64 scopeid 0x20<link>
    ether d8:d3:85:00:a6:a8 txqueuelen 1000 (Ethernet)
    RX packets 2785005 bytes 2428057977 (2.2 GiB)
    RX errors 0 dropped 0 overruns 0 frame 0
    TX packets 2028018 bytes 869865354 (829.5 MiB)
    TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0
    device interrupt 22 memory 0xd4800000-d4820000

    ens2: flags=4163<UP,BROADCAST,RUNNING,MULTICAST> mtu 1500
    inet 10.0.0.139 netmask 255.255.255.0 broadcast 10.0.0.255
    inet6 fe80::211:22ff:fe33:476e prefixlen 64 scopeid 0x20<link>
    ether 00:11:22:33:47:6e txqueuelen 1000 (Ethernet)
    RX packets 2184868 bytes 869251946 (828.9 MiB)
    RX errors 0 dropped 0 overruns 0 frame 0
    TX packets 2771234 bytes 2396051585 (2.2 GiB)
    TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0
    device interrupt 18

    lo: flags=73<UP,LOOPBACK,RUNNING> mtu 65536
    inet 127.0.0.1 netmask 255.0.0.0
    inet6 ::1 prefixlen 128 scopeid 0x10<host>
    loop txqueuelen 1000 (Local Loopback)
    RX packets 2518 bytes 296894 (289.9 KiB)
    RX errors 0 dropped 0 overruns 0 frame 0
    TX packets 2518 bytes 296894 (289.9 KiB)
    TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0

    ppp0: flags=4305<UP,POINTOPOINT,RUNNING,NOARP,MULTICAST> mtu 1492
    inet 118.208.205.182 netmask 255.255.255.255 destination 150.101.32.142
    ppp txqueuelen 3 (Point-to-Point Protocol)
    RX packets 2782859 bytes 2355523608 (2.1 GiB)
    RX errors 0 dropped 0 overruns 0 frame 0
    TX packets 2026123 bytes 816893239 (779.0 MiB)
    TX errors 0 dropped 0 overruns 0 carrier 0 collisions 0


    lspci -k | grep Eth -A 3
    00:19.0 Ethernet controller: Intel Corporation 82567LM Gigabit Network Connection (rev 03)
    Subsystem: Hewlett-Packard Company Device 30db
    Kernel driver in use: e1000e
    Kernel modules: e1000e
    --
    02:00.0 Ethernet controller: Marvell Technology Group Ltd. 88E8053 PCI-E Gigabit Ethernet Controller (rev 22)
    Subsystem: Santa Cruz Operation Device 2222
    Kernel driver in use: sky2
    Kernel modules: sky2
    The reply is currently minimized Show
  • Accepted Answer

    Tuesday, July 16 2019, 09:52 AM - #Permalink
    Resolved
    0 votes
    If you have not dug into ClearOS before, this could be a little tricky. Firstly I'll say I am not sure we have released any apps which could cause this, but there may have been some upstream releases which we just push through causing problems but I am not aware of anything on my machines.

    If you are a Windows user, please can you get hold of a copy of WinSCP and PuTTY. WinSCP is a graphical file manager and text editor and PuTTy is a remote console tool. You can copy from PuTTy just by selecting text with the mouse, and paste into it by right-clicking.

    In PuTTy, what is the output to:
    lspci -k | grep Eth -A 3


    From /var/log/yum.log, what updates have you received from about when you started noticing the issues? Copy and paste the section of log from when the problem began.

    Following from the previous posts, what are the contents of /etc/dnsmasq.d/dhcp.conf and /etc/clearos/network.conf and the result of "ifconfig".

    Please put all results between code tags (the piece of paper icon with a <> on it at the top of the reply box).
    The reply is currently minimized Show
  • Accepted Answer

    pgriffith
    pgriffith
    Offline
    Tuesday, July 16 2019, 09:21 AM - #Permalink
    Resolved
    0 votes
    I'm having some weird issues as well in the last 2 weeks. My network just stops working, no response from DHCP server from any machine on the network if I try an ipconfig/release/renew. I can log into the box locally and see that it still has an external IP, but I can't access the machine from the web console, the only way to restore it is to restart the box. Been running stable for literally years until the last 2-3 weeks. I'm having to do this around every day or so, sometimes I'll get 2 or 3 days out of it. Thought it had rectified itself as it had been a few days but it just happened again, so hence me looking in these forums to see if anyone else is reporting an issue. If anyone wants me to check anything, you will need to step me through it, total linux noob here, never had to go 'under the hood' so to speak, only the web interface.
    The reply is currently minimized Show
  • Accepted Answer

    Sunday, July 14 2019, 08:19 PM - #Permalink
    Resolved
    0 votes
    Really odd. I am not aware of any updates which could have done that. The ClearOS default is to .100-.254 for DHCP. I don't use that and use .128-.191 (an exact /26 subnet), and I have not seen any change to it. Also the DHCP server has not been updated since at least November - there was a change and it is in updates-testing, but the change needs to b backed out before it can be released.
    The reply is currently minimized Show
  • Accepted Answer

    Sunday, July 14 2019, 05:55 PM - #Permalink
    Resolved
    0 votes
    The update changed my dhcp start and end range for some reason. Seems to split the range of the netmask, with first half for static and second for dhcp...

    Also chrome on iPad doesn’t display all fields correctly, specifically the IP address fields, bottom one comes up root for netmask..
    The reply is currently minimized Show
  • Accepted Answer

    Saturday, July 13 2019, 09:37 PM - #Permalink
    Resolved
    0 votes
    You don't have the problem NIC I was looking for, so I don't know the fix. Are there any clues in the logs?
    The reply is currently minimized Show
  • Accepted Answer

    Saturday, July 13 2019, 04:38 PM - #Permalink
    Resolved
    0 votes
    Last login: Sat Jul 13 09:11:56 2019 from erics-ipad.1scom.net
    [root@orion ~]# lspci -k | grep Eth -A 3
    00:19.0 Ethernet controller: Intel Corporation 82579V Gigabit Network Connection (rev 04)
    Subsystem: Gigabyte Technology Co., Ltd Device e000
    Kernel driver in use: e1000e
    Kernel modules: e1000e
    --
    07:00.0 Ethernet controller: Qualcomm Atheros AR8161 Gigabit Ethernet (rev 10)
    Subsystem: Gigabyte Technology Co., Ltd Device e000
    Kernel driver in use: alx
    Kernel modules: alx
    The reply is currently minimized Show
  • Accepted Answer

    Saturday, July 13 2019, 04:11 PM - #Permalink
    Resolved
    0 votes
    No problems here. What is the output of:
    lspci -k | grep Eth -A 3
    The reply is currently minimized Show
Your Reply