Skip to content

gluon-wan-dnsmasq crash after quick link downs+ups #2950

@T-X

Description

@T-X

Bug report

What is the problem?

  • What is not working as expected?
    • fastd is unable to resolve hostnames and therefore unable to connect
    • "gluon-wan ping google.com" is not working, while "ping 141.1.1.1" works fine over WAN
  • How is it misbehaving?
    • The dnsmasq instance that is supposed to listen on port 54 is not running:
root@nml-pa2200:~# netstat -tulpen | grep :54
udp        0      0 :::546                  :::*                                7382/odhcp6c
udp        0      0 :::546                  :::*                                1971/odhcp6c
root@nml-pa2200:~# ps | grep dnsmasq
  886 root      2396 S    {dnsmasq} /sbin/ujail -t 5 -n dnsmasq -u -l -r /bin/ubus -r /etc/TZ -r /etc/dnsmasq.conf -r /etc/ethers -r  
  939 dnsmasq   1280 S    /usr/sbin/dnsmasq -C /var/etc/dnsmasq.conf.cfg01411c -k -x /var/run/dnsmasq/dnsmasq.cfg01411c.pid
28289 root      1144 S    grep dnsmasq
root@nml-pa2200:~#
  • When did the problem first start showing up?
  • What were you doing when you first noticed the problem?
    • Using the Freifunk network
  • On which devices (vendor, model and revision) is it misbehaving?
  • Does the issue appear on multiple devices or targets?
    • untested
  • Workarounds?
    • the device works fine again after rebooting it, netstat then looks ok:
root@nml-pa2200:~# netstat -tulpen | grep :54
tcp        0      0 0.0.0.0:54              0.0.0.0:*               LISTEN      2448/dnsmasq
tcp        0      0 :::54                   :::*                    LISTEN      2448/dnsmasq
udp        0      0 0.0.0.0:54              0.0.0.0:*                           2448/dnsmasq
udp        0      0 :::546                  :::*                                2349/odhcp6c
udp        0      0 :::546                  :::*                                2010/odhcp6c
udp        0      0 :::54                   :::*                                2448/dnsmasq
root@nml-pa2200:~# ps | grep dnsmasq
  886 root      2396 S    {dnsmasq} /sbin/ujail -t 5 -n dnsmasq -u -l -r /bin/ubus -r /etc/TZ -r /etc/dnsmasq.conf -r /etc/ethers -r 
  932 dnsmasq   1268 S    /usr/sbin/dnsmasq -C /var/etc/dnsmasq.conf.cfg01411c -k -x /var/run/dnsmasq/dnsmasq.cfg01411c.pid
 2544 root      1268 S    /usr/sbin/dnsmasq -x /var/run/gluon-wan-dnsmasq.pid -u root -i lo -p 54 -h -r /var/gluon/wan-dnsmasq/resolv
 4538 root      1144 S    grep dnsmasq
root@nml-pa2200:~# 

What is the expected behaviour?

  • How do you think it should work instead?
    • after a link went down and up again then not only routing over WAN but also DNS over WAN should work again
  • Did it work like that before?
    • unknown

Gluon Version:

  • v2022.1

Site Configuration:

root@nml-pa2200:~# cat /lib/gluon/release 
0.16.3~exp20220914
root@nml-pa2200:~#

Custom patches:

  • none

update1:

  • added "ps" output

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions