Pre-release of Version 2.1.8

Bjørn Mork bjorn at mork.no
Tue Dec 8 10:10:11 CET 2009


Alan DeKok <aland at deployingradius.com> writes:
> Bjørn Mork wrote:
>> Bjørn Mork <bjorn at mork.no> writes:
>>> The server had been running for 45 hours when this happened.  I haven't
>>> got the faintest idea where to start looking for the bug.
>> 
>> I have to correct myself after looking over the logs:  The server
>> stopped answering authentication requsts, but it continued to answer
>> accounting requests.
>
>   Found, fixed, pushed to "v2.1.x" on github.

Yes, now it continues to answer both authentication and accounting
requests, but it still stops proxying after a while (where "a while"
might be something like 20+ hours and 1+ million auth requests - I have
no indication that these values are fixed).  

The symptoms are that all home servers are marked dead/zombie.  Typical
obfuscated home_server list in this state:

server(bjorn) ~ 71$ radmin -e "show home_server list"
192.168.8.120   1812    auth    alive   0
192.168.8.120   1813    acct    alive   0
192.168.8.246   1812    auth    alive   0
192.168.8.246   1813    acct    alive   0
192.168.8.132   1645    auth    dead    0
192.168.8.132   1646    acct    alive   0
192.168.8.132   1645    auth    dead    3
192.168.8.132   1646    acct    alive   0
192.168.8.14    1812    auth    alive   0
192.168.8.14    1813    acct    zombie  0
192.168.8.10    1812    auth    alive   0
192.168.8.10    1813    acct    zombie  0
192.168.8.210   1812    auth    alive   0
192.168.8.210   1813    acct    alive   0
192.168.8.50    1812    auth    zombie  0
192.168.8.50    1813    acct    alive   0
192.168.8.20    1812    auth    zombie  0
192.168.8.20    1813    acct    alive   0
192.168.8.40    1812    auth    zombie  0
192.168.8.40    1813    acct    alive   0
192.168.8.44    1812    auth    alive   0
192.168.8.44    1813    acct    alive   0
192.168.8.216   1812    auth    zombie  0
192.168.8.216   1813    acct    zombie  0
192.168.8.218   1812    auth    alive   0
192.168.8.218   1813    acct    zombie  0
192.168.8.1     1645    auth    zombie  0
192.168.8.1     1646    acct    zombie  4
192.168.8.137   1645    auth    alive   1
192.168.8.137   1646    acct    dead    0
192.168.8.150   1812    auth    zombie  0
192.168.8.150   1813    acct    alive   0
192.168.8.158   1812    auth    zombie  0
192.168.8.158   1813    acct    zombie  0
192.168.8.222   1812    auth    zombie  0
192.168.8.222   1813    acct    zombie  0
192.168.8.6     1812    auth    zombie  6
192.168.8.6     1813    acct    alive   0
192.168.8.27    1812    auth    zombie  2
192.168.8.27    1813    acct    zombie  0
192.168.8.158   1812    auth    zombie  0
192.168.8.158   1813    acct    zombie  0
192.168.8.4     1812    auth    alive   0
192.168.8.4     1813    acct    zombie  0
192.168.9.6     1812    auth    zombie  4
192.168.9.6     1813    acct    zombie  0



There are a number of servers marked "alive", but these are all servers
which have been revived after the fixed period.  When used, they will be
marked dead/zombie again.

I'm running the v2.1.x branch from github, with
cbbcb5232261c5b28093c3a97d6da2a16c9e06af being the last commit.


Now, I wish I could say than I was sure that some other version did not
have the same problem, but I'm not.  I'm afraid I haven't been running
any of them continuously for a long enough period to be completely sure.

But I will test that now, starting with the stable branch from
git.freeradius.org, commit d7b4f003477644978f3fefa694305dce9b5dc8bf,
which was the last point where things seemed to work



Bjørn




More information about the Freeradius-Users mailing list