[clue-tech] nfs frustrations

Wed Jun 3 17:24:39 MDT 2009

Nate Duehr wrote:
> On Wed, 03 Jun 2009 17:22 -0400, "Angelo Bertolli"
> <angelo.bertolli at gmail.com> wrote:
>   
>> Ok so none of the options on an NFS mount do what I want it to do.  
>> Maybe automount is the only solution, but for regular nfs...
>>     
>
> autofs will mount and unmount things as they're "used", but it adds some
> wait time when you first use the remote filesystem.  I also forget how
> to tell it how long to wait before unmounting... it's been years since I
> had to deal with developers machines that used it to auto-mount the
> development playground server... 
>
> But if the problem really is network connectivity going away, it won't
> help... the NFS mount will still be "hung" by bad network connectivity. 
> NFS is from another era where it assumes networks are perfect.  When
> they're not, NFS becomes highly annoying.
>   

That's exactly what I was thinking when I was going through this:  it 
runs based on the assumption that the connectivity never goes away.

>> 2) There doesn't seem to be any way to tell NFS to fail within 1 
>> minute.  I know the maximum retrans timeout is supposed to be 60 
>> seconds, but after tweaking it
>>
>> When a mount is unavailable (I'm using ls to test) ...
>>     - soft/hard doesn't seem to make any difference (I'm using ro,noexec)
>>     - retrans, timeo, retry don't seem to make any difference no matter 
>> what settings I use
>>     
>
> timeo should work, but it requires that there actually be file access
> going on... if the mount is "quiet", it has no idea that there's
> something to "timeout", so to speak.  If you already have network issues
> going on, using soft would make your life a living hell.  I highly
> recommend against it, unless you enjoy I/O errors in your application
> level code.  (GRIN!)
>   

I only played with timeo a little bit.  But the default is already at 
0.7 seconds, and the behavior has always been at least 3 minutes which 
doesn't match the 60 second rule in the man page, unless it's climbing 
up to 60 seconds each of 3 times (retrans rate).  But that's not what it 
says.  The default says after the first transmission problem, wait 0.7 
seconds, then wait 1.4 seconds, then wait 2.8 seconds...  unless of 
course the it ONLY has a minor timeout when TCP times out which would 
make timeo almost pointless unless you wanted really long wait times.

I'm ok with not caring about the mount if it's not being used.  It's 
just when something goes to use it, I want it to return with some 
feedback in 60 seconds so I can do something like... write a script that 
tests all the mounts.  We have over 400 NFS mounts on one machine, so I 
really can't wait 3 minutes for each one.  What I've decided to do for 
the time being is only testing the first mount per host in fstab, which 
gets me down to under 60.

>>     - I've tried at least 10 combinations of the above, and ls returns 
>> with an IO error within 3 - 5 minutes every time.
>>     
>
> I've also farted around with it in the past.  There were a number of
> implementation bugs in Linux NFS stacks over the years.  Those weren't
> very helpful at the time.  Maybe they've cleaned those up.  The
> strongest NFS implementation has always been the one in Solaris, but
> like many things Solaris, it traded robustness for lack of features...
> and you still couldn't really do anything about "hung" NFS mounts very
> well.
>
>   
>> Oh well.  We're using nfs3.  Should I expect different behavior from nfs4
>>     
>
> Doubt it.  NFSv4 really only dealt with authentication issues, and is
> kinda a "too little, too late" approach to fixing things with NFS.  
>
> I think other network filesystems, even the venerable and possibly hated
> CIFS ("Windows shares") handle network outages better.  But there's a
> whole new world of problems there... filenames, permissions,
> ownership... Samba can also drive someone mad given the wrong set of
> requirements for group access or other weird requests.
>
> I guess the only GOOD thing about NFS is that it certainly shows you if
> your network or servers aren't up to snuff.  If you can fix the
> root-cause connectivity problems, it's plenty fast and maps better to
> unix permissions and other things "Linuxy", but eventually NFS does
> drive one mad when network or server issues are happening.
>
> I've always wanted to try out OpenAFS, but I can't think of a good need
> I have for it right now...

Well actually, the network is pretty good, and machines rarely just 
crash.  Our biggest problem is storage hardware:  we have some 
problematic JBODs that were built poorly and are underpowered.  Once in 
a while they end up dropping all their devices.  oops!  Then some user 
comes and does an ls on one of these guys via ftp, the system hangs, so 
they try again, etc.  Maybe the simple addition of intr will help with 
this problem because it allows the IO request to go away more easily.

I admit, I've been testing this by shutting off the nfs server, and not 
by disappearing the devices.  But I figured that was a good test.

So far some kind of automounter sounds like the only possibility (if 
there's anything we can do besides just getting rid of these JBODs), but 
I've been told that the automounters aren't really able to unmount 
things that are having problems either (as you mentioned above).

Angelo