[NTLUG:Discuss] "You Don't Exist: Go Away!"

Christopher Browne cbbrowne at hex.net
Fri Nov 3 00:08:53 CST 2000


On Thu, 02 Nov 2000 10:25:05 CST, the world broke into rejoicing as
MadHat <madhat at unspecific.com>  said:
> MadHat wrote:
> > 
> > Christopher Browne wrote:
> > >
> > > I am having an unfortunate situation where a machine periodically gets
> > > somewhat 'wedged up' such that:
> > >   a) Port services that check for user IDs die;
> > >   b) Permissions on files apparently "disappear";
> > >   c) Pretty much anything that checks IDs against /etc/passwd gets
> > >      hosed.
> > >
> > > This does _not_ appear to be the result of a hack; it seems moderately
> > > "time based," probably relating to some resource filling up thereby
> > > making {utmp|PAM} throw up.
> > >
> > > Other interesting facts:
> > > - It seems to happen _around_ once a day.  But not greatly predictable.
> > >    Oct 27, 03:38
> > >    Oct 27, 21:32
> > >    Oct 30, 08:02
> > >    Oct 31, about 1:56am
> > >    Nov 1, between 9:00 and 9:02 pm.
> > >
> > > - I don't need to reboot to get everything to "reset;" if I drop to
> > >   runlevel 1 via "init 1," and then head back to "init 3", this seems
> > >   to suffice to clear things up.
> > >
> > > - Debian Unstable Pretty Much Up To Date.
> > >   Linux knuth 2.2.14 #5 Sat May 6 07:29:45 CDT 2000 i586 unknown
> > >
> > > The two things I've seen looking on Google that match the symptoms are:
> > >
> > > a) "Oops.  You deleted /etc/passwd."
> > >
> > >    Not the case.
> > >
> > > b) Something vague involving utmp being "somehow messed up."
> > >
> > > Anyone run into this sort of thing before?
> > 
> > kind of...  My problem was bad nodes on the drive, but it took a fsck to
> > fix.  The drive was going bad and was losing data on the section of the
> > disk that held the /etc.
> > 
> > Because an 'init 1' & 'init 3' seem to be the p[roblem, that does point
> > more towards the software not hardware...  what Kernel you running?
> 
> This should have read because the init 1 and init 3 seem to _FIX_ the
> problem, that doesn't pont towards hardware, but more towards software.
> 
> Need more caffeine.

Need more blood with your caffeine level?  I follow that...

It certainly seems to be a software issue, and the fact that changing 
runlevels "fixes" it seems suggestive that the problem is not with the kernel. 
 (2.2.14, as mentioned up there somewhere...)
--
cbbrowne at ntlug.org - <http://www.hex.net/~cbbrowne/lsf.html>
"War is  a matter of  vital importance to  the State; the  province of
life or death;  the road to survival or ruin. It  is mandatory that it
be thoroughly studied."  -- Sun Tzu





More information about the Discuss mailing list