Topic: Debian is crashing! |
|
---|---|
Author | Thread |
Paranoid (IV) Inmate From: f(x) |
posted 01-16-2008 20:53
I'm running Debian on a machine and recently it has been crashing. I was never around to see it actually crash, but when I switch my KVM to it, my monitor gets no signal. I did manage to take a picture of one crash that happened about 10 minutes after bootup, however I did not see it happen, I just switched my KVM and saw the screen like this. After this last crash, I looked at the /var/log/messages to see what was there, but I don't know what to look for. I know it crashed sometime between 10:58 and 11:28 because I have an IRC bot that runs on the machine and it got disconnected at 11:28 and the log stopped at 10:58. This is what I see in /var/log/messages before the crash: quote:
|
Maniac (V) Inmate From: there...no..there..... |
posted 01-17-2008 01:25
well...after my wife saw the picture even she said..."Fatal is never a good sign is it?" code: cat /var/log/messages | egrep "signal|restart"
|
Paranoid (IV) Inmate From: Florida |
posted 01-17-2008 02:27
I won't even put gconf on a desktop, let alone a server. It keeps a stranglehold on conf files on purpose ? I'd like to smack the genius that thought that was a good idea. |
Maniac (V) Mad Scientist with Finglongers From: Germany |
posted 01-17-2008 09:57
yeah, there's nothing interesting in that log file. |
Paranoid (IV) Inmate From: f(x) |
posted 01-17-2008 10:51
Ok, it crashed again... rebooted at 4:05. |
Paranoid (IV) Inmate From: f(x) |
posted 01-17-2008 10:59
I couldn't disable linkwords when editing... It needs fixed apparently. code: Jan 15 20:05:35 localhost syslogd 1.4.1#18: restart. Jan 15 20:17:41 localhost exiting on signal 15 Jan 15 20:18:35 localhost syslogd 1.4.1#18: restart. Jan 15 20:18:55 localhost papd[2719]: restart (2.0.3) Jan 16 07:36:25 localhost syslogd 1.4.1#18: restart. Jan 16 14:07:43 localhost syslogd 1.4.1#18: restart. Jan 16 14:08:05 localhost papd[2761]: restart (2.0.3) Jan 17 04:05:26 localhost syslogd 1.4.1#18: restart. Jan 17 04:05:48 localhost papd[2749]: restart (2.0.3) Jan 17 04:12:00 localhost syslogd 1.4.1#18: restart.
code: Jan 16 14:17:01 localhost /USR/SBIN/CRON[3527]: (root) CMD ( cd / && run-parts --report /etc/cron.hourly) Jan 16 14:27:44 localhost -- MARK -- Jan 16 14:33:01 localhost /USR/SBIN/CRON[3918]: (nobody) CMD ([ -x /usr/share/sa-exim/greylistclean ] && /usr/share/sa-exim/greylistclean) Jan 16 14:33:02 localhost sa-exim[3919]: Removed 0 of 0 greylist tuplets in 0 seconds Jan 16 14:33:02 localhost sa-exim[3919]: Removed 0 of 0 greylist directories in 0 seconds Jan 16 14:39:01 localhost /USR/SBIN/CRON[4068]: (root) CMD ( [ -d /var/lib/php5 ] && find /var/lib/php5/ -type f -cmin +$(/usr/lib/php5/maxlifetime) -print0 | xargs -r -0 rm) Jan 16 15:02:01 localhost /USR/SBIN/CRON[4631]: (root) CMD (if [ -x /usr/sbin/pg_maintenance ]; then /usr/sbin/pg_maintenance --analyze >/dev/null; fi) Jan 16 15:09:01 localhost /USR/SBIN/CRON[4807]: (root) CMD ( [ -d /var/lib/php5 ] && find /var/lib/php5/ -type f -cmin +$(/usr/lib/php5/maxlifetime) -print0 | xargs -r -0 rm) Jan 16 15:17:01 localhost /USR/SBIN/CRON[5008]: (root) CMD ( cd / && run-parts --report /etc/cron.hourly) Jan 16 15:27:45 localhost -- MARK -- Jan 16 15:33:01 localhost /USR/SBIN/CRON[5399]: (nobody) CMD ([ -x /usr/share/sa-exim/greylistclean ] && /usr/share/sa-exim/greylistclean) Jan 16 15:33:02 localhost sa-exim[5400]: Removed 0 of 0 greylist tuplets in 0 seconds Jan 16 15:33:02 localhost sa-exim[5400]: Removed 0 of 0 greylist directories in 0 seconds Jan 16 15:39:01 localhost /USR/SBIN/CRON[5548]: (root) CMD ( [ -d /var/lib/php5 ] && find /var/lib/php5/ -type f -cmin +$(/usr/lib/php5/maxlifetime) -print0 | xargs -r -0 rm) Jan 16 16:07:46 localhost -- MARK -- Jan 16 16:09:01 localhost /USR/SBIN/CRON[6284]: (root) CMD ( [ -d /var/lib/php5 ] && find /var/lib/php5/ -type f -cmin +$(/usr/lib/php5/maxlifetime) -print0 | xargs -r -0 rm) Jan 16 16:17:01 localhost /USR/SBIN/CRON[6486]: (root) CMD ( cd / && run-parts --report /etc/cron.hourly) Jan 16 16:27:46 localhost -- MARK -- Jan 16 16:33:01 localhost /USR/SBIN/CRON[6877]: (nobody) CMD ([ -x /usr/share/sa-exim/greylistclean ] && /usr/share/sa-exim/greylistclean) Jan 16 16:33:02 localhost sa-exim[6878]: Removed 0 of 0 greylist tuplets in 0 seconds Jan 16 16:33:02 localhost sa-exim[6878]: Removed 0 of 0 greylist directories in 0 seconds Jan 16 16:39:01 localhost /USR/SBIN/CRON[7026]: (root) CMD ( [ -d /var/lib/php5 ] && find /var/lib/php5/ -type f -cmin +$(/usr/lib/php5/maxlifetime) -print0 | xargs -r -0 rm) Jan 16 17:07:46 localhost -- MARK -- Jan 16 17:09:01 localhost /USR/SBIN/CRON[7762]: (root) CMD ( [ -d /var/lib/php5 ] && find /var/lib/php5/ -type f -cmin +$(/usr/lib/php5/maxlifetime) -print0 | xargs -r -0 rm) Jan 16 17:17:01 localhost /USR/SBIN/CRON[7964]: (root) CMD ( cd / && run-parts --report /etc/cron.hourly) Jan 16 17:27:47 localhost -- MARK -- Jan 16 17:33:01 localhost /USR/SBIN/CRON[8355]: (nobody) CMD ([ -x /usr/share/sa-exim/greylistclean ] && /usr/share/sa-exim/greylistclean) Jan 16 17:33:02 localhost sa-exim[8356]: Removed 0 of 0 greylist tuplets in 0 seconds Jan 16 17:33:02 localhost sa-exim[8356]: Removed 0 of 0 greylist directories in 0 seconds Jan 16 17:39:01 localhost /USR/SBIN/CRON[8504]: (root) CMD ( [ -d /var/lib/php5 ] && find /var/lib/php5/ -type f -cmin +$(/usr/lib/php5/maxlifetime) -print0 | xargs -r -0 rm) Jan 16 17:42:35 localhost proftpd[8597]: connect from 201.221.146.73 (201.221.146.73) Jan 16 17:43:03 localhost proftpd[8609]: connect from 201.221.146.73 (201.221.146.73) Jan 16 17:43:22 localhost proftpd[8618]: connect from 201.221.146.73 (201.221.146.73) Jan 16 17:43:44 localhost proftpd[8629]: connect from 201.221.146.73 (201.221.146.73) Jan 16 18:07:47 localhost -- MARK -- Jan 16 18:09:01 localhost /USR/SBIN/CRON[9243]: (root) CMD ( [ -d /var/lib/php5 ] && find /var/lib/php5/ -type f -cmin +$(/usr/lib/php5/maxlifetime) -print0 | xargs -r -0 rm) Jan 16 18:17:01 localhost /USR/SBIN/CRON[9444]: (root) CMD ( cd / && run-parts --report /etc/cron.hourly) Jan 16 18:27:47 localhost -- MARK -- Jan 16 18:33:01 localhost /USR/SBIN/CRON[9836]: (nobody) CMD ([ -x /usr/share/sa-exim/greylistclean ] && /usr/share/sa-exim/greylistclean) Jan 16 18:33:01 localhost sa-exim[9837]: Removed 0 of 0 greylist tuplets in 0 seconds Jan 16 18:33:01 localhost sa-exim[9837]: Removed 0 of 0 greylist directories in 0 seconds Jan 16 18:39:01 localhost /USR/SBIN/CRON[9985]: (root) CMD ( [ -d /var/lib/php5 ] && find /var/lib/php5/ -type f -cmin +$(/usr/lib/php5/maxlifetime) -print0 | xargs -r -0 rm) Jan 16 19:07:48 localhost -- MARK -- Jan 16 19:09:01 localhost /USR/SBIN/CRON[10721]: (root) CMD ( [ -d /var/lib/php5 ] && find /var/lib/php5/ -type f -cmin +$(/usr/lib/php5/maxlifetime) -print0 | xargs -r -0 rm) Jan 16 19:17:01 localhost /USR/SBIN/CRON[10922]: (root) CMD ( cd / && run-parts --report /etc/cron.hourly) Jan 16 19:27:48 localhost -- MARK -- Jan 16 19:33:01 localhost /USR/SBIN/CRON[11314]: (nobody) CMD ([ -x /usr/share/sa-exim/greylistclean ] && /usr/share/sa-exim/greylistclean) Jan 16 19:33:01 localhost sa-exim[11315]: Removed 0 of 0 greylist tuplets in 0 seconds Jan 16 19:33:01 localhost sa-exim[11315]: Removed 0 of 0 greylist directories in 0 seconds Jan 16 19:39:01 localhost /USR/SBIN/CRON[11463]: (root) CMD ( [ -d /var/lib/php5 ] && find /var/lib/php5/ -type f -cmin +$(/usr/lib/php5/maxlifetime) -print0 | xargs -r -0 rm)
|
Maniac (V) Mad Scientist with Finglongers From: Germany |
posted 01-17-2008 11:07
I'd say you syslog has 'rolled over' - it's all older than when the crash happend. |
Paranoid (IV) Inmate From: f(x) |
posted 01-17-2008 11:53
How do you know when the crash happened, I don't. Last I knew it running was around 17:00, I came back around 4:00. The syslog has about a 8.5 hour gap between Jan 16 19:39:01 and Jan 17 04:05:26. Look: code: ... Jan 16 19:33:01 localhost sa-exim[11315]: Removed 0 of 0 greylist directories in 0 seconds Jan 16 19:39:01 localhost /USR/SBIN/CRON[11463]: (root) CMD ( [ -d /var/lib/php5 ] && find /var/lib/php5/ -type f -cmin +$(/usr/lib/php5/maxlifetime) -print0 | xargs -r -0 rm) Jan 17 04:05:26 localhost syslogd 1.4.1#18: restart. Jan 17 04:05:26 localhost kernel: klogd 1.4.1#18, log source = /proc/kmsg started. ... |
Maniac (V) Mad Scientist with Finglongers From: Germany |
posted 01-17-2008 12:21
the crash happend just before the gap, obviously. |
Paranoid (IV) Inmate From: f(x) |
posted 01-17-2008 12:35
There is no new hardware from when I first installed Debian. I woulg guess some update screwed it up. I have been doing the same things I have been doing the past few months. I use it for backups (FTP from another server), my IRC bot (using sirc) as mentioned in my first post, and rendering (Bryce 5.5 through Wine, high quality 3300x2550 images). I update whenever it says there's new updates. |
Maniac (V) Mad Scientist with Finglongers From: Germany |
posted 01-17-2008 14:04
but you haven't done an upgrade that required you to reboot the machine (ie. new kernel)? |
Paranoid (IV) Inmate From: f(x) |
posted 01-18-2008 04:13 |
Maniac (V) Mad Scientist with Finglongers From: Germany |
posted 01-18-2008 08:46
Well, if it's been running smoothly the last few months, and now starts failing with kernel panics, |
Paranoid (IV) Inmate From: f(x) |
posted 01-18-2008 14:24
Yes, all the fans are spinning. |
Maniac (V) Mad Scientist with Finglongers From: Germany |
posted 01-18-2008 15:32
are there actually multiple processors in that machine? |
Paranoid (IV) Inmate From: f(x) |
posted 01-18-2008 17:04 |
Maniac (V) Mad Scientist with Finglongers From: Germany |
posted 01-18-2008 17:13
first off, you should run a memory tester. |
Maniac (V) Inmate From: there...no..there..... |
posted 01-18-2008 17:25
then you can probably look in /var/log/dmesg to see what when on during boot. |
Paranoid (IV) Inmate From: f(x) |
posted 01-19-2008 15:53
TP: Tested, memory is good. Can you point me somewhere that clearly tells how to replace the kernel? I never did it before. |
Maniac (V) Mad Scientist with Finglongers From: Germany |
posted 01-19-2008 18:45
This is debian. |
Paranoid (IV) Inmate From: f(x) |
posted 01-19-2008 20:23
Bleh... Stupid question? |
Maniac (V) Mad Scientist with Finglongers From: Germany |
posted 01-20-2008 13:40
now you boot a ubuntu cd. |
Paranoid (IV) Inmate From: f(x) |
posted 01-23-2008 03:49
I have reasons to believe that my CD drive may have a bad connection somewhere, I don't feel like confirming this right now. (I'm sleepy...) |
Paranoid (IV) Inmate From: f(x) |
posted 01-26-2008 23:46
I guess I should close this up... |
Maniac (V) Inmate From: there...no..there..... |
posted 01-27-2008 01:57
fsck is to repair the linux filesystem. So...you might have just had some crud in there clogging up the works (that's pretty technical speak right there isn't it? ) |