I am not sure where to start to track down this problem. After a while (different each time) server stops responding. I am unable to ping or shell into box to see what is up. I hard reset box and all works ok then a few days or few hours later it stops responding again!! What logs should I look at or how should I track this to correct? Please advise. thanks in advance. ubuntu perfect server 10.10 ispconfig setup installed about 1 month ago...
I suggest you install munin. It might help you track down the problem. Sounds like a problem with your memory or swap.
Did you e.g. check /var/log/messages after the server is reachable again? Munin is a good alternative to have a broeader look and trend about some specific values. You could also check whether auditd can help you with this issue: http://www.cyberciti.biz/tips/linux-audit-files-to-see-who-made-changes-to-a-file.html
server stopped again log from when it died to after restart Muni didnt really show anything just that it went to sleep and stopped too. Could this be a power saving issue ? how to turn that off? Thanks.. ps doesnt appear to be using swap at all
physical server Not virtual here is a link to the munin report http://www.plasmapages.net/rpcserv1.rpc/web/monitoring/rpc/rpcserv1.rpc/index.html I am not really sure about what I am seeing in it. If you could take a look? thanks,,,
To me it looks as if this is related to your hard drive. You have spikes in the Disk Latency (= long I/O wait) and the IO Service Time graphs which (I guess) makes your server hang.
new drive installed swap still not working Ok, keep fingers crossed that new drive does it for me. How do I get the swap file to work again? thanks----