All new Registrations are manually reviewed and approved, so a short delay after registration may occur before your account becomes active.
Online.net - Server hangs after a few hours of running
Hi,
I have a server at Online.net and I am running proxmox on it. The problem is, after a few hours of running, it hangs. If I reboot it, it will work well but after a few hours, it happens again.
I have tried to reinstalled the server, check all the settings but cannot find anything that may cause the issue. I use the same setting for all of my 20 servers but only this server has this problem.
When I login to KVM over IP, I found a lot of this message on Event Log: Configuration Error (DMI) - Assertion.
I have also tried to open a support ticket but their supporter insists saying that my server is online. Actually, it's online but it hangs and ping down.
Anyone have any ideas with the error I got? Thank you in advance.
Comments
Have you tried running a memory test to make sure the ram isn't bad?
Wait until it hangs again and then tell them to look at the machine, maybe they will then take it serious, i find it strange that it takes a few hours, have you tried monitoring the temp, what diagnostics have you ran, check the HDD, check the ram, maybe stress the cpu for a bit see if any of this will result in a hang.
seems like something wrong with your hardware ...
install kdump and run the output it gives you through mcelog after the next crash, you will have your answer.
Thank you for all the suggestion. Actually, I have waited until it hangs and create a support ticket but they still said the server was online!
I have checked the SSD, and monitor the temperature but it seems all is normal (the CPU is about 68 degree celcius).
I will check the RAM ,install kdump and get back for reports. Thank you all.
Idle or under load?
had the same before on one server then i leave them ...
Either a motherboard / processor issue, DMI is the link between the Northbridge and Southbridge of a motherboard.
BIOS could need updating or the board could be bust.
PH
Notes: https://www.google.co.uk/search?q=Direct+Media+Interface&ie=utf-8&oe=utf-8&client=firefox-b&gfe_rd=cr&ei=YkuKWPPvFOLW8geMore4Bw
68C is a little high on idle but each CPU generation varies in temp.
Hi,
I have tested the RAM but no error found.
68C is the temperature on full load.
I have also installed kdump but it does not show any of kernel crash info. I think the reason is my server does not crash, it hangs so kdump cannot do anything here.
Another information, this node is always on full load (load = 1.5 per core), do you think 1.5 is too high? Actually, I have some servers with the same type and load = 2 and it still runs well.
Any further suggestion would be much appreciated. Thank you.
It's a dedi, you should be able to run full load 24/7, especially if it's a Xeon server.
Am running conversion (ffmpeg) 24/7 at 10.5 load on a 4c/8t server. Still works fine, also online.net deal.
Any solution for this problem?
I have the same problem.
How do you know that it hung, rather than just becoming inaccessible by network temporarily? Have you tried accessing from another machine in the same DC? Maybe their IPMI system?
After some tickets with a lot of evidence to convince them, they changed my server.
Thanks man.