Message boards :
Number crunching :
Server Outages
Message board moderation
Author | Message |
---|---|
![]() ![]() Send message Joined: 8 Oct 07 Posts: 289 Credit: 3,690,838 RAC: 0 ![]() ![]() |
I find it interesting that under 3 days of no load the server hasn't crashed....under load it was crashing at least once a day. |
![]() ![]() Send message Joined: 8 Oct 07 Posts: 289 Credit: 3,690,838 RAC: 0 ![]() ![]() |
Well there goes that theory ...server crashed this morning unless it was taken down on purpose.... |
zombie67 [MM] Send message Joined: 29 Aug 07 Posts: 115 Credit: 500,475,682 RAC: 60 ![]() ![]() |
|
![]() ![]() Send message Joined: 8 Oct 07 Posts: 289 Credit: 3,690,838 RAC: 0 ![]() ![]() |
Server crashes? I though they were due to power outages? When a computer reboots itself it could be both.....Travis never said all the computers in their computer room were having outages....it sounded like just this one.He also used the word crashed in the front page news ;) |
Emanuel Send message Joined: 18 Nov 07 Posts: 280 Credit: 2,442,757 RAC: 0 ![]() ![]() |
The server could even be crashing due to an unstable power supply :) |
![]() ![]() Send message Joined: 12 Nov 07 Posts: 31 Credit: 123,621 RAC: 0 ![]() ![]() |
What ever it is, it's obviously a hard ware error. If you turn off the "auto reboot", you should get a BSOD that gives you some more information what it could be that causing it. CPU error, not very likely, unless it's due to over heating, all BOINC projects create increased heating to the CPU, which could explain why it crashes more often when the project is on. How ever, over heating is a quite unusual reason for a crash, unless you have over clocked the CPU. The most usual error that causes the most vague and various symptoms is actually faulty graphic cards. The next most usual error is unfortunately the one that is the hardest to fix and that is mother board failures... ![]() |
![]() ![]() Send message Joined: 8 Oct 07 Posts: 289 Credit: 3,690,838 RAC: 0 ![]() ![]() |
What ever it is, it's obviously a hard ware error. Good info Crystallize.....however....I would "hope" Lab Staff knows all this :) |
![]() ![]() Send message Joined: 9 Sep 07 Posts: 22 Credit: 320,035 RAC: 0 ![]() ![]() |
Maybe if it ran Windows... But BOINC software runs under Linux. Kathryn :o) The BOINC FAQ Service The Unofficial BOINC Wiki The Trac System More BOINC information than you can shake a stick of RAM at. |
![]() ![]() Send message Joined: 12 Nov 07 Posts: 31 Credit: 123,621 RAC: 0 ![]() ![]() |
Perhaps, but since it takes so long and they don't seem to have a clue. I'm usually solving a hard ware problem in less than two days, how ever complicated it may seem. So I thought I just give some pointers... :o)
But there must be some debugging tools also for Linux ? right ? ![]() |
![]() Volunteer moderator Project administrator Project developer Project tester Project scientist Send message Joined: 30 Aug 07 Posts: 2046 Credit: 26,480 RAC: 0 ![]() ![]() |
unfortunately, our labstaff isn't dedicated to just working with us - they handle all the systems administration for the entire computer science department at RPI. so getting things to work correctly often takes a bit longer than we'd all like. also unfortunately, we're also kind of locked into using them :P |
![]() Send message Joined: 27 Dec 07 Posts: 35 Credit: 1,432,926 RAC: 0 ![]() ![]() |
|
![]() ![]() Send message Joined: 28 Aug 07 Posts: 133 Credit: 29,423,179 RAC: 0 ![]() ![]() |
|
![]() Volunteer moderator Project administrator Project developer Project tester Project scientist Send message Joined: 30 Aug 07 Posts: 2046 Credit: 26,480 RAC: 0 ![]() ![]() |
It looks like we are running again. i've been bugging labstaff as much as i can :) school is back in session now so hopefully they're all back from vacations and things like that. i haven't gotten any response from them in the last few days. |
![]() ![]() Send message Joined: 12 Nov 07 Posts: 2425 Credit: 524,164 RAC: 0 ![]() ![]() |
Why is this going down almost every day? |
![]() Volunteer moderator Project administrator Project developer Project tester Project scientist Send message Joined: 30 Aug 07 Posts: 2046 Credit: 26,480 RAC: 0 ![]() ![]() |
Why is this going down almost every day? the guy i've been talkign to in labstaff thinks there might be a problem in the kernel. other than that i'm really not too sure. |
![]() ![]() Send message Joined: 8 Oct 07 Posts: 289 Credit: 3,690,838 RAC: 0 ![]() ![]() |
Is anyone looking into the recent rash of server crashes? |
![]() Volunteer moderator Project administrator Project developer Project tester Project scientist Send message Joined: 30 Aug 07 Posts: 2046 Credit: 26,480 RAC: 0 ![]() ![]() |
Is anyone looking into the recent rash of server crashes? i'll let labstaff know about it. not quite sure whats causing them. there have been a lot of downtime for different servers on campus so that might be part of the problem. it was up and running smoothly for a good week or two there before the recent bout of outages. ![]() |
![]() ![]() Send message Joined: 27 Aug 07 Posts: 647 Credit: 27,592,547 RAC: 0 ![]() ![]() |
Phew, we're back again! :-)))) Lovely greetings, Cori ![]() ![]() |
![]() ![]() Send message Joined: 9 Nov 07 Posts: 131 Credit: 180,454 RAC: 0 ![]() ![]() |
|
![]() Send message Joined: 28 Aug 07 Posts: 146 Credit: 10,280,584 RAC: 0 ![]() ![]() |
|
©2023 Astroinformatics Group