Welcome to MilkyWay@home

Computer 'sploded

Message boards : Number crunching : Computer 'sploded
Message board moderation

To post messages, you must log in.

AuthorMessage
Emanuel

Send message
Joined: 18 Nov 07
Posts: 280
Credit: 2,442,757
RAC: 0
Message 3053 - Posted: 6 Apr 2008, 22:43:32 UTC
Last modified: 6 Apr 2008, 22:43:59 UTC

Hey guys, my computer died today, not sure what's wrong but it's probably either the motherboard, the GPU or a problem with the cooling of the CPU. Either way I don't really have time to look at it this week, and I leave for England this Thursday, so you might want to reassign the three In Progress WUs.
http://milkyway.cs.rpi.edu/milkyway/results.php?hostid=8764
I seem to be plagued by these problems lately o-o That's three out of three of the computers in this house breaking down in the past two weeks ...
ID: 3053 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile tahanko

Send message
Joined: 13 Dec 07
Posts: 11
Credit: 1,849,193
RAC: 0
Message 3057 - Posted: 7 Apr 2008, 4:51:04 UTC

maybe it is the electricity. try a surge protector> http://www.infosec-ups.com/electrical-protection/product-calatogue.htm

ID: 3057 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Reeltime

Send message
Joined: 31 Jan 08
Posts: 6
Credit: 6,776,875
RAC: 0
Message 3058 - Posted: 7 Apr 2008, 11:40:10 UTC

That sounds suspiciously like a power issue. As tahanko says, if you aren't using a surge protector, try one
ID: 3058 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
John McLeod VII
Avatar

Send message
Joined: 27 Aug 07
Posts: 85
Credit: 405,705
RAC: 0
Message 3060 - Posted: 7 Apr 2008, 12:31:34 UTC

You ought to be using a UPS, you need to be using at least a surge protector.


BOINC WIKI
ID: 3060 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Emanuel

Send message
Joined: 18 Nov 07
Posts: 280
Credit: 2,442,757
RAC: 0
Message 3061 - Posted: 7 Apr 2008, 17:44:34 UTC - in response to Message 3060.  

Thanks for the interest and the suggestions, guys, I'll look into getting some surge protection when I get back. On another note, my computer seems to have recovered somewhat (somehow the HDDs' SMART capability was making it balk? WTF?) so I'm letting it finish those WUs.
ID: 3061 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile ChertseyAl
Avatar

Send message
Joined: 31 Aug 07
Posts: 66
Credit: 1,002,668
RAC: 0
Message 3062 - Posted: 7 Apr 2008, 18:46:11 UTC - in response to Message 3060.  

You ought to be using a UPS


Wow, that's *serious* BOINCing ;)

Random machine/power failures are why I now only crunch short WUs, or marginally longer ones with proper checkpointing.

Had a power failure this weekend. Probably lost an hour of science at worst across 3 hosts. Also revealed a weird characteristic of SIMAP - Sadly cannot prove my conjecture as I'm dry on that project ATM.

Al.
ID: 3062 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
John McLeod VII
Avatar

Send message
Joined: 27 Aug 07
Posts: 85
Credit: 405,705
RAC: 0
Message 3066 - Posted: 8 Apr 2008, 0:04:07 UTC - in response to Message 3062.  

You ought to be using a UPS


Wow, that's *serious* BOINCing ;)

Random machine/power failures are why I now only crunch short WUs, or marginally longer ones with proper checkpointing.

Had a power failure this weekend. Probably lost an hour of science at worst across 3 hosts. Also revealed a weird characteristic of SIMAP - Sadly cannot prove my conjecture as I'm dry on that project ATM.

Al.

Actually, using a UPS on every machine is sound advice. Most operating systems do not write immediately, but delay for a while in the hopes of reducing the number of writes. This includes the directory structure. A sudden loss of power can cause any file that has been recently modified to lose data, including the file system. If data in the file system is lost, entire files can disappear.


BOINC WIKI
ID: 3066 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Emanuel

Send message
Joined: 18 Nov 07
Posts: 280
Credit: 2,442,757
RAC: 0
Message 3069 - Posted: 8 Apr 2008, 7:19:49 UTC - in response to Message 3066.  

Most operating systems do not write immediately, but delay for a while in the hopes of reducing the number of writes. This includes the directory structure. A sudden loss of power can cause any file that has been recently modified to lose data, including the file system. If data in the file system is lost, entire files can disappear.


This is why it would be nice if everyone used managed writes (is that the term? I forget), but OS APIs tend to be much harder to figure out than they should be. Of course, that doesn't solve the problem that writes also take more time internally than the HDDs would have you think. Aren't UPS' expensive though? I can afford to get some surge protectors (and intend to do so today) but ...
ID: 3069 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Emanuel

Send message
Joined: 18 Nov 07
Posts: 280
Credit: 2,442,757
RAC: 0
Message 3074 - Posted: 8 Apr 2008, 17:04:46 UTC - in response to Message 3069.  

Surge protectors activated (it sounds so sci-fi) :) Now let's hope it helps.
ID: 3074 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : Computer 'sploded

©2024 Astroinformatics Group