Welcome to MilkyWay@home

Compute Errors

Message boards : Number crunching : Compute Errors
Message board moderation

To post messages, you must log in.

1 · 2 · 3 · 4 . . . 6 · Next

AuthorMessage
Profile The Gas Giant
Avatar

Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,884,648
RAC: 0
Message 24158 - Posted: 4 Jun 2009, 18:00:25 UTC

It appears as though there are some more faulty wu's out there....

ps_sgr_208_3s_1_20922_1244137807

Plus others...
ID: 24158 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile [FVG] bax
Avatar

Send message
Joined: 7 Mar 09
Posts: 8
Credit: 140,903,170
RAC: 0
Message 24160 - Posted: 4 Jun 2009, 18:09:07 UTC - in response to Message 24158.  
Last modified: 4 Jun 2009, 18:17:06 UTC

seems to me that:

only "3s" type are involved !

GPU goes on "compute error" but CPU works well

http://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=72202438
ID: 24160 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Kalessin
Avatar

Send message
Joined: 10 Nov 07
Posts: 42
Credit: 27,012,695
RAC: 0
Message 24161 - Posted: 4 Jun 2009, 18:25:49 UTC
Last modified: 4 Jun 2009, 18:31:54 UTC

At the moment i have only 2s running. 1s and 3s are producing complete stillstand on both of my GPUs.

ok 1s start working again, took them only a while to recover after i terminated the 3s.

Grrrr.
Dragons can fly because they don't fit into pirate ships!
ID: 24161 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile [XTBA>XTC] ZeuZ

Send message
Joined: 27 Dec 07
Posts: 14
Credit: 5,089,974
RAC: 0
Message 24162 - Posted: 4 Jun 2009, 18:31:12 UTC

Same problem here

ID: 24162 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile [FVG] bax
Avatar

Send message
Joined: 7 Mar 09
Posts: 8
Credit: 140,903,170
RAC: 0
Message 24163 - Posted: 4 Jun 2009, 18:36:44 UTC - in response to Message 24161.  

...1s ... complete stillstand


only 1s in standstill here (GPU)
ID: 24163 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Kalessin
Avatar

Send message
Joined: 10 Nov 07
Posts: 42
Credit: 27,012,695
RAC: 0
Message 24165 - Posted: 4 Jun 2009, 18:44:00 UTC

Ok just had a closer look.
The stillstand here is caused by the "3s" but if everything stood still, the "1s" and "2s" do need a complete restart of boinc to get going again.
Dragons can fly because they don't fit into pirate ships!
ID: 24165 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile [P3D] Crashtest

Send message
Joined: 8 Jan 09
Posts: 58
Credit: 53,161,741
RAC: 0
Message 24166 - Posted: 4 Jun 2009, 18:50:34 UTC - in response to Message 24165.  

yes - got the "3s" problem too (Gipsel 0.19e GPU App)

"1s" and "2s" ok
ID: 24166 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile borandi
Avatar

Send message
Joined: 21 Feb 09
Posts: 180
Credit: 27,806,824
RAC: 0
Message 24167 - Posted: 4 Jun 2009, 18:51:53 UTC

I thought I had a haul when it said '48 new tasks'... =P
ID: 24167 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile borandi
Avatar

Send message
Joined: 21 Feb 09
Posts: 180
Credit: 27,806,824
RAC: 0
Message 24172 - Posted: 4 Jun 2009, 19:28:08 UTC

6/4/2009 7:58:26 PM|Milkyway@home|Scheduler request completed: got 5 new tasks
6/4/2009 8:07:51 PM|Milkyway@home|Scheduler request completed: got 19 new tasks
6/4/2009 8:14:51 PM|Milkyway@home|Scheduler request completed: got 21 new tasks
6/4/2009 8:16:03 PM|Milkyway@home|Scheduler request completed: got 16 new tasks
6/4/2009 8:20:46 PM|Milkyway@home|Scheduler request completed: got 16 new tasks
6/4/2009 8:24:14 PM|Milkyway@home|Scheduler request completed: got 7 new tasks

Seems like the s3 WUs are shutting down the automated machines? I'm getting only s1 and s2 at the minute after a small batch of s3
ID: 24172 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile [FVG] bax
Avatar

Send message
Joined: 7 Mar 09
Posts: 8
Credit: 140,903,170
RAC: 0
Message 24173 - Posted: 4 Jun 2009, 19:34:18 UTC - in response to Message 24172.  

..Seems like the s3 WUs are shutting down the automated machines?


we have the first WU virus ;-)

hi hi
ID: 24173 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Lord Tedric
Avatar

Send message
Joined: 9 Nov 07
Posts: 151
Credit: 8,391,608
RAC: 0
Message 24174 - Posted: 4 Jun 2009, 19:36:52 UTC
Last modified: 4 Jun 2009, 19:48:42 UTC

Same here, wu's failing to run then causing errors.

ps_sgr_208_3s_2_9411_1244143134_0
ps_sgr_208_3s_2_9410_1244143134_0
ps_sgr_208_3s_2_9409_1244143134_0
ps_sgr_208_3s_2_9408_1244143134_0
ps_sgr_208_3s_2_9407_1244143134_0
ps_sgr_208_3s_2_9386_1244143134_0
ps_sgr_208_3s_2_9385_1244143134_0
ps_sgr_208_3s_2_9384_1244143134_0

<core_client_version>6.6.28</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>

and, seems to be using a lot of processor power for a wu unit running on an ATI
and, once downloaded they fail to start processing, a shut down and restart of BOINC does not remedy this either.
I've started to abort these units but the next batch of downloads just sends more of the same!
ID: 24174 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
localizer

Send message
Joined: 28 Jan 08
Posts: 40
Credit: 379,931,801
RAC: 0
Message 24176 - Posted: 4 Jun 2009, 19:51:00 UTC - in response to Message 24174.  

........... just seems to be hanging WUs with a side order of computation errors at the moment on the ps_sgr_208_xs series of WUs.
ID: 24176 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile borandi
Avatar

Send message
Joined: 21 Feb 09
Posts: 180
Credit: 27,806,824
RAC: 0
Message 24179 - Posted: 4 Jun 2009, 20:04:16 UTC

Now,

On one of my ATi clients, I have to cancel the s3s, but s1 and s2 work fine.
On another ATi client, s3 WUs come out as comp errors, but s1 and s2 process and are marked invalid.
ID: 24179 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Lord Tedric
Avatar

Send message
Joined: 9 Nov 07
Posts: 151
Credit: 8,391,608
RAC: 0
Message 24181 - Posted: 4 Jun 2009, 20:08:30 UTC - in response to Message 24179.  

I'm getting errors on s1, s2 & s3's

I'm having to micromange at the moment as none are completeing, therefore, none are being returned.....so unless i abort them i don't get any new units.
the circle of life...............
ID: 24181 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Berserk_Tux
Avatar

Send message
Joined: 2 Jan 08
Posts: 79
Credit: 365,471,675
RAC: 0
Message 24184 - Posted: 4 Jun 2009, 20:12:36 UTC - in response to Message 24176.  
Last modified: 4 Jun 2009, 20:16:44 UTC

I have the samme problems. All my Ati's hang and errors out:-(
ID: 24184 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
boosted

Send message
Joined: 4 Feb 08
Posts: 116
Credit: 17,263,566
RAC: 0
Message 24185 - Posted: 4 Jun 2009, 20:13:40 UTC

I have had about 60 or so error or freeze on the 3s units.
ID: 24185 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Labbie
Avatar

Send message
Joined: 29 Aug 07
Posts: 327
Credit: 116,463,193
RAC: 0
Message 24186 - Posted: 4 Jun 2009, 20:18:14 UTC

My 2 ATI machines error immediately on the 3s WUs, my CPU machine finishes these just fine.

Calm Chaos Forum...Join Calm Chaos Now
ID: 24186 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile [FVG] bax
Avatar

Send message
Joined: 7 Mar 09
Posts: 8
Credit: 140,903,170
RAC: 0
Message 24188 - Posted: 4 Jun 2009, 20:23:28 UTC - in response to Message 24165.  

Ok just had a closer look.
The stillstand here is caused by the "3s" but if everything stood still, the "1s" and "2s" do need a complete restart of boinc to get going again.


I think this is the correct diagnosys for ATI GPU clients

if I'm lucky, I erase all "3s" WUs before BOINC starts crunching them. In this way I save "1s" and "2s" without restarting BOINC
ID: 24188 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Lord Tedric
Avatar

Send message
Joined: 9 Nov 07
Posts: 151
Credit: 8,391,608
RAC: 0
Message 24190 - Posted: 4 Jun 2009, 20:33:21 UTC - in response to Message 24188.  
Last modified: 4 Jun 2009, 20:45:56 UTC

Ok just had a closer look.
The stillstand here is caused by the "3s" but if everything stood still, the "1s" and "2s" do need a complete restart of boinc to get going again.


I think this is the correct diagnosys for ATI GPU clients

if I'm lucky, I erase all "3s" WUs before BOINC starts crunching them. In this way I save "1s" and "2s" without restarting BOINC


Tried this approach, does not work for me................

Just checked my task manager, though i have no wu's currently downloaded or running, task manager shows : (multiple instances) astronomy_0.19_ATI_SSE2e.exe running in the background?
ID: 24190 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile [FVG] bax
Avatar

Send message
Joined: 7 Mar 09
Posts: 8
Credit: 140,903,170
RAC: 0
Message 24191 - Posted: 4 Jun 2009, 20:47:02 UTC - in response to Message 24190.  

Just checked my task manager, though i have no wu's currently downloaded or running, task manager shows : astronomy_0.19_ATI_SSE2e.exe running in the background?


my task manager do not show this ;-)
ID: 24191 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
1 · 2 · 3 · 4 . . . 6 · Next

Message boards : Number crunching : Compute Errors

©2024 Astroinformatics Group