Welcome to MilkyWay@home

Compute Errors


Advanced search

Message boards : Number crunching : Compute Errors
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · Next

AuthorMessage
Profile[KWSN]John Galt 007
Avatar

Send message
Joined: 12 Dec 08
Posts: 56
Credit: 269,889,439
RAC: 0
200 million credit badge10 year member badge
Message 24400 - Posted: 6 Jun 2009, 20:29:07 UTC

I have had a few over 4 different 3850 cards, but am not really worrying about it...Probably over 99% have completed sucessully...
Click to help Seti City.




ID: 24400 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
John Clark

Send message
Joined: 4 Oct 08
Posts: 1734
Credit: 64,228,409
RAC: 0
50 million credit badge10 year member badge
Message 24415 - Posted: 6 Jun 2009, 23:51:57 UTC

CPUs are having invalid results as well. My old dual P3 has 1 in 4 results returned here.
Go away, I was asleep


ID: 24415 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ProfileKWSN imcrazynow
Avatar

Send message
Joined: 22 Nov 08
Posts: 136
Credit: 319,414,799
RAC: 0
300 million credit badge10 year member badge
Message 24417 - Posted: 6 Jun 2009, 23:56:53 UTC
Last modified: 6 Jun 2009, 23:59:33 UTC

I'm still getting quite a few. Maybe around 5%, that's still alot as far as the project goes. Somebody may want to look at why it's happening. That could be quite alot of work on the grand scale that will be missing if not corrected. Insta purge isn't helping any. If they could see why the're failing it would help.

4870 GPU
4870 GPU
ID: 24417 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ProfileLabbie
Avatar

Send message
Joined: 29 Aug 07
Posts: 327
Credit: 116,463,193
RAC: 0
100 million credit badge10 year member badge
Message 24423 - Posted: 7 Jun 2009, 1:14:12 UTC

I'm now showing 7 on my CPU machine that were not there this morning.



Calm Chaos Forum...Join Calm Chaos Now
ID: 24423 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Divide Overflow
Avatar

Send message
Joined: 16 Feb 09
Posts: 109
Credit: 11,089,510
RAC: 0
10 million credit badge10 year member badge
Message 24438 - Posted: 7 Jun 2009, 3:50:23 UTC
Last modified: 7 Jun 2009, 3:50:55 UTC

If the rare invalid result is occurring on both CPU and GPU crunched tasks, perhaps there is still a fundamental issue with some of the new work we're getting.

I see a small percentage of my work report as invalid as well.
ID: 24438 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ProfileThe Gas Giant
Avatar

Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,884,648
RAC: 0
200 million credit badge10 year member badge
Message 24439 - Posted: 7 Jun 2009, 5:10:14 UTC

I'm seeing approx 10% failure rate on my 4870.
ID: 24439 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ProfileNeil Polson
Avatar

Send message
Joined: 31 Dec 08
Posts: 9
Credit: 1,338,590
RAC: 0
1 million credit badge10 year member badge
Message 24442 - Posted: 7 Jun 2009, 5:39:23 UTC
Last modified: 7 Jun 2009, 5:44:15 UTC

It seems overnight all my 2s_2 have been marked as invalid (Have had 5 of them). All on cpu. No problems with the other searches.
ID: 24442 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ProfileThe Gas Giant
Avatar

Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,884,648
RAC: 0
200 million credit badge10 year member badge
Message 24444 - Posted: 7 Jun 2009, 6:33:57 UTC

My work E6850 (cpu) has about a 5% to 10% error rate.
ID: 24444 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
John Clark

Send message
Joined: 4 Oct 08
Posts: 1734
Credit: 64,228,409
RAC: 0
50 million credit badge10 year member badge
Message 24449 - Posted: 7 Jun 2009, 8:23:36 UTC

The error rate on my 3850 seems to have disappeared, but risen on the CPUs to about 5%.
Go away, I was asleep


ID: 24449 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Chris S
Avatar

Send message
Joined: 20 Sep 08
Posts: 1387
Credit: 188,981,825
RAC: 25,568
100 million credit badge10 year member badge
Message 24451 - Posted: 7 Jun 2009, 8:41:39 UTC

My card is not getting any errors since updating to 0.19f, but my CPU's are hardly getting any work at all so I can't comment on them. The last 2s unit that errored out resulted in 3 screens worth of debugging information in the output file. If anyone wants it I'll pass it on. Hopefully when the GPU project is able to start up all this will go away.
ID: 24451 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ProfileThe Gas Giant
Avatar

Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,884,648
RAC: 0
200 million credit badge10 year member badge
Message 24457 - Posted: 7 Jun 2009, 9:40:14 UTC
Last modified: 7 Jun 2009, 10:01:54 UTC

Just a few of the invalid wu's from my 4870

http://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=73584677
http://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=73571669
http://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=73544740
http://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=73544745
http://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=73571652
http://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=73571666
http://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=73571667
http://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=73544733
http://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=73501652
http://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=73501661
http://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=73544733
http://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=73544740
http://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=73544745
http://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=73571652
http://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=73571667
http://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=73571666
http://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=73571669


And a couple from my cpu

http://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=73422247
http://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=73422257
ID: 24457 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ProfileBruce
Avatar

Send message
Joined: 28 Apr 08
Posts: 1415
Credit: 2,716,428
RAC: 0
2 million credit badge10 year member badge
Message 24458 - Posted: 7 Jun 2009, 9:48:53 UTC
Last modified: 7 Jun 2009, 10:05:43 UTC

Yep I've got a few invalid Wu's too, all are 2_2's.
;-(
ID: 24458 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Seejay
Avatar

Send message
Joined: 22 Dec 07
Posts: 51
Credit: 2,405,016
RAC: 0
2 million credit badge10 year member badge
Message 24462 - Posted: 7 Jun 2009, 11:14:57 UTC - in response to Message 24458.  

Yep I've got a few invalid Wu's too, all are 2_2's.
;-(


Me too - all CPU - all 2_2s.

Seejay **Proud Member and Founder of BOINC Team Allprojectstats.com**
ID: 24462 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
John Vickers
Volunteer moderator
Project developer
Project scientist
Avatar

Send message
Joined: 11 May 09
Posts: 30
Credit: 81,093
RAC: 0
10 thousand credit badge10 year member badge
Message 24563 - Posted: 8 Jun 2009, 14:05:58 UTC
Last modified: 8 Jun 2009, 14:50:52 UTC

Hello MW@Home,

Can you please tell me if these _2s_2 runs that were returning errors are all "ps_sgr_208_2s_2", all "ps_sgr_235_2s_1" or a mix of both?

Thank You,
John Vickers

Edit: typo: "ps_sgr_235_2s_2" - > "ps_sgr_235_2s_1"
ID: 24563 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Brian Silvers

Send message
Joined: 21 Aug 08
Posts: 625
Credit: 558,425
RAC: 0
500 thousand credit badge10 year member badge
Message 24564 - Posted: 8 Jun 2009, 14:12:47 UTC - in response to Message 24563.  

Hello MW@Home,

Can you please tell me if these _2s_2 runs that were returning errors are all "ps_sgr_208_2s_2", all "ps_sgr_235_2s_2" or a mix of both?

Thank You,
John Vickers


It would help somewhat if the purge delay could be increased some more, that way there would be more results for people to look through to find an answer to your question. Alternatively, you could write a SQL query to run against the results database periodically to gather the data from your side.
ID: 24564 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ProfileCrunch3r
Volunteer developer
Avatar

Send message
Joined: 17 Feb 08
Posts: 363
Credit: 258,227,990
RAC: 0
200 million credit badge10 year member badge
Message 24566 - Posted: 8 Jun 2009, 14:18:43 UTC - in response to Message 24563.  

Hello MW@Home,

Can you please tell me if these _2s_2 runs that were returning errors are all "ps_sgr_208_2s_2", all "ps_sgr_235_2s_2" or a mix of both?

Thank You,
John Vickers


From what i've seen only "ps_sgr_208_2s_2" cause errors.

Join Support science! Joinc Team BOINC United now!
ID: 24566 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Cluster Physik

Send message
Joined: 26 Jul 08
Posts: 627
Credit: 94,940,203
RAC: 0
50 million credit badge10 year member badgeextraordinary contributions badge
Message 24568 - Posted: 8 Jun 2009, 14:25:38 UTC - in response to Message 24563.  
Last modified: 8 Jun 2009, 14:27:44 UTC

Can you please tell me if these _2s_2 runs that were returning errors are all "ps_sgr_208_2s_2", all "ps_sgr_235_2s_2" or a mix of both?

Most of the invalid WUs are ps_sgr_208_2s_2 ones, with a few ps_sgr_235_2s_1 amongst them (ratio 15:1 or so). I haven't found a single failed ps_sgr_235_2s_2 WU (in about 900 I looked at).

I would think the outlier detection may be too picky for some of the WUs, that means also some correct results get rejected.
ID: 24568 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profilekrahulik

Send message
Joined: 7 Nov 08
Posts: 14
Credit: 180,768,799
RAC: 0
100 million credit badge10 year member badge
Message 24584 - Posted: 8 Jun 2009, 16:51:53 UTC

ID: 24584 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ProfileKWSN imcrazynow
Avatar

Send message
Joined: 22 Nov 08
Posts: 136
Credit: 319,414,799
RAC: 0
300 million credit badge10 year member badge
Message 24587 - Posted: 8 Jun 2009, 17:43:47 UTC

These are the ones I could get before insta purge took care of them.

Host 39176 GPU
ps_sgr_208_2s_2_1637698_1244467419
ps_sgr_208_2s_2_1637697_1244467419
ps_sgr_208_2s_2_1637695_1244467419_0
ps_sgr_208_2s_2_1624691_1244465356_0
ps_sgr_208_2s_2_1615716_1244463950
ps_sgr_208_2s_2_1615702_1244463950_0

Host 60779 GPU
ps_sgr_208_2s_2_1641829_1244468066
ps_sgr_208_2s_2_1628695_1244465990
ps_sgr_208_2s_2_1628692_1244465990
ps_sgr_208_2s_2_1623219_1244465117
ps_sgr_235_2s_1_1572923_1244457187
ps_sgr_208_2s_2_308822_1244201732

Host 39247 CPU
ps_sgr_208_2s_2_1598226_1244461206

4870 GPU
4870 GPU
ID: 24587 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile[KWSN]John Galt 007
Avatar

Send message
Joined: 12 Dec 08
Posts: 56
Credit: 269,889,439
RAC: 0
200 million credit badge10 year member badge
Message 24591 - Posted: 8 Jun 2009, 18:37:54 UTC - in response to Message 24587.  

These are the ones I could get before insta purge took care of them.

Host 39176 GPU
ps_sgr_208_2s_2_1637698_1244467419
ps_sgr_208_2s_2_1637697_1244467419
ps_sgr_208_2s_2_1637695_1244467419_0
ps_sgr_208_2s_2_1624691_1244465356_0
ps_sgr_208_2s_2_1615716_1244463950
ps_sgr_208_2s_2_1615702_1244463950_0

Host 60779 GPU
ps_sgr_208_2s_2_1641829_1244468066
ps_sgr_208_2s_2_1628695_1244465990
ps_sgr_208_2s_2_1628692_1244465990
ps_sgr_208_2s_2_1623219_1244465117
ps_sgr_235_2s_1_1572923_1244457187
ps_sgr_208_2s_2_308822_1244201732

Host 39247 CPU
ps_sgr_208_2s_2_1598226_1244461206


All but one of my 0 credits have come on the 208_2s_2 WUs as well...too bad instapurge will get them shortly...I didn't see any in the most recent results...

Click to help Seti City.




ID: 24591 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · 6 · Next

Message boards : Number crunching : Compute Errors

©2020 Astroinformatics Group