Welcome to MilkyWay@home

fix to the invalid workunit problem


Advanced search

Message boards : News : fix to the invalid workunit problem
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
ProfileTravis
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 30 Aug 07
Posts: 2046
Credit: 26,480
RAC: 0
10 thousand credit badge10 year member badge
Message 48525 - Posted: 7 May 2011, 7:52:11 UTC
Last modified: 7 May 2011, 7:52:24 UTC

I think I've fixed the problem with workunits all being marked invalid. Let me know if newly reported workunits are validating ok.
ID: 48525 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Bent Vangli

Send message
Joined: 22 Jan 11
Posts: 2
Credit: 3,590,635
RAC: 4,303
3 million credit badge8 year member badge
Message 48526 - Posted: 7 May 2011, 8:02:18 UTC - in response to Message 48525.  

The one cuda work unit I just reported validated. Good work.

Are those units already marked as invalid lost, or will you do a re-validating?

Best regards Bent
ID: 48526 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ProfileTravis
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 30 Aug 07
Posts: 2046
Credit: 26,480
RAC: 0
10 thousand credit badge10 year member badge
Message 48527 - Posted: 7 May 2011, 8:04:14 UTC - in response to Message 48526.  

The one cuda work unit I just reported validated. Good work.

Are those units already marked as invalid lost, or will you do a re-validating?

Best regards Bent


Theres a chance they might get revalidated, but i'm not quite sure.
ID: 48527 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Bent Vangli

Send message
Joined: 22 Jan 11
Posts: 2
Credit: 3,590,635
RAC: 4,303
3 million credit badge8 year member badge
Message 48528 - Posted: 7 May 2011, 8:06:43 UTC - in response to Message 48527.  

You have my feelings. Let us just pray :-) Bent
ID: 48528 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jesse Viviano

Send message
Joined: 4 Feb 11
Posts: 83
Credit: 42,757,668
RAC: 13,335
30 million credit badge8 year member badge
Message 48529 - Posted: 7 May 2011, 8:56:24 UTC

I noticed that many of the work units now only allow a maximum of one error before the work unit is rejected. That seems to be a bad idea due to the number of people with outdated graphics drivers (causing some ATI/AMD work units to fail) and those with optimized applications that will generate validate errors because they still upload results with file uploads, where the validator no longer looks.
ID: 48529 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ProfileToppie*

Send message
Joined: 28 Mar 09
Posts: 68
Credit: 1,003,982,681
RAC: 0
1 billion credit badge10 year member badge
Message 48530 - Posted: 7 May 2011, 9:07:46 UTC - in response to Message 48525.  

As from 07:33 UTC no more invalids.
Thank you!
ID: 48530 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ProfileThe Gas Giant
Avatar

Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,884,648
RAC: 0
200 million credit badge10 year member badge
Message 48531 - Posted: 7 May 2011, 10:32:16 UTC

Still have wu's marked as invalid.
ID: 48531 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
vandiesel

Send message
Joined: 10 May 10
Posts: 27
Credit: 43,104,187
RAC: 0
30 million credit badge9 year member badge
Message 48533 - Posted: 7 May 2011, 11:01:59 UTC

I have had a load just "aborted by project"
ID: 48533 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ExtraTerrestrial Apes
Avatar

Send message
Joined: 1 Sep 08
Posts: 204
Credit: 219,354,537
RAC: 0
200 million credit badge10 year member badge
Message 48534 - Posted: 7 May 2011, 12:04:01 UTC - in response to Message 48529.  

I agree with Jesse, the current situation could use some improvement.

While looking at recent results I noticed you became quite generous with "initial replication". I've seen 4 and 6. Out of these only 1 error (due to an old app or whatever) already triggers the WU being marked invalid. Sorry, but this is stupid if you've got 5 perfectly fine and agreeing results together with that error. Take the results which are good, give them credit and never mind the error.

MrS
Scanning for our furry friends since Jan 2002
ID: 48534 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profiledskagcommunity
Avatar

Send message
Joined: 26 Feb 11
Posts: 170
Credit: 183,085,176
RAC: 0
100 million credit badge8 year member badge
Message 48535 - Posted: 7 May 2011, 12:05:03 UTC
Last modified: 7 May 2011, 12:16:11 UTC

I got @~50% per WU an aborting computing error with all the WUs from today :/
There must be something wrong :(

Edit:
I saw now in errorreports on website from two of the WUs:

Maximum elapsed time exceeded

hum? after ~4 Minutes? Timeline is 15.5. not today in 4 minutes.. ^^ Dont have a 5990 or something to get this finished in this time ^^ Need 7-13 Minutes depending on Type of WU
DSKAG Austria Research Team: http://www.research.dskag.at



ID: 48535 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Dan

Send message
Joined: 17 May 09
Posts: 5
Credit: 25,350,789
RAC: 0
20 million credit badge10 year member badge
Message 48538 - Posted: 7 May 2011, 12:16:59 UTC - in response to Message 48525.  

Don't know if this is related, but all wu on my dual 5870s error now.
ID: 48538 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
[HWU]Flotta Stellare - Starfleet

Send message
Joined: 22 Feb 09
Posts: 6
Credit: 25,439,032
RAC: 0
20 million credit badge10 year member badge
Message 48540 - Posted: 7 May 2011, 12:38:44 UTC

I have a 6950 and the same problem, every WU end with Computation error.

ID: 48540 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Eux

Send message
Joined: 19 Apr 11
Posts: 1
Credit: 208,466
RAC: 0
100 thousand credit badge8 year member badge
Message 48541 - Posted: 7 May 2011, 12:44:33 UTC
Last modified: 7 May 2011, 13:00:05 UTC

-every tasks from my dual ATI 4890 GPU are in error today.
-my CPU Tasks are ok.


best regards
ID: 48541 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profilefischju

Send message
Joined: 28 Apr 11
Posts: 3
Credit: 8,487,173
RAC: 0
5 million credit badge8 year member badge
Message 48543 - Posted: 7 May 2011, 14:14:46 UTC

Same here, all my GPU WUs (4850) end in Computation Error. Just watched one and they only get about half way.
ID: 48543 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ivk

Send message
Joined: 10 Feb 10
Posts: 6
Credit: 157,756,946
RAC: 0
100 million credit badge9 year member badge
Message 48544 - Posted: 7 May 2011, 15:12:10 UTC
Last modified: 7 May 2011, 15:41:13 UTC

All of my workunits running on the GPU are currently being aborted - automatically, with diagnostics similar to the following:

07/05/2011 16:07:31 Milkyway@home Starting de_separation_10_3s_fix20_1_142549_1304780731_0
07/05/2011 16:07:31 Milkyway@home Starting task de_separation_10_3s_fix20_1_142549_1304780731_0 using milkyway version 62
07/05/2011 16:09:09 Milkyway@home Aborting task de_separation_10_3s_fix20_1_142549_1304780731_0: exceeded elapsed time limit 96.990841
07/05/2011 16:09:10 Milkyway@home Computation for task de_separation_10_3s_fix20_1_142549_1304780731_0 finished

I guess this is another screw-up?

As you can see, the unit was aborted after about a minute of the astronomic time, due to "exceeded elapsed time limit". Evidently the "elapsed time limit" is being set wrongly.
ID: 48544 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
TJ

Send message
Joined: 12 Aug 09
Posts: 262
Credit: 92,574,414
RAC: 0
50 million credit badge10 year member badge
Message 48545 - Posted: 7 May 2011, 15:57:01 UTC

I have lots of error while computing now, just powerd the rig.
Wingman have almost all errors with same WU's.
Greetings from,
TJ
ID: 48545 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ProfileArif Mert Kapicioglu

Send message
Joined: 14 Dec 09
Posts: 161
Credit: 589,318,064
RAC: 10,014
500 million credit badge9 year member badge
Message 48555 - Posted: 7 May 2011, 18:57:53 UTC

Guess I'm one of the lucky ones. I have very few of these invalid or error wus. Most of them are cancelled by server.

I don't use any app_info, cpu and only have one backup project which doesn't use cpu as well. BOINC Version 6.10.60, catalyst 11.3, 7 Pro X64SP1.

Also, I downclock the memory within official limits unlike the unofficial one in MSI AB. If i downclock more than 1/2 of the initial memory amount, than at some point, the driver causes BSOD.

Hope the info helps.
ID: 48555 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
TJ

Send message
Joined: 12 Aug 09
Posts: 262
Credit: 92,574,414
RAC: 0
50 million credit badge10 year member badge
Message 48556 - Posted: 7 May 2011, 19:50:55 UTC - in response to Message 48555.  

I think you are lucky indeed. All my wingman have errors so a lot of people have them.
I haven't made any changes to settings of cards, cpus and so on. No tweaks no overclocks or downclocks, and till today all was running like a train, a fast train.
Greetings from,
TJ
ID: 48556 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
BarryAZ

Send message
Joined: 1 Sep 08
Posts: 519
Credit: 282,930,159
RAC: 8,236
200 million credit badge10 year member badgeextraordinary contributions badge
Message 48563 - Posted: 8 May 2011, 5:43:46 UTC

Travis, not clear that this is fixed yet. I did a project reset and the first unit went to a computation error.

What I will do from this side is update to the most recent (11.4) ATI drivers -- been running 11.2. After that I'll try again.


ID: 48563 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
BarryAZ

Send message
Joined: 1 Sep 08
Posts: 519
Credit: 282,930,159
RAC: 8,236
200 million credit badge10 year member badgeextraordinary contributions badge
Message 48564 - Posted: 8 May 2011, 6:04:04 UTC

OK - looks like the problem is not resolved.

I'm running 6.10.58

I installed the current 11.4 ATI drivers

I then did a project reset.

I got computation errors (after about 33% complete) -- on a Windows XP as well as a Win 7 - 64 bit, both workstations have HD 4850's -- not over clocked.

So for now, I figure the issue which showed up in the past day, is on the MW side of things.

I figure to watch the news here but for now, I will push over to Collatz.

ID: 48564 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
1 · 2 · Next

Message boards : News : fix to the invalid workunit problem

©2019 Astroinformatics Group