fix to the invalid workunit problem
log in

Advanced search

Message boards : News : fix to the invalid workunit problem

1 · 2 · Next
Author Message
Profile Travis
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Send message
Joined: 30 Aug 07
Posts: 2046
Credit: 26,480
RAC: 0

Message 48525 - Posted: 7 May 2011, 7:52:11 UTC
Last modified: 7 May 2011, 7:52:24 UTC

I think I've fixed the problem with workunits all being marked invalid. Let me know if newly reported workunits are validating ok.
____________

Bent Vangli
Send message
Joined: 22 Jan 11
Posts: 2
Credit: 3,324,891
RAC: 0

Message 48526 - Posted: 7 May 2011, 8:02:18 UTC - in response to Message 48525.

The one cuda work unit I just reported validated. Good work.

Are those units already marked as invalid lost, or will you do a re-validating?

Best regards Bent

Profile Travis
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Send message
Joined: 30 Aug 07
Posts: 2046
Credit: 26,480
RAC: 0

Message 48527 - Posted: 7 May 2011, 8:04:14 UTC - in response to Message 48526.

The one cuda work unit I just reported validated. Good work.

Are those units already marked as invalid lost, or will you do a re-validating?

Best regards Bent


Theres a chance they might get revalidated, but i'm not quite sure.
____________

Bent Vangli
Send message
Joined: 22 Jan 11
Posts: 2
Credit: 3,324,891
RAC: 0

Message 48528 - Posted: 7 May 2011, 8:06:43 UTC - in response to Message 48527.

You have my feelings. Let us just pray :-) Bent

Jesse Viviano
Send message
Joined: 4 Feb 11
Posts: 82
Credit: 35,686,578
RAC: 12,753

Message 48529 - Posted: 7 May 2011, 8:56:24 UTC

I noticed that many of the work units now only allow a maximum of one error before the work unit is rejected. That seems to be a bad idea due to the number of people with outdated graphics drivers (causing some ATI/AMD work units to fail) and those with optimized applications that will generate validate errors because they still upload results with file uploads, where the validator no longer looks.

Profile Toppie*
Send message
Joined: 28 Mar 09
Posts: 68
Credit: 1,003,982,681
RAC: 0

Message 48530 - Posted: 7 May 2011, 9:07:46 UTC - in response to Message 48525.

As from 07:33 UTC no more invalids.
Thank you!

Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,884,648
RAC: 26

Message 48531 - Posted: 7 May 2011, 10:32:16 UTC

Still have wu's marked as invalid.

vandiesel
Send message
Joined: 10 May 10
Posts: 27
Credit: 43,104,187
RAC: 0

Message 48533 - Posted: 7 May 2011, 11:01:59 UTC

I have had a load just "aborted by project"

ExtraTerrestrial Apes
Avatar
Send message
Joined: 1 Sep 08
Posts: 204
Credit: 219,354,537
RAC: 0

Message 48534 - Posted: 7 May 2011, 12:04:01 UTC - in response to Message 48529.

I agree with Jesse, the current situation could use some improvement.

While looking at recent results I noticed you became quite generous with "initial replication". I've seen 4 and 6. Out of these only 1 error (due to an old app or whatever) already triggers the WU being marked invalid. Sorry, but this is stupid if you've got 5 perfectly fine and agreeing results together with that error. Take the results which are good, give them credit and never mind the error.

MrS
____________
Scanning for our furry friends since Jan 2002

Profile dskagcommunity
Avatar
Send message
Joined: 26 Feb 11
Posts: 170
Credit: 183,085,176
RAC: 0

Message 48535 - Posted: 7 May 2011, 12:05:03 UTC
Last modified: 7 May 2011, 12:16:11 UTC

I got @~50% per WU an aborting computing error with all the WUs from today :/
There must be something wrong :(

Edit:
I saw now in errorreports on website from two of the WUs:

Maximum elapsed time exceeded

hum? after ~4 Minutes? Timeline is 15.5. not today in 4 minutes.. ^^ Dont have a 5990 or something to get this finished in this time ^^ Need 7-13 Minutes depending on Type of WU
____________
DSKAG Austria Research Team: http://www.research.dskag.at



Dan
Send message
Joined: 17 May 09
Posts: 5
Credit: 25,313,792
RAC: 0

Message 48538 - Posted: 7 May 2011, 12:16:59 UTC - in response to Message 48525.

Don't know if this is related, but all wu on my dual 5870s error now.

[HWU]Flotta Stellare - Starfleet
Send message
Joined: 22 Feb 09
Posts: 6
Credit: 25,439,032
RAC: 0

Message 48540 - Posted: 7 May 2011, 12:38:44 UTC

I have a 6950 and the same problem, every WU end with Computation error.

Eux
Send message
Joined: 19 Apr 11
Posts: 1
Credit: 208,466
RAC: 0

Message 48541 - Posted: 7 May 2011, 12:44:33 UTC
Last modified: 7 May 2011, 13:00:05 UTC

-every tasks from my dual ATI 4890 GPU are in error today.
-my CPU Tasks are ok.


best regards

Profile fischju
Send message
Joined: 28 Apr 11
Posts: 3
Credit: 8,487,173
RAC: 0

Message 48543 - Posted: 7 May 2011, 14:14:46 UTC

Same here, all my GPU WUs (4850) end in Computation Error. Just watched one and they only get about half way.

ivk
Send message
Joined: 10 Feb 10
Posts: 6
Credit: 157,756,946
RAC: 0

Message 48544 - Posted: 7 May 2011, 15:12:10 UTC
Last modified: 7 May 2011, 15:41:13 UTC

All of my workunits running on the GPU are currently being aborted - automatically, with diagnostics similar to the following:

07/05/2011 16:07:31 Milkyway@home Starting de_separation_10_3s_fix20_1_142549_1304780731_0
07/05/2011 16:07:31 Milkyway@home Starting task de_separation_10_3s_fix20_1_142549_1304780731_0 using milkyway version 62
07/05/2011 16:09:09 Milkyway@home Aborting task de_separation_10_3s_fix20_1_142549_1304780731_0: exceeded elapsed time limit 96.990841
07/05/2011 16:09:10 Milkyway@home Computation for task de_separation_10_3s_fix20_1_142549_1304780731_0 finished

I guess this is another screw-up?

As you can see, the unit was aborted after about a minute of the astronomic time, due to "exceeded elapsed time limit". Evidently the "elapsed time limit" is being set wrongly.
____________

TJ
Send message
Joined: 12 Aug 09
Posts: 262
Credit: 91,881,498
RAC: 24

Message 48545 - Posted: 7 May 2011, 15:57:01 UTC

I have lots of error while computing now, just powerd the rig.
Wingman have almost all errors with same WU's.
____________
Greetings from,
TJ

Profile Arif Mert Kapicioglu
Send message
Joined: 14 Dec 09
Posts: 159
Credit: 573,720,351
RAC: 0

Message 48555 - Posted: 7 May 2011, 18:57:53 UTC

Guess I'm one of the lucky ones. I have very few of these invalid or error wus. Most of them are cancelled by server.

I don't use any app_info, cpu and only have one backup project which doesn't use cpu as well. BOINC Version 6.10.60, catalyst 11.3, 7 Pro X64SP1.

Also, I downclock the memory within official limits unlike the unofficial one in MSI AB. If i downclock more than 1/2 of the initial memory amount, than at some point, the driver causes BSOD.

Hope the info helps.

TJ
Send message
Joined: 12 Aug 09
Posts: 262
Credit: 91,881,498
RAC: 24

Message 48556 - Posted: 7 May 2011, 19:50:55 UTC - in response to Message 48555.

I think you are lucky indeed. All my wingman have errors so a lot of people have them.
I haven't made any changes to settings of cards, cpus and so on. No tweaks no overclocks or downclocks, and till today all was running like a train, a fast train.
____________
Greetings from,
TJ

BarryAZ
Send message
Joined: 1 Sep 08
Posts: 519
Credit: 281,480,125
RAC: 185

Message 48563 - Posted: 8 May 2011, 5:43:46 UTC

Travis, not clear that this is fixed yet. I did a project reset and the first unit went to a computation error.

What I will do from this side is update to the most recent (11.4) ATI drivers -- been running 11.2. After that I'll try again.


____________

BarryAZ
Send message
Joined: 1 Sep 08
Posts: 519
Credit: 281,480,125
RAC: 185

Message 48564 - Posted: 8 May 2011, 6:04:04 UTC

OK - looks like the problem is not resolved.

I'm running 6.10.58

I installed the current 11.4 ATI drivers

I then did a project reset.

I got computation errors (after about 33% complete) -- on a Windows XP as well as a Win 7 - 64 bit, both workstations have HD 4850's -- not over clocked.

So for now, I figure the issue which showed up in the past day, is on the MW side of things.

I figure to watch the news here but for now, I will push over to Collatz.

____________

1 · 2 · Next
Post to thread

Message boards : News : fix to the invalid workunit problem


Main page · Your account · Message boards


Copyright © 2018 AstroInformatics Group