Welcome to MilkyWay@home

Thousands of validation errors, no good work?

Message boards : Number crunching : Thousands of validation errors, no good work?
Message board moderation

To post messages, you must log in.

AuthorMessage
Donald Qualls

Send message
Joined: 13 Apr 11
Posts: 33
Credit: 29,420,441
RAC: 6,488
Message 66238 - Posted: 23 Mar 2017, 10:40:44 UTC

I noticed recently that my credit average on Milyway has dropped. On investigation, I see I have thousands of work units with validation errors, and nothing (recently) that has validated successfully.

Given I don't see the message boards exploding, I presume this is specific to my computer -- what should I look for?
ID: 66238 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Vortac

Send message
Joined: 22 Apr 09
Posts: 95
Credit: 4,808,181,963
RAC: 0
Message 66240 - Posted: 23 Mar 2017, 17:38:33 UTC - in response to Message 66238.  

I noticed recently that my credit average on Milyway has dropped. On investigation, I see I have thousands of work units with validation errors, and nothing (recently) that has validated successfully.

Given I don't see the message boards exploding, I presume this is specific to my computer -- what should I look for?

Check the GPU temperatures - maybe the fan has falied and the card is overheating?

If that's not the case, it is possible that the card is dying. Try to lower the clocks (even below factory settings) and then monitor if there's any improvement.
ID: 66240 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Dunx

Send message
Joined: 13 Feb 11
Posts: 31
Credit: 1,403,524,537
RAC: 0
Message 66241 - Posted: 23 Mar 2017, 20:53:39 UTC
Last modified: 23 Mar 2017, 20:55:36 UTC

If it has failed on four attempts,

On four different systems,

then it's.... garbage in = garbage out !

IMHO

dunx

P.S. It is NOT just your system.
ID: 66241 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
alanb1951

Send message
Joined: 16 Mar 10
Posts: 208
Credit: 105,462,642
RAC: 36,107
Message 66242 - Posted: 24 Mar 2017, 2:57:46 UTC

Donald,

I notice that all your Invalid tasks report the application as "MilkyWay@Home Anonymous platform (CPU)"

I would expect a Linux system running CPU tasks to report as "MilkyWay@Home 1.40" (GPU tasks would have the GPU plan class listed in brackets after that...)

So the obvious question is why is your application reporting as Anonymous? It might be worth resetting Milkyway on that machine to see if kick-starting it again fetches the current application from the server - you won't be any worse off than you are now!

Good luck sorting it out.

Cheers - Al.
ID: 66242 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Brickhead
Avatar

Send message
Joined: 20 Mar 08
Posts: 108
Credit: 2,607,924,860
RAC: 0
Message 66243 - Posted: 24 Mar 2017, 12:37:36 UTC

Clients using Anonymous platform, where the application and settings are specified locally, are not subject to automatic updates from the MW servers.

Have your checked whether the MW application on this host is the current latest release? My guess is that it might be obsolete.

There are a few other hosts with this problem (all of them use Anonymous platform AFAICT), but the vast majority are OK.
ID: 66243 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Donald Qualls

Send message
Joined: 13 Apr 11
Posts: 33
Credit: 29,420,441
RAC: 6,488
Message 66259 - Posted: 1 Apr 2017, 18:43:43 UTC
Last modified: 1 Apr 2017, 19:02:16 UTC

@Vortac: I'm not running GPU on Milkyway. I am running GPU on Einstein, and it's ticking right along without problems.

I presume I'm running "anonymous" because that's what I get when I install BOINC from the Canonical repositories -- which Canonical always recommends over anything from a third party. I don't know how to check what the application release is, or how to reset it. More details would be helpful.

Edit: Okay, following up, I found the "reset" button in the project controls; tried it, and was still getting tasks "completing" in 30+ seconds that should have taken an hour and a half, so I tried the "remove" button. That locked up my BOINC manager, but after closing and reopening BOINC manager it came up showing Einstein, but not Milkyway. I then used "add project" to add Milkyway again, and after it downloaded a few tasks, I see a 4 CPU task with an original estimate of 15 minutes or so progressing at approximately 1:1 between incrementing elapsed and decrementing remaining time. It'll take some time to see if validation goes through, and I still don't know what client version I have, but it seems I'm getting believable operation, at least.
ID: 66259 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Brickhead
Avatar

Send message
Joined: 20 Mar 08
Posts: 108
Credit: 2,607,924,860
RAC: 0
Message 66260 - Posted: 1 Apr 2017, 23:53:30 UTC
Last modified: 1 Apr 2017, 23:54:11 UTC

The last (I think) task marked "MilkyWay@Home Anonymous platform (CPU)" was returned 1 Apr 2017, 18:51:57 UTC. Status "Abandoned" indicates that this was when you reattached to MW on this computer.

After that, all non-nbody tasks sent to that computer are marked "MilkyWay@Home v1.40", and the one returned 1 Apr 2017, 21:31:21 UTC does indeed say "Completed and validated".

Looks to me like you've sorted the problem :)
ID: 66260 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : Thousands of validation errors, no good work?

©2024 Astroinformatics Group