Welcome to MilkyWay@home

Help with validate errors on Sep 1.46 WU

Message boards : Number crunching : Help with validate errors on Sep 1.46 WU
Message board moderation

To post messages, you must log in.

AuthorMessage
Ron Kosinski
Avatar

Send message
Joined: 15 Mar 20
Posts: 9
Credit: 250,085,312
RAC: 12,444
Message 70924 - Posted: 27 Jun 2021, 1:33:21 UTC

Two of my boxes have started getting a bunch of validate errors on Separation 1.46 (both CPU and GPU) work units. Can someone look at them and let me know what could be causing the errors.
Thanks for any help.
ID: 70924 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
alanb1951

Send message
Joined: 16 Mar 10
Posts: 213
Credit: 108,362,077
RAC: 4,510
Message 70925 - Posted: 27 Jun 2021, 4:02:23 UTC - in response to Message 70924.  
Last modified: 27 Jun 2021, 4:03:19 UTC

Validation matches returned results to a certain degree of precision; unfortunately, in some cases different hardware can result in slightly different calculation results (depending [mostly?] on rounding methods) and there are certain data sets for which these discrepancies get bigger and bigger as the results approach optimization.

If you look at the most recent News item New Separation Runs 6/9/2021, Tom Donlon introduced the newest set of runs thus:
I've just put some new separation runs up on the server. Remember those stripe 84 and 85 runs that would start to throw validate errors as they became more optimized? I've been testing and comparing runs on different builds and *hopefully* that problem has been resolved.

And (as Tom points out) this is not a new issue -- it seems to be inherent in the data!

Hope this helps...

Cheers - Al.
ID: 70925 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Ron Kosinski
Avatar

Send message
Joined: 15 Mar 20
Posts: 9
Credit: 250,085,312
RAC: 12,444
Message 70927 - Posted: 27 Jun 2021, 12:44:29 UTC - in response to Message 70925.  
Last modified: 27 Jun 2021, 13:07:10 UTC

Hi Al,
Thanks for the info. I guess there is nothing I can do on my end to stop this.
Should I post this in Tom's thread too?
ID: 70927 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
alanb1951

Send message
Joined: 16 Mar 10
Posts: 213
Credit: 108,362,077
RAC: 4,510
Message 70934 - Posted: 27 Jun 2021, 20:07:13 UTC - in response to Message 70927.  

Hi Al,
Thanks for the info. I guess there is nothing I can do on my end to stop this.
Should I post this in Tom's thread too?

Looks like he's picked up on this, so I suspect he'll ask for more evidence if he needs it...

Cheers - Al.
ID: 70934 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Keith Myers
Avatar

Send message
Joined: 24 Jan 11
Posts: 715
Credit: 555,441,958
RAC: 38,704
Message 70935 - Posted: 28 Jun 2021, 18:59:08 UTC - in response to Message 70934.  

He's going to try and try again releasing these Stripe 84 and 85 tasks because they are the last bit of data he needs to finish his paper.
He thought he fixed the problem with those datasets but they have reoccurred again. Needs to take another crack at it.
ID: 70935 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : Help with validate errors on Sep 1.46 WU

©2024 Astroinformatics Group