Welcome to MilkyWay@home

Odd Validation Result

Message boards : Number crunching : Odd Validation Result
Message board moderation

To post messages, you must log in.

AuthorMessage
Brian Priebe

Send message
Joined: 27 Nov 09
Posts: 108
Credit: 430,760,953
RAC: 0
Message 38361 - Posted: 8 Apr 2010, 5:38:18 UTC

With reference to Workunit 90777892:

Three results were apparently received from HD5xxx class ATI boards: one from a 32-bit platform running stock version 0.23, one from a 64-bit platform also running version 0.23, and one from an anonymous 64-bit platform running version 0.20. I would have thought the two 0.23 versions would form a quorum but they apparently did not. The first version 0.23 result (mine) was rejected while the second 0.23 and the 0.20 result were accepted.

Any idea why this would be?
ID: 38361 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Werkstatt

Send message
Joined: 19 Feb 08
Posts: 350
Credit: 141,284,369
RAC: 0
Message 38363 - Posted: 8 Apr 2010, 9:23:37 UTC - in response to Message 38361.  

Hi,
i checked my results right now. Since i run the stock app (0.23), only very few results were not validated (~ 2% or so), this seems to be acceptable in a phase, where big problem need to be solved. It looks like things are stabilizing now. Rejection-rate before 0.23 was over 30%.

Alexander
ID: 38363 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Zydor
Avatar

Send message
Joined: 24 Feb 09
Posts: 620
Credit: 100,587,625
RAC: 0
Message 38367 - Posted: 8 Apr 2010, 11:15:16 UTC - in response to Message 38361.  

With reference to Workunit 90777892:

Three results were apparently received from HD5xxx class ATI boards: one from a 32-bit platform running stock version 0.23, one from a 64-bit platform also running version 0.23, and one from an anonymous 64-bit platform running version 0.20. I would have thought the two 0.23 versions would form a quorum but they apparently did not. The first version 0.23 result (mine) was rejected while the second 0.23 and the 0.20 result were accepted.

Any idea why this would be?


I've had exactly the same scenario, no idea why. However I suspect all current efforts are focused on the new app and validation related to that, and I can appreciate why that would be, as amongst other things, the new app will resolve the perpetual server crashes due to it not being able to keep up with the required file deletions.

Having achieved a large measure of stability with 0.23 and the validator change, I doubt much else will change until the new app rolls out this week.

Regards
Zy
ID: 38367 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Crunch3r
Volunteer developer
Avatar

Send message
Joined: 17 Feb 08
Posts: 363
Credit: 258,227,990
RAC: 0
Message 38372 - Posted: 8 Apr 2010, 12:32:06 UTC - in response to Message 38367.  

With reference to Workunit 90777892:

Three results were apparently received from HD5xxx class ATI boards: one from a 32-bit platform running stock version 0.23, one from a 64-bit platform also running version 0.23, and one from an anonymous 64-bit platform running version 0.20. I would have thought the two 0.23 versions would form a quorum but they apparently did not. The first version 0.23 result (mine) was rejected while the second 0.23 and the 0.20 result were accepted.

Any idea why this would be?


I've had exactly the same scenario, no idea why. However I suspect all current efforts are focused on the new app and validation related to that, and I can appreciate why that would be, as amongst other things, the new app will resolve the perpetual server crashes due to it not being able to keep up with the required file deletions.

Having achieved a large measure of stability with 0.23 and the validator change, I doubt much else will change until the new app rolls out this week.

Regards
Zy


Same for me too.I got a whole bunch of tasks where two 58xx (0.20b and 0.22) got validated against each other (!) and my result (4870, 0.23) was marked as invalid ...

So there's definetly still something wrong with the validator...



Join Support science! Joinc Team BOINC United now!
ID: 38372 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Emanuel

Send message
Joined: 18 Nov 07
Posts: 280
Credit: 2,442,757
RAC: 0
Message 38374 - Posted: 8 Apr 2010, 13:12:48 UTC - in response to Message 38372.  

Can you really call that a problem with the validator? It seems to be doing its job correctly - it's the older applications that mess things up. Pushing out the new, not backwards compatible applications should fix this, but that doesn't mean the validator is broken. Unless I'm misunderstanding you.
ID: 38374 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Cluster Physik

Send message
Joined: 26 Jul 08
Posts: 627
Credit: 94,940,203
RAC: 0
Message 38377 - Posted: 8 Apr 2010, 13:30:11 UTC - in response to Message 38374.  

Can you really call that a problem with the validator? It seems to be doing its job correctly - it's the older applications that mess things up. Pushing out the new, not backwards compatible applications should fix this, but that doesn't mean the validator is broken. Unless I'm misunderstanding you.

I guess Crunch3r thinks the validator should also check the combination of GPU and application version. If a HD5800 GPU returns a result calculated with anything prior to 0.23, it should be rejected because it is a known fact that this result is not reliable.
It would greatly reduce the number of results wrongly declared as invalid (and wrongly declared as valid, too!) and at the same time increase the pressure on HD5800 owners to update to 0.23.
ID: 38377 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Crunch3r
Volunteer developer
Avatar

Send message
Joined: 17 Feb 08
Posts: 363
Credit: 258,227,990
RAC: 0
Message 38385 - Posted: 8 Apr 2010, 15:05:28 UTC - in response to Message 38377.  
Last modified: 8 Apr 2010, 15:31:13 UTC


I guess Crunch3r thinks the validator should also check the combination of GPU and application version. If a HD5800 GPU returns a result calculated with anything prior to 0.23, it should be rejected because it is a known fact that this result is not reliable.
It would greatly reduce the number of results wrongly declared as invalid (and wrongly declared as valid, too!) and at the same time increase the pressure on HD5800 owners to update to 0.23.


Yes, your're right about that. Should be easy to implement a app version check server side... since each work request no matther if "stock" or anonymous contains a version tag <version_num>22</version_num>.

FWIW, the best way to handle the sittuation would have been to introcude a new app like it is planed for milkyway3. That would have prevented sending out WUs to hosts runing the faulty apps in the first place since everyone would have been forced to upgrade...

Join Support science! Joinc Team BOINC United now!
ID: 38385 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Brian Priebe

Send message
Joined: 27 Nov 09
Posts: 108
Credit: 430,760,953
RAC: 0
Message 38402 - Posted: 8 Apr 2010, 18:13:05 UTC - in response to Message 38385.  

Yes, your're right about that. Should be easy to implement a app version check server side... since each work request no matther if "stock" or anonymous contains a version tag <version_num>22</version_num>.
In another post, Travis said he tried that and it just caused the server to crash more often.
ID: 38402 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Simplex0
Avatar

Send message
Joined: 11 Nov 07
Posts: 232
Credit: 178,229,009
RAC: 0
Message 38411 - Posted: 8 Apr 2010, 19:07:13 UTC - in response to Message 38385.  


I guess Crunch3r thinks the validator should also check the combination of GPU and application version. If a HD5800 GPU returns a result calculated with anything prior to 0.23, it should be rejected because it is a known fact that this result is not reliable.
It would greatly reduce the number of results wrongly declared as invalid (and wrongly declared as valid, too!) and at the same time increase the pressure on HD5800 owners to update to 0.23.


Yes, your're right about that. Should be easy to implement a app version check server side... since each work request no matther if "stock" or anonymous contains a version tag <version_num>22</version_num>.

FWIW, the best way to handle the sittuation would have been to introcude a new app like it is planed for milkyway3. That would have prevented sending out WUs to hosts runing the faulty apps in the first place since everyone would have been forced to upgrade...


Absoluthly. I think it shows a bad atetude to let people continue to produse good results that still will be marked as 'Invalid' and wasted.
I can't understand why it is more importent to let the validator running than that it is working correct.
ID: 38411 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Travis
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 30 Aug 07
Posts: 2046
Credit: 26,480
RAC: 0
Message 38417 - Posted: 8 Apr 2010, 19:47:46 UTC - in response to Message 38411.  


Absoluthly. I think it shows a bad atetude to let people continue to produse good results that still will be marked as 'Invalid' and wasted.
I can't understand why it is more importent to let the validator running than that it is working correct.


Well if the validator is crashing every two hours, or unable to keep up with the stream of workunits; it's going to effect a lot more people (and a lot less work is going to be done). Right now the number of invalid results is < 5%; and I think we've reached critical mass so that the vast majority of the ones being flagged as invalid now are actually invalid.
ID: 38417 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Simplex0
Avatar

Send message
Joined: 11 Nov 07
Posts: 232
Credit: 178,229,009
RAC: 0
Message 38419 - Posted: 8 Apr 2010, 20:34:04 UTC - in response to Message 38417.  
Last modified: 8 Apr 2010, 20:52:51 UTC


Absoluthly. I think it shows a bad atetude to let people continue to produse good results that still will be marked as 'Invalid' and wasted.
I can't understand why it is more importent to let the validator running than that it is working correct.


Well if the validator is crashing every two hours, or unable to keep up with the stream of workunits; it's going to effect a lot more people (and a lot less work is going to be done). Right now the number of invalid results is < 5%; and I think we've reached critical mass so that the vast majority of the ones being flagged as invalid now are actually invalid.


Based on the information in this thread would you say that the results
that are marked as 'Valid' really IS valid?
Is the validator also checking that the correct apps has ben used?
ID: 38419 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : Odd Validation Result

©2024 Astroinformatics Group