Welcome to MilkyWay@home

High number of validation inconclusive (pending)

Message boards : Number crunching : High number of validation inconclusive (pending)
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · Next

AuthorMessage
Profile HRFMguy

Send message
Joined: 12 Nov 21
Posts: 236
Credit: 575,038,236
RAC: 4,912
Message 71928 - Posted: 11 Mar 2022, 21:08:25 UTC - in response to Message 71927.  

@Jimbocous,

I too have had a bunch of WUs validated and credited with only me as a cruncher. I also agree with your assertion one WU, with 2 or more computers. I have not inquired of MW about this, and Tom would know, but I suspect that they have reserved a few computers in house to also crunch. If this is true, then only 1 external computer would be required. Tom?
ID: 71928 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Keith Myers
Avatar

Send message
Joined: 24 Jan 11
Posts: 708
Credit: 543,286,755
RAC: 140,281
Message 71930 - Posted: 11 Mar 2022, 22:54:22 UTC

You don't understand the system apparently. For a long while, the admins have used BOINC's "known reliable host" mechanism whereby a task returned by such a designated host is validated without a wingman task.
https://boinc.berkeley.edu/trac/wiki/ProjectOptions#Acceleratingretries

So if your host returns validated results consistently against a wingman task, it gets designated as reliable and can be validated with just the one task. BOINC then periodically checks the host by sending out another task to a different host to test whether the host is still reliable.

If the two tasks match and validate, the "known reliable host" designation is maintained. If the host suddenly errors out a large amount of work, it loses the designation and will need to build up its reputation again with a long session of validated against wingmen results.
ID: 71930 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
mmonnin

Send message
Joined: 2 Oct 16
Posts: 167
Credit: 1,005,839,047
RAC: 48,035
Message 71931 - Posted: 11 Mar 2022, 23:09:10 UTC - in response to Message 71930.  

You don't understand the system apparently. For a long while, the admins have used BOINC's "known reliable host" mechanism whereby a task returned by such a designated host is validated without a wingman task.
https://boinc.berkeley.edu/trac/wiki/ProjectOptions#Acceleratingretries

So if your host returns validated results consistently against a wingman task, it gets designated as reliable and can be validated with just the one task. BOINC then periodically checks the host by sending out another task to a different host to test whether the host is still reliable.

If the two tasks match and validate, the "known reliable host" designation is maintained. If the host suddenly errors out a large amount of work, it loses the designation and will need to build up its reputation again with a long session of validated against wingmen results.


This isn't WCG and that is not used here at MW.
ID: 71931 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile PecosRiverM

Send message
Joined: 25 Aug 17
Posts: 12
Credit: 1,228,797,142
RAC: 9,550
Message 71932 - Posted: 12 Mar 2022, 2:22:26 UTC - in response to Message 71931.  

ID: 71932 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jimbocous
Avatar

Send message
Joined: 7 Mar 20
Posts: 22
Credit: 104,988,122
RAC: 12,208
Message 71934 - Posted: 12 Mar 2022, 3:10:19 UTC

I may have said "initial replication=1". I meant to refer to "minimum quorum =1".
All the valid stuff I have has a minimum quorum >1.
ID: 71934 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
San-Fernando-Valley

Send message
Joined: 13 Apr 17
Posts: 256
Credit: 604,411,638
RAC: 0
Message 71935 - Posted: 12 Mar 2022, 7:39:31 UTC - in response to Message 71930.  

Keith, please, WHO are you addressing with the following:
You don't understand the system apparently ....


We, here in Southern California, especially in L.A., are more relaxed and would have said "Here is an explanation of how I (hopefully) correctly understand the validation process at MW ..."

So, just move here, perhaps to San Diego - life is very pleasurable and less aggressive here.

Have a real nice weekend - the surf at Palisade Beach is good for a try tomorrow ...
ID: 71935 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile crashtech

Send message
Joined: 2 Oct 16
Posts: 4
Credit: 5,429,908,267
RAC: 110,775
Message 71938 - Posted: 12 Mar 2022, 17:15:39 UTC

I see that the server status page says 5385758 units are awaiting validation as of 12 March 2022 1700 UTC, I personally have 32923 pending. Is it time to stop running the project to let the validator catch up? Is it just stuck on a bad WU again?
ID: 71938 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Septimus

Send message
Joined: 8 Nov 11
Posts: 205
Credit: 2,894,249
RAC: 442
Message 71939 - Posted: 12 Mar 2022, 17:47:41 UTC - in response to Message 71938.  

I have nothing like the amount of WU’s outstanding you have and have been running other stuff.

Not sure how long it will take to shift the backlog but it will be many days I would think even at the current level without any more being done.
ID: 71939 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Tom Donlon
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 10 Apr 19
Posts: 408
Credit: 120,203,200
RAC: 0
Message 71940 - Posted: 12 Mar 2022, 20:56:26 UTC

Not every WU is cross-validated, because otherwise that would cut our total crunching power in half or thirds. In short, the more your WUs are successfully validated, the less we cross-validate your WUs, because you're "trusted" more than someone who returns WUs that frequently fail validation. This is part of the TAO (Toolkit for Asymmetric Optimization) algorithm that MW@h uses for its backend.

We do have in-house machines that run MW@h, but they are treated the same as all of your machines. They are machines that we use for other things, but they run MW@h when we are not actively using them.
ID: 71940 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile HRFMguy

Send message
Joined: 12 Nov 21
Posts: 236
Credit: 575,038,236
RAC: 4,912
Message 71941 - Posted: 13 Mar 2022, 2:07:03 UTC - in response to Message 71940.  

Wohoo!! I'm trusted!! ;)
ID: 71941 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Septimus

Send message
Joined: 8 Nov 11
Posts: 205
Credit: 2,894,249
RAC: 442
Message 71945 - Posted: 13 Mar 2022, 11:00:06 UTC - in response to Message 71941.  

It looks like any credits that we are getting are not being sent to BOINC stats my total has stayed the same for at least two days.
ID: 71945 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile mikey
Avatar

Send message
Joined: 8 May 09
Posts: 3319
Credit: 520,336,315
RAC: 21,640
Message 71946 - Posted: 13 Mar 2022, 11:38:10 UTC - in response to Message 71945.  

It looks like any credits that we are getting are not being sent to BOINC stats my total has stayed the same for at least two days.


You don't have any tasks in progress on either pc so are just waiting for a wingman or the Server to do it's thing, crunching tasks keeps the RAC more even than not crunching and waiting for the system to work it's way thru your already completed tasks.
ID: 71946 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Septimus

Send message
Joined: 8 Nov 11
Posts: 205
Credit: 2,894,249
RAC: 442
Message 71947 - Posted: 13 Mar 2022, 12:31:46 UTC - in response to Message 71946.  

Thanks BOINC total was updated in the last hour so.
ID: 71947 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Septimus

Send message
Joined: 8 Nov 11
Posts: 205
Credit: 2,894,249
RAC: 442
Message 71960 - Posted: 15 Mar 2022, 14:28:55 UTC - in response to Message 71947.  

Is anything being validated ? I have outstanding WU’s going back to the 8th March, last day anything got validated is 12th March.
ID: 71960 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Max_Pirx

Send message
Joined: 13 Dec 17
Posts: 46
Credit: 2,421,362,376
RAC: 0
Message 71961 - Posted: 15 Mar 2022, 15:22:48 UTC

I have several validated tasks from the 13th and one from yesterday. Seems some validation goes on, but I still have 30+K pending tasks.
ID: 71961 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
San-Fernando-Valley

Send message
Joined: 13 Apr 17
Posts: 256
Credit: 604,411,638
RAC: 0
Message 71962 - Posted: 15 Mar 2022, 16:06:59 UTC

Click on "Server Status" under "Computing" to see/check if validation is progressing at all ...
ID: 71962 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Septimus

Send message
Joined: 8 Nov 11
Posts: 205
Credit: 2,894,249
RAC: 442
Message 71963 - Posted: 15 Mar 2022, 16:24:09 UTC - in response to Message 71962.  

It seems to be but nothing come my way yet, still a backlog of well over 5.6 Million.
ID: 71963 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
San-Fernando-Valley

Send message
Joined: 13 Apr 17
Posts: 256
Credit: 604,411,638
RAC: 0
Message 71964 - Posted: 15 Mar 2022, 18:35:20 UTC - in response to Message 71963.  

This morning it was approx. 6.5 Million.
It started to drop down to 5.632.627 at UTC 13:02 because there was no work (WUs) .
At UTC 14:12 it was 5.668.662 .
So I was wondering why it was going up again.
It did so, because there were suddenly WUs ready to send and the crunchers started, well, crunching!
Now, at UTC 18:29 it is down to 5.455.986, so it seems to get less, but very slowly.
I guess we just have to be patient.
ID: 71964 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Septimus

Send message
Joined: 8 Nov 11
Posts: 205
Credit: 2,894,249
RAC: 442
Message 71969 - Posted: 16 Mar 2022, 15:38:50 UTC - in response to Message 71964.  

Not sure how often the figures are updated, they have been the same for several hours. Currently 9 systems are down, unable to return WU’s.
ID: 71969 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
San-Fernando-Valley

Send message
Joined: 13 Apr 17
Posts: 256
Credit: 604,411,638
RAC: 0
Message 71970 - Posted: 16 Mar 2022, 16:24:59 UTC - in response to Message 71969.  

I guess they are having real troubles ...

I have no idea at what intervals or if at all they update at the moment.
Just a while ago the nbody validator was running, but just for a short time.

Would be nice to hear from them - maybe just a short note, well, I suppose no time for that.
ID: 71970 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · Next

Message boards : Number crunching : High number of validation inconclusive (pending)

©2024 Astroinformatics Group