Message boards :
Number crunching :
Tasks Completed, but validation tasks remain Unsent
Message board moderation
Author | Message |
---|---|
Send message Joined: 5 Dec 15 Posts: 4 Credit: 875,220 RAC: 48 |
Two of my tasks are Completed, validation inconclusive: Not running any more tasks for now. LLP, PhD, Prof. Engr. I think => I THINK I am. My thinking is not the source of my being, nor does it prove my existence to you. The Living Word of God World Youth Day |
Send message Joined: 8 May 09 Posts: 3339 Credit: 524,010,781 RAC: 0 |
Two of my tasks are Completed, validation inconclusive: Validation Inconclusive just means 'waiting for a wingman', keep crunching and they will get validated in the end. MW only sends out the original task then if it needs a wingman task it generates it but it goes at the end of the list of available tasks, so they can take awhile to validate. |
Send message Joined: 18 Jan 22 Posts: 1 Credit: 1,009,684 RAC: 2,382 |
Everything ive completed in the last 10 days or so is "Validation Inconclusive"... I'm still getting new tasks (slowly) but its happening. |
Send message Joined: 19 Jul 10 Posts: 624 Credit: 19,297,971 RAC: 2,484 |
Yes, this is a known "issue", which should resolve itself in the next 2-4 weeks I guess. The more we crunch, the sooner this will happen, check the "Admin Updates Discussion" thread in the News section. |
Send message Joined: 5 Dec 15 Posts: 4 Credit: 875,220 RAC: 48 |
MW only sends out the original task then if it needs a wingman task it generates itboth WUs say minimum quorum 1 initial replication 2 ...not sure what's the difference between quorum and replication, but quite obviously a send task IS needed to complete validation. I've not heard the term 'wingman' task, but the task to complete the validation for BOTH WUs had already been generated, but neither has been sent to be run ... both have status Unsent. LLP, PhD, Prof. Engr. I think => I THINK I am. My thinking is not the source of my being, nor does it prove my existence to you. The Living Word of God World Youth Day |
Send message Joined: 8 May 09 Posts: 3339 Credit: 524,010,781 RAC: 0 |
MW only sends out the original task then if it needs a wingman task it generates itboth WUs sayminimum quorum 1 initial replication 2 No a wingman is not always needed, apparently if you return I think the number is 10 tasks in a row that are valid then the Server thinks your pc is trustworthy and it will only periodically send out a wingman task for that pc. BUT as soon as your wingman proves your pc is not trustworthy anymore then the process starts all over from zero again. Becoming non trustworthy can be from dust, overclocking, components wearing out etc etc. Link tried to explain WHY they haven't been sent out yet, the Server made a million tasks and all wingman tasks go at the end of the list, so in a couple of weeks we should be getting ALOT of _1 tasks, the initial tasks end in _0 and then everyone tasks should be valid or the Project will send out a 3rd task to try and figure out which of the first 2 pc's has the right answer. |
Send message Joined: 1 Jan 17 Posts: 37 Credit: 111,034,474 RAC: 35,537 |
Link wrote: Yes, this is a known "issue", which should resolve itself in the next 2-4 weeks I guess. The more we crunch, the sooner this will happen, check the "Admin Updates Discussion" thread in the News section.Here is an estimation of my own, according to which the pile of 'validation inconclusive' will last for about two months:
Does this make sense? |
Send message Joined: 19 Jul 10 Posts: 624 Credit: 19,297,971 RAC: 2,484 |
Does this make sense?Yes, but it doesn't take into account, that we have two types of tasks here: the long tasks, which need 96000-116000 CPU seconds on my computer and for which we get around 1000 credits and the short tasks, which need less than 10000 CPU seconds (and as you can see, they are the majority, at least on my list). The oldest _1 waiting for to be send out from my WU list is 935953561, created on the 17th January. The oldest _0 I got on the 16th is 932856512 The newest _0 I got today (29th) is 935473682 That means we processed 2,617,170 tasks in 13 days. That's 201,321 tasks per day. There are 479,879 tasks left between the 935473682 I got today and 935953561. At 201,321 tasks per day, 935953561 should be sent out in about 2.5 days. Does that make more sense? |
Send message Joined: 1 Jan 17 Posts: 37 Credit: 111,034,474 RAC: 35,537 |
Link wrote: Yes, but it doesn't take into account, that we have two types of tasks here: the long tasks, which need 96000-116000 CPU seconds on my computer and for which we get around 1000 credits and the short tasks, which need less than 10000 CPU seconds (and as you can see, they are the majority, at least on my list).Good point. I missed these because hardly any of them can be found among the valid tasks which are currently left in the database. I think these short tasks get ~100 credits. (source) The current top host has got 4000 inconclusive results by now. A couple of hours ago I copy+pasted 500 of its then most recent inconclusive results into a spreadsheet. Of these, 222 took 3,300...3,600 CPU seconds and 278 took 35,600...45,400 CPU seconds. I.e. this host had 44 % short tasks and 56 % long tasks recently. Let's say it's fifty-fifty long and short tasks, which gives ~600 average credits per result. If this was the same earlier this month, then the ~14,000,000 credits/day before January 17 mean ~23,000 valid results per day. And to get ~690,000 tasks returned validly would take 30 days (4 weeks) = until mid February if that rate remained constant. Edit: The average during 2023-12-21...2024-01-17 actually was 17,000,000 credits/day = almost 30,000 valid results/day, which would translate to 24 days for 690,000 tasks. Link wrote: The oldest _0 I got on the 16th is 932856512We need the rate of successfully computed results. Your figure also includes all aborted tasks, computation errors, and timeouts. I am not saying though that we have a huge ratio of error returns; I don't know why your figure is 7...9 times as much as mine. Edit 2: Oh wait. Your figure is possibly skewed a lot because during late January 16 ... mid January 24, there were 3 million tasks-ready-to-send on the server. 2.3 million of those no longer exist since January 24 because Kevin deleted them, but they may be included in the workunit numbers which you found. |
Send message Joined: 19 Jul 10 Posts: 624 Credit: 19,297,971 RAC: 2,484 |
We need the rate of successfully computed results. Your figure also includes all aborted tasks, computation errors, and timeouts.No, for the rate at which the tasks are sent it doesn't matter what happens with them later on the clients. Oh wait. Your figure is possibly skewed a lot because during late January 16 ... mid January 24, there were 3 million tasks-ready-to-send on the server. 2.3 million of those no longer exist since January 24 because Kevin deleted them, but they may be included in the workunit numbers which you found.Yes. |
Send message Joined: 19 Jul 10 Posts: 624 Credit: 19,297,971 RAC: 2,484 |
So let's try again... 932856512, sent 16 Jan 2024, 20:40:44 UTC 933039926, sent 23 Jan 2024, 11:27:26 UTC That are 183,414 tasks sent out in about 177 hours. 935342117, sent 24 Jan 2024, 13:12:42 UTC 935473011, sent 29 Jan 2024, 16:29:58 UTC That are 130,894 tasks sent out in about 123 hours. So total 314,308 tasks in 300 hours, or 1,047.7 tasks per hour. There are 480,550 tasks left between the 935473011 and 935953561 (apparently the server isn't sending the tasks exactly after their numbers, but close enough). At 1,047.7 tasks per hour, 935953561 should be sent out about 458 hours (about 19 days) after 935473011. So around the 17th February. I think I can't get it more exactly than that. Did I still miss something? |
Send message Joined: 9 Aug 21 Posts: 1 Credit: 3,140,897 RAC: 1,356 |
Everything ive completed in the last 10 days or so is "Validation Inconclusive"... I'm still getting new tasks (slowly) but its happening. I have been experiencing a similar situation on my computers since January 16, 2024 |
Send message Joined: 1 Jan 17 Posts: 37 Credit: 111,034,474 RAC: 35,537 |
xii5ku wrote: ~23,000 valid results per day [typically returned until January 17]Link wrote: 1,047.7 tasks per hour [being assigned to hosts on average during the last few days]I.e. our updated estimations are in the same ballpark. Meaning that we are currently on the way to get workunits validly completed again sometime in mid February. xii5ku wrote: We need the rate of successfully computed results.Link wrote: No, for the rate at which the tasks are sent it doesn't matter what happens with them later on the clients.There are two related, but not identical questions: When will the server start to assign _1 tasks to hosts? When will the server receive _1 results which match _0 results so that successful validations are happening again? I for one was more occupied by the latter than by the former question. However, concerning this latter question, I admit on second thought that intermittent occurrences of error returns don't actually defer the point in time at which validations start happening again (ignoring unrealistic corner cases). They merely reduce the rate of validations, after validations started happening again. -------- February Thunder wrote: I have been experiencing a similar situation on my computers since January 16, 2024The very same is happening on all hosts which are currently active, without exception. It is because there was accidentally a very unusual amount of new work queued on the server at once, combined with how MilkyWay@Home is implementing workunit validation. |
Send message Joined: 5 Sep 09 Posts: 9 Credit: 559,475,854 RAC: 65,758 |
Very good thread gentlemen. You validated many of my thoughts. So...... how hard would it be for the admin to tell us how many of the 690,000 tasks "ready to send" are initial tasks (_0) and how many are "validation" tasks (wingman)(_1) tasks. If these tasks are just files sitting in a UNIX/Linux directory they should be easy to count. If posted once a week, we could see our progress. |
Send message Joined: 19 Jul 10 Posts: 624 Credit: 19,297,971 RAC: 2,484 |
No, they are not just files, they are database entries. Not sure how hard it is to find them, but considering that they still have not been able to find and delete old separation WUs from 2021, I guess we don't need to hope for official numbers of _0 and _1 tasks. But if you assume, that the 480,550 tasks left between the 935473011 and 935953561 are nearly all _0 and everything else is _1, you will be pretty close to the truth. Or if we take my newest task 935501281, than there are about 452,280 _0s left, probably a bit less than that. |
©2024 Astroinformatics Group