Welcome to MilkyWay@home

Clearing the Queue

Message boards : News : Clearing the Queue
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
Sidd
Project developer
Project tester
Project scientist

Send message
Joined: 19 May 14
Posts: 73
Credit: 356,131
RAC: 0
Message 66695 - Posted: 18 Oct 2017, 19:21:21 UTC

Hey All,

We will have to clear the workunit queue for both separation and nbody. This will cause many invalid workunits. We apologize for this, as this is the only way to clear the system for it to return to normal operation.


Thanks for your continued support,
Jake and Sidd
ID: 66695 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile TimeRanger

Send message
Joined: 31 Oct 10
Posts: 83
Credit: 38,632,375
RAC: 0
Message 66697 - Posted: 18 Oct 2017, 19:39:48 UTC - in response to Message 66695.  

I have quite a few that have resulted in ERROR due to being Cancelled By Server, apparently without running.
ID: 66697 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Hubbie

Send message
Joined: 30 Nov 16
Posts: 1
Credit: 5,245,286
RAC: 0
Message 66698 - Posted: 18 Oct 2017, 19:47:43 UTC - in response to Message 66695.  

Hi guys,

I don't know if you're aware/it's expected that already validated workunits are being cancelled now also. If so, please disregard my post.

For example http://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=1507357863

Kind regards,

Hubbie
ID: 66698 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mark W. Patton
Avatar

Send message
Joined: 24 Apr 09
Posts: 13
Credit: 66,932,064
RAC: 0
Message 66699 - Posted: 18 Oct 2017, 22:41:21 UTC - in response to Message 66695.  

I'll just be glad when you are up and running at full capacity.
ID: 66699 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile TimeRanger

Send message
Joined: 31 Oct 10
Posts: 83
Credit: 38,632,375
RAC: 0
Message 66700 - Posted: 19 Oct 2017, 2:47:24 UTC - in response to Message 66698.  
Last modified: 19 Oct 2017, 2:50:03 UTC

Same here ... 137 cancelled without running, over 40 completed, Validated, then cancelled ??? Not a single valid unit showing up since the crash - they are all running full length, then getting cancelled.
ID: 66700 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Alan Barnes

Send message
Joined: 30 Nov 13
Posts: 7
Credit: 2,147,568
RAC: 0
Message 66701 - Posted: 19 Oct 2017, 9:35:53 UTC

I have noticed that recently a number of completed work-units (MilkyWay@Homev1.46) have been marked as "Didn't need", only some of which have been allocated credit. Please explain!

Also I obtained a number of "Completed, can't validate" results. Again I suppose this is due to the recent Transitioner problems etc.

Alan Barnes
ID: 66701 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile mikey
Avatar

Send message
Joined: 8 May 09
Posts: 3315
Credit: 519,946,449
RAC: 22,365
Message 66702 - Posted: 19 Oct 2017, 11:09:58 UTC - in response to Message 66701.  

I have noticed that recently a number of completed work-units (MilkyWay@Homev1.46) have been marked as "Didn't need", only some of which have been allocated credit. Please explain!

Also I obtained a number of "Completed, can't validate" results. Again I suppose this is due to the recent Transitioner problems etc.

Alan Barnes


If you look under your "didn't need" workunits you will see that they each say "errors WU cancelled", so they too were part of the clearing out process.
ID: 66702 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Alan Barnes

Send message
Joined: 30 Nov 13
Posts: 7
Credit: 2,147,568
RAC: 0
Message 66705 - Posted: 19 Oct 2017, 15:22:17 UTC - in response to Message 66702.  

But that doesn't entirely explain why some "Didn't need" WUs which were cancelled attracted credit whilst others didn't.

AB
ID: 66705 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
aad

Send message
Joined: 30 Mar 09
Posts: 63
Credit: 621,582,726
RAC: 4
Message 66708 - Posted: 19 Oct 2017, 22:54:48 UTC

All the work I did today is in the state of 'invalid'(Completed, can't validate)
or 'Validation inconclusive'(Completed, validation inconclusive)
Not a single wu is validated today!
State: All (2124) · In progress (159) · Validation pending (1) · Validation inconclusive (528) · Valid (393) · Invalid (554) · Error (489)


I stop doin' work for the wastebasket....
ID: 66708 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile mikey
Avatar

Send message
Joined: 8 May 09
Posts: 3315
Credit: 519,946,449
RAC: 22,365
Message 66709 - Posted: 20 Oct 2017, 1:17:43 UTC - in response to Message 66708.  
Last modified: 20 Oct 2017, 1:19:49 UTC

All the work I did today is in the state of 'invalid'(Completed, can't validate)
or 'Validation inconclusive'(Completed, validation inconclusive)
Not a single wu is validated today!
State: All (2124) · In progress (159) · Validation pending (1) · Validation inconclusive (528) · Valid (393) · Invalid (554) · Error (489)


I stop doin' work for the wastebasket....


I sent this to a teammate the other day:
State: All (3660) · In progress (400) · Validation pending (0) · Validation inconclusive (481) · Valid (2779) · Invalid (0) · Error (0)
Application: All (3660) · MilkyWay@Home (3660) · MilkyWay@Home N-Body Simulation (0)

Right now I see this:
State: All (1148) · In progress (80) · Validation pending (0) · Validation inconclusive (80) · Valid (388) · Invalid (216) · Error (384)
Application: All (1148) · MilkyWay@Home (1148) · MilkyWay@Home N-Body Simulation (0)

Today every single on of the 400 in progress and 481 inconclusive units were cancelled!! On top of that the same thing happened a few months ago!!! I will NEVER put all of my pc's here at MilkyWay again, this is getting ridiculous!!! I put my pc's here by choice and the project can't seem to figure out how to respect the help I/we give them. I have crunched for MANY projects and NOT ONE of them has had the problems that MilkyWay is having recently!!
ID: 66709 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Ulrich Metzner
Avatar

Send message
Joined: 11 Apr 15
Posts: 58
Credit: 63,291,127
RAC: 0
Message 66712 - Posted: 20 Oct 2017, 11:37:40 UTC - in response to Message 66708.  

All the work I did today is in the state of 'invalid'(Completed, can't validate)
or 'Validation inconclusive'(Completed, validation inconclusive)
Not a single wu is validated today!
State: All (2124) · In progress (159) · Validation pending (1) · Validation inconclusive (528) · Valid (393) · Invalid (554) · Error (489)


I stop doin' work for the wastebasket....


Yes, there is something fishy going on:
https://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=1529043651

For example:
2156510831 734325 19 Oct 2017, 6:02:10 UTC 19 Oct 2017, 8:54:50 UTC Completed, validation inconclusive 106.32 29.42 pending MilkyWay@Home v1.46 (opencl_ati_101)
2156604339 745869 19 Oct 2017, 8:55:38 UTC 19 Oct 2017, 10:25:45 UTC Completed, validation inconclusive 228.69 11.73 pending MilkyWay@Home v1.46 (opencl_nvidia_101)
2156652994 616064 19 Oct 2017, 10:27:28 UTC 20 Oct 2017, 10:46:29 UTC Completed, validation inconclusive 6,490.58 5,791.41 pending MilkyWay@Home v1.46
2157355682 569912 20 Oct 2017, 10:47:47 UTC 20 Oct 2017, 11:19:22 UTC Completed, validation inconclusive 120.84 23.95 pending MilkyWay@Home v1.46 (opencl_ati_101)
2157370914 733142 20 Oct 2017, 11:20:21 UTC 1 Nov 2017, 11:20:21 UTC In progress --- --- --- MilkyWay@Home v1.46 (opencl_nvidia_101)

I think in the next step the WU is cancelled, because of too many results... :/

Have also stopped crunching until this is resolved.
Aloha, Uli

ID: 66712 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
HassanShebli

Send message
Joined: 2 Oct 10
Posts: 74
Credit: 18,362,557
RAC: 0
Message 66713 - Posted: 20 Oct 2017, 11:44:14 UTC
Last modified: 20 Oct 2017, 11:45:31 UTC

I stopped crunching as all have (Completed, can't validate)

When It'll be up again?
ID: 66713 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile mikey
Avatar

Send message
Joined: 8 May 09
Posts: 3315
Credit: 519,946,449
RAC: 22,365
Message 66714 - Posted: 20 Oct 2017, 12:22:30 UTC

I stopped too, this is the 2nd time this has happened this year, 500 workunits of mine each time, just gone poof all the work for nothing.
ID: 66714 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Ulrich Metzner
Avatar

Send message
Joined: 11 Apr 15
Posts: 58
Credit: 63,291,127
RAC: 0
Message 66716 - Posted: 20 Oct 2017, 18:06:18 UTC

Ok, since nobody cares, i aborted all remaining WUs and set milkyway to nnt.
I'm out until further notice.
Aloha, Uli

ID: 66716 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile TimeRanger

Send message
Joined: 31 Oct 10
Posts: 83
Credit: 38,632,375
RAC: 0
Message 66717 - Posted: 20 Oct 2017, 18:50:29 UTC - in response to Message 66712.  

All the work I did today is in the state of 'invalid'(Completed, can't validate)
or 'Validation inconclusive'(Completed, validation inconclusive)
Not a single wu is validated today!
State: All (2124) · In progress (159) · Validation pending (1) · Validation inconclusive (528) · Valid (393) · Invalid (554) · Error (489)


I stop doin' work for the wastebasket....


Yes, there is something fishy going on:
https://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=1529043651

For example:
2156510831 734325 19 Oct 2017, 6:02:10 UTC 19 Oct 2017, 8:54:50 UTC Completed, validation inconclusive 106.32 29.42 pending MilkyWay@Home v1.46 (opencl_ati_101)
2156604339 745869 19 Oct 2017, 8:55:38 UTC 19 Oct 2017, 10:25:45 UTC Completed, validation inconclusive 228.69 11.73 pending MilkyWay@Home v1.46 (opencl_nvidia_101)
2156652994 616064 19 Oct 2017, 10:27:28 UTC 20 Oct 2017, 10:46:29 UTC Completed, validation inconclusive 6,490.58 5,791.41 pending MilkyWay@Home v1.46
2157355682 569912 20 Oct 2017, 10:47:47 UTC 20 Oct 2017, 11:19:22 UTC Completed, validation inconclusive 120.84 23.95 pending MilkyWay@Home v1.46 (opencl_ati_101)
2157370914 733142 20 Oct 2017, 11:20:21 UTC 1 Nov 2017, 11:20:21 UTC In progress --- --- --- MilkyWay@Home v1.46 (opencl_nvidia_101)

I think in the next step the WU is cancelled, because of too many results... :/

Have also stopped crunching until this is resolved.


I just set No New Tasks also. I haven't received a credit since early AM on the 18th - everything since then - ZILCH.
ID: 66717 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Cliff
Avatar

Send message
Joined: 28 Nov 14
Posts: 51
Credit: 86,696,721
RAC: 0
Message 66718 - Posted: 21 Oct 2017, 0:51:59 UTC

I have set my computers to NNT, aborted those WU on my main rig and will NOT crunch any MW@H tasks until I am SURE that they will be handled correctly.

Everyone crunching for this project has had their computer time and the money spent paying for the power running their computer wasted.

I find this disgraceful and will not crunch until I can be sure I'm not throwing good money after bad henceforth.
Regards,
Cliff.
--
Been there Done That, still no Damn T-Shirt
ID: 66718 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Avlin

Send message
Joined: 27 Feb 10
Posts: 1
Credit: 19,709,885
RAC: 1,535
Message 66719 - Posted: 21 Oct 2017, 7:41:58 UTC
Last modified: 21 Oct 2017, 7:44:37 UTC

You would at least score any points for those tasks that were cheated, but because of your fault they were not recognized as correct.

Assignments are rejected by hundreds, and the CPU time is spent on them.

Because now all the dissatisfied will be cut off from the project and there will be no one to cheat the tasks.
ID: 66719 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile mikey
Avatar

Send message
Joined: 8 May 09
Posts: 3315
Credit: 519,946,449
RAC: 22,365
Message 66720 - Posted: 21 Oct 2017, 11:06:26 UTC - in response to Message 66719.  

You would at least score any points for those tasks that were cheated, but because of your fault they were not recognized as correct.

Assignments are rejected by hundreds, and the CPU time is spent on them.

Because now all the dissatisfied will be cut off from the project and there will be no one to cheat the tasks.


I think you meant to use the word 'crunched' instead of "cheated", but yes I knew that.

But as I said earlier this same scenario played out a few months ago where hundreds of tasks were lost by me alone, not counting all the tasks lost by others that were all in various stages of being crunched. I only keep a 1.5 day cache of workunits on my pc's, that's still alot considering I can do a gpu unit in about 70 seconds and to lose them twice in a few months just isn't worth my time and effort anymore. I even contributed cash to this project at one time.
ID: 66720 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Wisesooth

Send message
Joined: 2 Oct 14
Posts: 43
Credit: 54,799,473
RAC: 1,632
Message 66723 - Posted: 21 Oct 2017, 16:13:13 UTC

"To err is human; to forgive is not company policy." Is that how it goes?

Let's plant our personal feelings aside and look at the problem. Once we discover what is wrong and what is right, let us help these budding academics instead of rejecting them as a person.

1. These points and "attaguy" badges are worth nothing but bragging rights. That stuff is for gamers, not us.
2. "No more work" will not help them clear the queue. It just postpones a solution to the problem.
3. The nature of the work they do is heuristic, not deterministic, which does not guarantee convergence. Third-order differential equations are difficult to solve. Tensors are even more difficult to solve. Let's cut these people some slack, shall we?
4. We have the option to abort what comes in by user. Let's do that.
5. Sidd and Jake, have you considered a "cleaner" project? This could initiate an algorithm to identify and compile a profile of tasks to kill and then automatically initiate an "abort by user" message. Those of us who want to help you clear the queue could add the cleaner project to their list of projects. That would clear the queue. When tasks stop coming in, shutdown the cleaner project.
ID: 66723 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
rbrahn

Send message
Joined: 16 Jul 17
Posts: 6
Credit: 222,105,326
RAC: 0
Message 66724 - Posted: 21 Oct 2017, 16:20:29 UTC - in response to Message 66723.  

Hear hear.
I'm still crunching in the hopes that invalidating WUs helps clear the queue and gets us back to normal operation faster. I could delete the WUs but I fear that won't help clear them (they appear to be aborting once about 7 comp attempts fail to validate). If someone tells me to start aborting, I'll do that, until then, ragequitting doesn't get anyone nearer a solution.
ID: 66724 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
1 · 2 · Next

Message boards : News : Clearing the Queue

©2024 Astroinformatics Group