Clearing the Queue
log in

Advanced search

Message boards : News : Clearing the Queue

1 · 2 · Next
Author Message
Sidd
Project developer
Project tester
Project scientist
Send message
Joined: 19 May 14
Posts: 60
Credit: 322,410
RAC: 855

Message 66695 - Posted: 18 Oct 2017, 19:21:21 UTC

Hey All,

We will have to clear the workunit queue for both separation and nbody. This will cause many invalid workunits. We apologize for this, as this is the only way to clear the system for it to return to normal operation.


Thanks for your continued support,
Jake and Sidd

Profile TimeRanger
Send message
Joined: 31 Oct 10
Posts: 74
Credit: 22,897,766
RAC: 28,409

Message 66697 - Posted: 18 Oct 2017, 19:39:48 UTC - in response to Message 66695.

I have quite a few that have resulted in ERROR due to being Cancelled By Server, apparently without running.

Hubbie
Send message
Joined: 30 Nov 16
Posts: 1
Credit: 1,000,141
RAC: 3,294

Message 66698 - Posted: 18 Oct 2017, 19:47:43 UTC - in response to Message 66695.

Hi guys,

I don't know if you're aware/it's expected that already validated workunits are being cancelled now also. If so, please disregard my post.

For example http://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=1507357863

Kind regards,

Hubbie

Mark W. Patton
Avatar
Send message
Joined: 24 Apr 09
Posts: 13
Credit: 65,216,932
RAC: 19,097

Message 66699 - Posted: 18 Oct 2017, 22:41:21 UTC - in response to Message 66695.

I'll just be glad when you are up and running at full capacity.

Profile TimeRanger
Send message
Joined: 31 Oct 10
Posts: 74
Credit: 22,897,766
RAC: 28,409

Message 66700 - Posted: 19 Oct 2017, 2:47:24 UTC - in response to Message 66698.
Last modified: 19 Oct 2017, 2:50:03 UTC

Same here ... 137 cancelled without running, over 40 completed, Validated, then cancelled ??? Not a single valid unit showing up since the crash - they are all running full length, then getting cancelled.

Alan Barnes
Send message
Joined: 30 Nov 13
Posts: 7
Credit: 946,792
RAC: 2,479

Message 66701 - Posted: 19 Oct 2017, 9:35:53 UTC

I have noticed that recently a number of completed work-units (MilkyWay@Homev1.46) have been marked as "Didn't need", only some of which have been allocated credit. Please explain!

Also I obtained a number of "Completed, can't validate" results. Again I suppose this is due to the recent Transitioner problems etc.

Alan Barnes

mikey
Avatar
Send message
Joined: 8 May 09
Posts: 2032
Credit: 179,632,378
RAC: 238,197

Message 66702 - Posted: 19 Oct 2017, 11:09:58 UTC - in response to Message 66701.

I have noticed that recently a number of completed work-units (MilkyWay@Homev1.46) have been marked as "Didn't need", only some of which have been allocated credit. Please explain!

Also I obtained a number of "Completed, can't validate" results. Again I suppose this is due to the recent Transitioner problems etc.

Alan Barnes


If you look under your "didn't need" workunits you will see that they each say "errors WU cancelled", so they too were part of the clearing out process.

Alan Barnes
Send message
Joined: 30 Nov 13
Posts: 7
Credit: 946,792
RAC: 2,479

Message 66705 - Posted: 19 Oct 2017, 15:22:17 UTC - in response to Message 66702.

But that doesn't entirely explain why some "Didn't need" WUs which were cancelled attracted credit whilst others didn't.

AB

aad
Send message
Joined: 30 Mar 09
Posts: 51
Credit: 248,234,327
RAC: 327,461

Message 66708 - Posted: 19 Oct 2017, 22:54:48 UTC

All the work I did today is in the state of 'invalid'(Completed, can't validate)
or 'Validation inconclusive'(Completed, validation inconclusive)
Not a single wu is validated today!

State: All (2124) · In progress (159) · Validation pending (1) · Validation inconclusive (528) · Valid (393) · Invalid (554) · Error (489)


I stop doin' work for the wastebasket....

mikey
Avatar
Send message
Joined: 8 May 09
Posts: 2032
Credit: 179,632,378
RAC: 238,197

Message 66709 - Posted: 20 Oct 2017, 1:17:43 UTC - in response to Message 66708.
Last modified: 20 Oct 2017, 1:19:49 UTC

All the work I did today is in the state of 'invalid'(Completed, can't validate)
or 'Validation inconclusive'(Completed, validation inconclusive)
Not a single wu is validated today!
State: All (2124) · In progress (159) · Validation pending (1) · Validation inconclusive (528) · Valid (393) · Invalid (554) · Error (489)


I stop doin' work for the wastebasket....


I sent this to a teammate the other day:
State: All (3660) · In progress (400) · Validation pending (0) · Validation inconclusive (481) · Valid (2779) · Invalid (0) · Error (0)
Application: All (3660) · MilkyWay@Home (3660) · MilkyWay@Home N-Body Simulation (0)

Right now I see this:
State: All (1148) · In progress (80) · Validation pending (0) · Validation inconclusive (80) · Valid (388) · Invalid (216) · Error (384)
Application: All (1148) · MilkyWay@Home (1148) · MilkyWay@Home N-Body Simulation (0)

Today every single on of the 400 in progress and 481 inconclusive units were cancelled!! On top of that the same thing happened a few months ago!!! I will NEVER put all of my pc's here at MilkyWay again, this is getting ridiculous!!! I put my pc's here by choice and the project can't seem to figure out how to respect the help I/we give them. I have crunched for MANY projects and NOT ONE of them has had the problems that MilkyWay is having recently!!
____________

Ulrich Metzner
Avatar
Send message
Joined: 11 Apr 15
Posts: 40
Credit: 16,078,599
RAC: 32,754

Message 66712 - Posted: 20 Oct 2017, 11:37:40 UTC - in response to Message 66708.

All the work I did today is in the state of 'invalid'(Completed, can't validate)
or 'Validation inconclusive'(Completed, validation inconclusive)
Not a single wu is validated today!
State: All (2124) · In progress (159) · Validation pending (1) · Validation inconclusive (528) · Valid (393) · Invalid (554) · Error (489)


I stop doin' work for the wastebasket....


Yes, there is something fishy going on:
https://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=1529043651

For example:
2156510831 734325 19 Oct 2017, 6:02:10 UTC 19 Oct 2017, 8:54:50 UTC Completed, validation inconclusive 106.32 29.42 pending MilkyWay@Home v1.46 (opencl_ati_101)
2156604339 745869 19 Oct 2017, 8:55:38 UTC 19 Oct 2017, 10:25:45 UTC Completed, validation inconclusive 228.69 11.73 pending MilkyWay@Home v1.46 (opencl_nvidia_101)
2156652994 616064 19 Oct 2017, 10:27:28 UTC 20 Oct 2017, 10:46:29 UTC Completed, validation inconclusive 6,490.58 5,791.41 pending MilkyWay@Home v1.46
2157355682 569912 20 Oct 2017, 10:47:47 UTC 20 Oct 2017, 11:19:22 UTC Completed, validation inconclusive 120.84 23.95 pending MilkyWay@Home v1.46 (opencl_ati_101)
2157370914 733142 20 Oct 2017, 11:20:21 UTC 1 Nov 2017, 11:20:21 UTC In progress --- --- --- MilkyWay@Home v1.46 (opencl_nvidia_101)

I think in the next step the WU is cancelled, because of too many results... :/

Have also stopped crunching until this is resolved.
____________
Aloha, Uli

HassanShebli
Send message
Joined: 2 Oct 10
Posts: 72
Credit: 17,079,518
RAC: 2

Message 66713 - Posted: 20 Oct 2017, 11:44:14 UTC
Last modified: 20 Oct 2017, 11:45:31 UTC

I stopped crunching as all have (Completed, can't validate)

When It'll be up again?

mikey
Avatar
Send message
Joined: 8 May 09
Posts: 2032
Credit: 179,632,378
RAC: 238,197

Message 66714 - Posted: 20 Oct 2017, 12:22:30 UTC

I stopped too, this is the 2nd time this has happened this year, 500 workunits of mine each time, just gone poof all the work for nothing.

Ulrich Metzner
Avatar
Send message
Joined: 11 Apr 15
Posts: 40
Credit: 16,078,599
RAC: 32,754

Message 66716 - Posted: 20 Oct 2017, 18:06:18 UTC

Ok, since nobody cares, i aborted all remaining WUs and set milkyway to nnt.
I'm out until further notice.
____________
Aloha, Uli

Profile TimeRanger
Send message
Joined: 31 Oct 10
Posts: 74
Credit: 22,897,766
RAC: 28,409

Message 66717 - Posted: 20 Oct 2017, 18:50:29 UTC - in response to Message 66712.

All the work I did today is in the state of 'invalid'(Completed, can't validate)
or 'Validation inconclusive'(Completed, validation inconclusive)
Not a single wu is validated today!
State: All (2124) · In progress (159) · Validation pending (1) · Validation inconclusive (528) · Valid (393) · Invalid (554) · Error (489)


I stop doin' work for the wastebasket....


Yes, there is something fishy going on:
https://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=1529043651

For example:
2156510831 734325 19 Oct 2017, 6:02:10 UTC 19 Oct 2017, 8:54:50 UTC Completed, validation inconclusive 106.32 29.42 pending MilkyWay@Home v1.46 (opencl_ati_101)
2156604339 745869 19 Oct 2017, 8:55:38 UTC 19 Oct 2017, 10:25:45 UTC Completed, validation inconclusive 228.69 11.73 pending MilkyWay@Home v1.46 (opencl_nvidia_101)
2156652994 616064 19 Oct 2017, 10:27:28 UTC 20 Oct 2017, 10:46:29 UTC Completed, validation inconclusive 6,490.58 5,791.41 pending MilkyWay@Home v1.46
2157355682 569912 20 Oct 2017, 10:47:47 UTC 20 Oct 2017, 11:19:22 UTC Completed, validation inconclusive 120.84 23.95 pending MilkyWay@Home v1.46 (opencl_ati_101)
2157370914 733142 20 Oct 2017, 11:20:21 UTC 1 Nov 2017, 11:20:21 UTC In progress --- --- --- MilkyWay@Home v1.46 (opencl_nvidia_101)

I think in the next step the WU is cancelled, because of too many results... :/

Have also stopped crunching until this is resolved.


I just set No New Tasks also. I haven't received a credit since early AM on the 18th - everything since then - ZILCH.

Profile Cliff
Avatar
Send message
Joined: 28 Nov 14
Posts: 45
Credit: 53,851,606
RAC: 114,161

Message 66718 - Posted: 21 Oct 2017, 0:51:59 UTC

I have set my computers to NNT, aborted those WU on my main rig and will NOT crunch any MW@H tasks until I am SURE that they will be handled correctly.

Everyone crunching for this project has had their computer time and the money spent paying for the power running their computer wasted.

I find this disgraceful and will not crunch until I can be sure I'm not throwing good money after bad henceforth.
____________
Regards,
Cliff.
--
Been there Done That, still no Damn T-Shirt

Avlin
Send message
Joined: 27 Feb 10
Posts: 1
Credit: 10,098,286
RAC: 6,265

Message 66719 - Posted: 21 Oct 2017, 7:41:58 UTC
Last modified: 21 Oct 2017, 7:44:37 UTC

You would at least score any points for those tasks that were cheated, but because of your fault they were not recognized as correct.

Assignments are rejected by hundreds, and the CPU time is spent on them.

Because now all the dissatisfied will be cut off from the project and there will be no one to cheat the tasks.

mikey
Avatar
Send message
Joined: 8 May 09
Posts: 2032
Credit: 179,632,378
RAC: 238,197

Message 66720 - Posted: 21 Oct 2017, 11:06:26 UTC - in response to Message 66719.

You would at least score any points for those tasks that were cheated, but because of your fault they were not recognized as correct.

Assignments are rejected by hundreds, and the CPU time is spent on them.

Because now all the dissatisfied will be cut off from the project and there will be no one to cheat the tasks.


I think you meant to use the word 'crunched' instead of "cheated", but yes I knew that.

But as I said earlier this same scenario played out a few months ago where hundreds of tasks were lost by me alone, not counting all the tasks lost by others that were all in various stages of being crunched. I only keep a 1.5 day cache of workunits on my pc's, that's still alot considering I can do a gpu unit in about 70 seconds and to lose them twice in a few months just isn't worth my time and effort anymore. I even contributed cash to this project at one time.

Profile Wisesooth
Send message
Joined: 2 Oct 14
Posts: 32
Credit: 19,769,743
RAC: 29,432

Message 66723 - Posted: 21 Oct 2017, 16:13:13 UTC

"To err is human; to forgive is not company policy." Is that how it goes?

Let's plant our personal feelings aside and look at the problem. Once we discover what is wrong and what is right, let us help these budding academics instead of rejecting them as a person.

1. These points and "attaguy" badges are worth nothing but bragging rights. That stuff is for gamers, not us.
2. "No more work" will not help them clear the queue. It just postpones a solution to the problem.
3. The nature of the work they do is heuristic, not deterministic, which does not guarantee convergence. Third-order differential equations are difficult to solve. Tensors are even more difficult to solve. Let's cut these people some slack, shall we?
4. We have the option to abort what comes in by user. Let's do that.
5. Sidd and Jake, have you considered a "cleaner" project? This could initiate an algorithm to identify and compile a profile of tasks to kill and then automatically initiate an "abort by user" message. Those of us who want to help you clear the queue could add the cleaner project to their list of projects. That would clear the queue. When tasks stop coming in, shutdown the cleaner project.
____________

rbrahn
Send message
Joined: 16 Jul 17
Posts: 5
Credit: 23,215,027
RAC: 317,048

Message 66724 - Posted: 21 Oct 2017, 16:20:29 UTC - in response to Message 66723.

Hear hear.
I'm still crunching in the hopes that invalidating WUs helps clear the queue and gets us back to normal operation faster. I could delete the WUs but I fear that won't help clear them (they appear to be aborting once about 7 comp attempts fail to validate). If someone tells me to start aborting, I'll do that, until then, ragequitting doesn't get anyone nearer a solution.

1 · 2 · Next
Post to thread

Message boards : News : Clearing the Queue


Main page · Your account · Message boards


Copyright © 2017 AstroInformatics Group