Aaargh! Servers are out of new work!(2)"
log in

Advanced search

Message boards : Number crunching : Aaargh! Servers are out of new work!(2)"

Author Message
John Clark
Send message
Joined: 4 Oct 08
Posts: 1613
Credit: 62,056,968
RAC: 27,572
Message 45224 - Posted: 23 Dec 2010 | 1:10:51 UTC

A restart
____________
Go away, I was asleep


TJ
Send message
Joined: 12 Aug 09
Posts: 251
Credit: 77,777,096
RAC: 628
Message 45237 - Posted: 23 Dec 2010 | 13:07:31 UTC - in response to Message 45224.

No need to John, all is working fine now again.
A new thread is tarting with faith...
I will see what happend when I start my rigs this night.
____________
Greetings from,
TJ

cncguru
Avatar
Send message
Joined: 11 Jun 10
Posts: 329
Credit: 1,166,219,987
RAC: 0
Message 45238 - Posted: 23 Dec 2010 | 13:38:13 UTC
Last modified: 23 Dec 2010 | 13:50:58 UTC

12/23/2010 8:34:47 AM Milkyway@home Message from server: No work sent
Workunits waiting for validation 25,208

Here we go again!!

IMO this is getting a bit ridiculous.

Is this an "unfixable " problem??

TJ
Send message
Joined: 12 Aug 09
Posts: 251
Credit: 77,777,096
RAC: 628
Message 45243 - Posted: 23 Dec 2010 | 18:22:00 UTC - in response to Message 45238.

Didn't I say so...
____________
Greetings from,
TJ

Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 45273 - Posted: 24 Dec 2010 | 16:31:04 UTC

Validator looks to be going: 55k+ pending.
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 45274 - Posted: 24 Dec 2010 | 16:35:24 UTC - in response to Message 45238.

12/23/2010 8:34:47 AM Milkyway@home Message from server: No work sent
Workunits waiting for validation 25,208

Here we go again!!

IMO this is getting a bit ridiculous.

Is this an "unfixable " problem??


Apparently it is. I really wish the admins would build a new server from scratch to replace this server.

And yes, I would donate money to the project to facilitate this endeavor.

-Mike
____________

cncguru
Avatar
Send message
Joined: 11 Jun 10
Posts: 329
Credit: 1,166,219,987
RAC: 0
Message 45275 - Posted: 24 Dec 2010 | 16:51:37 UTC
Last modified: 24 Dec 2010 | 16:57:10 UTC

Hey, I already told them I would build them a new one custom for their needs!
Or give more cash, whatever they need to keep the server running smoothly!
No response to that as of yet.
Feels way better to me than just dumping money into some "fund" and not knowing where it actually goes.

But is this a software or hardware issue??

David

(and yes I am prepared to put my money where my mouth is, within reason,lol)

Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 45276 - Posted: 24 Dec 2010 | 17:10:04 UTC - in response to Message 45275.


But is this a software or hardware issue??

David

(and yes I am prepared to put my money where my mouth is, within reason,lol)


It is more than likely, a software issue.

Building a new (different) server would ensure that they would be able to test it before replacing the existing one.

Also, a new (different) server would reduce the chance of inheriting an old bug from the old server.

Building a new (different) server would also give the admins a chance to incorporate some of the changes that have been requested by the people who support and crunch for the project. Maybe even increase the cache size or other changes.

-Mike
____________

cncguru
Avatar
Send message
Joined: 11 Jun 10
Posts: 329
Credit: 1,166,219,987
RAC: 0
Message 45277 - Posted: 24 Dec 2010 | 17:22:40 UTC

All excellent points Mike.
Hopefully the new year will bring some renewed energy to solve these issues.
Having a limited understanding of the real "nuts n bolts" of this kind of programming I am at somewhat of a loss to understand.
However, I do believe in the importance of this project beyond it just generating numbers for people out here to count and compete with each other about!
I remain optimistic and ready to help in any way that I can!

Anyone listening????

cncguru
Avatar
Send message
Joined: 11 Jun 10
Posts: 329
Credit: 1,166,219,987
RAC: 0
Message 45287 - Posted: 25 Dec 2010 | 1:08:20 UTC

Wow, it looks like I'm going to get my Christmas wish of an endless flow of WU's !!
LOL, Yeah, I ought to get a life ,huh?

Profile Blurf
Volunteer moderator
Project administrator
Send message
Joined: 13 Mar 08
Posts: 619
Credit: 25,447,954
RAC: 0
Message 45288 - Posted: 25 Dec 2010 | 2:22:06 UTC

Just got back from my parent's xmas dinner festivities...I'll look into a possible hardware fund drive after the New Year
____________

Zydor
Avatar
Send message
Joined: 24 Feb 09
Posts: 608
Credit: 85,353,250
RAC: 234,091
Message 45289 - Posted: 25 Dec 2010 | 4:51:34 UTC

If we do this, I suggest we go for 2 servers - its well within reach on the basis of a target suggestion 3cents per day per contributor for 2011 - ie $10 up front. With the number of active users we would easily raise the cash, even allowing for a relatively small proportion of active users contributing.

Not a bad New Years resolution at 3 cents a day ......

Regards
Zy

cncguru
Avatar
Send message
Joined: 11 Jun 10
Posts: 329
Credit: 1,166,219,987
RAC: 0
Message 45292 - Posted: 25 Dec 2010 | 9:21:29 UTC
Last modified: 25 Dec 2010 | 9:22:46 UTC

Yes, reminds me of the line in the movie "Contact".
Why build only one when you can have two (and keep the second one secret,lol)
Having a backup/baseline machine would be a very good idea.
And yes, even if only a relatively small percentage of us contribute it is still very "do able".
Great idea Zydor.

Oh, and Merry Christmas to everyone out there who celebrates today!

Profile Werkstatt
Send message
Joined: 19 Feb 08
Posts: 297
Credit: 105,742,273
RAC: 84
Message 45294 - Posted: 25 Dec 2010 | 10:54:45 UTC

Hi,

I really like the words posted here!
And I'm willing to contribute one extra $ to supply the dev's with newest hardware for app-development.
http://en.wikipedia.org/wiki/Sandy_Bridge_%28microarchitecture%29
This is a link to newest intel CPU-Technology with advanced SSE Instruction-set called AVX which will be distributed from january on.
As we have seen, dev's are able to write excellent apps if they have a hardware to test ( as seen by a generous donation from CNCGURU ).
And they should have at least one HD69xx GPU as OpenCL is said to be much improved compaired to the 'old' HD58xx GPU's.

Regards
Alexander

Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 45308 - Posted: 26 Dec 2010 | 14:39:22 UTC

Validator looks to be going AGAIN...32k+ pending
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

bertodell
Send message
Joined: 15 Dec 10
Posts: 14
Credit: 1,385,867
RAC: 0
Message 45309 - Posted: 26 Dec 2010 | 14:45:10 UTC - in response to Message 45308.

Yea I'm not getting any GPU work at all
____________

bertodell
Send message
Joined: 15 Dec 10
Posts: 14
Credit: 1,385,867
RAC: 0
Message 45315 - Posted: 26 Dec 2010 | 15:27:33 UTC

Got a little bit of work for my GPU,but it's not getting work again,it's up and down time for the servers.
____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1613
Credit: 62,056,968
RAC: 27,572
Message 45331 - Posted: 27 Dec 2010 | 14:14:23 UTC
Last modified: 27 Dec 2010 | 14:14:57 UTC

Validator backing up again (36+K waiting), but work still being distributed.
____________
Go away, I was asleep


Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 45341 - Posted: 27 Dec 2010 | 20:40:50 UTC

AAARRRGGGHHH

28/12/2010 7:39:56 AM | Milkyway@home | Requesting new tasks for ATI GPU
28/12/2010 7:39:59 AM | Milkyway@home | Scheduler request completed: got 0 new tasks
28/12/2010 7:39:59 AM | Milkyway@home | Message from Milkyway@home: No work sent
28/12/2010 7:39:59 AM | Milkyway@home | Message from Milkyway@home: Your app_info.xml file doesn't have a version of MilkyWay@Home N-Body Simulation.

Ba
Send message
Joined: 6 Nov 09
Posts: 12
Credit: 347,317,219
RAC: 0
Message 45342 - Posted: 27 Dec 2010 | 20:49:30 UTC

This validator issue seems to be getting more frequent, the problem is i default back to collatz and it drops me several hours work.
With a single 4770 not such a problrm but with two 5970's the cache in gone in mins even with 8 cores.

Profile Blurf
Volunteer moderator
Project administrator
Send message
Joined: 13 Mar 08
Posts: 619
Credit: 25,447,954
RAC: 0
Message 45344 - Posted: 27 Dec 2010 | 22:15:41 UTC

The issue has been escalated but people are going to have to be patient until the New Year. Doubt labstaff is even around right now.
____________

Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 45348 - Posted: 28 Dec 2010 | 1:17:54 UTC

If it is the same problem that occured it has been fixed twice this week.
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

Profile Arif Mert Kapicioglu
Send message
Joined: 14 Dec 09
Posts: 158
Credit: 572,866,449
RAC: 2,694
Message 45353 - Posted: 28 Dec 2010 | 15:38:26 UTC

Anyone getting continuous GPU wu? We definitely need a server status page like DNETC@Home so that we can distinguish the n-body and gpu work available.

Profile Volodymyr Shcherbyna
Send message
Joined: 28 Apr 10
Posts: 16
Credit: 9,668,276
RAC: 14
Message 45354 - Posted: 28 Dec 2010 | 17:20:54 UTC - in response to Message 45353.

Hello Everyone,

I have a problem simular on the subject ... On one machine I keep getting the following messages:

28/12/2010 18:12:56 Milkyway@home update requested by user
28/12/2010 18:12:57 Milkyway@home Sending scheduler request: Requested by user.
28/12/2010 18:12:57 Milkyway@home Requesting new tasks
28/12/2010 18:12:59 Milkyway@home Scheduler request completed: got 0 new tasks
28/12/2010 18:12:59 Milkyway@home Message from server: No work sent
28/12/2010 18:12:59 Milkyway@home Message from server: (reached daily quota of 1 tasks)

It is Windows 2000 machine and it happens only there, on all other machines there are no problems. Can it be related to Windows 2000?

Thanks!

____________

Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 45355 - Posted: 28 Dec 2010 | 19:24:24 UTC - in response to Message 45354.

Hello Everyone,

I have a problem simular on the subject ... On one machine I keep getting the following messages:

28/12/2010 18:12:56 Milkyway@home update requested by user
28/12/2010 18:12:57 Milkyway@home Sending scheduler request: Requested by user.
28/12/2010 18:12:57 Milkyway@home Requesting new tasks
28/12/2010 18:12:59 Milkyway@home Scheduler request completed: got 0 new tasks
28/12/2010 18:12:59 Milkyway@home Message from server: No work sent
28/12/2010 18:12:59 Milkyway@home Message from server: (reached daily quota of 1 tasks)

It is Windows 2000 machine and it happens only there, on all other machines there are no problems. Can it be related to Windows 2000?

Thanks!

You are putting out bad returns.

Name de_separation_14_3s_fix_1_488830_1290690754_1
Workunit 192286101
Created 25 Nov 2010 13:59:27 UTC
Sent 25 Nov 2010 14:02:05 UTC
Received 25 Nov 2010 14:03:12 UTC
Server state Over
Outcome Client error
Client state Compute error
Exit status -185 (0xffffffffffffff47)
Computer ID 208967
Report deadline 3 Dec 2010 14:02:05 UTC
Run time 0
CPU time 0
stderr out <core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
CreateProcess() failed -
</message>
]]>

Validate state Invalid
Claimed credit 0
Granted credit 0
Database Error
application version v0.00


____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

Profile Volodymyr Shcherbyna
Send message
Joined: 28 Apr 10
Posts: 16
Credit: 9,668,276
RAC: 14
Message 45358 - Posted: 28 Dec 2010 | 21:26:43 UTC - in response to Message 45355.

I see!

But BOINC does not report any errors on that machine ... Strange ...

Profile Werkstatt
Send message
Joined: 19 Feb 08
Posts: 297
Credit: 105,742,273
RAC: 84
Message 45359 - Posted: 28 Dec 2010 | 21:40:36 UTC - in response to Message 45355.


Validate state Invalid
Claimed credit 0
Granted credit 0
Database Error
application version v0.00



How do you get an application version v0.00 ?

Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 45360 - Posted: 28 Dec 2010 | 22:01:53 UTC - in response to Message 45359.

Via a compute error...

John Clark
Send message
Joined: 4 Oct 08
Posts: 1613
Credit: 62,056,968
RAC: 27,572
Message 45365 - Posted: 31 Dec 2010 | 21:43:31 UTC

Looks like things are back again, andnot Monday as we thought!
____________
Go away, I was asleep


Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 45367 - Posted: 31 Dec 2010 | 21:51:10 UTC - in response to Message 45365.

Looks like things are back again, andnot Monday as we thought!



Still had time to happen again! :p
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

bertodell
Send message
Joined: 15 Dec 10
Posts: 14
Credit: 1,385,867
RAC: 0
Message 45371 - Posted: 31 Dec 2010 | 22:52:46 UTC

looks like a lot of people lost a lot of points from milkway,dunno if the server screwed up or what,I hope what was lost can be reclaimed,this is insane.
____________

Profile Beyond
Send message
Joined: 15 Jul 08
Posts: 383
Credit: 501,817,389
RAC: 4
Message 45372 - Posted: 31 Dec 2010 | 23:25:03 UTC - in response to Message 45371.
Last modified: 31 Dec 2010 | 23:55:14 UTC

looks like a lot of people lost a lot of points from milkway,dunno if the server screwed up or what,I hope what was lost can be reclaimed,this is insane.

I'll repost from the other thread: The first stats dump to Free-DC and BoincStats were probably from an earlier stats run. If you look at your current totals on the MW site I think you'll see that your credits are correct. I'm betting the next stats dump to the above sites will fix everything.

Edit: Both stats sites have been updated and are now showing correct numbers.

Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 45404 - Posted: 4 Jan 2011 | 18:42:55 UTC

Server is working again.

Is it possible again to have the results around for more than 1 second?
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

JAMC
Send message
Joined: 9 Sep 08
Posts: 96
Credit: 336,443,946
RAC: 0
Message 45426 - Posted: 8 Jan 2011 | 2:01:30 UTC

boom-boom... out go the lights...

Profile arkayn
Avatar
Send message
Joined: 14 Feb 09
Posts: 915
Credit: 74,781,320
RAC: 237
Message 45427 - Posted: 8 Jan 2011 | 3:20:32 UTC

I still am receiving work, looks like it was a minor bump as I only had to do 2 Collatz units.

I still have a large cache from last weekend.
____________

TJ
Send message
Joined: 12 Aug 09
Posts: 251
Credit: 77,777,096
RAC: 628
Message 45505 - Posted: 12 Jan 2011 | 0:29:09 UTC
Last modified: 12 Jan 2011 | 0:31:32 UTC

Server status is still green, but I got only 3WU's and that's it.
Does more crunchers have this experience?
The amount of tasks ready to send was (is) dropping it should have been as i now see 4770 ready. Hope I get a bunch.
____________
Greetings from,
TJ

Profile KWSN Checklist
Avatar
Send message
Joined: 12 Aug 08
Posts: 130
Credit: 203,351,125
RAC: 726
Message 45512 - Posted: 12 Jan 2011 | 3:13:08 UTC

Snow drifts in the server perhaps?
____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1613
Credit: 62,056,968
RAC: 27,572
Message 45518 - Posted: 12 Jan 2011 | 9:17:00 UTC

GPU work coming in dribs and drabs, showing in BM statics as a small down turn. Backing up with the longer Collatz work now.
____________
Go away, I was asleep


Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 45521 - Posted: 12 Jan 2011 | 12:49:21 UTC

Only n-body wus.
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

Profile Kirdnem - Vince (GPU)
Avatar
Send message
Joined: 22 Feb 09
Posts: 10
Credit: 223,930,144
RAC: 209,675
Message 45522 - Posted: 12 Jan 2011 | 13:27:21 UTC

I have 7 very hungry 5850's 5970 cards idling ?

any idea when work will flow again for the GPU's

DNETC will be benifiting in the mean time but i would much prefer Milkyway
____________

Profile Arif Mert Kapicioglu
Send message
Joined: 14 Dec 09
Posts: 158
Credit: 572,866,449
RAC: 2,694
Message 45523 - Posted: 12 Jan 2011 | 15:19:36 UTC

Couple of months ago, an outage was considered to be an exception which was happening maximum 2 times in a month. Nowadays, crunchers accept the outage, which happens at least 2 times per week, as a part of the project.

I did say this, and will be saying that, we need better servers. If crunchers allocate their resources to buy hardware for the science, the project staff is obligated to provide a service that meets the crunchers behavior.

The existence of backup projects doesn't soften the excuse of sequential outages. I don't know whether the cause is server load, validation or feeder but apparently something is wrong in the server side.

Still, milkyway@home is my primary project but when the opencl becomes an industry platform, which has been becoming really fast, i wonder how many people will stick to this project. At least the Prime Grid did a wonderful job implementing it into their application.

cncguru
Avatar
Send message
Joined: 11 Jun 10
Posts: 329
Credit: 1,166,219,987
RAC: 0
Message 45524 - Posted: 12 Jan 2011 | 15:54:21 UTC

I got a message thru, lets see if it helps!

Un4given
Send message
Joined: 14 Feb 09
Posts: 8
Credit: 9,196,709
RAC: 1,362
Message 45525 - Posted: 12 Jan 2011 | 15:58:37 UTC

Well, this thread answered my question. I was wondering if I was having BOINC issues. Just got my 6950 unlocked to 6970 specs and moved my 4890 to my secondary rig and was hoping to punch out some work units, but I guess that isn't going to happen for awhile.

John Clark
Send message
Joined: 4 Oct 08
Posts: 1613
Credit: 62,056,968
RAC: 27,572
Message 45527 - Posted: 12 Jan 2011 | 20:35:43 UTC

Clearly there is no Milkyway GPU work coming through, despite the server status claiming plenty of new work is ready to send.

It looks like other people have a correct surmise that it's only nBody work.
____________
Go away, I was asleep


Bryan [TeaM]
Send message
Joined: 26 Jun 09
Posts: 47
Credit: 254,030,335
RAC: 212
Message 45528 - Posted: 12 Jan 2011 | 21:24:42 UTC - in response to Message 45523.

i wonder how many people will stick to this project. At least the Prime Grid did a wonderful job implementing it into their application.


Some did bail quite a while ago. I only run MW now when DNETC is down which is the opposite of what I would prefer. Shame, MW doesn't even support the automatic backup capability of BOINC.

This used to be one of the premier projects for reliability but things have changed. Very unfortunate!
____________
Bryan

Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 45529 - Posted: 12 Jan 2011 | 21:28:25 UTC

Don't know what you mean about MW not supporting the automatic backup capability of BOINC. I have Collatz and DNETC at 0% resource share and BOINC pretty seamlessly swaps over to one of those when MW has no work.

I run MW pretty much 100% GPU resource allocation and have no problems..other than MW doesn't have any GPU work right now.

I've PM'd Matt about the work outrage, maybe I'll also PM Travis.

Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 45530 - Posted: 12 Jan 2011 | 21:45:12 UTC - in response to Message 45529.
Last modified: 12 Jan 2011 | 21:45:44 UTC

Don't know what you mean about MW not supporting the automatic backup capability of BOINC. I have Collatz and DNETC at 0% resource share and BOINC pretty seamlessly swaps over to one of those when MW has no work.

I run MW pretty much 100% GPU resource allocation and have no problems..other than MW doesn't have any GPU work right now.

I've PM'd Matt about the work outrage, maybe I'll also PM Travis.


He was talking about having a significant amount of workunits CACHED to help lessen the burden of the very frequent outages.

A 20 minute cache is not enough for this project.
____________

cncguru
Avatar
Send message
Joined: 11 Jun 10
Posts: 329
Credit: 1,166,219,987
RAC: 0
Message 45532 - Posted: 12 Jan 2011 | 22:46:28 UTC
Last modified: 12 Jan 2011 | 22:47:44 UTC

Personally, Ive about had it with this nonsense.
They don't bother to communicate with us what they are doing most of the time.
They don't seem to be willing or able to solve a recurring problem.
Arrogance for the first ,incompetence for the latter.
I would have fired the lot of them by now if they were any of my employees!
Think I'll take my 40 cores and 19 GPU's and go play somewhere I'm respected just a little more.

In fact I'm already gone,lol!

Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 45533 - Posted: 12 Jan 2011 | 22:57:40 UTC
Last modified: 12 Jan 2011 | 23:00:42 UTC

Well we were getting double credits there for a couple of days...maybe this is pay back! :)

[edit]PM'd Travis as well.

TJ
Send message
Joined: 12 Aug 09
Posts: 251
Credit: 77,777,096
RAC: 628
Message 45535 - Posted: 13 Jan 2011 | 0:10:03 UTC

Yes! I just got 4 WU's the ATI are firing up.

See our patience is rewarded (is this good English?)
____________
Greetings from,
TJ

TJ
Send message
Joined: 12 Aug 09
Posts: 251
Credit: 77,777,096
RAC: 628
Message 45536 - Posted: 13 Jan 2011 | 0:13:04 UTC - in response to Message 45535.

I think we need more patience. The 4WU's ran, and where uploaded again.
But no new ones, but I keep looking.
____________
Greetings from,
TJ

Matthew
Volunteer moderator
Project developer
Project scientist
Send message
Joined: 6 May 09
Posts: 183
Credit: 4,799
RAC: 0
Message 45537 - Posted: 13 Jan 2011 | 1:58:55 UTC

I started up some new GPU runs, that should get things going again.

-Matthew

Profile Tretboot
Send message
Joined: 20 Aug 10
Posts: 7
Credit: 53,721,698
RAC: 0
Message 45540 - Posted: 13 Jan 2011 | 4:13:24 UTC - in response to Message 45537.

I got my 24 new WUs! Yay! I still wish i could store a few more, i go through them so fast :)

James Nunley
Send message
Joined: 29 Nov 07
Posts: 39
Credit: 60,421,187
RAC: 35,277
Message 45542 - Posted: 13 Jan 2011 | 6:48:52 UTC - in response to Message 45532.

Personally, Ive about had it with this nonsense.
They don't bother to communicate with us what they are doing most of the time.
They don't seem to be willing or able to solve a recurring problem.
Arrogance for the first ,incompetence for the latter.
I would have fired the lot of them by now if they were any of my employees!
Think I'll take my 40 cores and 19 GPU's and go play somewhere I'm respected just a little more.

In fact I'm already gone,lol!



I often wonder what people don't understand about the word

V O L U N T E E R

John Clark
Send message
Joined: 4 Oct 08
Posts: 1613
Credit: 62,056,968
RAC: 27,572
Message 45544 - Posted: 13 Jan 2011 | 10:15:15 UTC

My GPU work started to flow earllier and I returned the first result an hour or so. So, back to normal (I hope).
____________
Go away, I was asleep


TJ
Send message
Joined: 12 Aug 09
Posts: 251
Credit: 77,777,096
RAC: 628
Message 45545 - Posted: 13 Jan 2011 | 10:46:23 UTC - in response to Message 45544.
Last modified: 13 Jan 2011 | 10:46:54 UTC

I noticed the same John, but I am now on the high power rate (my high power consuming rigs run nights). So I start to finish the lot.

But I see all de_separation_11, 13, 14, 15 and 16 (run 1.26) not the 17 (run 2.12), so I guess it is not totally "normal" just jet.

Happy crunching.
____________
Greetings from,
TJ

Profile KWSN Checklist
Avatar
Send message
Joined: 12 Aug 08
Posts: 130
Credit: 203,351,125
RAC: 726
Message 45594 - Posted: 16 Jan 2011 | 13:34:19 UTC

Snow drifts in server again, clean and kick please.
____________

Profile Berserk_Tux
Avatar
Send message
Joined: 2 Jan 08
Posts: 79
Credit: 365,471,675
RAC: 0
Message 45595 - Posted: 16 Jan 2011 | 14:46:26 UTC - in response to Message 45594.

Snow drifts in server again, clean and kick please.


Here we go again.

____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1613
Credit: 62,056,968
RAC: 27,572
Message 45610 - Posted: 18 Jan 2011 | 15:02:20 UTC

Server page looks wierd
____________
Go away, I was asleep


Matthew
Volunteer moderator
Project developer
Project scientist
Send message
Joined: 6 May 09
Posts: 183
Credit: 4,799
RAC: 0
Message 45613 - Posted: 18 Jan 2011 | 18:17:44 UTC

Thanks, I gave it a kick and took some old runs down. It looks like things are calming down... I'll keep an eye on it today.

-Matthew

scottishwebcamslive.com
Avatar
Send message
Joined: 10 Oct 07
Posts: 79
Credit: 69,337,972
RAC: 0
Message 45614 - Posted: 18 Jan 2011 | 18:45:34 UTC
Last modified: 18 Jan 2011 | 19:24:21 UTC

hello,

the problem of running out of work can be cured in two ways

1. you up the amount of WU per core that we can download ( which for some reason your desperate not to do )

2. hold a far bigger cashe of work units on your own server so that even if it jams for an hour or two we still can download work until it starts producing new work again ( although that wont work either .... just now for instance theres 5000 WU on the server page waiting to be worked on it says and yet nothing for my two machines :( )


GPU technology is making leaps and bounds ( twin dual GPU 5970's here ) and having 10 or 15 mins of work when theres a server glitch which can last hours just isnt cutting it anymore

best regards
Ian
____________
....Please Join team Scotland HERE

JAMC
Send message
Joined: 9 Sep 08
Posts: 96
Credit: 336,443,946
RAC: 0
Message 45615 - Posted: 18 Jan 2011 | 19:32:23 UTC

What happened to the longer running 17s WU's??

Profile Beyond
Send message
Joined: 15 Jul 08
Posts: 383
Credit: 501,817,389
RAC: 4
Message 45617 - Posted: 18 Jan 2011 | 20:07:03 UTC - in response to Message 45613.

Thanks, I gave it a kick and took some old runs down. It looks like things are calming down... I'll keep an eye on it today.

-Matthew

Hi Matthew,

I know queue size has been discussed many times before, but now that we have the N-Body WUs for CPUs how about dedicating the CPUs to them and leave the old WUs for the GPUs. That way GPU cache size could be increased and turn around time would still be faster than it is now for both types of WUs.

Profile Werkstatt
Send message
Joined: 19 Feb 08
Posts: 297
Credit: 105,742,273
RAC: 84
Message 45618 - Posted: 18 Jan 2011 | 20:32:09 UTC - in response to Message 45617.


I know queue size has been discussed many times before, but now that we have the N-Body WUs for CPUs how about dedicating the CPUs to them and leave the old WUs for the GPUs. That way GPU cache size could be increased and turn around time would still be faster than it is now for both types of WUs.


I agree, this would make app_infos unnecessary. At least for non-OpenCL users.

Matthew
Volunteer moderator
Project developer
Project scientist
Send message
Joined: 6 May 09
Posts: 183
Credit: 4,799
RAC: 0
Message 45620 - Posted: 18 Jan 2011 | 22:38:34 UTC

The server seems to be running smoothly now - let me know if it isn't working well on your end.

I'll discuss some of you ideas with the rest of the group; I will let you know what happens.

-Matthew

John Clark
Send message
Joined: 4 Oct 08
Posts: 1613
Credit: 62,056,968
RAC: 27,572
Message 45622 - Posted: 18 Jan 2011 | 23:22:50 UTC

Thanks Matthew, all running normally here
____________
Go away, I was asleep


Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 45625 - Posted: 19 Jan 2011 | 4:52:56 UTC - in response to Message 45620.

The server seems to be running smoothly now - let me know if it isn't working well on your end.

I'll discuss some of you ideas with the rest of the group; I will let you know what happens.

-Matthew

I'm not getting double credits now...bugger!

Profile Berserk_Tux
Avatar
Send message
Joined: 2 Jan 08
Posts: 79
Credit: 365,471,675
RAC: 0
Message 45630 - Posted: 19 Jan 2011 | 13:35:58 UTC - in response to Message 45625.

Here we go again!! No work.
____________

scottishwebcamslive.com
Avatar
Send message
Joined: 10 Oct 07
Posts: 79
Credit: 69,337,972
RAC: 0
Message 45631 - Posted: 19 Jan 2011 | 13:37:43 UTC

hello,

there is always a third way of doing this which would involve alot more work on your part i would suspect
that is make the WU way longer than 90 seconds on some of our machines ( say 10 or 15 minute length or longer each ) and let us choose to do those WU on our faster machines if you dont want to figure out some way of sending them out to faster machines automatically

just a thought

Ian
____________
....Please Join team Scotland HERE

John Clark
Send message
Joined: 4 Oct 08
Posts: 1613
Credit: 62,056,968
RAC: 27,572
Message 45632 - Posted: 19 Jan 2011 | 16:11:31 UTC

Matthew

The servers are going funny again, like they did a few days ago.

Database/file status
State #
Results ready to send 2,876
Results in progress 181,275
Workunits waiting for validation 56,749
Workunits waiting for assimilation 176
Workunits waiting for deletion 3
Results waiting for deletion 269
Transitioner backlog (hours) 0


Can you give the servers a kick again, please?
____________
Go away, I was asleep


cncguru
Avatar
Send message
Joined: 11 Jun 10
Posts: 329
Credit: 1,166,219,987
RAC: 0
Message 45688 - Posted: 22 Jan 2011 | 14:45:17 UTC
Last modified: 22 Jan 2011 | 15:21:15 UTC

This is becoming so common place that we aren't even bothering to post about it anymore!!
Why can't this be fixed?
Is it so astronomically difficult that it just cannot be solved?
Cmon guys I need my BILLION credits before I can go home, LOL!
Yes, they are still sending work but the validator is choking and the RAC's are moving backwards even as we crunch!

Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 45690 - Posted: 22 Jan 2011 | 15:27:36 UTC

Twice yesterday and once so far today is too often. Whatever is being done is only last hours not even days.
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

TJ
Send message
Joined: 12 Aug 09
Posts: 251
Credit: 77,777,096
RAC: 628
Message 45700 - Posted: 22 Jan 2011 | 18:48:29 UTC
Last modified: 22 Jan 2011 | 18:49:32 UTC

Well I'm not getting new work in the last 2 hours while there are WU's though.
The validater is in "stress-mode" as it says on the server status page: Workunits waiting for validation 167,508 , 411 of them are mine.

How just when posting this, 18:49UTC, I got 2 WU's.
____________
Greetings from,
TJ

loeakaodas
Send message
Joined: 2 Jan 09
Posts: 34
Credit: 93,631,891
RAC: 0
Message 45737 - Posted: 24 Jan 2011 | 16:49:26 UTC

Looks like the server needs a kick or two.

data-driven web pages milkyway Running
upload/download server milkyway Running
scheduler milkyway Running
feeder milkyway Not Running
transitioner milkyway Not Running
milkyway_purge milkyway Not Running
file_deleter milkyway Not Running
nbody_assimilator milkyway Not Running
separation_assimilator milkyway Not Running

____________

Matthew
Volunteer moderator
Project developer
Project scientist
Send message
Joined: 6 May 09
Posts: 183
Credit: 4,799
RAC: 0
Message 45741 - Posted: 24 Jan 2011 | 20:02:37 UTC
Last modified: 24 Jan 2011 | 20:03:09 UTC

Server-side, things look like they should be stable; Travis has asked users to stop certain 'iffy' behaviors (http://milkyway.cs.rpi.edu/milkyway/forum_thread.php?id=2179#45739) as they may be responsible for overloading the server.

Once their behavior stops (Either by choice, or by ban from us admins), we should see things start to calm down.

Cheers,
Matthew

Profile Berserk_Tux
Avatar
Send message
Joined: 2 Jan 08
Posts: 79
Credit: 365,471,675
RAC: 0
Message 45762 - Posted: 25 Jan 2011 | 14:20:55 UTC - in response to Message 45741.

No work again.
____________

Profile kashi
Send message
Joined: 30 Dec 07
Posts: 309
Credit: 148,432,104
RAC: 0
Message 45768 - Posted: 25 Jan 2011 | 16:21:24 UTC

No teams also.

John Clark
Send message
Joined: 4 Oct 08
Posts: 1613
Credit: 62,056,968
RAC: 27,572
Message 45853 - Posted: 28 Jan 2011 | 11:06:34 UTC

The validator retention is growing again, and the servers are not sending me work.
____________
Go away, I was asleep


vandiesel
Send message
Joined: 10 May 10
Posts: 27
Credit: 43,104,187
RAC: 0
Message 45854 - Posted: 28 Jan 2011 | 11:37:34 UTC

ditto

Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 45855 - Posted: 28 Jan 2011 | 12:54:43 UTC

Assimilator is offline. 37k results pending.
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 45924 - Posted: 3 Feb 2011 | 0:30:30 UTC

Assimilator is down again. 32k pending.
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

Profile KWSN Checklist
Avatar
Send message
Joined: 12 Aug 08
Posts: 130
Credit: 203,351,125
RAC: 726
Message 46031 - Posted: 7 Feb 2011 | 3:04:48 UTC

Kick it like a football!
____________

Profile Blurf
Volunteer moderator
Project administrator
Send message
Joined: 13 Mar 08
Posts: 619
Credit: 25,447,954
RAC: 0
Message 46032 - Posted: 7 Feb 2011 | 3:21:57 UTC

Matt has been working on the server tonight.
____________

Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 46036 - Posted: 7 Feb 2011 | 6:44:53 UTC - in response to Message 46032.

Matt has been working on the server tonight.

Arrrrggghhhh!

www.MeuPicanto.com
Send message
Joined: 1 Feb 11
Posts: 24
Credit: 26,780,798
RAC: 0
Message 46037 - Posted: 7 Feb 2011 | 12:44:25 UTC - in response to Message 46036.

Forecast to be up again?

Profile Blurf
Volunteer moderator
Project administrator
Send message
Joined: 13 Mar 08
Posts: 619
Credit: 25,447,954
RAC: 0
Message 46049 - Posted: 7 Feb 2011 | 18:17:03 UTC

Not sure why there's no work--escalating now.
____________

Matthew
Volunteer moderator
Project developer
Project scientist
Send message
Joined: 6 May 09
Posts: 183
Credit: 4,799
RAC: 0
Message 46053 - Posted: 7 Feb 2011 | 19:58:32 UTC

I started up some new searches and killed some older ones, and gave the whole thing a reboot. Hopefully it will run smoothly now - if not, I'll get back on it.

Also, don't worry too much if the server status page reads 'not running' for several components - the 'disabled' option doesn't work the way we want it to (yet), so when we pause things to do work the status switches to 'not running'. DO worry if it stays like that for more than an hour, though...

Thanks for keeping us informed!

-Matthew

Profile Bymark
Avatar
Send message
Joined: 6 Mar 09
Posts: 48
Credit: 385,976,815
RAC: 0
Message 46054 - Posted: 7 Feb 2011 | 21:13:41 UTC - in response to Message 46053.
Last modified: 7 Feb 2011 | 21:24:06 UTC

We here as users, must refresh, uppdate, to get some new work.
Please, fix that too...

www.MeuPicanto.com
Send message
Joined: 1 Feb 11
Posts: 24
Credit: 26,780,798
RAC: 0
Message 46056 - Posted: 8 Feb 2011 | 0:34:10 UTC - in response to Message 46054.

We here as users, must refresh, uppdate, to get some new work.
Please, fix that too...


Sure, I needed to press the update to get new GPU packages... =( Very Strage!

Regards!

YoDude.SETI.USA [TopGun]
Send message
Joined: 29 May 09
Posts: 31
Credit: 33,829,769
RAC: 0
Message 46060 - Posted: 8 Feb 2011 | 6:29:44 UTC

.....and here I was proudly explaining to all my friends how they should run MW too, just like me.....THEN.......@&%^#*% no new work!

How will I ever explain this to my wife and kids...Their father can't get new work!


Honey.....honey...come back, you really weren't supposed to see that post. It's ok, really. I promise they will have something for me soon. Really, they will, I'm so sure of it......Come on sweetie, don't be like that it'll be ok, I'm sure.......I swear it.

:)

loeakaodas
Send message
Joined: 2 Jan 09
Posts: 34
Credit: 93,631,891
RAC: 0
Message 46095 - Posted: 9 Feb 2011 | 18:23:50 UTC

It appears someone's messing with the server, or there's a problem...

2/9/2011 1:20:57 PM Milkyway@home Message from server: Server error: feeder not running


____________

swiftmallard
Avatar
Send message
Joined: 18 Jul 09
Posts: 157
Credit: 162,217,420
RAC: 196,351
Message 46096 - Posted: 9 Feb 2011 | 18:40:23 UTC

2/9/2011 11:13:16 AM Milkyway@home Message from server: Project is temporarily shut down for maintenance

John Clark
Send message
Joined: 4 Oct 08
Posts: 1613
Credit: 62,056,968
RAC: 27,572
Message 46100 - Posted: 9 Feb 2011 | 19:51:39 UTC

Oh! Boy!

The servers are running, but the assimilators are off!
____________
Go away, I was asleep


www.MeuPicanto.com
Send message
Joined: 1 Feb 11
Posts: 24
Credit: 26,780,798
RAC: 0
Message 46101 - Posted: 9 Feb 2011 | 20:24:18 UTC

oohhh boy...

2/9/2011 6:23:16 PM Milkyway@home Sending scheduler request: Requested by user.
2/9/2011 6:23:16 PM Milkyway@home Requesting new tasks for GPU
2/9/2011 6:23:18 PM Milkyway@home Scheduler request completed: got 0 new tasks
2/9/2011 6:23:18 PM Milkyway@home Message from server: No work available

So... othing to do again...

I think that we have more compute Power than the server are able to create WU for us...

regards!

Matthew
Volunteer moderator
Project developer
Project scientist
Send message
Joined: 6 May 09
Posts: 183
Credit: 4,799
RAC: 0
Message 46102 - Posted: 9 Feb 2011 | 20:32:16 UTC

Bear with us for a bit, the RPI CS account servers are down, and it's making it hard for us to access the Milkyway@home server. I'm doing everything I can to keep it going, but I think we won't be fully operational until the account servers come back up.

Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 46238 - Posted: 14 Feb 2011 | 20:43:04 UTC

Same old, same old

15/02/2011 7:42:16 AM | Milkyway@home | Your app_info.xml file doesn't have a version of MilkyWay@Home N-Body Simulation.

Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 46249 - Posted: 15 Feb 2011 | 20:17:47 UTC

Now work again!. Sometimes I get "only N-body work", sometimes not.
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

loeakaodas
Send message
Joined: 2 Jan 09
Posts: 34
Credit: 93,631,891
RAC: 0
Message 46350 - Posted: 25 Feb 2011 | 16:04:00 UTC

Looks like there isn't any more GPU work and the assimilator has stopped working.
____________

Sabroe_SMC
Avatar
Send message
Joined: 2 Aug 08
Posts: 23
Credit: 104,125,949
RAC: 17,611
Message 46351 - Posted: 25 Feb 2011 | 17:38:43 UTC - in response to Message 46350.

Server status

Program Host Status
data-driven web pages milkyway Running
upload/download server milkyway Running
scheduler milkyway Running
feeder milkyway Not Running
transitioner milkyway Not Running
milkyway_purge milkyway Not Running
file_deleter milkyway Not Running
nbody_assimilator milkyway Not Running
separation_assimilator milkyway Not Running


____________
Greetz to all

Profile Blurf
Volunteer moderator
Project administrator
Send message
Joined: 13 Mar 08
Posts: 619
Credit: 25,447,954
RAC: 0
Message 46352 - Posted: 25 Feb 2011 | 21:09:33 UTC

They're up now....
____________

Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 46383 - Posted: 28 Feb 2011 | 22:48:32 UTC

Ho hum...down again.

Results ready to send 0

Results in progress 146,646

Workunits waiting for validation 51,748

I've been without work for over an hour and yet the result in progress is amazingly large! There are some reeaaalllyyyy slow systems out there crunching this project.

www.MeuPicanto.com
Send message
Joined: 1 Feb 11
Posts: 24
Credit: 26,780,798
RAC: 0
Message 46384 - Posted: 28 Feb 2011 | 22:50:30 UTC - in response to Message 46383.
Last modified: 28 Feb 2011 | 22:50:50 UTC

Ho hum...down again.

Results ready to send 0

Results in progress 146,646

Workunits waiting for validation 51,748

I've been without work for over an hour and yet the result in progress is amazingly large! There are some reeaaalllyyyy slow systems out there crunching this project.


Thanks gosh I have an ATI 5870, 87 seconds each WU... of course, when there is a WU available. ARGH!

Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 46385 - Posted: 28 Feb 2011 | 22:55:10 UTC - in response to Message 46384.
Last modified: 28 Feb 2011 | 22:57:07 UTC

Ho hum...down again.

Results ready to send 0

Results in progress 146,646

Workunits waiting for validation 51,748

I've been without work for over an hour and yet the result in progress is amazingly large! There are some really, reeaaalllyyyy slow systems out there crunching this project.


Thanks gosh I have an ATI 5870, 87 seconds each WU... of course, when there is a WU available. ARGH!

Yes, and my 3850 (AGP card) which takes ~540 seconds per wu in my old P4 with HT on, which means it gets 12 wu's so its cache is ~105 minutes is out of work as well. Which means there are some reeeeaaaaallllyyyy slow systems out there.

BarryAZ
Send message
Joined: 1 Sep 08
Posts: 512
Credit: 223,368,243
RAC: 161,780
Message 46388 - Posted: 1 Mar 2011 | 5:03:03 UTC - in response to Message 46383.

Or hosts that have figured out that MW is a secondary project for reliability reasons -- my primary GPU projects are Collatz and Dnetc -- both of which run more regularly than MW as well as supporting single precision GPU's.

I realize there is a fair amount of load on this project with relatively limited resources -- and also suspect that folks who run NW as their only GPU project are likely to be a tad unhappy....

Ho hum...down again.

Results ready to send 0

Results in progress 146,646

Workunits waiting for validation 51,748

I've been without work for over an hour and yet the result in progress is amazingly large! There are some reeaaalllyyyy slow systems out there crunching this project.


____________

Profile arkayn
Avatar
Send message
Joined: 14 Feb 09
Posts: 915
Credit: 74,781,320
RAC: 237
Message 46389 - Posted: 1 Mar 2011 | 5:18:04 UTC

I have to leave MW as my primary ATI GPU project since the server does not support the "0" share setting. Collats is my backup.
____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1613
Credit: 62,056,968
RAC: 27,572
Message 46390 - Posted: 1 Mar 2011 | 10:13:13 UTC

When Milkyway stops delivering new work my Collatz back up takes over. When Milkyway is working again I NNT Collatz and let the cache run down.

I have just come to the PCs and see Milkyway was not giving out work and I have only just lifter the NNT on Collatz. Milkyway returned it's last crunched unit at 01.05am this morning.


Aaaarrgggh - all that wasted time.
____________
Go away, I was asleep


vaio
Send message
Joined: 5 Oct 07
Posts: 5
Credit: 92,090,917
RAC: 0
Message 46391 - Posted: 1 Mar 2011 | 12:05:58 UTC

Annoying isn't it?
Was hoping to hit 50 mil mark this week before starting my shutdown.
____________
Team
Forum

John Clark
Send message
Joined: 4 Oct 08
Posts: 1613
Credit: 62,056,968
RAC: 27,572
Message 46393 - Posted: 1 Mar 2011 | 13:40:41 UTC

Still in with a good chance this week with that RAC. But their servers need to be working for at least tomorrow, assuning your shutdown is Saturday?
____________
Go away, I was asleep


Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 46394 - Posted: 1 Mar 2011 | 14:14:45 UTC

Up to 103k pending.
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

Matthew
Volunteer moderator
Project developer
Project scientist
Send message
Joined: 6 May 09
Posts: 183
Credit: 4,799
RAC: 0
Message 46396 - Posted: 1 Mar 2011 | 23:24:42 UTC

I think I got it going again - thanks!

We'll have some new runs up soon, should clear out the gutters a bit...

-Matthew

Jesse Viviano
Send message
Joined: 4 Feb 11
Posts: 60
Credit: 25,198,767
RAC: 0
Message 46397 - Posted: 1 Mar 2011 | 23:49:39 UTC

Something just happened because the project is out of work again.

www.MeuPicanto.com
Send message
Joined: 1 Feb 11
Posts: 24
Credit: 26,780,798
RAC: 0
Message 46398 - Posted: 2 Mar 2011 | 0:31:34 UTC

Tired of that! I just attached to Collatz and set the priority for this project to 10%... It's a shame that all the time the server get out of work. Regards!

[SG-SPEG]muto1899
Send message
Joined: 19 Jan 11
Posts: 1
Credit: 15,165,411
RAC: 0
Message 46401 - Posted: 2 Mar 2011 | 9:23:50 UTC - in response to Message 46398.

Just got about ten WUs about an hour ago. Now I get again no WUs

www.MeuPicanto.com
Send message
Joined: 1 Feb 11
Posts: 24
Credit: 26,780,798
RAC: 0
Message 46404 - Posted: 2 Mar 2011 | 20:09:51 UTC

Got around 30 WU again and... STOP again... no more packs for GPUs... OUCH!

HELLO SERVERS ADMIN, WHAT YOU GUYS NEED??

Matthew
Volunteer moderator
Project developer
Project scientist
Send message
Joined: 6 May 09
Posts: 183
Credit: 4,799
RAC: 0
Message 46405 - Posted: 2 Mar 2011 | 20:18:34 UTC

We just started some new runs, so they might still be filtering into the system. I'll be keeping an eye on things throughout the day.

-Matthew

Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 46406 - Posted: 2 Mar 2011 | 21:16:39 UTC

Going AGAIN! 12k pending.
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

Jesse Viviano
Send message
Joined: 4 Feb 11
Posts: 60
Credit: 25,198,767
RAC: 0
Message 46407 - Posted: 2 Mar 2011 | 21:17:38 UTC

I noticed something: every time a backlog forms with the queue for results waiting for validation, the server runs out of work. Is there something wrong with the validator?

BarryAZ
Send message
Joined: 1 Sep 08
Posts: 512
Credit: 223,368,243
RAC: 161,780
Message 46408 - Posted: 2 Mar 2011 | 22:28:49 UTC - in response to Message 46407.

Yup -- it is been that way for many many months -- fails semi-regularly for reasons as yet over these many months, unresolved (indeterminate??).

It is one of the primary reasons that MW has become a part time project.
____________

DanNeely
Send message
Joined: 6 Oct 09
Posts: 39
Credit: 77,212,649
RAC: 0
Message 46409 - Posted: 2 Mar 2011 | 23:17:56 UTC - in response to Message 46407.

I noticed something: every time a backlog forms with the queue for results waiting for validation, the server runs out of work. Is there something wrong with the validator?


Results from the validator are used to determine what to set the new WUs up as; I assume that means if validation stops so does WU creation.

Matthew
Volunteer moderator
Project developer
Project scientist
Send message
Joined: 6 May 09
Posts: 183
Credit: 4,799
RAC: 0
Message 46410 - Posted: 3 Mar 2011 | 0:00:28 UTC

Travis and I hammered out a small issue, things seem to be running very smoothly now.

-Matthew

www.MeuPicanto.com
Send message
Joined: 1 Feb 11
Posts: 24
Credit: 26,780,798
RAC: 0
Message 46411 - Posted: 3 Mar 2011 | 4:13:35 UTC - in response to Message 46410.

Travis and I hammered out a small issue, things seem to be running very smoothly now.

-Matthew


Thanks God Matthew... let us keep crunching please! We have some ATI babies that needs to crunch!

Best Regards!

www.MeuPicanto.com
Send message
Joined: 1 Feb 11
Posts: 24
Credit: 26,780,798
RAC: 0
Message 46438 - Posted: 4 Mar 2011 | 11:33:19 UTC - in response to Message 46411.

Travis and I hammered out a small issue, things seem to be running very smoothly now.

-Matthew


Thanks God Matthew... let us keep crunching please! We have some ATI babies that needs to crunch!

Best Regards!


Works great for 1 day... receiving right now :

3/4/2011 8:31:17 AM Milkyway@home Reporting 1 completed tasks, requesting new tasks for GPU
3/4/2011 8:31:20 AM Milkyway@home Scheduler request completed: got 0 new tasks
3/4/2011 8:31:20 AM Milkyway@home Message from server: (Project has no jobs available)

My GPU is running out of WU... again... =(

Regards!

Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 46454 - Posted: 5 Mar 2011 | 23:07:58 UTC
Last modified: 5 Mar 2011 | 23:12:34 UTC

not getting any...

6/03/2011 10:05:41 AM | Milkyway@home | Sending scheduler request: Requested by user.
6/03/2011 10:05:41 AM | Milkyway@home | Requesting new tasks for ATI GPU
6/03/2011 10:05:46 AM | Milkyway@home | Scheduler request completed: got 0 new tasks
6/03/2011 10:05:46 AM | Milkyway@home | No work sent
6/03/2011 10:05:46 AM | Milkyway@home | Your app_info.xml file doesn't have a version of MilkyWay@Home N-Body Simulation.

[edit]
Good news!

6/03/2011 10:11:23 AM | Milkyway@home | Sending scheduler request: To fetch work.
6/03/2011 10:11:23 AM | Milkyway@home | Reporting 2 completed tasks, requesting new tasks for ATI GPU
6/03/2011 10:11:30 AM | Milkyway@home | Scheduler request completed: got 15 new tasks

Cliff
Send message
Joined: 26 Nov 09
Posts: 25
Credit: 37,186,285
RAC: 0
Message 46515 - Posted: 9 Mar 2011 | 3:36:52 UTC

not getting any GPU work units.
3/8/2011 8:21:10 PM Milkyway@home Scheduler request completed: got 0 new tasks
3/8/2011 8:21:10 PM Milkyway@home Message from server: No work sent
3/8/2011 8:21:26 PM Milkyway@home Computation for task de_separation_19_3s_fix_4_2060697_1299638139_0 finished
3/8/2011 8:22:20 PM Milkyway@home Sending scheduler request: To fetch work.
3/8/2011 8:22:20 PM Milkyway@home Reporting 1 completed tasks, requesting new tasks for GPU
3/8/2011 8:22:22 PM Milkyway@home Scheduler request completed: got 0 new tasks
3/8/2011 8:22:22 PM Milkyway@home Message from server: No work sent
3/8/2011 8:23:32 PM Milkyway@home Sending scheduler request: To fetch work.
3/8/2011 8:23:32 PM Milkyway@home Requesting new tasks for GPU
3/8/2011 8:23:34 PM Milkyway@home Scheduler request completed: got 0 new tasks
3/8/2011 8:23:34 PM Milkyway@home Message from server: No work sent
3/8/2011 8:25:44 PM Milkyway@home Sending scheduler request: To fetch work.
3/8/2011 8:25:44 PM Milkyway@home Requesting new tasks for GPU
3/8/2011 8:25:46 PM Milkyway@home Scheduler request completed: got 0 new tasks
3/8/2011 8:25:46 PM Milkyway@home Message from server: No work sent
3/8/2011 8:27:56 PM Milkyway@home Sending scheduler request: To fetch work.
3/8/2011 8:27:56 PM Milkyway@home Requesting new tasks for GPU
3/8/2011 8:27:58 PM Milkyway@home Scheduler request completed: got 0 new tasks
3/8/2011 8:27:58 PM Milkyway@home Message from server: No work sent
3/8/2011 8:34:03 PM Milkyway@home Sending scheduler request: To fetch work.
3/8/2011 8:34:03 PM Milkyway@home Requesting new tasks for GPU
3/8/2011 8:34:05 PM Milkyway@home Scheduler request completed: got 0 new tasks
3/8/2011 8:34:05 PM Milkyway@home Message from server: No work sent

FruehwF
Send message
Joined: 28 Feb 10
Posts: 120
Credit: 109,830,617
RAC: 0
Message 46517 - Posted: 9 Mar 2011 | 11:58:48 UTC

I have bought a HD4850 from the secondhand market (45 Euro) and brought it online 2 days ago.
Collatz did work fine from the beginning. From Milkyway I was a little bit disappointed for the little cache. I had to make an update request every half an hour. This was not practicable for a system which is 7/24 online.

But from today morning (MEZ) there is every 2 minutes :) an automatic update request and the system gets enough work. The turnaround time is now 0.03 Days.
If this behavior continues I would be satisfied. :)
My question: Was there a change on the server, or depends the interval of the requests on the individual statistic of my machine?

Settings:
BOINC Manager V 6.12.15

<global_preferences>
<run_on_batteries>1</run_on_batteries>
<run_if_user_active>1</run_if_user_active>
<run_gpu_if_user_active>1</run_gpu_if_user_active>
<suspend_cpu_usage>0.000000</suspend_cpu_usage>
<start_hour>0.000000</start_hour>
<end_hour>0.000000</end_hour>
<net_start_hour>0.000000</net_start_hour>
<net_end_hour>0.000000</net_end_hour>
<leave_apps_in_memory>0</leave_apps_in_memory>
<confirm_before_connecting>0</confirm_before_connecting>
<hangup_if_dialed>0</hangup_if_dialed>
<dont_verify_images>0</dont_verify_images>
<work_buf_min_days>0.000000</work_buf_min_days>
<work_buf_additional_days>0.500000</work_buf_additional_days>
<max_ncpus_pct>100.000000</max_ncpus_pct>
<cpu_scheduling_period_minutes>15.000000</cpu_scheduling_period_minutes>
<disk_interval>60.000000</disk_interval>
<disk_max_used_gb>10.000000</disk_max_used_gb>
<disk_max_used_pct>50.000000</disk_max_used_pct>
<disk_min_free_gb>0.100000</disk_min_free_gb>
<vm_max_used_pct>75.000000</vm_max_used_pct>
<ram_max_used_busy_pct>50.000000</ram_max_used_busy_pct>
<ram_max_used_idle_pct>90.000000</ram_max_used_idle_pct>
<max_bytes_sec_up>0.000000</max_bytes_sec_up>
<max_bytes_sec_down>0.000000</max_bytes_sec_down>
<cpu_usage_limit>25.000000</cpu_usage_limit>
<daily_xfer_limit_mb>0.000000</daily_xfer_limit_mb>
<daily_xfer_period_days>0</daily_xfer_period_days>
</global_preferences>

TJ
Send message
Joined: 12 Aug 09
Posts: 251
Credit: 77,777,096
RAC: 628
Message 46522 - Posted: 10 Mar 2011 | 22:37:21 UTC - in response to Message 46517.

But from today morning (MEZ) there is every 2 minutes :) an automatic update request and the system gets enough work. The turnaround time is now 0.03 Days.
If this behavior continues I would be satisfied. :)
My question: Was there a change on the server, or depends the interval of the requests on the individual statistic of my machine?


Hi,

You are using a test version of BOINC manager V 6.12.15 which may be an issue.
However when running a few hours the statistics of your result do indeed matter and is anticipated. So you see MW runs smoothly.

By the way this is not a good thread for your question as it is for as the servers give no new work. And you get plenty of work...
____________
Greetings from,
TJ

FruehwF
Send message
Joined: 28 Feb 10
Posts: 120
Credit: 109,830,617
RAC: 0
Message 46534 - Posted: 11 Mar 2011 | 13:09:51 UTC - in response to Message 46522.


By the way this is not a good thread for your question as it is for as the servers give no new work. And you get plenty of work...


Thanks for the info, and sry for my spamming.

Jesse Viviano
Send message
Joined: 4 Feb 11
Posts: 60
Credit: 25,198,767
RAC: 0
Message 46553 - Posted: 14 Mar 2011 | 3:02:26 UTC - in response to Message 46517.

BOINC development version 6.12.18 is out. Testing 6.12.15 is not going to do the developers any good anymore.

Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 46571 - Posted: 16 Mar 2011 | 9:05:35 UTC

16/03/2011 8:03:31 PM | Milkyway@home | Incomplete read of 1084.000000 < 5KB for de_separation_20_3s_fix_4_709807_1300265776_search_parameters - truncating
16/03/2011 8:05:04 PM | Milkyway@home | [error] File de_separation_10_3s_fix_5_710538_1300266237_search_parameters has wrong size: expected 636, got 0

Arrrgghhh!

Cartoonman
Send message
Joined: 10 Dec 09
Posts: 18
Credit: 6,518,358
RAC: 0
Message 46596 - Posted: 17 Mar 2011 | 21:51:47 UTC
Last modified: 17 Mar 2011 | 21:54:43 UTC

Feeder, along with a bunch of other things, aren't running. can't get new tasks.


3/17/2011 5:52:19 PM Milkyway@home Scheduler request completed: got 0 new tasks
3/17/2011 5:52:19 PM Milkyway@home Message from server: Server error: feeder not running


http://milkyway.cs.rpi.edu/milkyway/server_status.php

Cartoonman
Send message
Joined: 10 Dec 09
Posts: 18
Credit: 6,518,358
RAC: 0
Message 46604 - Posted: 18 Mar 2011 | 3:07:13 UTC - in response to Message 46596.

/\ Feeder back online.

Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 46649 - Posted: 20 Mar 2011 | 12:06:16 UTC

Validator is down again. 18k+
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

www.MeuPicanto.com
Send message
Joined: 1 Feb 11
Posts: 24
Credit: 26,780,798
RAC: 0
Message 46683 - Posted: 22 Mar 2011 | 18:40:39 UTC

Everything is down again...

ouch!

Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 46685 - Posted: 22 Mar 2011 | 20:34:28 UTC

Aaaarrrggghhhhh!

John Clark
Send message
Joined: 4 Oct 08
Posts: 1613
Credit: 62,056,968
RAC: 27,572
Message 46686 - Posted: 22 Mar 2011 | 20:42:13 UTC
Last modified: 22 Mar 2011 | 20:43:07 UTC

Aaaaarrrrgggghhh!


Soon to be downloading half a days Collatz work.
____________
Go away, I was asleep


Profile Keith Myers
Send message
Joined: 24 Jan 11
Posts: 51
Credit: 32,667,342
RAC: 54,679
Message 46687 - Posted: 22 Mar 2011 | 22:37:29 UTC

Don't jump the gun too soon! Everything is up and running again after the outage. 5K worth of work to send out available.
____________

vandiesel
Send message
Joined: 10 May 10
Posts: 27
Credit: 43,104,187
RAC: 0
Message 46690 - Posted: 23 Mar 2011 | 0:31:28 UTC

I already left, be back when dnetc breaks or when I can be ersed to change over again

TJ
Send message
Joined: 12 Aug 09
Posts: 251
Credit: 77,777,096
RAC: 628
Message 46794 - Posted: 28 Mar 2011 | 21:58:59 UTC

Problems again. No new work and ready work cannot be sent.
Six servers are red. We have to wait a while.
____________
Greetings from,
TJ

Jesse Viviano
Send message
Joined: 4 Feb 11
Posts: 60
Credit: 25,198,767
RAC: 0
Message 46795 - Posted: 28 Mar 2011 | 22:22:29 UTC - in response to Message 46794.

I think that this outage might be intentional. The server has been sending out work units that cannot validate because either nobody can generate two results close enough to each other so that they can validate, or the validator fails and declares a validate error. Therefore, the flow of work units was probably stopped to fix what is wrong.

www.MeuPicanto.com
Send message
Joined: 1 Feb 11
Posts: 24
Credit: 26,780,798
RAC: 0
Message 46798 - Posted: 28 Mar 2011 | 23:22:38 UTC

Servers Back... happy crunching!

regards!

Profile heffalumpen
Avatar
Send message
Joined: 6 Nov 09
Posts: 5
Credit: 11,782,288
RAC: 8,049
Message 46829 - Posted: 29 Mar 2011 | 22:05:26 UTC - in response to Message 46798.

Servers Back... happy crunching!

regards!



And down again !!!
____________

Profile heffalumpen
Avatar
Send message
Joined: 6 Nov 09
Posts: 5
Credit: 11,782,288
RAC: 8,049
Message 46833 - Posted: 29 Mar 2011 | 23:26:54 UTC - in response to Message 46829.

And up again...... :-))
____________

sandor
Send message
Joined: 7 May 10
Posts: 8
Credit: 39,602,929
RAC: 68
Message 46897 - Posted: 2 Apr 2011 | 1:11:38 UTC

My Fan just spun down. Out of Work again.

Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 46904 - Posted: 2 Apr 2011 | 2:13:57 UTC

Aaaaarrrrgggghhhhhh!

Jesse Viviano
Send message
Joined: 4 Feb 11
Posts: 60
Credit: 25,198,767
RAC: 0
Message 46922 - Posted: 2 Apr 2011 | 7:33:43 UTC
Last modified: 2 Apr 2011 | 7:34:09 UTC

From what I know about BOINC and the server status page, there is plenty of work units to process. However, the scheduler processes cannot get to them because they use the feeder to maintain a shared connection to the database via the feeder, which maintains one or more database connections for the scheduler processes to use because this saves the overhead of creating and destroying a connection to the database each time a new scheduler process is spawned to deal with each scheduler request. Since the feeder is down, there is no way for the scheduler to contact the database.

cncguru
Avatar
Send message
Joined: 11 Jun 10
Posts: 329
Credit: 1,166,219,987
RAC: 0
Message 46985 - Posted: 5 Apr 2011 | 23:51:39 UTC
Last modified: 5 Apr 2011 | 23:52:05 UTC

Feeders not running!!!
And for some reason I can't get thru to any of the admins!!
Backwards we go again!!!!

Zydor
Avatar
Send message
Joined: 24 Feb 09
Posts: 608
Credit: 85,353,250
RAC: 234,091
Message 46986 - Posted: 5 Apr 2011 | 23:55:15 UTC

Hope springs eternal, the feeder came back up in about 10 mins - but there is now no GPU WUs available - AMD or NVidia. Only CPU WUs. Hey Ho ..... :)

Regards
Zy

Matt Arsenault
Volunteer moderator
Project developer
Project tester
Project scientist
Send message
Joined: 8 May 10
Posts: 576
Credit: 15,704,253
RAC: 0
Message 46987 - Posted: 5 Apr 2011 | 23:59:15 UTC - in response to Message 46985.

Feeders not running!!!
And for some reason I can't get thru to any of the admins!!
Backwards we go again!!!!
I was just restarting everything.

cncguru
Avatar
Send message
Joined: 11 Jun 10
Posts: 329
Credit: 1,166,219,987
RAC: 0
Message 46988 - Posted: 6 Apr 2011 | 0:46:20 UTC
Last modified: 6 Apr 2011 | 0:58:45 UTC

Results ready to send 5
Results in progress 158,943
Workunits waiting for validation 17,284

How did we jump to this?????
Come on guys I love this project!
If I just wanted points I'd have all my guys on primegrid!!

Profile nenym
Send message
Joined: 16 Jan 09
Posts: 5
Credit: 28,130,382
RAC: 1,399
Message 46989 - Posted: 6 Apr 2011 | 1:53:25 UTC

There is a lot of tasks in my buffer. What has happend, new task limit or unwanted server code change?

Matt Arsenault
Volunteer moderator
Project developer
Project tester
Project scientist
Send message
Joined: 8 May 10
Posts: 576
Credit: 15,704,253
RAC: 0
Message 46991 - Posted: 6 Apr 2011 | 2:01:17 UTC - in response to Message 46989.

There is a lot of tasks in my buffer. What has happend, new task limit or unwanted server code change?
Working on the server right now. Don't log out or you won't be able to get in again right now...

Profile arkayn
Avatar
Send message
Joined: 14 Feb 09
Posts: 915
Credit: 74,781,320
RAC: 237
Message 47001 - Posted: 6 Apr 2011 | 6:08:04 UTC
Last modified: 6 Apr 2011 | 6:08:54 UTC

I saw that I had 98 WU cached and went WTF, guess something is being changed in the background.

[edit]
Of course the only way I can read the thread is to hit post to thread.
[/edit]
____________

cncguru
Avatar
Send message
Joined: 11 Jun 10
Posts: 329
Credit: 1,166,219,987
RAC: 0
Message 47009 - Posted: 6 Apr 2011 | 9:50:22 UTC

OK big changes taking place but why is only one of my 5 getting any work?
keep getting:

4/6/2011 5:39:20 AM Milkyway@home Message from server: Tasks for CPU are available, but your preferences are set to not accept them

same settings on all the machines so why is only one getting work?
Plus, what's up with .98CPU usage???
This effectively blocks other cpu crunching??
I smell an app_info file somewhere but is anyone else having this kind of problem where all their machines are not getting work?

Thanks,
David

Zydor
Avatar
Send message
Joined: 24 Feb 09
Posts: 608
Credit: 85,353,250
RAC: 234,091
Message 47010 - Posted: 6 Apr 2011 | 10:00:12 UTC - in response to Message 47009.

Upload/Download server is still disabled, he's still beavering away.

Re CPU rate - thats been raised over at News forum, looks like its a consequence of the server upgrades. Doubt its got to the top of the to do list yet.

Regards
Zy

cncguru
Avatar
Send message
Joined: 11 Jun 10
Posts: 329
Credit: 1,166,219,987
RAC: 0
Message 47011 - Posted: 6 Apr 2011 | 10:09:15 UTC

Yeah, I seen the "Disabled" but didn't know if I could trust it seeing as my one guy keeps getting work.
Geez, my top guy has lost 35K RAC so far, donno if he's ever going to see that magic 1 MIL mark, LOL!
Well, I'm leaving my top guy waiting and send all the others to work elsewhere today while they iron things out.
Thanks,
David

Zydor
Avatar
Send message
Joined: 24 Feb 09
Posts: 608
Credit: 85,353,250
RAC: 234,091
Message 47012 - Posted: 6 Apr 2011 | 10:14:06 UTC

There's still some holes he's plugging. the large majority have not had a feed for getting on for 12 hours - looks like the spurt of WUs to a few middle of the night was more server code corrupted than anything. He jumped on that fairly quickly. Could even be that the larger downloads were a result of corrupted code, not deliberate cache increase, have to wait and see.

A few pots of coffee to go methinks .... :)

Regards
Zy

cncguru
Avatar
Send message
Joined: 11 Jun 10
Posts: 329
Credit: 1,166,219,987
RAC: 0
Message 47014 - Posted: 6 Apr 2011 | 10:19:56 UTC
Last modified: 6 Apr 2011 | 10:21:03 UTC

My one guy who's been getting the work has 80 wu's sitting in "Completed, validation inconclusive" state but still that system has climbed 2K in RAC so some are getting validated.
Not showing any errors.

And wowee what have we got here???

MilkyWay@home Version 2 Beta Applications | MilkyWay@Home Version 3 | MilkyWay@Home w/ GPU



--------------------------------------------------------------------------------

Zydor
Avatar
Send message
Joined: 24 Feb 09
Posts: 608
Credit: 85,353,250
RAC: 234,091
Message 47015 - Posted: 6 Apr 2011 | 10:24:50 UTC - in response to Message 47014.

Its indicating the level of work he's doing.

"BOINC Server Upgrade" may well be a euphamism for "Project Admin Nightmare", but not this much - he's usually too good to get caught this badly. Methinks there is much more to this than a "simple" upgrade.

Keeps the brain cells ticking over as we wait if nothing else :)

Regards
Zy

cncguru
Avatar
Send message
Joined: 11 Jun 10
Posts: 329
Credit: 1,166,219,987
RAC: 0
Message 47017 - Posted: 6 Apr 2011 | 10:40:08 UTC

OMG, Did I miss something big?
BOINC server upgrade??
Not just Milkyway?
I have been babysitting the kids (computers) remotely lately as my "real job" here has been taking ALL my time!!

Zydor
Avatar
Send message
Joined: 24 Feb 09
Posts: 608
Credit: 85,353,250
RAC: 234,091
Message 47018 - Posted: 6 Apr 2011 | 10:50:35 UTC

I have been babysitting the kids (computers)......


rofl ..... love it :)

Regards
Zy

cncguru
Avatar
Send message
Joined: 11 Jun 10
Posts: 329
Credit: 1,166,219,987
RAC: 0
Message 47019 - Posted: 6 Apr 2011 | 11:13:23 UTC

Well, yeah! there are parallels here.
As I got serious last summer with this crunching hobbie, I kept buying components and putting together new machines.
It's been a learning process to see how different things work together best and alot of times overclocking(since I just couldn't leave well enough alone,lol)everything can go really crazy and some configurations would "behave" better than others.
So they took on lives of their own over time and I have learned a bit and can now say that my kids are all very well behaved (most of the time)
And they do keep "Daddy" busy and out of trouble!!

vandiesel
Send message
Joined: 10 May 10
Posts: 27
Credit: 43,104,187
RAC: 0
Message 47020 - Posted: 6 Apr 2011 | 11:14:52 UTC - in response to Message 47009.
Last modified: 6 Apr 2011 | 11:21:41 UTC

Yep only one machine could get work out of 3, I have not checked the cpu usage that you refer too.


edit-small rant


Dnetc down for me, 4 machines playing up big time with the running out of hcc over at wcg, the other wcg hfcc/muscular dystrophy projects couple with gpu crunching are taking its toll on my wcg rac, been working at them for the past three days, looks like I have finally found some relief

cncguru
Avatar
Send message
Joined: 11 Jun 10
Posts: 329
Credit: 1,166,219,987
RAC: 0
Message 47021 - Posted: 6 Apr 2011 | 11:18:01 UTC
Last modified: 6 Apr 2011 | 11:18:34 UTC

I didn't get to check the "real" cpu usage
Was just commenting on the message bar saying instead of the .05 I'm used to seeing it said .98!!
And now that machine has run out of work too.

vandiesel
Send message
Joined: 10 May 10
Posts: 27
Credit: 43,104,187
RAC: 0
Message 47023 - Posted: 6 Apr 2011 | 11:53:04 UTC

The machine that was left with mw running in conjunction with wcg(cpu crunching) the cpu usage was fluctuating it should be at 100% but was going from 80/90, I switch it over to collatz and back to 100% cpu utilization. **** this this is becoming more like work rather than a hobby

cncguru
Avatar
Send message
Joined: 11 Jun 10
Posts: 329
Credit: 1,166,219,987
RAC: 0
Message 47035 - Posted: 6 Apr 2011 | 14:35:59 UTC

I was reminded by Beyond in another thread about running with an app_info.xml and once I did that the work units flowed like water!!

Thanks again Beyond!!!

cncguru
Avatar
Send message
Joined: 11 Jun 10
Posts: 329
Credit: 1,166,219,987
RAC: 0
Message 47101 - Posted: 7 Apr 2011 | 1:15:02 UTC
Last modified: 7 Apr 2011 | 1:15:47 UTC

Simply incredible!
I just woke up from taking a little nap to find my no.1 guy
OUT OF WORK!!
All the others still have plenty of work

4/6/2011 9:02:52 PM Milkyway@home Sending scheduler request: To fetch work.
4/6/2011 9:02:52 PM Milkyway@home Requesting new tasks for GPU
4/6/2011 9:02:53 PM Milkyway@home Scheduler request completed: got 0 new tasks
4/6/2011 9:02:53 PM Milkyway@home Message from server: No work available
4/6/2011 9:04:59 PM Milkyway@home Sending scheduler request: To fetch work.
4/6/2011 9:04:59 PM Milkyway@home Requesting new tasks for GPU
4/6/2011 9:05:00 PM Milkyway@home Scheduler request completed: got 0 new tasks
4/6/2011 9:05:00 PM Milkyway@home Message from server: No work available

Zydor
Avatar
Send message
Joined: 24 Feb 09
Posts: 608
Credit: 85,353,250
RAC: 234,091
Message 47102 - Posted: 7 Apr 2011 | 1:18:08 UTC

They have turned off supply for the moment whilst Travis works on the server. He's hoping to get it back up tonight.

Regards
Zy

cncguru
Avatar
Send message
Joined: 11 Jun 10
Posts: 329
Credit: 1,166,219,987
RAC: 0
Message 47104 - Posted: 7 Apr 2011 | 1:32:11 UTC
Last modified: 7 Apr 2011 | 1:35:30 UTC

oh ok thats what I get for nappin!! LOL
And wow, I didn't think the others were THAT much slower!!

Zydor
Avatar
Send message
Joined: 24 Feb 09
Posts: 608
Credit: 85,353,250
RAC: 234,091
Message 47107 - Posted: 7 Apr 2011 | 1:47:20 UTC - in response to Message 47104.

Incoming ..... rofl :)

Regards
Zy

cncguru
Avatar
Send message
Joined: 11 Jun 10
Posts: 329
Credit: 1,166,219,987
RAC: 0
Message 47108 - Posted: 7 Apr 2011 | 2:26:21 UTC
Last modified: 7 Apr 2011 | 2:37:56 UTC

I don't get it!
my slowest cruncher is getting new wu's but my no.1 gets nada!!!
Of course I have changed nothing from before when I had 1500 wu cache!!!
These new wu's are coming in trickles but why is the main guy getting absolutely nothing?
I've had enough, I will now shut up and let them figure it all out!!

Zydor
Avatar
Send message
Joined: 24 Feb 09
Posts: 608
Credit: 85,353,250
RAC: 234,091
Message 47109 - Posted: 7 Apr 2011 | 2:39:33 UTC

Its going to be unpredictable whilst Travis is working on it. Could be short time - could be long - never know in server sagas .....

I hear the frustration, and understand it, but its life. It will take as long as it takes to fix. Travis will be just as keen to get it going.

Just have to wait it out.

Regards
Zy

sandor
Send message
Joined: 7 May 10
Posts: 8
Credit: 39,602,929
RAC: 68
Message 47137 - Posted: 7 Apr 2011 | 16:12:29 UTC

If I don't get new WU soon I'll be forced to turn up my heat. :o

Zydor
Avatar
Send message
Joined: 24 Feb 09
Posts: 608
Credit: 85,353,250
RAC: 234,091
Message 47141 - Posted: 7 Apr 2011 | 17:00:02 UTC - in response to Message 47137.

Nip over to DNETC, that'll keep the circulation going.

Take a fire extinguisher though :)

Regards
Zy

Joshrandom
Avatar
Send message
Joined: 6 Mar 08
Posts: 3
Credit: 63,015,221
RAC: 79,748
Message 47144 - Posted: 7 Apr 2011 | 17:35:39 UTC - in response to Message 47141.

Sadly DNETC seems to currently be in an even worse position than Milkyway. :(

vandiesel
Send message
Joined: 10 May 10
Posts: 27
Credit: 43,104,187
RAC: 0
Message 47145 - Posted: 7 Apr 2011 | 17:40:27 UTC

nothing for 2 days, site down, not looking to good for dnetc

Heavy Metal Dungeon Keeper
Avatar
Send message
Joined: 31 Aug 07
Posts: 21
Credit: 21,003,952
RAC: 0
Message 47146 - Posted: 7 Apr 2011 | 17:53:19 UTC

An update about Dnetc here:

http://boincstats.com/forum/forum_thread.php?id=6315
____________

Profile Chris
Avatar
Send message
Joined: 16 Dec 10
Posts: 42
Credit: 93,121,110
RAC: 171,315
Message 47150 - Posted: 7 Apr 2011 | 18:56:38 UTC
Last modified: 7 Apr 2011 | 19:00:18 UTC

Uii, i don't know yet that i can get piratet windows with Dnetc units.... :-p

OK, lets wait and let collatz smile....
____________


www.MeuPicanto.com
Send message
Joined: 1 Feb 11
Posts: 24
Credit: 26,780,798
RAC: 0
Message 47241 - Posted: 9 Apr 2011 | 16:13:26 UTC

No more WU? that's it? All servers green and nothing... this is happening with all you guys? Thanks!

Zydor
Avatar
Send message
Joined: 24 Feb 09
Posts: 608
Credit: 85,353,250
RAC: 234,091
Message 47242 - Posted: 9 Apr 2011 | 16:34:55 UTC - in response to Message 47241.

Take a peek at the News Forum. Project down at present, they are putting humpty back together. Be a while yet. Also note the deprecation of MW AMD Applications on restart - a new AMD app is going to be distributed, and if you run app_info file, you'll need to change it. If you dont, no worries, all happens automatically.

Meanwhile I suggest you run some WUs from another project for a bit - going to be a while here, tonight Eastern time or tomorrow, difficult to say, just keep an eye on MW for the restart.

Regards
Zy

Post to thread

Message boards : Number crunching : Aaargh! Servers are out of new work!(2)"


Main page · Your account · Message boards


Copyright © 2013 AstroInformatics Group