Aaargh! Server out of new work!
log in

Advanced search

Message boards : Number crunching : Aaargh! Server out of new work!

Author Message
John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 39188 - Posted: 27 Apr 2010 | 14:03:28 UTC
Last modified: 27 Apr 2010 | 14:07:30 UTC

At this moment the server has no new work to send and my limited Milkyway caches are dry, and so am I!

HELP!
____________
Go away, I was asleep


CTAPbIi
Send message
Joined: 4 Jan 10
Posts: 86
Credit: 51,753,924
RAC: 0
Message 39189 - Posted: 27 Apr 2010 | 14:18:30 UTC

+1
____________

Profile David Glogau*
Avatar
Send message
Joined: 12 Aug 09
Posts: 172
Credit: 645,240,165
RAC: 0
Message 39191 - Posted: 27 Apr 2010 | 14:20:07 UTC

Typical, just as I pass 200 million everything falls over!
____________

Zydor
Avatar
Send message
Joined: 24 Feb 09
Posts: 608
Credit: 85,339,711
RAC: 399,830
Message 39195 - Posted: 27 Apr 2010 | 14:41:45 UTC

Well Travis did acknowledge impending doom & gloom :)

Murphy is alive and well!

Regards
Zy

CTAPbIi
Send message
Joined: 4 Jan 10
Posts: 86
Credit: 51,753,924
RAC: 0
Message 39197 - Posted: 27 Apr 2010 | 14:49:30 UTC
Last modified: 27 Apr 2010 | 14:49:51 UTC

so it gonna be a week of the server unavailability?
http://milkyway.cs.rpi.edu/milkyway/forum_thread.php?id=1713&nowrap=true#39165
____________

Per Lindstrom
Send message
Joined: 12 Apr 10
Posts: 3
Credit: 5,198,376
RAC: 0
Message 39204 - Posted: 27 Apr 2010 | 17:32:37 UTC - in response to Message 39191.
Last modified: 27 Apr 2010 | 17:44:39 UTC

Typical, just as I pass 200 million everything falls over!


200 million, congrats!

A link from Collatz message board.

http://boinc.thesonntags.com/collatz/forum_thread.php?id=472

CTAPbIi
Send message
Joined: 4 Jan 10
Posts: 86
Credit: 51,753,924
RAC: 0
Message 39206 - Posted: 27 Apr 2010 | 17:40:26 UTC - in response to Message 39191.

Typical, just as I pass 200 million everything falls over!

congrats, man :-) good job and RAC :-)

I switched to collatz for a while
____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 39215 - Posted: 27 Apr 2010 | 19:13:11 UTC

Been running Collatz since I started this thread.

This is the third break in work generation in the time I am trying to see what the steady RAC my ATI GPU will produce. It produced an RAC of 154K on Collatz, and I am hoping to break 200K but these server "no work" is side stepping this ambition. ;-((
____________
Go away, I was asleep


Profile Cori
Avatar
Send message
Joined: 27 Aug 07
Posts: 647
Credit: 27,592,547
RAC: 0
Message 39217 - Posted: 27 Apr 2010 | 20:03:30 UTC - in response to Message 39188.

... my limited Milkyway caches are dry, and so am I! ...

Thank god I'm drinking a fresh cold beer so just my ATI card has run dry so far.
____________
Lovely greetings, Cori

Per Lindstrom
Send message
Joined: 12 Apr 10
Posts: 3
Credit: 5,198,376
RAC: 0
Message 39225 - Posted: 27 Apr 2010 | 21:41:58 UTC

An odd thing, I have exactly 50 tasks waiting for credit, is this an coincidence?

BarryAZ
Send message
Joined: 1 Sep 08
Posts: 512
Credit: 222,446,536
RAC: 158,326
Message 39227 - Posted: 27 Apr 2010 | 22:07:05 UTC - in response to Message 39225.

It could well be related -- the number of work units awaiting validation is way up as well. I'm guessing the status page is doing the 'politician' thing and not representing things as they are (aside from the no work status).

An odd thing, I have exactly 50 tasks waiting for credit, is this an coincidence?


____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 39259 - Posted: 28 Apr 2010 | 14:31:13 UTC
Last modified: 28 Apr 2010 | 14:31:58 UTC

Work available again.

Travis has kicked the servers, the work waiting validation is dropping quickly.
____________
Go away, I was asleep


Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 39389 - Posted: 3 May 2010 | 2:54:12 UTC

Validator crapped. Also no tasks.

Workunits waiting for validation 17,366

____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

Profile Travis
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Send message
Joined: 30 Aug 07
Posts: 1976
Credit: 26,480
RAC: 0
Message 39402 - Posted: 3 May 2010 | 16:00:45 UTC - in response to Message 39389.

Dropped a kung fu bomb on the server last night, looks like things are running. I'm pretty close to tracking down the problem -- it seems after a couple days of uptime it can have a memory leak. Trying to track it down (but it'll take a little time as I have to wait for it to happen again).
____________

Profile Cori
Avatar
Send message
Joined: 27 Aug 07
Posts: 647
Credit: 27,592,547
RAC: 0
Message 39404 - Posted: 3 May 2010 | 19:31:22 UTC - in response to Message 39402.

Dropped a kung fu bomb on the server last night...

You can ask me next time, I'm known as notorious 'Kaboom girlie' in BOINCdom...



*LOL*
____________
Lovely greetings, Cori

Per Lindstrom
Send message
Joined: 12 Apr 10
Posts: 3
Credit: 5,198,376
RAC: 0
Message 39491 - Posted: 7 May 2010 | 13:58:52 UTC
Last modified: 7 May 2010 | 13:59:45 UTC

Message from server: No work available.
Switching to Collatz.

hoarfrost
Send message
Joined: 4 Sep 09
Posts: 20
Credit: 123,873,987
RAC: 0
Message 39492 - Posted: 7 May 2010 | 14:09:20 UTC

+1

NGS~R.Stanneveld
Send message
Joined: 13 Dec 09
Posts: 5
Credit: 38,436,391
RAC: 0
Message 39493 - Posted: 7 May 2010 | 14:22:47 UTC

+1
____________

CTAPbIi
Send message
Joined: 4 Jan 10
Posts: 86
Credit: 51,753,924
RAC: 0
Message 39494 - Posted: 7 May 2010 | 14:30:44 UTC

+1
____________

Profile Arif Mert Kapicioglu
Send message
Joined: 14 Dec 09
Posts: 158
Credit: 572,866,449
RAC: 4,516
Message 39495 - Posted: 7 May 2010 | 14:31:09 UTC

+1

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 39498 - Posted: 7 May 2010 | 14:56:31 UTC

Just noticed that instead of seeing FreeHAL and Einstein running on the CPU and Milkyway running on the GPU, I only see Einstein (and there is serious upload and download problems problems on theat project.

Now over to Collatz for a quarter of a day's work.
____________
Go away, I was asleep


SerVal
Send message
Joined: 1 Nov 08
Posts: 1
Credit: 26,534,297
RAC: 0
Message 39501 - Posted: 7 May 2010 | 15:59:57 UTC
Last modified: 7 May 2010 | 16:04:07 UTC

No work available ...
Until WU exec time 1.5 minutes(my Rad-5870), we will have same situation very soon.
SerVal, Moscow, Russia.

Divide Overflow
Avatar
Send message
Joined: 16 Feb 09
Posts: 109
Credit: 11,089,510
RAC: 0
Message 39502 - Posted: 7 May 2010 | 17:44:18 UTC

I hope Travis was able to get information on the memory leak he was looking for!

Profile Travis
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Send message
Joined: 30 Aug 07
Posts: 1976
Credit: 26,480
RAC: 0
Message 39504 - Posted: 7 May 2010 | 19:12:42 UTC - in response to Message 39502.

Sadly, it wasn't the memory leak but something else, looking into it right now.
____________

Profile arkayn
Avatar
Send message
Joined: 14 Feb 09
Posts: 914
Credit: 74,780,239
RAC: 249
Message 39505 - Posted: 7 May 2010 | 20:58:54 UTC

Looks like it is back up as I just got work. Of course I also turned on Collatz as I will be away this weekend and I want to keep the card running.
____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 39552 - Posted: 9 May 2010 | 9:02:42 UTC

Anybody having trouble uploading results?

This seems to be intermittent, and I am getting this in the BM Messages tab -

09/05/2010 09:57:14 Milkyway@home update requested by user
09/05/2010 09:57:25 Milkyway@home Sending scheduler request: Requested by user.
09/05/2010 09:57:25 Milkyway@home Reporting 10 completed tasks, requesting new tasks for GPU
09/05/2010 09:57:46 Project communication failed: attempting access to reference site
09/05/2010 09:57:48 Internet access OK - project servers may be temporarily down.
09/05/2010 09:57:50 Milkyway@home Scheduler request failed: Failure when receiving data from the peer



____________
Go away, I was asleep


Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 39555 - Posted: 9 May 2010 | 9:57:31 UTC

Must be due to the pentathlon, which is now concentrating on MW for the next week.

I've already seen BOINC hit my backup project.

Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 39603 - Posted: 11 May 2010 | 7:00:58 UTC

ARRRGH! Out again.

Server Status 0 ready to send.

Profile Travis
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Send message
Joined: 30 Aug 07
Posts: 1976
Credit: 26,480
RAC: 0
Message 39604 - Posted: 11 May 2010 | 7:14:55 UTC - in response to Message 39603.

GDI.
____________

Profile Travis
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Send message
Joined: 30 Aug 07
Posts: 1976
Credit: 26,480
RAC: 0
Message 39605 - Posted: 11 May 2010 | 7:20:31 UTC - in response to Message 39604.

Should be back up and running now.

Time to add another bug to our longstanding issues list:

Travis cracking open a beer after a long day causes the server to crash.

To the previous list of:

Travis going to sleep frequently causes the server to crash.
and
Travis going on a ski/snowboarding trip frequently causes the server to crash.
____________

Profile Tomasz R. Gwiazda
Avatar
Send message
Joined: 23 Mar 09
Posts: 13
Credit: 100,032,796
RAC: 0
Message 39606 - Posted: 11 May 2010 | 7:26:57 UTC
Last modified: 11 May 2010 | 7:28:07 UTC

don't see any progres.
On server status page there are WU
on host: No work available


Edit: Now it's working.
Thanks
____________

Join us at www.boincatpoland.org

Odd-Rod
Send message
Joined: 7 Sep 07
Posts: 355
Credit: 989,303
RAC: 889
Message 39608 - Posted: 11 May 2010 | 9:27:20 UTC - in response to Message 39605.

Time to add another bug to our longstanding issues list:

Travis cracking open a beer after a long day causes the server to crash.

To the previous list of:

Travis going to sleep frequently causes the server to crash.
and
Travis going on a ski/snowboarding trip frequently causes the server to crash.

Lets see now, gathering all the words that are common to all three of the above:
"Travis causes the server to crash" ;-)
I suggest you have another beer after that revelation!
Travis, it would appear that you and Murphy are well acquainted.
Time to remove tongue from cheek...

Profile Cori
Avatar
Send message
Joined: 27 Aug 07
Posts: 647
Credit: 27,592,547
RAC: 0
Message 39618 - Posted: 11 May 2010 | 19:19:13 UTC - in response to Message 39608.

Time to add another bug to our longstanding issues list:

Travis cracking open a beer after a long day causes the server to crash.

To the previous list of:

Travis going to sleep frequently causes the server to crash.
and
Travis going on a ski/snowboarding trip frequently causes the server to crash.

Lets see now, gathering all the words that are common to all three of the above:
"Travis causes the server to crash" ;-)
I suggest you have another beer after that revelation!
Travis, it would appear that you and Murphy are well acquainted.
Time to remove tongue from cheek...


1+ *smiles*
____________
Lovely greetings, Cori

Profile Travis
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Send message
Joined: 30 Aug 07
Posts: 1976
Credit: 26,480
RAC: 0
Message 39637 - Posted: 12 May 2010 | 4:48:33 UTC - in response to Message 39608.

Time to add another bug to our longstanding issues list:

Travis cracking open a beer after a long day causes the server to crash.

To the previous list of:

Travis going to sleep frequently causes the server to crash.
and
Travis going on a ski/snowboarding trip frequently causes the server to crash.

Lets see now, gathering all the words that are common to all three of the above:
"Travis causes the server to crash" ;-)
I suggest you have another beer after that revelation!
Travis, it would appear that you and Murphy are well acquainted.
Time to remove tongue from cheek...


Well technically I'd have to say at least 90% of the server crashes were because of me in some way shape or form :) Updating code and causing all those pesky bugs!
____________

Profile KWSN Checklist
Avatar
Send message
Joined: 12 Aug 08
Posts: 128
Credit: 203,351,125
RAC: 1,217
Message 39769 - Posted: 17 May 2010 | 1:44:19 UTC

Is Murphy is sitting in for Travis tonight?
____________

Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 39775 - Posted: 17 May 2010 | 9:50:02 UTC

Murphy must have been offered a beer down the pub...things look good again.

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 39776 - Posted: 17 May 2010 | 11:31:28 UTC - in response to Message 39775.

Murphy must have been offered a beer down the pub...things look good again.




Ssshhh, there!

Tempting fate are we?
____________
Go away, I was asleep


Haris Dublas
Send message
Joined: 25 Feb 10
Posts: 49
Credit: 10,136,474
RAC: 0
Message 39801 - Posted: 18 May 2010 | 16:25:52 UTC

Cant be avoided. Lots of people will run out of work with hosts like this, http://milkyway.cs.rpi.edu/milkyway/show_host_detail.php?hostid=171705.

That host have over 2000 tasks in cache while my 3850 is limited to only 6 tasks in progress. *sigh*

Profile Paul D. Buck
Send message
Joined: 12 Apr 08
Posts: 621
Credit: 161,934,067
RAC: 0
Message 39804 - Posted: 18 May 2010 | 16:39:00 UTC - in response to Message 39801.

Cant be avoided. Lots of people will run out of work with hosts like this, http://milkyway.cs.rpi.edu/milkyway/show_host_detail.php?hostid=171705.

That host have over 2000 tasks in cache while my 3850 is limited to only 6 tasks in progress. *sigh*

Don't know how that happened! I sure hope Travis takes a look at that one ... no way he should be getting more than 24 tasks at a time ... something bad is happening ... and it does not seem like he is returning any at all either ...

Chris S
Avatar
Send message
Joined: 20 Sep 08
Posts: 1357
Credit: 173,075,472
RAC: 13
Message 39805 - Posted: 18 May 2010 | 17:13:22 UTC

He's also using a very old version of Boinc and MW app. Time to pull the plug methinks....

____________
Don't drink water, that's the stuff that rusts pipes

Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 39815 - Posted: 19 May 2010 | 1:39:58 UTC - in response to Message 39805.

He's also using a very old version of Boinc and MW app. Time to pull the plug methinks....

I'm using 5.10.45, no problem with it. I don't see those as the problem. Seems to be 2000+ ghost wus.


How about posting some of the messages from the timeframe the tasks kept downloading?
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

Haris Dublas
Send message
Joined: 25 Feb 10
Posts: 49
Credit: 10,136,474
RAC: 0
Message 39820 - Posted: 19 May 2010 | 4:19:25 UTC

Somebody poke Travis, the tasks are now 2598. :)

That host is a wingman. At this rate, he might become a wingman on future tasks as well and we'll be stuck in pending limbo.

Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 39831 - Posted: 19 May 2010 | 14:34:51 UTC - in response to Message 39820.

I would suggest putting MW on NNW untill this is fixed.
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

Haris Dublas
Send message
Joined: 25 Feb 10
Posts: 49
Credit: 10,136,474
RAC: 0
Message 39836 - Posted: 19 May 2010 | 16:34:12 UTC
Last modified: 19 May 2010 | 16:38:02 UTC

That host is not mine, it belongs to Lord Nelloz. I only found out about it after I checked a task of mine that's been inconclusive for a week now.

By the way, that host have 2817 tasks now.

Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 39970 - Posted: 27 May 2010 | 8:34:24 UTC

Arrrgghhh.

Profile Simplex0
Avatar
Send message
Joined: 11 Nov 07
Posts: 232
Credit: 178,221,048
RAC: 0
Message 39972 - Posted: 27 May 2010 | 9:33:11 UTC

Gnnnnnnnnh! :)

Profile magyarficko
Send message
Joined: 22 Jan 09
Posts: 35
Credit: 46,731,190
RAC: 0
Message 39973 - Posted: 27 May 2010 | 9:49:39 UTC


Pbbfflliitt!


____________

Profile AriZonaMoon*
Avatar
Send message
Joined: 29 Sep 08
Posts: 1610
Credit: 41,867,203
RAC: 0
Message 39979 - Posted: 27 May 2010 | 15:37:10 UTC


haha - I dont think I should answer that one officially. LoLz
____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 39981 - Posted: 27 May 2010 | 16:25:10 UTC

I see the Server Status shows them all working (green), but there is no work yet.

Anyone know when new work might be created and issued?
____________
Go away, I was asleep


Brickhead
Avatar
Send message
Joined: 20 Mar 08
Posts: 92
Credit: 1,558,924,535
RAC: 991,951
Message 39982 - Posted: 27 May 2010 | 16:30:38 UTC - in response to Message 39981.

Could it be that http://milkyway.cs.rpi.edu/milkyway/show_host_detail.php?hostid=171705 is soaking it all up? That host's task count has now increased to 2920 :(
____________

Profile Arif Mert Kapicioglu
Send message
Joined: 14 Dec 09
Posts: 158
Credit: 572,866,449
RAC: 4,516
Message 39986 - Posted: 27 May 2010 | 17:15:18 UTC - in response to Message 39982.

Could it be that http://milkyway.cs.rpi.edu/milkyway/show_host_detail.php?hostid=171705 is soaking it all up? That host's task count has now increased to 2920 :(


How come that guy haven't been banned yet?

TJ
Send message
Joined: 12 Aug 09
Posts: 251
Credit: 77,776,909
RAC: 1,061
Message 39989 - Posted: 27 May 2010 | 17:21:45 UTC

Well boys and girls,

After a few times hitting the "Update" button, I got new stuff to crunch again.
____________
Greetings from,
TJ

Profile Mankka*
Send message
Joined: 10 Mar 10
Posts: 2
Credit: 51,125,508
RAC: 64
Message 39991 - Posted: 27 May 2010 | 17:42:37 UTC - in response to Message 39989.

I also got fresh WU's (was quite dry for some hours), seems we are up and running again =)

Profile Blurf
Volunteer moderator
Project administrator
Send message
Joined: 13 Mar 08
Posts: 617
Credit: 25,447,954
RAC: 0
Message 39992 - Posted: 27 May 2010 | 19:41:20 UTC - in response to Message 39986.

Could it be that http://milkyway.cs.rpi.edu/milkyway/show_host_detail.php?hostid=171705 is soaking it all up? That host's task count has now increased to 2920 :(


How come that guy haven't been banned yet?


The information on this cruncher has been sent up to Travis to be investigated further.
____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 40773 - Posted: 3 Jul 2010 | 21:03:54 UTC

Although the Server Status says all is running OK, we have been out of work for 8+ hours now.

Looks like we need someone to kick the servers soon?
____________
Go away, I was asleep


John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 40846 - Posted: 8 Jul 2010 | 7:45:28 UTC
Last modified: 8 Jul 2010 | 7:48:33 UTC

Aaargh!

The servers have no work ready to dispense.

Kick the servers, we are nearly starving.


Travis must be asleep (most likely) or have gone away snowboarding?

____________
Go away, I was asleep


SkyeHunter
Send message
Joined: 6 Mar 09
Posts: 41
Credit: 38,856,291
RAC: 0
Message 40847 - Posted: 8 Jul 2010 | 7:49:16 UTC
Last modified: 8 Jul 2010 | 7:49:30 UTC

Must be... The trolls are acting up again...

scottishwebcamslive.com
Avatar
Send message
Joined: 10 Oct 07
Posts: 79
Credit: 69,337,972
RAC: 0
Message 40848 - Posted: 8 Jul 2010 | 9:28:15 UTC

Hello,

This is rediculous half on half off !!
Some work really needs to be done to the continuous output of work from the server on this project

best regards
Ian
____________
....Please Join team Scotland HERE

Profile Simplex0
Avatar
Send message
Joined: 11 Nov 07
Posts: 232
Credit: 178,221,048
RAC: 0
Message 40849 - Posted: 8 Jul 2010 | 10:02:14 UTC

I wonder if the Milkyway server was made by the same guy that made this Alarm Clock?

scottishwebcamslive.com
Avatar
Send message
Joined: 10 Oct 07
Posts: 79
Credit: 69,337,972
RAC: 0
Message 40850 - Posted: 8 Jul 2010 | 11:11:05 UTC
Last modified: 8 Jul 2010 | 11:12:42 UTC

Hi,

Nice one lol
I just wonder if the thing has Texas Instruments on the front :))

best regards
Ian
____________
....Please Join team Scotland HERE

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 40852 - Posted: 8 Jul 2010 | 11:59:10 UTC

Still nowt to crunch!
____________
Go away, I was asleep


Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 40854 - Posted: 8 Jul 2010 | 17:15:23 UTC

Validator crapped out again.
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

Chris S
Avatar
Send message
Joined: 20 Sep 08
Posts: 1357
Credit: 173,075,472
RAC: 13
Message 40855 - Posted: 8 Jul 2010 | 18:20:54 UTC

OK, back to Collatz, sigh ....
____________
Don't drink water, that's the stuff that rusts pipes

Profile Scarecrow
Avatar
Send message
Joined: 30 Jan 09
Posts: 56
Credit: 73,625
RAC: 0
Message 40859 - Posted: 9 Jul 2010 | 2:32:51 UTC


Told ya so.

Skysnake
Send message
Joined: 31 Oct 09
Posts: 20
Credit: 11,930,190
RAC: 101
Message 40900 - Posted: 13 Jul 2010 | 21:02:41 UTC

and again no work :(

Profile Frank van den Brink
Avatar
Send message
Joined: 18 Jul 09
Posts: 7
Credit: 21,008,342
RAC: 0
Message 40901 - Posted: 13 Jul 2010 | 21:20:39 UTC - in response to Message 40900.

My GPU-fan went eerily silent a couple of minutes ago. I guess all those Collatz crunchers returned the favour with Collatz down and sucked our well dry as well :-)
____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 40902 - Posted: 13 Jul 2010 | 21:23:33 UTC
Last modified: 13 Jul 2010 | 22:11:05 UTC

Argh!

The servers have little or no work to distribute, so we are in for another dry period until the servers are kicked.

It looks like the validator has a backlog of over 20K results to clear (again)

Oh woe is me, with Collatz offline as well


Just attached to DNETC as well for my HD4850 and the old AGP HD3850 so when Milkyway caches (24 WUS) are empty, then DNETC will crunch until MW returns.

On the older ATI cards the output looks like it is less productive, in output terms, than MW (lower RACs per GPU).

____________
Go away, I was asleep


ztmike
Send message
Joined: 4 Jun 09
Posts: 45
Credit: 447,355
RAC: 0
Message 40903 - Posted: 13 Jul 2010 | 21:35:43 UTC

Noooooooo

I just got my ati 5870 and amd x6 rig with every part now in it and no work to do :(

Profile arkayn
Avatar
Send message
Joined: 14 Feb 09
Posts: 914
Credit: 74,780,239
RAC: 249
Message 40905 - Posted: 13 Jul 2010 | 21:57:35 UTC

I also keep an account on DNETC, just in case. I downloaded 50 5xxx units to keep my GPU busy.
____________

Profile David Glogau*
Avatar
Send message
Joined: 12 Aug 09
Posts: 172
Credit: 645,240,165
RAC: 0
Message 40906 - Posted: 14 Jul 2010 | 0:29:45 UTC

This is great news. Time to swap out a failing PSU and do some maintainence

Dusting now, caugh, caugh.
____________

Profile Arif Mert Kapicioglu
Send message
Joined: 14 Dec 09
Posts: 158
Credit: 572,866,449
RAC: 4,516
Message 40912 - Posted: 14 Jul 2010 | 5:45:08 UTC

out of work. this starts to happen so often.

ztmike
Send message
Joined: 4 Jun 09
Posts: 45
Credit: 447,355
RAC: 0
Message 40913 - Posted: 14 Jul 2010 | 6:06:32 UTC - in response to Message 40912.
Last modified: 14 Jul 2010 | 6:07:44 UTC

out of work. this starts to happen so often.


And its going to be out of work for awhile ..

http://milkyway.cs.rpi.edu/milkyway/server_status.php

Its got nothing to send..and with a growing number of people getting their WUs done that have yet passed validation..I imagine its going to be another mad rush when the server does get up again..

I got to say though..this is getting silly for server downtime. I think they need a new guy behind the scenes for the server or something..

Profile Arif Mert Kapicioglu
Send message
Joined: 14 Dec 09
Posts: 158
Credit: 572,866,449
RAC: 4,516
Message 40914 - Posted: 14 Jul 2010 | 7:26:43 UTC - in response to Message 40913.

out of work. this starts to happen so often.


And its going to be out of work for awhile ..

http://milkyway.cs.rpi.edu/milkyway/server_status.php

Its got nothing to send..and with a growing number of people getting their WUs done that have yet passed validation..I imagine its going to be another mad rush when the server does get up again..

I got to say though..this is getting silly for server downtime. I think they need a new guy behind the scenes for the server or something..


I believe that the guy is good enough but the hardware lacks the capability to catch up. Maybe it is time to renew or upgrade the server.

Profile Scarecrow
Avatar
Send message
Joined: 30 Jan 09
Posts: 56
Credit: 73,625
RAC: 0
Message 40915 - Posted: 14 Jul 2010 | 8:48:34 UTC

Look at bright side... just think how boring the graphs would be without the occasional high hills and deep, wide valleys.
Then again, boring isn't so bad.

Profile Werkstatt
Send message
Joined: 19 Feb 08
Posts: 297
Credit: 105,742,273
RAC: 141
Message 40916 - Posted: 14 Jul 2010 | 9:03:35 UTC - in response to Message 40914.

out of work. this starts to happen so often.


And its going to be out of work for awhile ..

http://milkyway.cs.rpi.edu/milkyway/server_status.php

Its got nothing to send..and with a growing number of people getting their WUs done that have yet passed validation..I imagine its going to be another mad rush when the server does get up again..

I got to say though..this is getting silly for server downtime. I think they need a new guy behind the scenes for the server or something..


I believe that the guy is good enough but the hardware lacks the capability to catch up. Maybe it is time to renew or upgrade the server.


More and more people use faster GPU's, which results in more wu's to be processed. Maybe the next part, which lacks the required capabilities is the network (collatz!!!).
This is a good example how evolution works!
@ all responsible stuff: please think about a solution for at least 6 months or so!

Regards,
Alexander

Profile Werkstatt
Send message
Joined: 19 Feb 08
Posts: 297
Credit: 105,742,273
RAC: 141
Message 40917 - Posted: 14 Jul 2010 | 9:10:10 UTC - in response to Message 40915.

Look at bright side...


Hmmm ...

Always look at the byte side of life !
There's a song as well ..

Alexander

[boinc.at] Nowi
Send message
Joined: 22 Mar 09
Posts: 89
Credit: 343,138,406
RAC: 346,055
Message 40918 - Posted: 14 Jul 2010 | 9:16:27 UTC
Last modified: 14 Jul 2010 | 9:17:46 UTC

If it is possible from the science, much longer WU could be a solution. It will reduce traffic and server load. Of course then a checkpoint-system must be implemented in the application code.

Maybe the constraint is not the server, but the brain...
____________

Profile Werkstatt
Send message
Joined: 19 Feb 08
Posts: 297
Credit: 105,742,273
RAC: 141
Message 40919 - Posted: 14 Jul 2010 | 10:32:47 UTC - in response to Message 40918.

Maybe the constraint is not the server, but the brain...


This is the reason why I will not join this team.

Skysnake
Send message
Joined: 31 Oct 09
Posts: 20
Credit: 11,930,190
RAC: 101
Message 40920 - Posted: 14 Jul 2010 | 10:37:32 UTC

There is 1 WU available, let´s look who get it ;)

Perhaps the server works now

scottishwebcamslive.com
Avatar
Send message
Joined: 10 Oct 07
Posts: 79
Credit: 69,337,972
RAC: 0
Message 40925 - Posted: 14 Jul 2010 | 11:48:25 UTC

Hello,

I must admit not usually being a complainer but having to baby sit this project through server downtime is a bit of a pain now
I would really like to just hook the system up leave it crunching and forget about it but now find myself having to check up on it way to often

more workunits per core has been discussed and ruled out the above suggestion of longer work units would cure two problems 1. less down time for our machines 2. less spin down down and spin up between work units
this idea also doesnt seem to be getting any attention for some reason

collatz doesnt quite give the same credit but seems online more often than here and dnetc seems to give more credit per gpu work done using longer work units

I'm stuck here just now because neither of the two projects above will work with my 3 ATI cards in crossfire but the longer this fiasco with the server carries on the more likely I am to set up another couple of machines splitting the cards so they each can work on the projects above

hopefully the hardware IT dept will announce on here there next set of upgrades soon so we can have a more consistant flow of work

best regards
Ian
____________
....Please Join team Scotland HERE

Profile Werkstatt
Send message
Joined: 19 Feb 08
Posts: 297
Credit: 105,742,273
RAC: 141
Message 40932 - Posted: 14 Jul 2010 | 19:23:38 UTC - in response to Message 40920.

There is 1 WU available, let´s look who get it ;)

Perhaps the server works now


I just got ~20 wu's.
Don't give up!

Profile Simplex0
Avatar
Send message
Joined: 11 Nov 07
Posts: 232
Credit: 178,221,048
RAC: 0
Message 40933 - Posted: 14 Jul 2010 | 19:52:45 UTC - in response to Message 40932.

There is 1 WU available, let´s look who get it ;)

Perhaps the server works now


I just got ~20 wu's.
Don't give up!


Dang! I just down loaded wu's for Folding@home and they will take
hours to finish and not just 1 minute like the Milkyway wu's.

Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 40934 - Posted: 14 Jul 2010 | 20:43:02 UTC - in response to Message 40925.

Hello,

I must admit not usually being a complainer but having to baby sit this project through server downtime is a bit of a pain now
I would really like to just hook the system up leave it crunching and forget about it but now find myself having to check up on it way to often

more workunits per core has been discussed and ruled out the above suggestion of longer work units would cure two problems 1. less down time for our machines 2. less spin down down and spin up between work units
this idea also doesnt seem to be getting any attention for some reason

collatz doesnt quite give the same credit but seems online more often than here and dnetc seems to give more credit per gpu work done using longer work units

I'm stuck here just now because neither of the two projects above will work with my 3 ATI cards in crossfire but the longer this fiasco with the server carries on the more likely I am to set up another couple of machines splitting the cards so they each can work on the projects above

hopefully the hardware IT dept will announce on here there next set of upgrades soon so we can have a more consistant flow of work

best regards
Ian

Why do you need to babysit this project? Just set a backup project and let BOINC do its thing. Yes it is annoying when MW stops but with two other project that support ATI cards your ATI GPUs shouldn't run out of work.

scottishwebcamslive.com
Avatar
Send message
Joined: 10 Oct 07
Posts: 79
Credit: 69,337,972
RAC: 0
Message 40935 - Posted: 14 Jul 2010 | 23:31:52 UTC - in response to Message 40934.

hi,

great idea but in my case my rig with three crossfired cards wont run the other two projects so if this one dies thats it :(

best regards
Ian
____________
....Please Join team Scotland HERE

Profile Arif Mert Kapicioglu
Send message
Joined: 14 Dec 09
Posts: 158
Credit: 572,866,449
RAC: 4,516
Message 40937 - Posted: 15 Jul 2010 | 6:12:04 UTC - in response to Message 40934.
Last modified: 15 Jul 2010 | 6:13:18 UTC

Hello,

I must admit not usually being a complainer but having to baby sit this project through server downtime is a bit of a pain now
I would really like to just hook the system up leave it crunching and forget about it but now find myself having to check up on it way to often

more workunits per core has been discussed and ruled out the above suggestion of longer work units would cure two problems 1. less down time for our machines 2. less spin down down and spin up between work units
this idea also doesnt seem to be getting any attention for some reason

collatz doesnt quite give the same credit but seems online more often than here and dnetc seems to give more credit per gpu work done using longer work units

I'm stuck here just now because neither of the two projects above will work with my 3 ATI cards in crossfire but the longer this fiasco with the server carries on the more likely I am to set up another couple of machines splitting the cards so they each can work on the projects above

hopefully the hardware IT dept will announce on here there next set of upgrades soon so we can have a more consistant flow of work

best regards
Ian

Why do you need to babysit this project? Just set a backup project and let BOINC do its thing. Yes it is annoying when MW stops but with two other project that support ATI cards your ATI GPUs shouldn't run out of work.


I think the point is that the project has been down repeatedly and since the main application as well as the amount of current output heavily relies on fast gpus, then it should have been expected that the server load will be heavy also. Apparently, everybody knows the down is about feeder can't catching up, or validation too loaded or server load. This is not surprise. So if to continue with current equipment, there will be much more downs for sure.
My point is to indicate the lack of hardware so that proper adjustments are to be made.

hoarfrost
Send message
Joined: 4 Sep 09
Posts: 20
Credit: 123,873,987
RAC: 0
Message 40938 - Posted: 15 Jul 2010 | 12:04:21 UTC

Zero work again.

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 40939 - Posted: 15 Jul 2010 | 12:07:16 UTC

Just as I am returning 2 rigs to Milkyway, do I detect another stoppage with ZERO work ready to send?

AAArrrgh!
____________
Go away, I was asleep


Ralph
Send message
Joined: 30 Oct 07
Posts: 39
Credit: 389,931
RAC: 1,059
Message 40940 - Posted: 15 Jul 2010 | 15:34:54 UTC

Yup - same here.

I'll watch the speed that the server status page shows the "results in progress" declining.

uBronan
Avatar
Send message
Joined: 9 Feb 09
Posts: 166
Credit: 27,520,813
RAC: 0
Message 40941 - Posted: 15 Jul 2010 | 16:20:51 UTC

lol
this projects on ati cards suffer from work most of the time
these nvidia projects are much slower but get steadily work :)
nvidia luckers xD

____________
Its new, its relative fast... my new bicycle

Profile Werkstatt
Send message
Joined: 19 Feb 08
Posts: 297
Credit: 105,742,273
RAC: 141
Message 40942 - Posted: 15 Jul 2010 | 17:01:57 UTC

What we are missing is a statement from the project stuff. Is somebody working for a solution?

Profile Simplex0
Avatar
Send message
Joined: 11 Nov 07
Posts: 232
Credit: 178,221,048
RAC: 0
Message 40943 - Posted: 15 Jul 2010 | 17:12:30 UTC
Last modified: 15 Jul 2010 | 17:13:27 UTC

Just thinking.

With the constantly malfunction server in mind, the never ending story of the 'Almost there' GPU page that turned up to be the 'Never there' GPU page and the currently ongoing project of the 'MilkyWay@home screensaver coming soon' that also seams to go the same way.

Would it not be just hilarious if the screensaver would come out when there was just one wu left to crunch :)

Profile Werkstatt
Send message
Joined: 19 Feb 08
Posts: 297
Credit: 105,742,273
RAC: 141
Message 40944 - Posted: 15 Jul 2010 | 18:25:10 UTC

Hi,
this is a copy from the gpugrid forum, 'GPUGRID and ATI'

Hi,

We'll probably be in a position to start testing of an ATI application after SDK 2.2 is released in a month or so. It will require a 5800-series card initially.

MJH

It is dated 10th July. And they talk about OpenCL, which seems to be a source of problems until now. They say, its not stable.
And they posted, that 48xx-cards will not be supported. They lack some capabilities. But for the lucky owners of a 58xx card a new star is rising ...

Alexander

Profile AriZonaMoon*
Avatar
Send message
Joined: 29 Sep 08
Posts: 1610
Credit: 41,867,203
RAC: 0
Message 40945 - Posted: 16 Jul 2010 | 15:46:06 UTC
Last modified: 16 Jul 2010 | 15:48:07 UTC

I wish I had a 58xx card, so I could complain more often ;-))
Maybe better to sing for her..

- To Miss Milkyway -

"Ain't no sunshine when she's gone.
It's not warm when she`s away.
Ain't no sunshine when she's gone
and she's always gone too long
anytime she goes away."

;-)
____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 40946 - Posted: 16 Jul 2010 | 16:04:50 UTC

On POETS Day there in no need to complain
as there is work elsewhere to gain.
Just move to Collatz and see,
or even to DNETC will take the p**.
____________
Go away, I was asleep


BarryAZ
Send message
Joined: 1 Sep 08
Posts: 512
Credit: 222,446,536
RAC: 158,326
Message 40949 - Posted: 16 Jul 2010 | 17:12:07 UTC - in response to Message 40942.

Indeed, the lack of information this time is getting a tad tedious. Oh well, at least both Dnetc and Collatz are (at the moment) running.

What we are missing is a statement from the project stuff. Is somebody working for a solution?


____________

Profile Simplex0
Avatar
Send message
Joined: 11 Nov 07
Posts: 232
Credit: 178,221,048
RAC: 0
Message 40950 - Posted: 16 Jul 2010 | 17:46:50 UTC - in response to Message 40945.

I wish I had a 58xx card, so I could complain more often ;-))
Maybe better to sing for her..

- To Miss Milkyway -

"Ain't no sunshine when she's gone.
It's not warm when she`s away.
Ain't no sunshine when she's gone
and she's always gone too long
anytime she goes away."

;-)


Jup! There is so many more important things to do in life than constantly glaring at a server when The sun is shining :)

BarryAZ
Send message
Joined: 1 Sep 08
Posts: 512
Credit: 222,446,536
RAC: 158,326
Message 40952 - Posted: 16 Jul 2010 | 19:58:08 UTC - in response to Message 40949.

Correction -- Collatz is encountering one of their daily Comcast can't figure it out connectivity problems.
____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 40953 - Posted: 16 Jul 2010 | 20:18:52 UTC
Last modified: 16 Jul 2010 | 20:19:17 UTC

Milkyway servers are up, but most are not running nor have any work.

Collatz is not available due to ISP problems

FreeHAL seems to have gone away (ISP as well or ...)

Just leaves DNETC

Ho! Hum!
____________
Go away, I was asleep


Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 40954 - Posted: 16 Jul 2010 | 21:42:17 UTC

We're baaaack.

Profile arkayn
Avatar
Send message
Joined: 14 Feb 09
Posts: 914
Credit: 74,780,239
RAC: 249
Message 40956 - Posted: 16 Jul 2010 | 22:45:05 UTC

Sorta, I still don't have any work from here and I also have a pretty full cache of DNETC work to keep the 5830 warm.
____________

Profile KWSN Checklist
Avatar
Send message
Joined: 12 Aug 08
Posts: 128
Credit: 203,351,125
RAC: 1,217
Message 41065 - Posted: 25 Jul 2010 | 22:32:21 UTC

Kick It!
____________

Profile David Glogau*
Avatar
Send message
Joined: 12 Aug 09
Posts: 172
Credit: 645,240,165
RAC: 0
Message 41066 - Posted: 25 Jul 2010 | 22:59:57 UTC

I am down as well.
____________

Profile Ian
Avatar
Send message
Joined: 9 Feb 10
Posts: 7
Credit: 182,808,408
RAC: 224
Message 41067 - Posted: 25 Jul 2010 | 23:02:14 UTC - in response to Message 41065.

Shake it!

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 41068 - Posted: 26 Jul 2010 | 7:15:04 UTC

Looks like the validator again?
____________
Go away, I was asleep


Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 41070 - Posted: 26 Jul 2010 | 10:57:33 UTC
Last modified: 26 Jul 2010 | 10:59:23 UTC

So all the GPU folks returned their wu's in less than 4 hrs (60,000 wu's) and now the CPU's are returning their wu's at 10,000 per 4hrs.



Care of Scarecrow

Profile Werkstatt
Send message
Joined: 19 Feb 08
Posts: 297
Credit: 105,742,273
RAC: 141
Message 41072 - Posted: 26 Jul 2010 | 13:02:05 UTC

Maybe we should make a collection for something like an 'Weekend Automatic Validator Kicker'.

Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 41073 - Posted: 26 Jul 2010 | 13:16:44 UTC

Could something be set up to automatically restart it?
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

Profile Werkstatt
Send message
Joined: 19 Feb 08
Posts: 297
Credit: 105,742,273
RAC: 141
Message 41074 - Posted: 26 Jul 2010 | 14:55:25 UTC - in response to Message 41073.

Could something be set up to automatically restart it?


I think so. Since at least one server gets info about nr. of wu's waiting for validation there could be a limit set up, which causes an auto-restart (like windows does ater updates), local or remote.
But this may depend on the operating-system and security-settings they use. What should work in every situation is a remote operated relay, whose contact is parallel to the reset-button of the PC.

Alexander

Profile RAMen
Avatar
Send message
Joined: 8 Apr 08
Posts: 41
Credit: 132,616,981
RAC: 192,733
Message 41075 - Posted: 26 Jul 2010 | 15:01:24 UTC
Last modified: 26 Jul 2010 | 15:21:26 UTC

With apologies to DEVO ... (ah the eighties)
Whip it good - Werkstatt just dusted off the archives in the back of my head

When something's going wrong
You must kick it
Now whip it
into shape.
Shape it up
Get straight
go forward
move ahead
Try to detect it
it's not too late
to whip it
whip it good

When a good time turns around
you must kick it
You will never live it down
unless you kick it
No one gets away
until they kick it
I say kick it
kick it good
I say whip it
kick it good
crack that whip
give the past the slip

Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 41090 - Posted: 26 Jul 2010 | 21:37:44 UTC

It's baaaaack....

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 41217 - Posted: 3 Aug 2010 | 11:02:12 UTC

No work, again, for a little while now, and caches running down.

There are 42,827 WUs awaiting validation, and possibly growing. I expect the usual validation server problem is associated with the lack of new work, as seen several times before.

Can someone kick the server, or it's over to Collatz for a burst of WUs there. :((
____________
Go away, I was asleep


Profile Cori
Avatar
Send message
Joined: 27 Aug 07
Posts: 647
Credit: 27,592,547
RAC: 0
Message 41227 - Posted: 3 Aug 2010 | 16:49:09 UTC

I need wooooooooooooooooooooooooork! ;-))))))
____________
Lovely greetings, Cori

Profile Crunch3r
Volunteer developer
Avatar
Send message
Joined: 17 Feb 08
Posts: 358
Credit: 256,958,531
RAC: 4,878
Message 41229 - Posted: 3 Aug 2010 | 17:14:35 UTC - in response to Message 41227.

I need wooooooooooooooooooooooooork! ;-))))))




____________

Join BOINC United now!

BarryAZ
Send message
Joined: 1 Sep 08
Posts: 512
Credit: 222,446,536
RAC: 158,326
Message 41230 - Posted: 3 Aug 2010 | 17:49:21 UTC
Last modified: 3 Aug 2010 | 17:51:36 UTC

It seems we have a repetitive problem -- symptoms are that work isn't being produced for download and completed work isn't getting validated.

Perhaps there is some form of memory leak going on here -- and absent solving the root cause problem, perhaps a scheduled stop/start process to act as a bandaid might be an at least marginally proactive approach to the problem.

I haven't seen any information from Travis or others at the project regarding what they are seeing there (or if they are seeing anything there or for that matter that they are looking into it).

Perhaps we are seeing 'BOINC project middle age' syndrome in terms of the project side activity and communications level....

It would be nice if they could put this repetitive problem to bed - that might allow Travis even more off site time.
____________

Profile Arif Mert Kapicioglu
Send message
Joined: 14 Dec 09
Posts: 158
Credit: 572,866,449
RAC: 4,516
Message 41231 - Posted: 3 Aug 2010 | 17:52:31 UTC - in response to Message 41230.

It seems we have a repetitive problem -- symptoms are that work isn't being produced for download and completed work isn't getting validated.

I haven't seen any information from Travis or others at the project regarding what they are seeing there (or if they are seeing anything there or for that matter that they are looking into it).

Perhaps we are seeing 'BOINC project middle age' syndrome in terms of the project side activity and communications level....

It would be nice if they could put this repetitive problem to bed - that might allow Travis even more off site time.


i agree because it gets more annoying each time. There hasn't been a month withouth an at least one day long break out.

CTAPbIi
Send message
Joined: 4 Jan 10
Posts: 86
Credit: 51,753,924
RAC: 0
Message 41232 - Posted: 3 Aug 2010 | 18:03:36 UTC

i can say even more - this happens on bi-weekly basis...
____________

Profile Cori
Avatar
Send message
Joined: 27 Aug 07
Posts: 647
Credit: 27,592,547
RAC: 0
Message 41233 - Posted: 3 Aug 2010 | 18:18:29 UTC - in response to Message 41229.

I need wooooooooooooooooooooooooork! ;-))))))





*ROFL*

PS. Besser wär:




____________
Lovely greetings, Cori

Matthew
Volunteer moderator
Project developer
Project scientist
Send message
Joined: 6 May 09
Posts: 182
Credit: 4,799
RAC: 0
Message 41235 - Posted: 3 Aug 2010 | 18:32:11 UTC

I gave the server a kick, let me know if it worked.

-Matthew

Profile Werkstatt
Send message
Joined: 19 Feb 08
Posts: 297
Credit: 105,742,273
RAC: 141
Message 41236 - Posted: 3 Aug 2010 | 18:43:15 UTC - in response to Message 41235.

I gave the server a kick, let me know if it worked.

-Matthew


Unfortunately not.

Alexander

Profile Cori
Avatar
Send message
Joined: 27 Aug 07
Posts: 647
Credit: 27,592,547
RAC: 0
Message 41239 - Posted: 3 Aug 2010 | 19:55:04 UTC

Yay, now there's frrrresh crrrrunchies! THX!
____________
Lovely greetings, Cori

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 41241 - Posted: 3 Aug 2010 | 20:38:30 UTC

It's working fine for me ATM, and I am d/ling new work as needed.
____________
Go away, I was asleep


Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 41258 - Posted: 4 Aug 2010 | 8:27:25 UTC

Arrrgghhhh!

Tapio
Send message
Joined: 8 Mar 09
Posts: 187
Credit: 10,072,025
RAC: 14,622
Message 41259 - Posted: 4 Aug 2010 | 8:35:08 UTC - in response to Message 41229.

I need wooooooooooooooooooooooooork! ;-))))))





Can anyone translate text to english please.

And yes: there is no work this time but DNETC@home is good backup project. :-)

Profile Simplex0
Avatar
Send message
Joined: 11 Nov 07
Posts: 232
Credit: 178,221,048
RAC: 0
Message 41260 - Posted: 4 Aug 2010 | 8:37:09 UTC
Last modified: 4 Aug 2010 | 8:37:48 UTC

No wonder all the problem This is the server kicking staf ;)

Profile mickydl*
Avatar
Send message
Joined: 1 Dec 09
Posts: 8
Credit: 688,790
RAC: 0
Message 41261 - Posted: 4 Aug 2010 | 10:04:11 UTC - in response to Message 41259.

I need wooooooooooooooooooooooooork! ;-))))))





Can anyone translate text to english please.

And yes: there is no work this time but DNETC@home is good backup project. :-)


Bundesagentur für Arbeit = Federal Employment Office of Germany

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 41262 - Posted: 4 Aug 2010 | 11:03:01 UTC

Kick the serveR

Kick the validator!
____________
Go away, I was asleep


CTAPbIi
Send message
Joined: 4 Jan 10
Posts: 86
Credit: 51,753,924
RAC: 0
Message 41263 - Posted: 4 Aug 2010 | 11:11:39 UTC

again...
____________

Profile David Glogau*
Avatar
Send message
Joined: 12 Aug 09
Posts: 172
Credit: 645,240,165
RAC: 0
Message 41265 - Posted: 4 Aug 2010 | 11:44:36 UTC

Travis you selfish bas!ard, you snuck out out of the building to have some sort of a life didn't you ?

OK Matthew, kicking the server really is not working so well these days. I suggest you try kicking the computer lab staff!
If they are bigger than you, feel free to use a bat :P

/OK back to my live ROV feeds.
____________

Profile Arif Mert Kapicioglu
Send message
Joined: 14 Dec 09
Posts: 158
Credit: 572,866,449
RAC: 4,516
Message 41267 - Posted: 4 Aug 2010 | 14:01:02 UTC
Last modified: 4 Aug 2010 | 14:01:48 UTC

Every time a staff member kicks server, it gets broken sooner. Maybe all it needs a love? or a Xeon CPU he can make love to? so don't kick it, just love it.

Also I'm just happy to set DNETC@Home as a backup project.

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 41270 - Posted: 4 Aug 2010 | 16:46:38 UTC

Looks like the server(s) have been working and dishing out work for a few hours now. Just need to NNT Collatz and run a 5 hour cache down then MW resumes.
____________
Go away, I was asleep


Jnargus
Send message
Joined: 26 Jun 08
Posts: 5
Credit: 4,148,054
RAC: 4,684
Message 41271 - Posted: 4 Aug 2010 | 17:46:02 UTC

bundesagentur für arbeitspakete translates to:
Federal department for work packages.

Now that's funny :-)

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 41274 - Posted: 4 Aug 2010 | 19:36:15 UTC

I see the validator is building again
____________
Go away, I was asleep


John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 41285 - Posted: 5 Aug 2010 | 11:11:32 UTC

It was nearly Aaargh! again a couple of hours ago this morning. The validator seemed to be building up and the "Ready to send" read ZERO!

Seems to continue to be running OK ATM.
____________
Go away, I was asleep


Chris S
Avatar
Send message
Joined: 20 Sep 08
Posts: 1357
Credit: 173,075,472
RAC: 13
Message 41287 - Posted: 5 Aug 2010 | 16:45:28 UTC

A usual hiccup !

____________
Don't drink water, that's the stuff that rusts pipes

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 41340 - Posted: 9 Aug 2010 | 15:44:42 UTC
Last modified: 9 Aug 2010 | 15:45:56 UTC

It looks like we are rapidly heading to a new Aaargh session according to the current server status.

The work ready to send had dropped well below it's normal levels (now = 304). But it does seem to be increasing slightly.

But the real give away is the numbers awaiting the validator - now 23,335. When we are running normally this is around 6,300.

Are we getting ready for the chorus - HELP! Please get ready to kick the servers?
____________
Go away, I was asleep


John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 41341 - Posted: 9 Aug 2010 | 16:09:53 UTC
Last modified: 9 Aug 2010 | 16:10:34 UTC

Definitely moved to Aaarrrrggghhhh!

The work ready is now ZERO and the "waiting-for-validation" has risen to over 30K
____________
Go away, I was asleep


John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 41343 - Posted: 9 Aug 2010 | 17:32:47 UTC
Last modified: 9 Aug 2010 | 17:33:09 UTC

Still no work

Please give the servers a kick soon!!
____________
Go away, I was asleep


BarryAZ
Send message
Joined: 1 Sep 08
Posts: 512
Credit: 222,446,536
RAC: 158,326
Message 41344 - Posted: 9 Aug 2010 | 17:37:00 UTC - in response to Message 41343.

I realize others have suggested this in the past, but it seems that either a bit of serious root cause identification/resolution effort ought to be done, or, alternatively, a automatic stop/restart process (perhaps once a week) to clear things out (or even daily off hours to insure no more than single day outages) needs to be in place.

By the way, since duplicate posts are subject to filtering, shouldn't duplicate outages also be dealt with? <just joking>



Still no work

Please give the servers a kick soon!!


____________

Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 41345 - Posted: 9 Aug 2010 | 17:38:46 UTC

A preemptive server reboot daily or every 48hrs sounds like a good idea!

Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 41346 - Posted: 9 Aug 2010 | 17:38:55 UTC

The best and fastest way to get someone to restart the validator is to send an email to astro@cs.rpi.edu because all 11 administrators receive it. I have already sent an email this time.

I do not believe they monitor these forums on a regular basis. Last time I sent an email, they thanked me for reporting it even though the validator had been down for over 6 hours.

Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 41347 - Posted: 9 Aug 2010 | 18:19:35 UTC

Work is up now.
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 41348 - Posted: 9 Aug 2010 | 19:39:31 UTC

Sighing with relief, and letting the raw patch at the back of my throat time to get better.

Where is my voice?
____________
Go away, I was asleep


Dirk Broer
Send message
Joined: 11 Dec 09
Posts: 11
Credit: 36,365,922
RAC: 62,768
Message 41353 - Posted: 10 Aug 2010 | 9:00:37 UTC

Feeder is not running

10-08-2010 10:51:55 Milkyway@home Sending scheduler request: Requested by user.
10-08-2010 10:51:55 Milkyway@home Reporting 2 completed tasks, requesting new tasks for GPU
10-08-2010 10:51:57 Milkyway@home Scheduler request completed: got 0 new tasks
10-08-2010 10:51:57 Milkyway@home Message from server: Server error: feeder not running

____________

Profile Werkstatt
Send message
Joined: 19 Feb 08
Posts: 297
Credit: 105,742,273
RAC: 141
Message 41354 - Posted: 10 Aug 2010 | 10:06:58 UTC - in response to Message 41353.

I've sent a email to stuff that servers are down.

Alexander

Dirk Broer
Send message
Joined: 11 Dec 09
Posts: 11
Credit: 36,365,922
RAC: 62,768
Message 41355 - Posted: 10 Aug 2010 | 10:47:03 UTC - in response to Message 41354.

So did I, and the feeder works again!
____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 41356 - Posted: 10 Aug 2010 | 11:33:09 UTC

I presume a third request to the admins for a server reboot will not go amiss. The Validator has a balance of Workunits waiting for validation 34,273 , so the awaiting for work must have happened a while ago.
____________
Go away, I was asleep


Cartoonman
Send message
Joined: 10 Dec 09
Posts: 18
Credit: 6,518,358
RAC: 0
Message 41357 - Posted: 10 Aug 2010 | 11:45:49 UTC

.. well, no new Wu's again. just finished my batch now...

i was wondering why there was so many Wu's waiting to report...

Profile David Glogau*
Avatar
Send message
Joined: 12 Aug 09
Posts: 172
Credit: 645,240,165
RAC: 0
Message 41359 - Posted: 10 Aug 2010 | 11:52:18 UTC

Since I live on the other side of the planet, I think the project admin should give me a big RED button to push whenever this happens, so the servers reset.

At least three of my babies switch automatically over to the backup project now.
____________

Tapio
Send message
Joined: 8 Mar 09
Posts: 187
Credit: 10,072,025
RAC: 14,622
Message 41360 - Posted: 10 Aug 2010 | 12:03:25 UTC - in response to Message 41357.

.. well, no new Wu's again. just finished my batch now...

i was wondering why there was so many Wu's waiting to report...


I dont know about other users but i'm switched to dnetc@home. I have 12 tasks cache waiting to run. I run them out when this project is running again but not before it.

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 41361 - Posted: 10 Aug 2010 | 12:16:24 UTC

I moved over to Collatz as the ATI HD3850 crunches DNETC incredibly slowly compered to either Milkyway (preferred) or Collatz (back up).

Now waiting the results of E-mails to the project admins to re-boot the servers.
____________
Go away, I was asleep


Chris S
Avatar
Send message
Joined: 20 Sep 08
Posts: 1357
Credit: 173,075,472
RAC: 13
Message 41362 - Posted: 10 Aug 2010 | 12:30:10 UTC

I'm already crunching elsewhere with backup projects! Micro management sucks though....

____________
Don't drink water, that's the stuff that rusts pipes

Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 41365 - Posted: 10 Aug 2010 | 12:40:58 UTC

The reliabilty of this project is almost getting as bad as SETI. I managed to get about 12 work units 30 minutes ago, then it stopped again.

Profile Werkstatt
Send message
Joined: 19 Feb 08
Posts: 297
Credit: 105,742,273
RAC: 141
Message 41367 - Posted: 10 Aug 2010 | 14:06:09 UTC - in response to Message 41365.

The reliabilty of this project is almost getting as bad as SETI. I managed to get about 12 work units 30 minutes ago, then it stopped again.


Have you ever tried orbit@home or lhc@home?
That could change your mind!

For your GPU Collatz Conjecture could be a backup project.

Alexander

Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 41368 - Posted: 10 Aug 2010 | 14:42:16 UTC - in response to Message 41367.


Have you ever tried orbit@home or lhc@home?
That could change your mind!

For your GPU Collatz Conjecture could be a backup project.

Alexander


I haven't tried orbit@home, and I gave up on lhc@home awhile ago.

As far as Collatz goes, after I installed my second ATI 5970 card, all Collatz will do for me is lock up my system. I wish I could get it to run. Evidently it doesn't like an i7 980x cpu, Win7 64bit, and 2 ATI 5970 cards.

Mike..

Profile Werkstatt
Send message
Joined: 19 Feb 08
Posts: 297
Credit: 105,742,273
RAC: 141
Message 41369 - Posted: 10 Aug 2010 | 15:00:33 UTC - in response to Message 41368.


As far as Collatz goes, after I installed my second ATI 5970 card, all Collatz will do for me is lock up my system. I wish I could get it to run. Evidently it doesn't like an i7 980x cpu, Win7 64bit, and 2 ATI 5970 cards.

Mike..


Mike,
collatz likes i7, win64 and 2 ATI-cards. As you can see, my mainsys is a similar configuration, except that I do not have 2 5970 but one 5830 and one 4870 and 'only' 8 threads. And collatz works fine.

But when I take a look onto your computers, I cannot find one with ATI-GPU's. There are two listed with nVidia.
Maybe you have a more basic problem?

Alexander

Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 41371 - Posted: 10 Aug 2010 | 15:25:22 UTC - in response to Message 41369.
Last modified: 10 Aug 2010 | 15:27:06 UTC


Mike,
collatz likes i7, win64 and 2 ATI-cards. As you can see, my mainsys is a similar configuration, except that I do not have 2 5970 but one 5830 and one 4870 and 'only' 8 threads. And collatz works fine.

But when I take a look onto your computers, I cannot find one with ATI-GPU's. There are two listed with nVidia.
Maybe you have a more basic problem?

Alexander


That's strange, when I look at my computers the first one listed at the top is the system I am talking about with the 2 ATI 5970 cards. I see 10 systems when I go to my list of computers.

Mike...

BarryAZ
Send message
Joined: 1 Sep 08
Posts: 512
Credit: 222,446,536
RAC: 158,326
Message 41372 - Posted: 10 Aug 2010 | 15:42:01 UTC

An automatic pre-emptive stop/start of the server (or server processes) is something of a brute force *work-around* which doesn't deal with what appears to be a root cause problem that could use some analysis and resolution efforts.

Back in the day when Travis was more closely involved, pleas here for that sort of corrective action seemed to have more effect.

Seemingly at this point, it is more a case of auto-pilot (where the best that can be had is frequent reboots) as the various admins have a lot of other things on their plate in addition to this project.


____________

BarryAZ
Send message
Joined: 1 Sep 08
Posts: 512
Credit: 222,446,536
RAC: 158,326
Message 41373 - Posted: 10 Aug 2010 | 15:51:24 UTC - in response to Message 41365.

Well not quite -- I mean the approach these days at SETI is a weekly *three day* outage -- preceded by 12 to 24 hour traffic jam and then followed by a post outage traffic jam of 12 to 24 hours. I believe the idea was to improve reliability when the outage wasn't going on -- it hasn't yet done that.

So for SETI, what is now in place is a part time project, but their message boards run about 162 hours a week. Rather a fair amount of resource there for message boards it seems to me. SETI moved to close to the bottom of my list rather a long time ago.

I suspect a large part of the problem here is that to a fair degree, the now *Doctor* Travis has moved on (as is to be expected) and there no longer is the motivational force behind this project.


The reliabilty of this project is almost getting as bad as SETI. I managed to get about 12 work units 30 minutes ago, then it stopped again.


____________

BarryAZ
Send message
Joined: 1 Sep 08
Posts: 512
Credit: 222,446,536
RAC: 158,326
Message 41374 - Posted: 10 Aug 2010 | 16:23:21 UTC - in response to Message 41361.

For me, Milkyway dropped to my second project simply because I have a flock of GPU's that MW doesn't support (ie non-double precision cards).

With DNetC now available which provides largely the same GPU support as Collatz, MW will drop down to my number three project in terms of TC within a couple of months. It is interesting the sort of reliability that Collatz and Dnetc can provide with quite limited resources (of course with lower user counts).

I agree with you regarding the somewhat finicky nature of Dnetc -- there are clearly some GPU configurations it doesn't play well with (like the dual 5970 ATI's and your 3850), and it can push the cards to a distracting degree -- I can't run Dnet on my primary computer when I am doing even ordinary tasks, compared to Collatz.

I guess we can hope that Travis is able to pass on the torch here to someone at RPI who will be 'invested' in the project as he was in the past. My other hope is that additional 'low end' GPU projects, particularly ATI GPU projects, start showing as well.


I moved over to Collatz as the ATI HD3850 crunches DNETC incredibly slowly compered to either Milkyway (preferred) or Collatz (back up).

Now waiting the results of E-mails to the project admins to re-boot the servers.


____________

Chris S
Avatar
Send message
Joined: 20 Sep 08
Posts: 1357
Credit: 173,075,472
RAC: 13
Message 41375 - Posted: 10 Aug 2010 | 16:36:09 UTC

Well not quite -- I mean the approach these days at SETI is a weekly *three day* outage -- preceded by 12 to 24 hour traffic jam and then followed by a post outage traffic jam of 12 to 24 hours. I believe the idea was to improve reliability when the outage wasn't going on -- it hasn't yet done that.


As I understand it, the 3 day outage is to let Nitpicker run on 10 years worth of results to sift for likely candidates to re-examine. When they tried it in real time it zonked the servers and the database out. You can't upload or download work for 3 days, but the message boards are only out for 9-12 hours as they were before.

I suspect a large part of the problem here is that to a fair degree, the now *Doctor* Travis has moved on (as is to be expected) and there no longer is the motivational force behind this project.


He did say that he would be around but not have as much involvement as before, so you are about right in what you say. The point is that there is DNETC which gives about 90% of credits you get here, and also Collatz which gives about 60%.
That is of course running GPU's.

Talking of GPU's I have said over and over again, that the basic Boinc infrastructure used by the majority of projects was just not designed for the high levels of data throughput that the onslaught of GPU crunching has unleashed. Servers were scoped out to deal with CPU work and it is not surprising to me at all that all the popular projects are struggling.

If you couple that with a general slow down of the www/Internet due to the world population approaching 7 billion, and the fact that China has nearly 20% of that, and is expanding its web presence at an exponential rate, everything is creaking at the seams.

Will DC survive ??






____________
Don't drink water, that's the stuff that rusts pipes

Profile Werkstatt
Send message
Joined: 19 Feb 08
Posts: 297
Credit: 105,742,273
RAC: 141
Message 41377 - Posted: 10 Aug 2010 | 16:46:33 UTC - in response to Message 41374.


With DNetC now available which provides largely the same GPU support as Collatz, MW will drop down to my number three project in terms of TC within a couple of months. It is interesting the sort of reliability that Collatz and Dnetc can provide with quite limited resources (of course with lower user counts).

I guess we can hope that Travis is able to pass on the torch here to someone at RPI who will be 'invested' in the project as he was in the past. My other hope is that additional 'low end' GPU projects, particularly ATI GPU projects, start showing as well.


BarryAZ,
I'm afraid you're right. My hope is that someone follows his footsteps.

As I've learned from GPUGRID, cal is an 'outdated' programming tool. They are working on OpenCL Apps, which should be a step to an independency of the type of your card as long as it supports OpenCL.

Two or three month's ago somebody posted here in this forum, that other projects are working on GPU-Apps. Let's see if I can find that again.

Alexander

BarryAZ
Send message
Joined: 1 Sep 08
Posts: 512
Credit: 222,446,536
RAC: 158,326
Message 41379 - Posted: 10 Aug 2010 | 18:00:11 UTC - in response to Message 41375.

Yes, I got that -- if that is the 'permanent' plan for SETI, then the folks most affected by that are the 'SETI uber alles' crowd -- for that group, the self-imposed 'only one project' makes them dependent on a project that, aside from the newsgroups, runs only 50% of the time and is pretty well stressed with folks crowding into the 'half-space' for uploads and downloads.

The plus side is that the much smaller active BOINC crowd that might have originally been brought into distributed processing by SETI and doesn't have a 'SETI only' DP world view might well help out other projects which are not half time.




As I understand it, the 3 day outage is to let Nitpicker run on 10 years worth of results to sift for likely candidates to re-examine. When they tried it in real time it zonked the servers and the database out. You can't upload or download work for 3 days, but the message boards are only out for 9-12 hours as they were before.



____________

Profile David Glogau*
Avatar
Send message
Joined: 12 Aug 09
Posts: 172
Credit: 645,240,165
RAC: 0
Message 41380 - Posted: 10 Aug 2010 | 18:17:43 UTC
Last modified: 10 Aug 2010 | 18:24:52 UTC

I run a GTX260 24/7 on Seti. it is no big deal. I needed to adjust my cache to 4 days to assure constant work, but that's no big deal.

As to DnetC, I find it makes an excellent backup project for me. I am running it now on two, twin 5790 computers. One 5790 and 4770 computer. And one single 5790 computer, so it is pretty adaptable from my point of view.

OK must go MW just downloaded on one of the boxes, so lets see if I can get the others back!
____________

Cartoonman
Send message
Joined: 10 Dec 09
Posts: 18
Credit: 6,518,358
RAC: 0
Message 41381 - Posted: 10 Aug 2010 | 18:23:04 UTC

hmm, i got some WU's, assimilation seems to be up and running. have to wait though. AFAICS, there are a small bunch ready to send.

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 41382 - Posted: 10 Aug 2010 | 19:24:30 UTC

Work is avilable again
____________
Go away, I was asleep


Profile skildude
Avatar
Send message
Joined: 13 Jan 08
Posts: 51
Credit: 46,285,288
RAC: 394
Message 41383 - Posted: 10 Aug 2010 | 19:33:59 UTC

no validation though
____________
Blah blah blah you know the rest

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 41384 - Posted: 10 Aug 2010 | 19:47:04 UTC - in response to Message 41383.

no validation though



It may have still been catshing up with the backlog when you looked. The Validator backlog now reads 6,287 WUs
____________
Go away, I was asleep


BarryAZ
Send message
Joined: 1 Sep 08
Posts: 512
Credit: 222,446,536
RAC: 158,326
Message 41385 - Posted: 10 Aug 2010 | 21:02:24 UTC

Looks like the root cause is either winning or being looked into -- web pages are on -- sort of in a SETI emulation mode we are at the moment.

____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 41387 - Posted: 10 Aug 2010 | 22:52:37 UTC

Back to Collatz, which has good uptime since Comcast repaired/replaced parts of the BB line.
____________
Go away, I was asleep


Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 41390 - Posted: 11 Aug 2010 | 1:42:11 UTC
Last modified: 11 Aug 2010 | 1:43:21 UTC

I just setup Collatz to be my primary project and Milky Way to be my backup. I really like Milky Way, but when you average in the downtime and the almost nonexistant cache, I am getting much better credit returns at Collatz.

If only I could get DNETC working om my system, I would set up as #1=DNETC, #2=Collatz, and #3=Milky Way.

I will keep checking back with the hope that I will find that the reliabilty of this project has improved.

Mike...

BarryAZ
Send message
Joined: 1 Sep 08
Posts: 512
Credit: 222,446,536
RAC: 158,326
Message 41407 - Posted: 12 Aug 2010 | 3:16:57 UTC

OK -- so could we PLEASE get some feedback from folks at the project as to what is going (or not going) on?

One thing more troublesome than outages is outages in a vacuum.

____________

Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 41410 - Posted: 12 Aug 2010 | 9:53:00 UTC - in response to Message 41407.

OK -- so could we PLEASE get some feedback from folks at the project as to what is going (or not going) on?

One thing more troublesome than outages is outages in a vacuum.

Erm..front page.

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 41529 - Posted: 17 Aug 2010 | 11:49:19 UTC
Last modified: 17 Aug 2010 | 12:28:37 UTC

Now the Server Status page is beginning to look ominous regarding another validatior backing up problem.

When all is running smoothly the status page Workunits waiting for validation usually runs with between 6,250 and 6,4000 results waiting. I see the status now gives us with 12,139 waiting validation. If this continues then the whole shebang will be down again.

I have PMed Matthew as a precaution, which I hope I am wrong about.


[Edit mode on - 35 minutes after first posting this potential panic]

Thanks Matthew.

The validator is back to it's usual numbers waithing for validation.

PANIC avoided

[Edit mode off]

____________
Go away, I was asleep


Profile KWSN Checklist
Avatar
Send message
Joined: 12 Aug 08
Posts: 128
Credit: 203,351,125
RAC: 1,217
Message 41625 - Posted: 21 Aug 2010 | 3:05:44 UTC

Knock knock!
____________

Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 41626 - Posted: 21 Aug 2010 | 4:35:45 UTC
Last modified: 21 Aug 2010 | 4:36:46 UTC

Sending scheduler request: To fetch work.
Requesting new tasks for GPU
Scheduler request completed: got 0 new tasks
Message from project server: No work sent
Message from project server: Your app_info.xml file doesn't have a version of MilkyWay@Home N-Body Simulation.

Is N-Body the only style of work being assigned now? Or is this just a glitch in the matrix?

Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 41627 - Posted: 21 Aug 2010 | 4:50:07 UTC - in response to Message 41626.
Last modified: 21 Aug 2010 | 4:51:09 UTC

Sending scheduler request: To fetch work.
Requesting new tasks for GPU
Scheduler request completed: got 0 new tasks
Message from project server: No work sent
Message from project server: Your app_info.xml file doesn't have a version of MilkyWay@Home N-Body Simulation.

Is N-Body the only style of work being assigned now? Or is this just a glitch in the matrix?


I was getting the same thing, so I reset the project. It didn't help. The server page shows workunits available, but I am unable to get any of them.

They could be CPU N-Body workunits. But, the validator is backed up and looks like it is about to crash, also.

TJ
Send message
Joined: 12 Aug 09
Posts: 251
Credit: 77,776,909
RAC: 1,061
Message 41628 - Posted: 21 Aug 2010 | 5:50:02 UTC

No panic boys, it is weekend....

Servers and validators have some days of too every now and then.
____________
Greetings from,
TJ

BarryAZ
Send message
Joined: 1 Sep 08
Posts: 512
Credit: 222,446,536
RAC: 158,326
Message 41629 - Posted: 21 Aug 2010 | 6:03:16 UTC - in response to Message 41627.

OK -- so this problem -- which has shown up every week or so remains unresolved -- frankly, not really a surprise.

Sending scheduler request: To fetch work.
Requesting new tasks for GPU
Scheduler request completed: got 0 new tasks
Message from project server: No work sent
Message from project server: Your app_info.xml file doesn't have a version of MilkyWay@Home N-Body Simulation.

Is N-Body the only style of work being assigned now? Or is this just a glitch in the matrix?


I was getting the same thing, so I reset the project. It didn't help. The server page shows workunits available, but I am unable to get any of them.

They could be CPU N-Body workunits. But, the validator is backed up and looks like it is about to crash, also.


____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 41630 - Posted: 21 Aug 2010 | 7:40:30 UTC

It's the validator again - it has backed up with 59K worth of work to knock off.
____________
Go away, I was asleep


John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 41637 - Posted: 21 Aug 2010 | 15:08:55 UTC

Are the servers dishing up nBody work ATM?
____________
Go away, I was asleep


Profile Toppie*
Send message
Joined: 28 Mar 09
Posts: 67
Credit: 680,319,947
RAC: 952,331
Message 41638 - Posted: 21 Aug 2010 | 15:14:24 UTC - in response to Message 41637.

Looks like it. I am getting Nbody only at the moment. With about

10% giving "error while computing".

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 41641 - Posted: 21 Aug 2010 | 15:58:17 UTC
Last modified: 21 Aug 2010 | 16:00:40 UTC

I've gone back to Collatz until the normal Milkyway WUs are being dished out.

I presume the Nbody work has it's own client, so how do I get my system to download this work? I am assuming this Nbody work is for GPUs, which I see from elsewhere appear not to exist?
____________
Go away, I was asleep


Profile Toppie*
Send message
Joined: 28 Mar 09
Posts: 67
Credit: 680,319,947
RAC: 952,331
Message 41642 - Posted: 21 Aug 2010 | 16:03:36 UTC - in response to Message 41641.

Nope. CPU only. Apparently GPU to come later.

Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 41678 - Posted: 22 Aug 2010 | 23:33:32 UTC

Did the project stop using the old workunits in favor of the N-Body workunits? For about the last day, I can only get one or two workunits every now and then. And, the ones I do get are resends because of invalid results from another user. It looks like there are no new workunits being sent out.

Profile KWSN Checklist
Avatar
Send message
Joined: 12 Aug 08
Posts: 128
Credit: 203,351,125
RAC: 1,217
Message 41680 - Posted: 23 Aug 2010 | 0:19:07 UTC

And about every hour I loose and hour or so of completed work! How does that happen?
____________

Profile KWSN Checklist
Avatar
Send message
Joined: 12 Aug 08
Posts: 128
Credit: 203,351,125
RAC: 1,217
Message 41685 - Posted: 23 Aug 2010 | 2:05:49 UTC - in response to Message 41680.

And about every hour I loose and hour or so of completed work! How does that happen?


It happens because the work units are validated! Monday started early....
____________

Profile verstapp
Avatar
Send message
Joined: 26 Jan 09
Posts: 585
Credit: 464,286,454
RAC: 996
Message 41692 - Posted: 23 Aug 2010 | 8:11:21 UTC

Feeder not running again.
____________
Cheers,

PeterV

.

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 41696 - Posted: 23 Aug 2010 | 13:26:55 UTC

Ican dowload normal MW GPU WUS now, so the system is feeding both Nbody and normal again (I hope)
____________
Go away, I was asleep


Profile Toppie*
Send message
Joined: 28 Mar 09
Posts: 67
Credit: 680,319,947
RAC: 952,331
Message 41700 - Posted: 23 Aug 2010 | 16:53:46 UTC - in response to Message 41696.

Aaah...the smell of GPU's working again.

Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 41767 - Posted: 26 Aug 2010 | 21:20:27 UTC

It's Friday, the server has been working well for a few days, time for it to fall over...

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 41771 - Posted: 26 Aug 2010 | 23:33:10 UTC

Then you'll head for Collatz as DNETC is in the middle of a server upgrade/change.
____________
Go away, I was asleep


Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 41774 - Posted: 27 Aug 2010 | 17:09:04 UTC

BOOM!!!
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

BarryAZ
Send message
Joined: 1 Sep 08
Posts: 512
Credit: 222,446,536
RAC: 158,326
Message 41775 - Posted: 27 Aug 2010 | 19:08:51 UTC - in response to Message 41767.

Indeed, absent dealing with the underlying problem (which is growing something of a beard), lots of server regular stop/starts just might keep things going. We the weekend in view, it seems that on every Friday a single or double server reboot should be done automatically.

It's Friday, the server has been working well for a few days, time for it to fall over...


____________

BarryAZ
Send message
Joined: 1 Sep 08
Posts: 512
Credit: 222,446,536
RAC: 158,326
Message 41776 - Posted: 27 Aug 2010 | 19:12:24 UTC - in response to Message 41771.


At Dnetc, the maintenance was announced (though only an hour before the upload/download servers were taken offline), and then the message boards were up for another several hours. But as of about 5PM yesterday (PDT) poof -- it went gone. In the announcement, it was noted the outage was expected to take 5 hours -- perhaps each of the hours turns out to be 600 minutes long....

Then you'll head for Collatz as DNETC is in the middle of a server upgrade/change.


____________

BarryAZ
Send message
Joined: 1 Sep 08
Posts: 512
Credit: 222,446,536
RAC: 158,326
Message 41777 - Posted: 27 Aug 2010 | 19:16:32 UTC - in response to Message 41776.

Ouch -- things might just have gotten worse -- can't connect to Collatz either.
____________

BarryAZ
Send message
Joined: 1 Sep 08
Posts: 512
Credit: 222,446,536
RAC: 158,326
Message 41778 - Posted: 27 Aug 2010 | 19:25:20 UTC

So it is clear that a number of us poor fools are (and have been) aware of MW being in dead state for several hours, do we have any wagers on the awareness of this in RPI land?

This many months long intermittent problem has not been a subject of much public discussion coming from the folks at RPI -- which of course does little in the way of confidence (or compassion) building......
____________

Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 41780 - Posted: 27 Aug 2010 | 19:42:14 UTC

Aaaarrgghhh, my GPUs are cold!

If you look at the history, picking this outrage was pretty easy.

Profile Bruce
Avatar
Send message
Joined: 28 Apr 08
Posts: 1415
Credit: 2,716,428
RAC: 0
Message 41782 - Posted: 27 Aug 2010 | 19:56:35 UTC
Last modified: 27 Aug 2010 | 20:03:05 UTC

Just to add insult to injury,Climate Prediction is having problems too!! I cant upload any trickles. As it turns out CPDN is making room on one of thier disks so all I can do is wait.
;-p
____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 41783 - Posted: 27 Aug 2010 | 19:56:35 UTC

Definitely!

DNETC is unobtainable still

Milkyway is not dishing work as the most of the servers are down

Collatz is obtainable but intermittent and the site is slow.

That's the main GPU projects gone south.
____________
Go away, I was asleep


Dirk Broer
Send message
Joined: 11 Dec 09
Posts: 11
Credit: 36,365,922
RAC: 62,768
Message 41787 - Posted: 27 Aug 2010 | 20:18:17 UTC

feeder milkyway Not Running
transitioner milkyway Not Running
milkyway_purge milkyway Not Running
file_deleter milkyway Not Running
...
____________

Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 41788 - Posted: 27 Aug 2010 | 20:25:32 UTC - in response to Message 41783.

Definitely!

DNETC is unobtainable still

Milkyway is not dishing work as the most of the servers are down

Collatz is obtainable but intermittent and the site is slow.

That's the main GPU projects gone south.


Rosetta has not had work either.

I now have no tasks to do. Oh well.
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

Haris Dublas
Send message
Joined: 25 Feb 10
Posts: 49
Credit: 10,136,474
RAC: 0
Message 41789 - Posted: 27 Aug 2010 | 20:50:07 UTC

Collatz probably went down due to the mass switching of people from MW and DNETC.

BarryAZ
Send message
Joined: 1 Sep 08
Posts: 512
Credit: 222,446,536
RAC: 158,326
Message 41790 - Posted: 27 Aug 2010 | 21:32:08 UTC - in response to Message 41789.

I wonder if MW will be offline for the entire weekend. Is Travis the only person at RPI who can figure this one out?
____________

BarryAZ
Send message
Joined: 1 Sep 08
Posts: 512
Credit: 222,446,536
RAC: 158,326
Message 41792 - Posted: 27 Aug 2010 | 21:40:07 UTC

Dnetc will be offline for at least another day -- during their software update/upgrade one of the HD's on the RAID failed -- they are in rebuild mode.

I hope they were at least in RAID 5 mode and not RAID 0 as THAT would be seriously ugly. Ideally folks running multi-drive RAID arrays are running controllers that handle RAID 5 plus a hot spare. The drives are not that expensive, but server class RAID 5 + hot spare controllers can be a bit pricey.
____________

Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 41795 - Posted: 27 Aug 2010 | 22:28:02 UTC

Wooohoo! I got a full MW cache. :)

BarryAZ
Send message
Joined: 1 Sep 08
Posts: 512
Credit: 222,446,536
RAC: 158,326
Message 41796 - Posted: 27 Aug 2010 | 23:29:53 UTC - in response to Message 41795.

It's alive!!
____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 41797 - Posted: 27 Aug 2010 | 23:44:10 UTC

Yes, mine are now filling up since I suspended Collatz. Now need to work off the Collatz cache between Milkyway sessions.
____________
Go away, I was asleep


Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 41801 - Posted: 28 Aug 2010 | 6:00:14 UTC

Validator needs a kick. It's only validating wu's that are paired. All single wu's are not being validated.

TJ
Send message
Joined: 12 Aug 09
Posts: 251
Credit: 77,776,909
RAC: 1,061
Message 41846 - Posted: 31 Aug 2010 | 7:04:27 UTC

Well it is zero again. The validater has "overheathed".
Time to let my rigs to cool as well (and saving some energy costs).
____________
Greetings from,
TJ

BarryAZ
Send message
Joined: 1 Sep 08
Posts: 512
Credit: 222,446,536
RAC: 158,326
Message 41847 - Posted: 31 Aug 2010 | 7:27:01 UTC - in response to Message 41846.

Dnetc may be back and running by the end of the week. The dreaded 'software upgrade' -- stress tested their server and they had a 'mid upgrade' RAID drive failure as well as a memory module failure. They are pretty much in a full rebuild mode for now.
____________

CTAPbIi
Send message
Joined: 4 Jan 10
Posts: 86
Credit: 51,753,924
RAC: 0
Message 41848 - Posted: 31 Aug 2010 | 10:33:52 UTC

f.ck, again... I've got used to shutdown every week on weekends, but it's Tuesday only. common guys, you might be kidding me - one day of work and then one day off.

Could you PLS fix the server???
____________

Chris S
Avatar
Send message
Joined: 20 Sep 08
Posts: 1357
Credit: 173,075,472
RAC: 13
Message 41853 - Posted: 31 Aug 2010 | 14:43:08 UTC

Dnetc may be back and running by the end of the week. The dreaded 'software upgrade' -- stress tested their server and they had a 'mid upgrade' RAID drive failure as well as a memory module failure. They are pretty much in a full rebuild mode for now.


That is good to hear :-)

This place is just unreliable so what we need is more Boinc ATI projects!!!!
____________
Don't drink water, that's the stuff that rusts pipes

BarryAZ
Send message
Joined: 1 Sep 08
Posts: 512
Credit: 222,446,536
RAC: 158,326
Message 41855 - Posted: 31 Aug 2010 | 17:17:57 UTC - in response to Message 41853.

One thing that seems odd to me. The problem appears fairly straightforward (at least the symptoms are pretty obvious and *repetitive*). The workaround resolution (either stop/start processes or a full server stop/restart) also seems reasonably straightforward.

Actually a couple of questions (although I realize that RPI folks rarely clock in over here)

Why does it take so long (12 hours or more) to go from symptom to restart?

Wouldn't it be possible to auotmate the stop/restart process and run it say every 48 hours?


I figure since this problem has been going on for months, in addition to efforts to track down the root cause, efforts to implement a workaround would be 'resource appropriate' and would have been in place by now.
____________

Fred J. Verster
Send message
Joined: 22 Apr 09
Posts: 38
Credit: 27,377,932
RAC: 0
Message 41856 - Posted: 31 Aug 2010 | 18:59:42 UTC - in response to Message 41855.

That sure would be nice as there are still very little projects using ATI GPU's
MilkyWay and Collatz C. are the only known to use ATI GPU's.

SETI@Home , also at SETI BÊTA, a usergroup The LUNATICS are testing ATI GPU's for AP computing, they already have some working app.'s.
Here is the latest installer for FERMI GPU's.
And you'll find one of the ATI GPU app.'s as well.


____________

Knight Who says Ni

Profile Werkstatt
Send message
Joined: 19 Feb 08
Posts: 297
Credit: 105,742,273
RAC: 141
Message 41857 - Posted: 31 Aug 2010 | 19:18:14 UTC - in response to Message 41855.

One thing that seems odd to me. The problem appears fairly straightforward (at least the symptoms are pretty obvious and *repetitive*). The workaround resolution (either stop/start processes or a full server stop/restart) also seems reasonably straightforward.

Actually a couple of questions (although I realize that RPI folks rarely clock in over here)

Why does it take so long (12 hours or more) to go from symptom to restart?

Wouldn't it be possible to auotmate the stop/restart process and run it say every 48 hours?


I figure since this problem has been going on for months, in addition to efforts to track down the root cause, efforts to implement a workaround would be 'resource appropriate' and would have been in place by now.


It looks like there is something going on at RPI. Can you remember the posting 'Screensaver coming soon' ? Or can you remember the project DNA@HOME ? Milkyway3 ?

It should not be a problem to detect that the validator stops validating. And of course, they do detect that because they stop producing wu's. But we all miss the next step which should be a rework of the validator, be it hardware, software or setup or what else. Or at least a quick restart.

It looks like nobody is responsible there. This is the best way to kill not only a project but the whole idea of distributed computing. Project responsibles should be serious in their handling of the project issues.

Alexander

BarryAZ
Send message
Joined: 1 Sep 08
Posts: 512
Credit: 222,446,536
RAC: 158,326
Message 41858 - Posted: 31 Aug 2010 | 19:19:23 UTC - in response to Message 41856.
Last modified: 31 Aug 2010 | 19:21:24 UTC

Dnetc -- when running, also supports ATI GPU's -- they hope to be back up and running again later this week (they encountered something of the worst case scenario -- memory and hard drive failure while in the middle of a software upgrade). They are in recovery mode for now.

I'm still sort of bemused by the informational (and response time) black hole we often encounter here. To a certain degree (good new/bad news I suppose) it seems that folks here have gotten acclimated to the non response (or delayed response) here.


That sure would be nice as there are still very little projects using ATI GPU's
MilkyWay and Collatz C. are the only known to use ATI GPU's.

SETI@Home , also at SETI BÊTA, a usergroup The LUNATICS are testing ATI GPU's for AP computing, they already have some working app.'s.
Here is the latest installer for FERMI GPU's.
And you'll find one of the ATI GPU app.'s as well.


____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 41861 - Posted: 31 Aug 2010 | 20:59:18 UTC

Barry

All the DNETC web pages have returned (Home, account and forums). The only bit missing ATM is new work (a few hours yet) and the servers accepting crunched work "waiting to report"
____________
Go away, I was asleep


Profile Tretboot
Send message
Joined: 20 Aug 10
Posts: 7
Credit: 53,721,698
RAC: 0
Message 41862 - Posted: 31 Aug 2010 | 22:22:51 UTC

It really should not be so hard to automaticly restart what is failling every day or 2 days or so. Most MMOs that i played over the years also have a daily downtime to prevent stuff like this. (totally different application, but i guess the problem is sort of the same).

Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 41864 - Posted: 31 Aug 2010 | 22:56:30 UTC
Last modified: 31 Aug 2010 | 22:58:35 UTC

Memroy leak kills system every 2 to 3 days, therefore reboot every 1 to 2 days until the source of the memory leak is found. Simple really.

So reboot every Monday, Wedensday and Friday morning.

ps. I think the system is conspiring to slow down my obtainment of major milestone...in this case 100mill cobblers on MW. Last time it was 100mill overall.

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 41865 - Posted: 31 Aug 2010 | 23:21:44 UTC

Work available again
____________
Go away, I was asleep


John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 41963 - Posted: 7 Sep 2010 | 8:01:23 UTC

Looks like one of the work creator servers are down and work available is falling rapidly.

Time to turn back to Collatz?
____________
Go away, I was asleep


John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 41971 - Posted: 7 Sep 2010 | 14:09:24 UTC

Told you so, there is no no work to dish out. Collatz coming soon to a screen near you.
____________
Go away, I was asleep


John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 41980 - Posted: 7 Sep 2010 | 19:30:20 UTC

Got it wrong - all is sweetness and light again!
____________
Go away, I was asleep


Profile arkayn
Avatar
Send message
Joined: 14 Feb 09
Posts: 914
Credit: 74,780,239
RAC: 249
Message 41986 - Posted: 7 Sep 2010 | 22:14:42 UTC

I ran out long enough that my backup project kicked in and got me something to crunch.

Currently working on 50 DNETC units, but have 24 MW units waiting in the wings.
____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 42065 - Posted: 12 Sep 2010 | 13:10:40 UTC

Servers need kicking again, or we are outta work
____________
Go away, I was asleep


TJ
Send message
Joined: 12 Aug 09
Posts: 251
Credit: 77,776,909
RAC: 1,061
Message 42070 - Posted: 12 Sep 2010 | 14:41:05 UTC - in response to Message 42065.

There are still some units to crunch but I am not getting one anymore. It is the validator again. (And I just thought this is the second weekend with good crunching...)
____________
Greetings from,
TJ

BarryAZ
Send message
Joined: 1 Sep 08
Posts: 512
Credit: 222,446,536
RAC: 158,326
Message 42082 - Posted: 12 Sep 2010 | 21:29:20 UTC - in response to Message 42070.

Well at least there is some consistency in the problem -- if we didn't have 'no new work' and 'validator not validator' runs -- we'd not know this was MilkyWay.
____________

Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 42317 - Posted: 23 Sep 2010 | 10:22:13 UTC

Arrggghhhh!!!!

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 42319 - Posted: 23 Sep 2010 | 12:17:31 UTC

Just starting to get messages, in BM, saying no work sent.

I see the validator is up over 16K, so the system may need a quick kick. Or we will be ....



Aaaarrrgh!!
____________
Go away, I was asleep


Profile arkayn
Avatar
Send message
Joined: 14 Feb 09
Posts: 914
Credit: 74,780,239
RAC: 249
Message 42322 - Posted: 23 Sep 2010 | 16:05:37 UTC

I have got DNETC set as my backup GPU project and 6.11.7 is working perfectly on that front, it requests 1 unit at a time and keeps after MW for work.

This last one I needed to crunch 2 DNETC before I got more MW work.
____________

CTAPbIi
Send message
Joined: 4 Jan 10
Posts: 86
Credit: 51,753,924
RAC: 0
Message 42454 - Posted: 29 Sep 2010 | 22:50:11 UTC


Aaaarrrgh!!
____________

Profile Toppie*
Send message
Joined: 28 Mar 09
Posts: 67
Credit: 680,319,947
RAC: 952,331
Message 42516 - Posted: 3 Oct 2010 | 9:07:03 UTC - in response to Message 42454.

Aaaarrgh!

Fred J. Verster
Send message
Joined: 22 Apr 09
Posts: 38
Credit: 27,377,932
RAC: 0
Message 42521 - Posted: 3 Oct 2010 | 12:44:48 UTC - in response to Message 42516.

Maybe, you are going too fast, I see incredible RAC's here....... ;^)
I still get one at a time, but it's goes fast on a 5870 GPU, especially compaired to a CPU.

____________

Knight Who says Ni

Profile Toppie*
Send message
Joined: 28 Mar 09
Posts: 67
Credit: 680,319,947
RAC: 952,331
Message 42532 - Posted: 3 Oct 2010 | 20:11:03 UTC - in response to Message 42521.

I got a few "Message from server: No work sent" msgs and my cache was dwindling.
Went into panic mode. Too soon I guess...

JAMC
Send message
Joined: 9 Sep 08
Posts: 96
Credit: 336,443,946
RAC: 0
Message 42534 - Posted: 3 Oct 2010 | 21:17:38 UTC

Reeee-boooot.... :)

Profile arkayn
Avatar
Send message
Joined: 14 Feb 09
Posts: 914
Credit: 74,780,239
RAC: 249
Message 42537 - Posted: 3 Oct 2010 | 21:37:57 UTC

Looks like I ran out long enough to crunch 2 DNETC wu today.
____________

Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 42577 - Posted: 5 Oct 2010 | 9:05:01 UTC

5/10/2010 7:57:18 PM | Milkyway@home | Message from Milkyway@home: Your app_info.xml file doesn't have a version of MilkyWay@Home N-Body Simulation.


Arrrrggghhh!

Fred J. Verster
Send message
Joined: 22 Apr 09
Posts: 38
Credit: 27,377,932
RAC: 0
Message 42581 - Posted: 5 Oct 2010 | 11:09:19 UTC - in response to Message 42577.

5/10/2010 7:57:18 PM | Milkyway@home | Message from Milkyway@home: Your app_info.xml file doesn't have a version of MilkyWay@Home N-Body Simulation.


Arrrrggghhh!


That's what I got a few hours back.
But I haven't been able to find out, what exactly is going on, expected
a message on the Front Page, about the new NBODY-simulation
and where you can find this, so I can Down Load this file and use it?





____________

Knight Who says Ni

Profile White Mountain Wes
Avatar
Send message
Joined: 24 Jul 09
Posts: 21
Credit: 1,972,254
RAC: 1,628
Message 42583 - Posted: 5 Oct 2010 | 16:21:27 UTC - in response to Message 42581.

5/10/2010 7:57:18 PM | Milkyway@home | Message from Milkyway@home: Your app_info.xml file doesn't have a version of MilkyWay@Home N-Body Simulation.


Arrrrggghhh!


That's what I got a few hours back.
But I haven't been able to find out, what exactly is going on, expected
a message on the Front Page, about the new NBODY-simulation
and where you can find this, so I can Down Load this file and use it?





I've been geting that same message all morning. I too have no idea what I am supposed to do about it. Can somebody in the know please enlighten us? Thanks.
____________

Profile Beyond
Send message
Joined: 15 Jul 08
Posts: 383
Credit: 501,817,389
RAC: 6
Message 42585 - Posted: 5 Oct 2010 | 16:55:13 UTC - in response to Message 42583.

5/10/2010 7:57:18 PM | Milkyway@home | Message from Milkyway@home: Your app_info.xml file doesn't have a version of MilkyWay@Home N-Body Simulation.

Arrrrggghhh!

That's what I got a few hours back.
But I haven't been able to find out, what exactly is going on, expected
a message on the Front Page, about the new NBODY-simulation
and where you can find this, so I can Down Load this file and use it?

I've been geting that same message all morning. I too have no idea what I am supposed to do about it. Can somebody in the know please enlighten us? Thanks.

AFAIK those are WUs that so far only run on CPUs. There doesn't seem to be any GPU WUs available. List of available apps:

http://milkyway.cs.rpi.edu/milkyway/apps.php

Profile White Mountain Wes
Avatar
Send message
Joined: 24 Jul 09
Posts: 21
Credit: 1,972,254
RAC: 1,628
Message 42586 - Posted: 5 Oct 2010 | 17:23:53 UTC - in response to Message 42585.

5/10/2010 7:57:18 PM | Milkyway@home | Message from Milkyway@home: Your app_info.xml file doesn't have a version of MilkyWay@Home N-Body Simulation.

Arrrrggghhh!

That's what I got a few hours back.
But I haven't been able to find out, what exactly is going on, expected
a message on the Front Page, about the new NBODY-simulation
and where you can find this, so I can Down Load this file and use it?

I've been geting that same message all morning. I too have no idea what I am supposed to do about it. Can somebody in the know please enlighten us? Thanks.

AFAIK those are WUs that so far only run on CPUs. There doesn't seem to be any GPU WUs available. List of available apps:

http://milkyway.cs.rpi.edu/milkyway/apps.php


But I don't have a GPU.
____________

Pascal Meeuws
Send message
Joined: 9 Jan 10
Posts: 1
Credit: 1,622,068
RAC: 81
Message 42587 - Posted: 5 Oct 2010 | 17:57:12 UTC - in response to Message 42586.

Hello Wes,

Did you by any means install an optimized client.
If so, you can add the standard NBody client into your app_info file.

Details for a Linux version is in your mailbox

Regards,

Pascal
____________
It's 100% certain. There is no intelligent life in this universe.

Profile Beyond
Send message
Joined: 15 Jul 08
Posts: 383
Credit: 501,817,389
RAC: 6
Message 42588 - Posted: 5 Oct 2010 | 18:32:59 UTC - in response to Message 42585.

5/10/2010 7:57:18 PM | Milkyway@home | Message from Milkyway@home: Your app_info.xml file doesn't have a version of MilkyWay@Home N-Body Simulation.

Arrrrggghhh!

That's what I got a few hours back.
But I haven't been able to find out, what exactly is going on, expected
a message on the Front Page, about the new NBODY-simulation
and where you can find this, so I can Down Load this file and use it?

I've been geting that same message all morning. I too have no idea what I am supposed to do about it. Can somebody in the know please enlighten us? Thanks.

AFAIK those are WUs that so far only run on CPUs. There doesn't seem to be any GPU WUs available. List of available apps:

http://milkyway.cs.rpi.edu/milkyway/apps.php


Looks like you're running Windows with an optimized client. You won't get the nbody work unless you add the app to your app_info.xml or go back to the automatically downloaded client. You have several days worth of work still in your queue anyway, so why worry?

http://milkyway.cs.rpi.edu/milkyway/results.php?hostid=91971

Matthew
Volunteer moderator
Project developer
Project scientist
Send message
Joined: 6 May 09
Posts: 182
Credit: 4,799
RAC: 0
Message 42589 - Posted: 5 Oct 2010 | 19:31:07 UTC
Last modified: 5 Oct 2010 | 19:32:27 UTC

*facepalm*

Sorry, my mistake. I pulled all of the separation runs, and left only nbody runs on the server - and of course, nbody doesn't run on GPUs. (yet) :P

I put some new separation runs up just now, so your GPUs should start getting some work.

This shouldn't happen again...

Cheers,
Matthew

CTAPbIi
Send message
Joined: 4 Jan 10
Posts: 86
Credit: 51,753,924
RAC: 0
Message 42591 - Posted: 5 Oct 2010 | 19:42:19 UTC

Thanks, Matt :-)
____________

Profile Werkstatt
Send message
Joined: 19 Feb 08
Posts: 297
Credit: 105,742,273
RAC: 141
Message 42592 - Posted: 5 Oct 2010 | 20:04:26 UTC - in response to Message 42589.

*facepalm*

Sorry, my mistake. I pulled all of the separation runs, and left only nbody runs on the server - and of course, nbody doesn't run on GPUs. (yet) :P

I put some new separation runs up just now, so your GPUs should start getting some work.

This shouldn't happen again...

Cheers,
Matthew


No problem, things happen sometimes.
Would it be possible to split the server-status that we can see which app has work and which does not?

Alexander

DanNeely
Send message
Joined: 6 Oct 09
Posts: 39
Credit: 77,212,649
RAC: 0
Message 42598 - Posted: 5 Oct 2010 | 22:36:55 UTC - in response to Message 42592.


Would it be possible to split the server-status that we can see which app has work and which does not?


Einstein at home does a full breakout on their server status page, so it's definately doable. Not sure how much of their stuff is custom though.

http://einstein.phys.uwm.edu/server_status.html[/url]

Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 42736 - Posted: 10 Oct 2010 | 22:01:12 UTC
Last modified: 10 Oct 2010 | 22:02:22 UTC

Validator seems to be going out again.

Results ready to send 2,475
Results in progress 171,488
Workunits waiting for validation 8,070
Workunits waiting for assimilation 125
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

TJ
Send message
Joined: 12 Aug 09
Posts: 251
Credit: 77,776,909
RAC: 1,061
Message 42744 - Posted: 10 Oct 2010 | 23:38:22 UTC - in response to Message 42736.

Validator seems to be going out again.


I whish you didn't post this, now it happend..."it" is out.
____________
Greetings from,
TJ

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 42745 - Posted: 10 Oct 2010 | 23:49:09 UTC

No work sent and the validator is now up to 48K. The last time this happened it was cleared relatively quickly. Hope this is the case again!!
____________
Go away, I was asleep


Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 42842 - Posted: 14 Oct 2010 | 14:25:28 UTC

Up to 21k waiting for validation.
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

CTAPbIi
Send message
Joined: 4 Jan 10
Posts: 86
Credit: 51,753,924
RAC: 0
Message 42843 - Posted: 14 Oct 2010 | 14:41:41 UTC
Last modified: 14 Oct 2010 | 15:18:28 UTC

over 30k now...

update
44k now and separation_assimilator is not Running
____________

Profile arkayn
Avatar
Send message
Joined: 14 Feb 09
Posts: 914
Credit: 74,780,239
RAC: 249
Message 42846 - Posted: 14 Oct 2010 | 15:45:26 UTC

I just finished my last wu, all they have on the server is Nbody work now.

Back to DNETC then.
____________

CTAPbIi
Send message
Joined: 4 Jan 10
Posts: 86
Credit: 51,753,924
RAC: 0
Message 42848 - Posted: 14 Oct 2010 | 17:02:42 UTC

back to work :-)

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 42857 - Posted: 14 Oct 2010 | 21:20:16 UTC

I cane here

I saw the server status

I hope I can crunch
____________
Go away, I was asleep


Profile Arif Mert Kapicioglu
Send message
Joined: 14 Dec 09
Posts: 158
Credit: 572,866,449
RAC: 4,516
Message 42886 - Posted: 16 Oct 2010 | 12:32:21 UTC

Feeder not running

Dirk Broer
Send message
Joined: 11 Dec 09
Posts: 11
Credit: 36,365,922
RAC: 62,768
Message 42887 - Posted: 16 Oct 2010 | 12:36:44 UTC - in response to Message 42886.

Feeder not running


data-driven web pages milkyway Running
upload/download server milkyway Running
scheduler milkyway Running
feeder milkyway Not Running

transitioner milkyway Not Running
milkyway_purge milkyway Not Running
file_deleter milkyway Not Running
nbody_assimilator milkyway Not Running
separation_assimilator milkyway Not Running

____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 42890 - Posted: 16 Oct 2010 | 17:03:27 UTC

Still no work, and DNETC is down. Just leaves Collatz to crunch
____________
Go away, I was asleep


John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 42895 - Posted: 16 Oct 2010 | 20:10:50 UTC
Last modified: 16 Oct 2010 | 20:12:28 UTC

It's back, but we lost over 7 crunching hours.

At least the repair was during the week end, so we have some hope the whole week end will not be lost?
____________
Go away, I was asleep


Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 42897 - Posted: 17 Oct 2010 | 2:22:39 UTC

Validator crapped again.



Workunits waiting for validation 112,104.
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

Profile arkayn
Avatar
Send message
Joined: 14 Feb 09
Posts: 914
Credit: 74,780,239
RAC: 249
Message 42902 - Posted: 17 Oct 2010 | 3:57:37 UTC

I am still getting work even with the validator up that high.
____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 42906 - Posted: 17 Oct 2010 | 8:29:23 UTC

So am I, and the RAC count on the rig in question is slowly rising. Does this mean a way has been found to keep the system going with the validator so high or are we looking at imminent collapse?
____________
Go away, I was asleep


Profile Arif Mert Kapicioglu
Send message
Joined: 14 Dec 09
Posts: 158
Credit: 572,866,449
RAC: 4,516
Message 42910 - Posted: 17 Oct 2010 | 10:36:33 UTC

Well, i currently have 44.26 pending credit which actually equals to (44.26/0.05)*213.76=189220 granted credit. Hope it validates soon.

Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 42922 - Posted: 18 Oct 2010 | 4:03:43 UTC

The CPU work units have been fixed, now if only we could download them!

The current server status is:

data-driven web pages milkyway Running
upload/download server milkyway Running
scheduler milkyway Running
feeder milkyway Not Running
transitioner milkyway Not Running
milkyway_purge milkyway Not Running
file_deleter milkyway Not Running
nbody_assimilator milkyway Not Running
separation_assimilator milkyway Not Running

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 42936 - Posted: 18 Oct 2010 | 14:56:05 UTC

Looking at the server status now, all seems to be up and running as expected. Even the validator is back to the accustomed 6.3K WUs.
____________
Go away, I was asleep


Profile Beyond
Send message
Joined: 15 Jul 08
Posts: 383
Credit: 501,817,389
RAC: 6
Message 43038 - Posted: 21 Oct 2010 | 2:53:33 UTC

Someone want to kick the server? This is what I'm seeing on all my machines:

Milkyway@home 10/20/2010 9:48:50 PM Requesting new tasks for ATI GPU
Milkyway@home 10/20/2010 9:48:51 PM Scheduler request completed: got 0 new tasks
Milkyway@home 10/20/2010 9:48:51 PM Message from Milkyway@home: No work sent
Milkyway@home 10/20/2010 9:48:51 PM Message from Milkyway@home: Your app_info.xml file doesn't have a version of MilkyWay@Home N-Body Simulation.

Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 43039 - Posted: 21 Oct 2010 | 2:59:26 UTC

Workunits waiting for validation 38,275
separation_assimilator milkyway Not Running

Seems this is a regular occorance now. Whatever was done recently needs more work.
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 43041 - Posted: 21 Oct 2010 | 4:00:02 UTC

Well, it looks like the situation has gone from bad to worse.

feeder milkyway Not Running
transitioner milkyway Not Running
milkyway_purge milkyway Not Running
file_deleter milkyway Not Running
nbody_assimilator milkyway Not Running
separation_assimilator milkyway Not Running

Matthew
Volunteer moderator
Project developer
Project scientist
Send message
Joined: 6 May 09
Posts: 182
Credit: 4,799
RAC: 0
Message 43042 - Posted: 21 Oct 2010 | 4:00:21 UTC

Should be working now.

-Matthew

Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 43043 - Posted: 21 Oct 2010 | 4:15:30 UTC - in response to Message 43042.

Thank you, Matthew.

You were restarting the server when I made my post. I should have waited a minute longer before posting.

cncguru
Avatar
Send message
Joined: 11 Jun 10
Posts: 329
Credit: 1,166,219,987
RAC: 0
Message 43044 - Posted: 21 Oct 2010 | 4:26:47 UTC

Then again maybe not!

Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 43045 - Posted: 21 Oct 2010 | 4:28:10 UTC - in response to Message 43044.
Last modified: 21 Oct 2010 | 4:29:40 UTC

Ok, now we have gone from worse to bad, again. Hopefully Matthew will double check the server before he calls it a night.

separation_assimilator milkyway Not Running

cncguru
Avatar
Send message
Joined: 11 Jun 10
Posts: 329
Credit: 1,166,219,987
RAC: 0
Message 43046 - Posted: 21 Oct 2010 | 4:40:49 UTC

oh geez, off to Collatz we go!

Matthew
Volunteer moderator
Project developer
Project scientist
Send message
Joined: 6 May 09
Posts: 182
Credit: 4,799
RAC: 0
Message 43047 - Posted: 21 Oct 2010 | 5:36:28 UTC

I worked my magic, hopefully things will be smooth again in a few minutes. I'll give Travis a head's up just in case.

-Matthew

Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 43048 - Posted: 21 Oct 2010 | 5:52:50 UTC - in response to Message 43047.
Last modified: 21 Oct 2010 | 5:53:33 UTC

It failed again. It looks like it doesn't like the new work units.

separation_assimilator milkyway Not Running

Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 43049 - Posted: 21 Oct 2010 | 6:49:30 UTC

Well, we are completely out of work, now.

Results ready to send 0

Does that mean we have completed mapping the Milky Way and can move on to mapping the Andromeda Galaxy, now?

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 43051 - Posted: 21 Oct 2010 | 8:27:01 UTC
Last modified: 21 Oct 2010 | 8:27:31 UTC

<GRIN> and out of work as well.
____________
Go away, I was asleep


Profile [seti.international] Philip J. Fry
Avatar
Send message
Joined: 30 Apr 09
Posts: 67
Credit: 3,120,494
RAC: 0
Message 43056 - Posted: 21 Oct 2010 | 9:18:10 UTC

Ohh..

S@h is down.
MW@h is down.

Which other 'astronomy project' have CPU and nVIDIA apps?
(Except Einstein@home - which have not a real nVIDIA app)

____________
Best regards!

Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 43059 - Posted: 21 Oct 2010 | 15:23:27 UTC

Atleast it is back to normal with the continual problems.
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 43142 - Posted: 25 Oct 2010 | 2:10:22 UTC
Last modified: 25 Oct 2010 | 2:19:40 UTC

Well, we are completely out of work, again.

Results ready to send: 0

separation_assimilator milkyway Not Running

An emial was sent to the admins....

KWH*
Avatar
Send message
Joined: 24 Aug 10
Posts: 180
Credit: 44,748,528
RAC: 115
Message 43145 - Posted: 25 Oct 2010 | 4:01:54 UTC

Get the same here, out of work.

Brian Priebe
Send message
Joined: 27 Nov 09
Posts: 98
Credit: 171,795,000
RAC: 74,830
Message 43218 - Posted: 28 Oct 2010 | 10:01:53 UTC
Last modified: 28 Oct 2010 | 10:09:27 UTC

Is the MW server out of GPU WU's again? (I will be out of WU's to process in about 10min.) BOINC shows MW server declining to send out any GPU work when asked.

EDIT: Problem solved itself 9min after this post :).

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 43224 - Posted: 28 Oct 2010 | 15:02:12 UTC

There is 2,403 WUs ready to send, but most of the other servers are currently down.

feeder milkyway Not Running
transitioner milkyway Not Running
milkyway_purge milkyway Not Running
file_deleter milkyway Not Running
nbody_assimilator milkyway Not Running
separation_assimilator milkyway Not Running

Clearly somebody is on the case!

____________
Go away, I was asleep


Profile Arif Mert Kapicioglu
Send message
Joined: 14 Dec 09
Posts: 158
Credit: 572,866,449
RAC: 4,516
Message 43228 - Posted: 28 Oct 2010 | 19:20:11 UTC

Validator not working as 63586 wus waiting for validation. Also, what is the mystery of 6300 wus that validator has never been validating? I have never seen it below that value.

Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 43229 - Posted: 28 Oct 2010 | 19:45:30 UTC - in response to Message 43228.

Validator not working as 63586 wus waiting for validation. Also, what is the mystery of 6300 wus that validator has never been validating? I have never seen it below that value.

Once the validator gets back to 'normal' check your pending and any inconclusive results....

Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 43230 - Posted: 28 Oct 2010 | 19:45:48 UTC - in response to Message 43228.

Also, what is the mystery of 6300 wus that validator has never been validating? I have never seen it below that value.


I've wondered about this, also.

It might be WUs like this one. After sending it out five times, it could not be validated.

http://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=167143584

Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 43231 - Posted: 28 Oct 2010 | 19:47:27 UTC - in response to Message 43230.

Could it be the wus that don't disappear from months ago?
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 43234 - Posted: 29 Oct 2010 | 0:31:56 UTC

Aaaahhhaaarrrgh!

Workunits waiting for validation 137,577

____________
Go away, I was asleep


Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 43236 - Posted: 29 Oct 2010 | 2:28:44 UTC - in response to Message 43234.

Aaaahhhaaarrrgh!

Workunits waiting for validation 137,577


It is still going up.

Workunits waiting for validation 170,249

I am surprised that we are still getting WUs with it being that high.

I sent an email to the admins requesting that it be restarted....

Brian Priebe
Send message
Joined: 27 Nov 09
Posts: 98
Credit: 171,795,000
RAC: 74,830
Message 43237 - Posted: 29 Oct 2010 | 5:40:04 UTC - in response to Message 43236.
Last modified: 29 Oct 2010 | 5:41:12 UTC

Now at 225,581 and counting. We should start a betting pool to see who can predict how high it will get...

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 43238 - Posted: 29 Oct 2010 | 7:43:01 UTC

Are the usual validator rules not broke?

We are still crunching, despite the result Workunits waiting for validation 270,699
____________
Go away, I was asleep


Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 43240 - Posted: 29 Oct 2010 | 9:21:09 UTC - in response to Message 41070.
Last modified: 29 Oct 2010 | 9:22:56 UTC




Care of Scarecrow


Returning them at a rate of 50,000 every 3 hrs!

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 43252 - Posted: 29 Oct 2010 | 13:30:44 UTC

At that rate we will be back to normal quite quickly.

That assumes the validator trip over bug waits until things drop to 6.3K waiting.
____________
Go away, I was asleep


Profile arkayn
Avatar
Send message
Joined: 14 Feb 09
Posts: 914
Credit: 74,780,239
RAC: 249
Message 43270 - Posted: 29 Oct 2010 | 16:31:51 UTC

I ran out for a little bit, and also so a message that the project was down for maintenance once.
____________

Profile Werkstatt
Send message
Joined: 19 Feb 08
Posts: 297
Credit: 105,742,273
RAC: 141
Message 43382 - Posted: 1 Nov 2010 | 15:11:18 UTC

Validator shows more than 48k wu pending.
I have sent a email to the project .

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 43386 - Posted: 1 Nov 2010 | 16:29:36 UTC

The server status shows the majority of then red, showing someone is working on them. Let us hope the 24 cache is long enough for them to come back online and dish out new work <cross fingers>
____________
Go away, I was asleep


msattler
Send message
Joined: 15 Jul 08
Posts: 288
Credit: 5,474,012
RAC: 0
Message 43387 - Posted: 1 Nov 2010 | 17:12:10 UTC

Fresh out of work here.....a pair of GTX295s makes pretty short work out of that small cache.
____________
I am the Kittyman.

Please visit and give a Click for Seti City.




msattler
Send message
Joined: 15 Jul 08
Posts: 288
Credit: 5,474,012
RAC: 0
Message 43389 - Posted: 1 Nov 2010 | 17:27:16 UTC

Looks like we are back underway......
Just reported my cache and got 17 new WUs.
____________
I am the Kittyman.

Please visit and give a Click for Seti City.




BarryAZ
Send message
Joined: 1 Sep 08
Posts: 512
Credit: 222,446,536
RAC: 158,326
Message 43392 - Posted: 1 Nov 2010 | 18:41:41 UTC - in response to Message 43389.

Or not:

11/1/2010 11:40:02 AM Milkyway@home Reporting 24 completed tasks, not requesting new tasks
11/1/2010 11:40:04 AM Milkyway@home Scheduler request completed
11/1/2010 11:40:04 AM Milkyway@home Message from server: Project is temporarily shut down for maintenance


Looks like we are back underway......
Just reported my cache and got 17 new WUs.


____________

Profile Arif Mert Kapicioglu
Send message
Joined: 14 Dec 09
Posts: 158
Credit: 572,866,449
RAC: 4,516
Message 43428 - Posted: 2 Nov 2010 | 19:20:03 UTC

Server down..

Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 43429 - Posted: 2 Nov 2010 | 19:32:38 UTC

..and up.

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 43430 - Posted: 2 Nov 2010 | 20:18:59 UTC

Still no work being distributed though. Running on reducing cache.
____________
Go away, I was asleep


Zydor
Avatar
Send message
Joined: 24 Feb 09
Posts: 608
Credit: 85,339,711
RAC: 399,830
Message 43431 - Posted: 2 Nov 2010 | 20:38:11 UTC - in response to Message 43430.

I keep getting one machine running dry, and not being fed. The other is ok albeit gets short at times. Its my dual 5970 machine that is not getting fed quick enough, keeps getting "zero sent" whilst there are WUs available. At literally the same instance in time, my other machine with a 5850 gets WUs.....

I have switched the dual 5970 to Collatz for now until supply is sorted out, as I am having to baby sit it, and its running at half capacity at MW at present

Regards
Zy

Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 43432 - Posted: 2 Nov 2010 | 20:39:33 UTC - in response to Message 43431.

I keep getting one machine running dry, and not being fed. The other is ok albeit gets short at times. Its my dual 5970 machine that is not getting fed quick enough, keeps getting "zero sent" whilst there are WUs available. At literally the same instance in time, my other machine with a 5850 gets WUs.....

I have switched the dual 5970 to Collatz for now until supply is sorted out, as I am having to baby sit it, and its running at half capacity at MW at present

Regards
Zy

B A C K U P P R O J E C T !

TJ
Send message
Joined: 12 Aug 09
Posts: 251
Credit: 77,776,909
RAC: 1,061
Message 43454 - Posted: 3 Nov 2010 | 21:14:58 UTC

We are in "code red" at the moment.
____________
Greetings from,
TJ

Matthew
Volunteer moderator
Project developer
Project scientist
Send message
Joined: 6 May 09
Posts: 182
Credit: 4,799
RAC: 0
Message 43456 - Posted: 3 Nov 2010 | 22:13:27 UTC

We're working on it.

-Matthew

55degrees
Send message
Joined: 8 Sep 09
Posts: 62
Credit: 61,330,584
RAC: 0
Message 43457 - Posted: 3 Nov 2010 | 22:17:32 UTC - in response to Message 43432.

B A C K U P P R O J E C T !


this has probably been hammered to pulp, but I am not interested in scouring this whopper thread.

I have a feeling backupproject is, in the current ati environement, not as envisioned when collatz thrives on memory clock and mw does not. changes have to be made if interested in keeping the gpus setup optimally for each project and that requires intervention.


BarryAZ
Send message
Joined: 1 Sep 08
Posts: 512
Credit: 222,446,536
RAC: 158,326
Message 43461 - Posted: 3 Nov 2010 | 23:26:40 UTC - in response to Message 43457.

I agree, I run three ATI GPU projects, Dnetc and Collatz run at least moderately well together. When I add MW into the mix, the only way MW will process work is when I temporarily suspend the other two projects.

Since MW is, frankly, the LEAST reliable of the three projects at this point, what I do is suspend the other two projects only when I can check back often enough to make sure that MW hasn't gone walkabout (like it has this afternoon), so I can pull either or both Collatz and Dnetc off of suspend mode, at which point they take over GPU processing (even if there is MW work).



B A C K U P P R O J E C T !


this has probably been hammered to pulp, but I am not interested in scouring this whopper thread.

I have a feeling backupproject is, in the current ati environement, not as envisioned when collatz thrives on memory clock and mw does not. changes have to be made if interested in keeping the gpus setup optimally for each project and that requires intervention.




____________

Profile [seti.international] Philip J. Fry
Avatar
Send message
Joined: 30 Apr 09
Posts: 67
Credit: 3,120,494
RAC: 0
Message 43462 - Posted: 4 Nov 2010 | 0:06:34 UTC - in response to Message 43456.

We're working on it.

-Matthew


Thanks for info!

____________
Best regards!

BarryAZ
Send message
Joined: 1 Sep 08
Posts: 512
Credit: 222,446,536
RAC: 158,326
Message 43463 - Posted: 4 Nov 2010 | 0:28:47 UTC - in response to Message 43462.

Though a bit more info might be a good thing (ie outage for hours, days)...

We're working on it.

-Matthew


Thanks for info!


____________

Profile Mike
Avatar
Send message
Joined: 31 Oct 10
Posts: 134
Credit: 2,061,348
RAC: 0
Message 43561 - Posted: 7 Nov 2010 | 9:08:15 UTC
Last modified: 7 Nov 2010 | 9:17:55 UTC

I always get the message won´t finnish in time.
Thats just a joke.

My monster does all the units i have in my cache in less a day.
Sadly i have to crunch einstein and collatz.

Boinc 6.10.58
____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 43563 - Posted: 7 Nov 2010 | 9:26:55 UTC

Mike

Many of the BM Messages tab messages are there to be ignored if you know your monster is meeting it's project targets.
____________
Go away, I was asleep


Profile Mike
Avatar
Send message
Joined: 31 Oct 10
Posts: 134
Credit: 2,061,348
RAC: 0
Message 43564 - Posted: 7 Nov 2010 | 9:51:56 UTC - in response to Message 43563.

Mike

Many of the BM Messages tab messages are there to be ignored if you know your monster is meeting it's project targets.


At least i dont get work.

I´m running 6 cores @ 3.8 GHZ atm but nothing to crunch.

____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 43584 - Posted: 8 Nov 2010 | 2:20:49 UTC

Validator up to 50K ATM, so trouble is here.

My MW tasks report but are now not getting new work.

I will leave the good cache and revert to Collatz.
____________
Go away, I was asleep


John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 43753 - Posted: 11 Nov 2010 | 20:43:23 UTC

Validator choking on returned work.

Is this another Aaaarrrrgggghhhh moment soon?
____________
Go away, I was asleep


55degrees
Send message
Joined: 8 Sep 09
Posts: 62
Credit: 61,330,584
RAC: 0
Message 43822 - Posted: 14 Nov 2010 | 11:30:31 UTC
Last modified: 14 Nov 2010 | 11:31:21 UTC

11/14/2010 6:14:52 AM Milkyway@home Sending scheduler request: To fetch work.
11/14/2010 6:14:52 AM Milkyway@home Requesting new tasks for GPU
11/14/2010 6:14:55 AM Milkyway@home Scheduler request completed: got 0 new tasks
11/14/2010 6:14:55 AM Milkyway@home Message from server: No work sent
11/14/2010 6:14:55 AM Milkyway@home Message from server: Your app_info.xml file doesn't have a version of MilkyWay@Home N-Body Simulation.
11/14/2010 6:17:02 AM Milkyway@home Sending scheduler request: To fetch work.
11/14/2010 6:17:02 AM Milkyway@home Requesting new tasks for GPU
11/14/2010 6:17:05 AM Milkyway@home Scheduler request completed: got 0 new tasks
11/14/2010 6:17:05 AM Milkyway@home Message from server: No work sent
11/14/2010 6:17:05 AM Milkyway@home Message from server: Your app_info.xml file doesn't have a version of MilkyWay@Home N-Body Simulation.
11/14/2010 6:19:35 AM Milkyway@home update requested by user
11/14/2010 6:19:37 AM Milkyway@home Sending scheduler request: Requested by user.
11/14/2010 6:19:37 AM Milkyway@home Requesting new tasks for GPU
11/14/2010 6:19:40 AM Milkyway@home Scheduler request completed: got 0 new tasks
11/14/2010 6:19:40 AM Milkyway@home Message from server: No work sent
11/14/2010 6:19:40 AM Milkyway@home Message from server: Your app_info.xml file doesn't have a version of MilkyWay@Home N-Body Simulation.


server status indicates WUs available, running. I run gpus, not cpu. not getting WUs. I post this because the messages kind of suggest no WUs because I am not running n-body simulation. that has not been a problem until about an hour ago. something change, lately?

Zydor
Avatar
Send message
Joined: 24 Feb 09
Posts: 608
Credit: 85,339,711
RAC: 399,830
Message 43823 - Posted: 14 Nov 2010 | 11:50:31 UTC - in response to Message 43822.
Last modified: 14 Nov 2010 | 11:54:24 UTC

The validator is choking again. An hour ago is was about par, 6,500 waiting for validation. Over the last hour its steadily risen, and its now well over 50,000 and still rising. When it gets like that it trys to balance off WU production V Supply V Validation and you end up with bursts of available WUs etc, but its an inexorable move to falling over. See what happens, lately its survived on over 200,000 waiting validation, but at present the runes are not good, and back up projects need checking and winding up. Would be no surprise if it fell over totally within an hour from now.

EDIT:
Hope springs eternal - the GPU WU assimilator just went offline, could be someone is beavering away up there , lets hope so (and plaudits as its Sunday and a day off as such for them)

Regards
Zy

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 43825 - Posted: 14 Nov 2010 | 13:33:07 UTC

And still rising - now to 72,490

Some admin needs to sort out the servers, they deserve a good kicking.
____________
Go away, I was asleep


KWH*
Avatar
Send message
Joined: 24 Aug 10
Posts: 180
Credit: 44,748,528
RAC: 115
Message 43827 - Posted: 14 Nov 2010 | 14:54:08 UTC

Not gotten squat the past few days. GPU's sitting there doing nothing.

Zydor
Avatar
Send message
Joined: 24 Feb 09
Posts: 608
Credit: 85,339,711
RAC: 399,830
Message 43828 - Posted: 14 Nov 2010 | 14:58:53 UTC - in response to Message 43827.
Last modified: 14 Nov 2010 | 15:03:32 UTC

Supply was fine up to earlier this morning. There must be another reason connected with your setup - what is your cache level, held WUs for different Projects, and BOINC stated run time for each of your GPU Project WUs?

EDIT:
Just had a look at your machines, your 3300 has been performing fine, the others are empty, so there is a difference somewhere in the setups between the main machine and the others which were not getting work. None will right now due to the validator

Regards
Zy

KWH*
Avatar
Send message
Joined: 24 Aug 10
Posts: 180
Credit: 44,748,528
RAC: 115
Message 43830 - Posted: 14 Nov 2010 | 15:14:38 UTC - in response to Message 43828.

The 3300 is the one showing me in red,multiple lines, no work received. I switched it to Collatz. It's been doing this for a few days. I should be over or near 5 mill by now.
I run one week MW and one Collatz.

Zydor
Avatar
Send message
Joined: 24 Feb 09
Posts: 608
Credit: 85,339,711
RAC: 399,830
Message 43831 - Posted: 14 Nov 2010 | 15:55:40 UTC - in response to Message 43830.
Last modified: 14 Nov 2010 | 16:00:02 UTC

The reason for my question was that the 3300 crunched over 60 this morning up to the time the servers fell over, yet the others show no crunching. Those 60 have not yet been validated due to this mornings problems, but they were downloaded and crunched, so there must be a difference in the setups somewhere. Cant do much anyway until the servers back up properly as cant test anything.

EDIT:
Just started getting new work and the Validator figure is dropping, might be on the up now - see what happens.

Regards
Zy

KWH*
Avatar
Send message
Joined: 24 Aug 10
Posts: 180
Credit: 44,748,528
RAC: 115
Message 43832 - Posted: 14 Nov 2010 | 16:03:02 UTC - in response to Message 43831.
Last modified: 14 Nov 2010 | 16:04:31 UTC

Thanks for the replies.
The 3300 is dedicated 24/7, the others only run in downtime which has not be much of late. I will switch back to MW to see if it is now working properly.

Update: It is now working. Thanks!

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 43864 - Posted: 16 Nov 2010 | 1:19:18 UTC

Aaargh!

Validator up to 33,059 WUs and we are heading to another validator choke?
____________
Go away, I was asleep


CTAPbIi
Send message
Joined: 4 Jan 10
Posts: 86
Credit: 51,753,924
RAC: 0
Message 43866 - Posted: 16 Nov 2010 | 1:36:15 UTC

i'm not getting WUs...

KWH*
Avatar
Send message
Joined: 24 Aug 10
Posts: 180
Credit: 44,748,528
RAC: 115
Message 43867 - Posted: 16 Nov 2010 | 1:49:22 UTC

Ditto.

CTAPbIi
Send message
Joined: 4 Jan 10
Posts: 86
Credit: 51,753,924
RAC: 0
Message 43868 - Posted: 16 Nov 2010 | 2:07:19 UTC

now i'm getting WUs, let's crunch :-)

TJ
Send message
Joined: 12 Aug 09
Posts: 251
Credit: 77,776,909
RAC: 1,061
Message 43869 - Posted: 16 Nov 2010 | 2:11:18 UTC - in response to Message 43868.
Last modified: 16 Nov 2010 | 2:13:01 UTC

Won't be for long I think. A few hours ago the six servers from the bottom up where red. And the validator is rising. I have a lot of pages with "waiting for validation". But yes I am also getting WU's, as is there a small delay at times.
I go sleeping, see tomorrow what happend overnight.
____________
Greetings from,
TJ

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 43874 - Posted: 16 Nov 2010 | 9:01:34 UTC

Still getting work OK, but the validator is now up to 90,699. It cannot be long before the servers need kicking to make things work smoothly again.
____________
Go away, I was asleep


TJ
Send message
Joined: 12 Aug 09
Posts: 251
Credit: 77,776,909
RAC: 1,061
Message 43876 - Posted: 16 Nov 2010 | 10:05:03 UTC

Not getting new work for the last 4 hours.
____________
Greetings from,
TJ

Profile kashi
Send message
Joined: 30 Dec 07
Posts: 309
Credit: 148,432,104
RAC: 0
Message 43877 - Posted: 16 Nov 2010 | 10:16:01 UTC

Waiting for validation just keeps rising, so I have pulled the plug until it starts to go down.

Current _2s tasks are completing in 6-95 seconds or so on some Linux machines, so this may be putting an extra strain on the validator. _3s tasks take 5 hours or longer on some of these same Linux machines.

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 43879 - Posted: 16 Nov 2010 | 13:53:23 UTC

Validator still high, but has dropped to 50,732 since I last posted.

Still getting new work.
____________
Go away, I was asleep


John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 43914 - Posted: 17 Nov 2010 | 12:36:08 UTC
Last modified: 17 Nov 2010 | 12:37:14 UTC

Aaargh!

Validator up to 14,734
____________
Go away, I was asleep


KWH*
Avatar
Send message
Joined: 24 Aug 10
Posts: 180
Credit: 44,748,528
RAC: 115
Message 43918 - Posted: 17 Nov 2010 | 14:38:06 UTC

No new work for me.

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 43929 - Posted: 17 Nov 2010 | 20:21:11 UTC

Back to normal again, if you can define normal?
____________
Go away, I was asleep


Profile KWSN Checklist
Avatar
Send message
Joined: 12 Aug 08
Posts: 128
Credit: 203,351,125
RAC: 1,217
Message 43966 - Posted: 19 Nov 2010 | 2:00:20 UTC

Normal it is!
____________

Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 44023 - Posted: 21 Nov 2010 | 19:04:02 UTC

And, so it begins, yet again:

Workunits waiting for validation 13,877

-Mike

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 44025 - Posted: 21 Nov 2010 | 20:30:34 UTC

Heading for normal again -

Workunits waiting for validation 6,402
____________
Go away, I was asleep


Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 44027 - Posted: 21 Nov 2010 | 20:48:54 UTC - in response to Message 44025.
Last modified: 21 Nov 2010 | 21:48:31 UTC

Heading for normal again -

Workunits waiting for validation 6,402


Interesting thing is:

The server reduced the number of workunits waiting to be validated by just marking all of them as invalid.

The validator is continuing to mark most of the workunits returned as invalid. I hope this gets fixed soon.

I sent an email to the admins about this problem.

-Mike

Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 44033 - Posted: 21 Nov 2010 | 22:08:58 UTC - in response to Message 44027.

Heading for normal again -

Workunits waiting for validation 6,402


Interesting thing is:

The server reduced the number of workunits waiting to be validated by just marking all of them as invalid.

The validator is continuing to mark most of the workunits returned as invalid. I hope this gets fixed soon.

I sent an email to the admins about this problem.

-Mike

Good thing it works great!
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 44058 - Posted: 22 Nov 2010 | 18:30:57 UTC

Kaboom!!
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 44059 - Posted: 22 Nov 2010 | 18:44:43 UTC

bugger...

Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 44061 - Posted: 22 Nov 2010 | 19:25:22 UTC

Aaargh! Down again.

Hopefully when it comes back up it will be sporting new GPU workunits that run for 30 minutes on a HD5970, with the appropriate credit granted. This would make my day.

-Mike

KWH*
Avatar
Send message
Joined: 24 Aug 10
Posts: 180
Credit: 44,748,528
RAC: 115
Message 44069 - Posted: 22 Nov 2010 | 20:41:15 UTC

I'm getting "feeder not running" error?

KWH*
Avatar
Send message
Joined: 24 Aug 10
Posts: 180
Credit: 44,748,528
RAC: 115
Message 44070 - Posted: 22 Nov 2010 | 20:47:57 UTC - in response to Message 44061.

Aaargh! Down again.

Hopefully when it comes back up it will be sporting new GPU workunits that run for 30 minutes on a HD5970, with the appropriate credit granted. This would make my day.

-Mike



That sounds like a plan. Good luck and share some times.
I am still waiting for the 58xx prices to drop so I can snag a few. Sadly, I may be waiting awhile.

Profile Purple Rabbit
Avatar
Send message
Joined: 9 Nov 08
Posts: 43
Credit: 62,531,510
RAC: 68,424
Message 44073 - Posted: 22 Nov 2010 | 21:11:49 UTC
Last modified: 22 Nov 2010 | 21:12:54 UTC

The good news is that "something" is happening as opposed to having the server contemplating its navel :)

Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 44074 - Posted: 22 Nov 2010 | 21:22:14 UTC - in response to Message 44073.
Last modified: 22 Nov 2010 | 21:23:40 UTC

The good news is that "something" is happening as opposed to having the server contemplating its navel :)


Well, we are not really sure anything is happening, other than the server is down. I haven't seen any posts from the admins about the problem.

I think they think that "posts" means, post something post-problem. Like after it has been fixed.

-Mike

Edited to add: I hope they are not on Thanksgiving break until November 29.

Profile prairie69
Send message
Joined: 2 Nov 08
Posts: 11
Credit: 162,463
RAC: 0
Message 44075 - Posted: 22 Nov 2010 | 21:46:30 UTC - in response to Message 44059.

bugger...


Couldn't have said it better.

VBobier
Avatar
Send message
Joined: 29 Jul 08
Posts: 259
Credit: 1,320,525
RAC: 0
Message 44077 - Posted: 22 Nov 2010 | 21:52:21 UTC - in response to Message 44069.

I'm getting "feeder not running" error?

Me too, Can't report any work, video card near starvation...
____________
12/21/2012? Bah Humbug!

Profile Purple Rabbit
Avatar
Send message
Joined: 9 Nov 08
Posts: 43
Credit: 62,531,510
RAC: 68,424
Message 44078 - Posted: 22 Nov 2010 | 21:59:31 UTC - in response to Message 44074.
Last modified: 22 Nov 2010 | 22:11:58 UTC

The good news is that "something" is happening as opposed to having the server contemplating its navel :)


Well, we are not really sure anything is happening, other than the server is down. I haven't seen any posts from the admins about the problem.


Past downtimes have shown the entire server up and ready, but just not doing anything. Now part of the server is down and not doing anything. I'm guessing this is better :) I'm processing Collatz and DNETC one task at a time as a backup (resource share 0) until everything is copacetic again with MW.

Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 44079 - Posted: 22 Nov 2010 | 22:08:43 UTC - in response to Message 44074.


Edited to add: I hope they are not on Thanksgiving break until November 29.

That's a good posibility, I would think the school is only closed on Thanksgiving day. Unless it could be restarted/fixed away from campus.
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

KWH*
Avatar
Send message
Joined: 24 Aug 10
Posts: 180
Credit: 44,748,528
RAC: 115
Message 44080 - Posted: 22 Nov 2010 | 22:12:39 UTC - in response to Message 44078.

The good news is that "something" is happening as opposed to having the server contemplating its navel :)


Well, we are not really sure anything is happening, other than the server is down. I haven't seen any posts from the admins about the problem.


Past downtimes have shown the server up and ready, but just not doing anything. Now part of the server is down and not doing anything. I'm guessing this is better :) I'm processing Colatz and DNETC one task at a time as a backup (resource share 0) until everything is copacetic again with MW.


Yeah, I am back on Collatz as well. Seems like MW starts going good then it craps out. Is the server(s) run from the team owners location or from "HQ"?

Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 44081 - Posted: 22 Nov 2010 | 22:15:06 UTC - in response to Message 44079.


Edited to add: I hope they are not on Thanksgiving break until November 29.

That's a good posibility, I would think the school is only closed on Thanksgiving day. Unless it could be restarted/fixed away from campus.


The college that I work at is closed all this week so the students can head home for the holiday. We closed Saturday and will not reopen until the 29th. I am sure they probably are doing the same thing.

-Mike

Profile Travis
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Send message
Joined: 30 Aug 07
Posts: 1976
Credit: 26,480
RAC: 0
Message 44094 - Posted: 23 Nov 2010 | 0:36:33 UTC - in response to Message 44081.

I'll be around this week making sure things are running, but our labstaff wont so if there's a serious problem it wont be fixed until after the break. However, it seems our serious problem for the holidays already happened over this weekend (the corrupted disk) and it was fixed today, so hopefully it will be smooth sailing here on out.
____________

Profile prairie69
Send message
Joined: 2 Nov 08
Posts: 11
Credit: 162,463
RAC: 0
Message 44105 - Posted: 23 Nov 2010 | 2:44:08 UTC

Thanks, Travis!!

Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 44107 - Posted: 23 Nov 2010 | 3:15:01 UTC
Last modified: 23 Nov 2010 | 3:15:17 UTC

Workunits waiting for validation 59,155
Workunits waiting for assimilation 147
Workunits waiting for deletion 73
Results waiting for deletion 445
Transitioner backlog (hours) 0


Well, it looks like the validator is backed up. It needs a good flushing....

Profile Mike
Avatar
Send message
Joined: 31 Oct 10
Posts: 134
Credit: 2,061,348
RAC: 0
Message 44111 - Posted: 23 Nov 2010 | 8:35:45 UTC

update

Results ready to send 2,922
Results in progress 190,573
Workunits waiting for validation 172,854
Workunits waiting for assimilation 68
Workunits waiting for deletion 4
Results waiting for deletion 4
Transitioner backlog (hours) 0
____________

Profile Arif Mert Kapicioglu
Send message
Joined: 14 Dec 09
Posts: 158
Credit: 572,866,449
RAC: 4,516
Message 44116 - Posted: 23 Nov 2010 | 14:21:15 UTC

Wasn't it kicked couple of hours ago? This server is dying:(

CTAPbIi
Send message
Joined: 4 Jan 10
Posts: 86
Credit: 51,753,924
RAC: 0
Message 44119 - Posted: 23 Nov 2010 | 16:22:38 UTC

server is down for a while. and now BOINC challenge ongoing :(

VBobier
Avatar
Send message
Joined: 29 Jul 08
Posts: 259
Credit: 1,320,525
RAC: 0
Message 44120 - Posted: 23 Nov 2010 | 16:41:04 UTC

Perhaps Milkyway needs a Fund Drive for a New Server? Over at Seti@home, We the Users bought the project two New HP Servers, Named Carolyn and Oscar, Oscar is the name of Msattlers deceased Cat, But then He started both of the Fund Raisers, The 2nd one has raised a little over $16,000 so far(lots of small donations are happening, S@H is having a weekly outage right now though for a few hours), Carolyn is now online as the Boinc Replica Database Server and by Next week Carolyn will be swapped with Jocelyn to become the Master, Oscar will take about another week or so to come online and then Oscar will replace Thumper, Cause that Wabbit is just about fricasseed.
____________
12/21/2012? Bah Humbug!

Profile GalaxyIce
Avatar
Send message
Joined: 6 Apr 08
Posts: 2018
Credit: 100,142,856
RAC: 0
Message 44124 - Posted: 23 Nov 2010 | 17:50:22 UTC - in response to Message 44119.
Last modified: 23 Nov 2010 | 18:07:59 UTC

server is down for a while. and now BOINC challenge ongoing :(

Server is back up CTAPbIi and the challenge continues ... (until the server is down again)
____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 44128 - Posted: 23 Nov 2010 | 20:48:45 UTC

Just another hitch building - Workunits waiting for validation 174,591
____________
Go away, I was asleep


Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 44138 - Posted: 23 Nov 2010 | 23:25:04 UTC - in response to Message 44128.
Last modified: 23 Nov 2010 | 23:26:22 UTC

Just another hitch building - Workunits waiting for validation 174,591


I sure wish that the validator would start validating something! It is up to 231,744.

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 44139 - Posted: 24 Nov 2010 | 0:37:39 UTC
Last modified: 24 Nov 2010 | 0:40:00 UTC

And still rising - Workunits waiting for validation 258,318

Something is bound to break soon with that high number
____________
Go away, I was asleep


Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 44142 - Posted: 24 Nov 2010 | 0:56:56 UTC - in response to Message 44139.

And still rising - Workunits waiting for validation 258,318

Something is bound to break soon with that high number


I am willing to bet that it will fail before it goes over 300,000.

-Mike

Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 44144 - Posted: 24 Nov 2010 | 1:35:26 UTC - in response to Message 44142.

And still rising - Workunits waiting for validation 258,318

Something is bound to break soon with that high number


I am willing to bet that it will fail before it goes over 300,000.

-Mike

Workunits waiting for validation 280,041.
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

Dataman
Avatar
Send message
Joined: 5 Sep 08
Posts: 14
Credit: 100,034,466
RAC: 20
Message 44145 - Posted: 24 Nov 2010 | 1:51:01 UTC

Down
____________

Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 44146 - Posted: 24 Nov 2010 | 1:51:37 UTC - in response to Message 44142.

And still rising - Workunits waiting for validation 258,318

Something is bound to break soon with that high number


I am willing to bet that it will fail before it goes over 300,000.

-Mike


Well, the validator made it to 282,697 before it either failed or was stopped.

feeder milkyway Not Running
transitioner milkyway Not Running
milkyway_purge milkyway Not Running
file_deleter milkyway Not Running
nbody_assimilator milkyway Not Running
separation_assimilator milkyway Not Running

-Mike[/code]

Dataman
Avatar
Send message
Joined: 5 Sep 08
Posts: 14
Credit: 100,034,466
RAC: 20
Message 44147 - Posted: 24 Nov 2010 | 1:57:17 UTC

Up
Perhaps this project should be run by Yoyo :)
I'm getting work again.
____________

Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 44150 - Posted: 24 Nov 2010 | 3:06:29 UTC

284,000

...the anticipation...
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

Profile TomaszPawel
Avatar
Send message
Joined: 9 Nov 08
Posts: 41
Credit: 92,786,635
RAC: 0
Message 44155 - Posted: 24 Nov 2010 | 6:27:55 UTC
Last modified: 24 Nov 2010 | 6:33:20 UTC

Up
Perhaps this project should be run by Yoyo :)
I'm getting work again.

Rather by DNETC@HOME :D

2010-11-24 07:25:01 Milkyway@home Message from server: No work available

Workunits waiting for validation 200,807
____________
A proud member of the Polish National Team

COME VISIT US at Polish National Team FORUM

Profile NullCoding*
Avatar
Send message
Joined: 23 Sep 10
Posts: 24
Credit: 10,242,586
RAC: 868
Message 44158 - Posted: 24 Nov 2010 | 8:26:52 UTC

They stopped work generation because there were duplicate assimilators running, which was messing up credit. The backlog is quite large though...and I've been getting work consistently (although I do have a huge buffer)

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 44160 - Posted: 24 Nov 2010 | 8:45:25 UTC

Falling!

Falling!

Falling????

Workunits waiting for validation 137,250
____________
Go away, I was asleep


Profile GalaxyIce
Avatar
Send message
Joined: 6 Apr 08
Posts: 2018
Credit: 100,142,856
RAC: 0
Message 44161 - Posted: 24 Nov 2010 | 8:50:05 UTC - in response to Message 44158.

They stopped work generation because there were duplicate assimilators running, which was messing up credit.

Yes, that was so. But since then they have reported a corrupt disk (which may have caused all the credit problems, so they say), and now they have stopped work generation to try and get the assimilator/validator to catch up.

____________

Profile Mike
Avatar
Send message
Joined: 31 Oct 10
Posts: 134
Credit: 2,061,348
RAC: 0
Message 44163 - Posted: 24 Nov 2010 | 10:13:47 UTC


At least i hope to get work over the weekend.

____________

Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 44172 - Posted: 24 Nov 2010 | 16:28:27 UTC
Last modified: 24 Nov 2010 | 16:29:01 UTC

This is very bad timing with the Retupmoc Milky Way Challenge going on this week.

-Mike
____________

Profile Mike
Avatar
Send message
Joined: 31 Oct 10
Posts: 134
Credit: 2,061,348
RAC: 0
Message 44173 - Posted: 24 Nov 2010 | 16:39:13 UTC

Well, i´m crunching for 3 days now with my new GPU.
2 of them i got no work.

Murphys Law.

____________

Profile GalaxyIce
Avatar
Send message
Joined: 6 Apr 08
Posts: 2018
Credit: 100,142,856
RAC: 0
Message 44179 - Posted: 24 Nov 2010 | 17:35:37 UTC - in response to Message 44172.

This is very bad timing with the Retupmoc Milky Way Challenge going on this week.

-Mike

Let me know if it gets going again. It's a little strange to be sitting around in the middle of a race.

____________

msattler
Send message
Joined: 15 Jul 08
Posts: 288
Credit: 5,474,012
RAC: 0
Message 44182 - Posted: 24 Nov 2010 | 18:37:32 UTC

I just now got an allocation of 24 tasks.......
____________
I am the Kittyman.

Please visit and give a Click for Seti City.




msattler
Send message
Joined: 15 Jul 08
Posts: 288
Credit: 5,474,012
RAC: 0
Message 44184 - Posted: 24 Nov 2010 | 18:53:10 UTC
Last modified: 24 Nov 2010 | 19:00:06 UTC

And now nothing available again.
Either the splitters cannot keep up with demand, or maybe just a batch of work sent out for testing purposes.


EDIT...
And of course, just as I posted this, some more work popped in.
So I guess just keep hitting the buttons...if you are into that kind of thing.
LOL.
____________
I am the Kittyman.

Please visit and give a Click for Seti City.




John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 44195 - Posted: 24 Nov 2010 | 21:30:45 UTC

The rig I am currently dedicating to Milkyway GPU crunching seems to have been running OK with it's minimal cache of 23 WUS (and one being crunched).
____________
Go away, I was asleep


Profile GalaxyIce
Avatar
Send message
Joined: 6 Apr 08
Posts: 2018
Credit: 100,142,856
RAC: 0
Message 44196 - Posted: 24 Nov 2010 | 21:39:29 UTC

All my ATI cards are now getting 12 WUs and crunching two at a time, in hope that the Retupmoc competition continues. ...Until it all goes pear shaped again at which time they'll all be switching over to pear@shape again.

____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 44231 - Posted: 25 Nov 2010 | 17:20:16 UTC

Once again it's rising ...


Workunits waiting for validation 8,690
____________
Go away, I was asleep


John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 44251 - Posted: 25 Nov 2010 | 20:58:11 UTC
Last modified: 25 Nov 2010 | 20:58:37 UTC

Now, isn't that strange?

Workunits waiting for validation 0
____________
Go away, I was asleep


Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 44252 - Posted: 25 Nov 2010 | 21:48:17 UTC - in response to Message 44251.

Now, isn't that strange?

Workunits waiting for validation 0


After you read this post, it won't seem as strange.

http://milkyway.cs.rpi.edu/milkyway/forum_thread.php?id=2080

-Mike
____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 44262 - Posted: 26 Nov 2010 | 1:05:22 UTC - in response to Message 44252.

Now, isn't that strange?

Workunits waiting for validation 0


After you read this post, it won't seem as strange.

http://milkyway.cs.rpi.edu/milkyway/forum_thread.php?id=2080

-Mike



It dawns, but slowly!!
____________
Go away, I was asleep


Profile Travis
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Send message
Joined: 30 Aug 07
Posts: 1976
Credit: 26,480
RAC: 0
Message 44271 - Posted: 26 Nov 2010 | 7:06:32 UTC - in response to Message 44262.

I don't want to break things by saying it, but it seems like we might make it through the holiday without too many problems. :)
____________

Profile GalaxyIce
Avatar
Send message
Joined: 6 Apr 08
Posts: 2018
Credit: 100,142,856
RAC: 0
Message 44279 - Posted: 26 Nov 2010 | 11:25:56 UTC
Last modified: 26 Nov 2010 | 11:27:14 UTC

I take it you mean the American holiday yesterday with some Americans taking a day of vacation today (Friday) to make a four-day weekend ?

Or are we talking about the annual year end holidays in a month or so?
____________

Profile Travis
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Send message
Joined: 30 Aug 07
Posts: 1976
Credit: 26,480
RAC: 0
Message 44284 - Posted: 26 Nov 2010 | 16:56:04 UTC - in response to Message 44279.
Last modified: 26 Nov 2010 | 16:56:23 UTC

Just the thanksgiving holiday. I'm sure we'll have all kinds of things break before the end of the year. Fate doesn't like me sleeping too well.
____________

Profile Arif Mert Kapicioglu
Send message
Joined: 14 Dec 09
Posts: 158
Credit: 572,866,449
RAC: 4,516
Message 44382 - Posted: 28 Nov 2010 | 10:11:06 UTC

Separation_assimilator not running. This is probably due to high load? It appears very often these days.

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 44387 - Posted: 28 Nov 2010 | 13:34:39 UTC

Back to normal, again, I think?
____________
Go away, I was asleep


Profile Arif Mert Kapicioglu
Send message
Joined: 14 Dec 09
Posts: 158
Credit: 572,866,449
RAC: 4,516
Message 44422 - Posted: 29 Nov 2010 | 21:01:43 UTC

feeder milkyway Not Running
transitioner milkyway Not Running
milkyway_purge milkyway Not Running
file_deleter milkyway Not Running
nbody_assimilator milkyway Not Running
separation_assimilator milkyway Not Running

Profile GalaxyIce
Avatar
Send message
Joined: 6 Apr 08
Posts: 2018
Credit: 100,142,856
RAC: 0
Message 44424 - Posted: 29 Nov 2010 | 21:21:06 UTC - in response to Message 44422.

feeder milkyway Not Running
transitioner milkyway Not Running
milkyway_purge milkyway Not Running
file_deleter milkyway Not Running
nbody_assimilator milkyway Not Running
separation_assimilator milkyway Not Running

Yes, back to normal I think
____________

JAMC
Send message
Joined: 9 Sep 08
Posts: 96
Credit: 336,443,946
RAC: 0
Message 44425 - Posted: 29 Nov 2010 | 21:34:21 UTC

SNAFU

Matthew
Volunteer moderator
Project developer
Project scientist
Send message
Joined: 6 May 09
Posts: 182
Credit: 4,799
RAC: 0
Message 44426 - Posted: 29 Nov 2010 | 22:33:03 UTC

Should be back up now.

-Matthew

Profile Mike
Avatar
Send message
Joined: 31 Oct 10
Posts: 134
Credit: 2,061,348
RAC: 0
Message 44441 - Posted: 30 Nov 2010 | 4:24:54 UTC

Still no work.

____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 44449 - Posted: 30 Nov 2010 | 8:33:14 UTC

Work flowing now, but I needed Collatz overnight.
____________
Go away, I was asleep


Profile The Gas Giant
Avatar
Send message
Joined: 24 Dec 07
Posts: 1947
Credit: 240,865,573
RAC: 0
Message 44450 - Posted: 30 Nov 2010 | 8:56:44 UTC

128,000 waiting for validation

Profile Mike
Avatar
Send message
Joined: 31 Oct 10
Posts: 134
Credit: 2,061,348
RAC: 0
Message 44652 - Posted: 4 Dec 2010 | 8:50:03 UTC

I dont get work again.

____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 44653 - Posted: 4 Dec 2010 | 8:58:38 UTC
Last modified: 4 Dec 2010 | 8:59:13 UTC

Back to normal all ready again - Workunits waiting for validation 15
____________
Go away, I was asleep


Profile Mike
Avatar
Send message
Joined: 31 Oct 10
Posts: 134
Credit: 2,061,348
RAC: 0
Message 44658 - Posted: 4 Dec 2010 | 14:14:32 UTC

Still not getting work.
My card is sitting idle since 3 hours while i was shopping.

____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 44661 - Posted: 4 Dec 2010 | 15:04:48 UTC

Mike

Try a Project Reset at the BM "Projects" Tab
____________
Go away, I was asleep


msattler
Send message
Joined: 15 Jul 08
Posts: 288
Credit: 5,474,012
RAC: 0
Message 44662 - Posted: 4 Dec 2010 | 15:05:24 UTC - in response to Message 44658.

Still not getting work.
My card is sitting idle since 3 hours while i was shopping.

Check again....
Server status is all go right now. I have my 24 tasks in cache, and am getting new ones as they complete to top off.
____________
I am the Kittyman.

Please visit and give a Click for Seti City.




Profile KWSN Checklist
Avatar
Send message
Joined: 12 Aug 08
Posts: 128
Credit: 203,351,125
RAC: 1,217
Message 44672 - Posted: 4 Dec 2010 | 17:13:40 UTC

Crash and burn here...
____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 44674 - Posted: 4 Dec 2010 | 17:27:51 UTC

Same here.

Collatz calls about now!
____________
Go away, I was asleep


Profile KWSN Checklist
Avatar
Send message
Joined: 12 Aug 08
Posts: 128
Credit: 203,351,125
RAC: 1,217
Message 44861 - Posted: 9 Dec 2010 | 23:56:04 UTC

Going..Going....Going........
____________

JAMC
Send message
Joined: 9 Sep 08
Posts: 96
Credit: 336,443,946
RAC: 0
Message 44864 - Posted: 10 Dec 2010 | 0:40:43 UTC

Kaa-blam-oh!

Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 44868 - Posted: 10 Dec 2010 | 2:29:32 UTC

I knew this was going to happen. The credit dispenser was handing out credits twice as fast as it should have been. Now, it has run out of credits to dispense. Would someone please refill it with credits so we can start collecting again.
____________

Cartoonman
Send message
Joined: 10 Dec 09
Posts: 18
Credit: 6,518,358
RAC: 0
Message 44869 - Posted: 10 Dec 2010 | 2:41:01 UTC
Last modified: 10 Dec 2010 | 2:51:48 UTC

once again, no more work. Plan B: collatz.

OOOH! Primegrid's got a new ATI app using CL. i gotta try this.

Profile arkayn
Avatar
Send message
Joined: 14 Feb 09
Posts: 914
Credit: 74,780,239
RAC: 249
Message 44871 - Posted: 10 Dec 2010 | 3:09:30 UTC - in response to Message 44869.

I also am crunching SETI on my HD5830 using a opencl app from Lunatics.
____________

Matthew
Volunteer moderator
Project developer
Project scientist
Send message
Joined: 6 May 09
Posts: 182
Credit: 4,799
RAC: 0
Message 44877 - Posted: 10 Dec 2010 | 5:55:01 UTC

*KICK*

*chunk* *chunk* *chunk* *putt* *putt* *vaaaarrooooOOOOOOMMMMMM*

Should be working now :)

Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 44878 - Posted: 10 Dec 2010 | 6:04:58 UTC - in response to Message 44877.

*KICK*

*chunk* *chunk* *chunk* *putt* *putt* *vaaaarrooooOOOOOOMMMMMM*

Should be working now :)


Thank you!

Did you refill the credit dispenser, also?
____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 44881 - Posted: 10 Dec 2010 | 9:07:24 UTC

Then it NNT time for Collatz just now,
to let Milkyway a chance to crunch, and how.
All on an old AGP HD3850 once more for a day,
while time passes on this POETS Day.
____________
Go away, I was asleep


John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 44984 - Posted: 13 Dec 2010 | 9:20:55 UTC

Is the validator fracked again?

Work needing clearing is building, but new work, etc, is flowing OK.
____________
Go away, I was asleep


Profile arkayn
Avatar
Send message
Joined: 14 Feb 09
Posts: 914
Credit: 74,780,239
RAC: 249
Message 44993 - Posted: 14 Dec 2010 | 21:44:02 UTC

Now we just have to wait and see what got nuked this time.
____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 44994 - Posted: 14 Dec 2010 | 21:48:53 UTC

That wish should soon be granted.

Web site accessible again, although many of the servers are down, being worked on I guess?
____________
Go away, I was asleep


Profile Blurf
Volunteer moderator
Project administrator
Send message
Joined: 13 Mar 08
Posts: 617
Credit: 25,447,954
RAC: 0
Message 44995 - Posted: 14 Dec 2010 | 22:03:23 UTC

Travis has been notified and is working on the issue
____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 45022 - Posted: 15 Dec 2010 | 12:59:09 UTC

Validator waiting building up quickly again?
____________
Go away, I was asleep


bertodell
Send message
Joined: 15 Dec 10
Posts: 14
Credit: 1,385,867
RAC: 0
Message 45029 - Posted: 15 Dec 2010 | 15:20:55 UTC

Yea I got a bunch waiting to be validated,dunno what's up

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 45073 - Posted: 16 Dec 2010 | 20:54:29 UTC

Servers down for maintenance (I think) and validator building again.

Running outta work (again).
____________
Go away, I was asleep


TJ
Send message
Joined: 12 Aug 09
Posts: 251
Credit: 77,776,909
RAC: 1,061
Message 45084 - Posted: 17 Dec 2010 | 1:23:44 UTC

There is not match running at the moment. Pitty as my computer goes on holiday for almost a week.
____________
Greetings from,
TJ

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 45088 - Posted: 17 Dec 2010 | 9:23:35 UTC

In a few hours my MW GPU (HD3850) will turn it's attention from Collatz back to MW.
____________
Go away, I was asleep


Profile arkayn
Avatar
Send message
Joined: 14 Feb 09
Posts: 914
Credit: 74,780,239
RAC: 249
Message 45092 - Posted: 17 Dec 2010 | 15:57:48 UTC

After updating my drivers from 10.11 APP to 10.12 APP I went back to DnetC and went through a few of those yesterday.

I keep both Collatz and DnetC on a 0 share.
____________

Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 45154 - Posted: 20 Dec 2010 | 22:50:33 UTC

Validator again = 42k+
____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

Sabroe_SMC
Avatar
Send message
Joined: 2 Aug 08
Posts: 23
Credit: 104,125,949
RAC: 29,516
Message 45157 - Posted: 20 Dec 2010 | 23:08:18 UTC - in response to Message 45154.

No New Work is send AAAAAAARGH
____________
Greetz to all

Profile Blurf
Volunteer moderator
Project administrator
Send message
Joined: 13 Mar 08
Posts: 617
Credit: 25,447,954
RAC: 0
Message 45163 - Posted: 21 Dec 2010 | 0:35:17 UTC - in response to Message 45157.

No New Work is send AAAAAAARGH


Admins have been notified.
____________

Siegfried Niklas
Send message
Joined: 27 Apr 08
Posts: 3
Credit: 104,258,152
RAC: 0
Message 45206 - Posted: 22 Dec 2010 | 16:27:46 UTC

[As of 22 Dec 2010 16:21:48 UTC]
Server status
Program Host Status
data-driven web pages milkyway: Running
upload/download server milkyway: Running
scheduler milkyway: Running
feeder milkyway Not Running
transitioner milkyway: Not Running
milkyway_purge milkyway: Not Running
file_deleter milkyway: Not Running
nbody_assimilator milkyway: Not Running
separation_assimilator milkyway: Not Running

Zydor
Avatar
Send message
Joined: 24 Feb 09
Posts: 608
Credit: 85,339,711
RAC: 399,830
Message 45211 - Posted: 22 Dec 2010 | 20:28:41 UTC
Last modified: 22 Dec 2010 | 20:31:49 UTC

I suggest a new thread is started (hint hint John :) ), and this one closed. Call the new one - say - "Aaargh! Server out of new work!(2)" as this one is getting very big and slow(ish) loading even for high end PCs/Cards. For those with low-medium cards its probably getting a pain to wait for it to load up and type into.

Regards
Zy

Profile banditwolf
Avatar
Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 295,133
RAC: 0
Message 45213 - Posted: 22 Dec 2010 | 20:52:39 UTC - in response to Message 45211.

I suggest a new thread is started (hint hint John :) ), and this one closed. Call the new one - say - "Aaargh! Server out of new work!(2)" as this one is getting very big and slow(ish) loading even for high end PCs/Cards. For those with low-medium cards its probably getting a pain to wait for it to load up and type into.

Regards
Zy


Actually if you set your MW preferences you can limit the number of posts so it loads faster.

If a thread contains more than this number of posts
only display the first one and this many of the last ones

____________
Doesn't expecting the unexpected make the unexpected the expected?
If it makes sense, DON'T do it.

Zydor
Avatar
Send message
Joined: 24 Feb 09
Posts: 608
Credit: 85,339,711
RAC: 399,830
Message 45214 - Posted: 22 Dec 2010 | 20:55:59 UTC - in response to Message 45213.

Nice one - good save - hadnt noticed that before, works a treat :)

Regards
Zy

BarryAZ
Send message
Joined: 1 Sep 08
Posts: 512
Credit: 222,446,536
RAC: 158,326
Message 45217 - Posted: 22 Dec 2010 | 22:35:35 UTC - in response to Message 45163.

This is good -- did they indicate whether (several hours later now) they would be able to help out on this?


No New Work is send AAAAAAARGH


Admins have been notified.


____________

Profile mdhittle*
Avatar
Send message
Joined: 25 Jun 10
Posts: 284
Credit: 260,490,091
RAC: 0
Message 45218 - Posted: 22 Dec 2010 | 22:39:44 UTC - in response to Message 45217.

This is good -- did they indicate whether (several hours later now) they would be able to help out on this?


No New Work is send AAAAAAARGH


Admins have been notified.



Actually, the message you quoted was posted yesterday in response to the server problems yesterday. The server come back up right after Blurf posted that.

Hopefully they are working on the current problem.
____________

John Clark
Send message
Joined: 4 Oct 08
Posts: 1610
Credit: 61,911,637
RAC: 27,980
Message 45223 - Posted: 23 Dec 2010 | 1:08:33 UTC

Now setting up a new replacement thread
____________
Go away, I was asleep


Message boards : Number crunching : Aaargh! Server out of new work!


Main page · Your account · Message boards


Copyright © 2013 AstroInformatics Group