Welcome to MilkyWay@home

Server Trouble

Message boards : News : Server Trouble
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 6 · 7 · 8 · 9 · 10 · 11 · 12 . . . 22 · Next

AuthorMessage
Septimus

Send message
Joined: 8 Nov 11
Posts: 205
Credit: 2,882,881
RAC: 267
Message 72085 - Posted: 20 Mar 2022, 9:47:37 UTC

I think there may still be a problem. I have 7 tasks showing in progress when they have been sent back. I have tried resetting the project but they are still showing. In terms of validating I think the system is around 6 days behind.
ID: 72085 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Rick

Send message
Joined: 29 Aug 21
Posts: 24
Credit: 67,217,373
RAC: 593
Message 72086 - Posted: 20 Mar 2022, 10:54:34 UTC - in response to Message 72082.  
Last modified: 20 Mar 2022, 10:57:38 UTC

My Mac's run 280X's, all repaired units + spares, best bang for the buck. Windforce OC seems to be the quickest.
ID: 72086 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile mikey
Avatar

Send message
Joined: 8 May 09
Posts: 3315
Credit: 519,946,492
RAC: 22,242
Message 72087 - Posted: 20 Mar 2022, 11:14:01 UTC - in response to Message 72063.  

mikey:
time is stretchable "thing".
Sometimes it can mean

several minutes or

an hour only or maybe

days or even

weeks.

I wonder how long it took to repair this HDD?


When I did it on a VERY old Server more than 10 years ago it took 2 days to get it's act together, I was just trying to see if it worked as the Server was free and I was backing up and crunching with it. This was an old floor model Server that had SCSI drives in it.
ID: 72087 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile mikey
Avatar

Send message
Joined: 8 May 09
Posts: 3315
Credit: 519,946,492
RAC: 22,242
Message 72088 - Posted: 20 Mar 2022, 11:19:47 UTC - in response to Message 72072.  

It dropped to that a while ago then stopped, and was sending out new work too. Maybe he's paused things to let the disk rebuild get done? I'm thinking these are hamster powered disks he's using.

I tried a couple of other GPU projects and some projects are terrible. SRBase only uses your first GPU. Numberfields doesn't work on 280X cards. Both of these keep giving me GPU work after I told them not to. So I keep aborting them until the server learns it's lesson!


SRBase will never learn...it sends 400+ tasks for my cpu's even when I have a cache of 0.001 and 0.001 and a resource share of zero!! I too just abort the extra tasks, not my problem I didn't ask for them in the first place, and let them deal with resending them to someone else. I usually have to suspend the Project to stop getting tasks.
ID: 72088 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile mikey
Avatar

Send message
Joined: 8 May 09
Posts: 3315
Credit: 519,946,492
RAC: 22,242
Message 72089 - Posted: 20 Mar 2022, 11:30:06 UTC - in response to Message 72073.  

Hmm, I see that the status page hasn't updated in a few hours. Normally it updates once or twice an hour...

Disk rebuild times obviously vary depending on the disk, but I've seen as low as an hour or two for a 300GB 10k rpm drive, to two days for 4TB 720rpm. No idea what MW's disk specs are. The validation queue was consistently going up until Tom announced he'd replaced the drive, so the rebuild was probably quick. I get the feeling the validation queue is lower now, but the page simply isn't updating for some reason.

As far as other GPU projects go, I've been pretty happy with MLC@Home. Amicable Numbers also worked well, but that requires a ton of system RAM.

EDIT: Got exactly one new task, just now (19:06 UTC). So it's working, just light-years (heh heh) behind. Lots for it to still catch up on. EDIT 2: And 299 more 90 seconds later! So it's kind of working, here and there.


Einstein's Gamma-ray pulsar binary search #1 on GPUs and SRBase's TF tasks are both fairly short and give out good credits for each and if you set a zero resource share for Einstein you will only get 1 task at a time, meaning if you set MilkyWay at a higher resource share it will finish an Einstein task then ask MilkyWay for work then get another task from Einstein if you don't get any MIlkyWay tasks.
ID: 72089 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
FurryGuy

Send message
Joined: 1 Aug 11
Posts: 10
Credit: 51,374,490
RAC: 0
Message 72092 - Posted: 21 Mar 2022, 2:16:11 UTC - in response to Message 72033.  

We just got the repaired drive back, and we're installing it shortly. We are waiting to see if the server starts rebuilding automatically, or if we need to schedule downtime.

While this is going on, I'm also trying to kick the DB and get things flowing again. Hopefully sometime soon we can have the server running normally again.

Mr. Donlon, the only inconvenience for me is having WU fetch going on and offline.

I know all we volunteers would like to at least have a bit of an status update notification from time to time, PLEASE! :)
ID: 72092 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 5 Jul 11
Posts: 990
Credit: 376,142,956
RAC: 2
Message 72093 - Posted: 21 Mar 2022, 2:23:24 UTC - in response to Message 72086.  

My Mac's run 280X's, all repaired units + spares, best bang for the buck. Windforce OC seems to be the quickest.
Are you one of those people who have managed a solder reflow in a domestic oven? I could never get that to work.

I get whatever I can find cheap, and fix minor faults, or ignore parts I don't need like the display output, or run them at a reduced clock speed, or limit them to small tasks (VRAM wise) like MW.

The only one that stood out was a triple width instead of double, nice thick heatsink, runs very cool.

I use risers (the ones for bitcoins, on a USB lead, only pass one lane but that's all you need for Boinc). I can connect 4 to each PCI-E socket. Or 16 if I daisy chain the adapters! I've yet to try a third daisy chain for 64 cards!
ID: 72093 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 5 Jul 11
Posts: 990
Credit: 376,142,956
RAC: 2
Message 72094 - Posted: 21 Mar 2022, 2:26:48 UTC - in response to Message 72088.  
Last modified: 21 Mar 2022, 2:32:26 UTC

It dropped to that a while ago then stopped, and was sending out new work too. Maybe he's paused things to let the disk rebuild get done? I'm thinking these are hamster powered disks he's using.

I tried a couple of other GPU projects and some projects are terrible. SRBase only uses your first GPU. Numberfields doesn't work on 280X cards. Both of these keep giving me GPU work after I told them not to. So I keep aborting them until the server learns it's lesson!


SRBase will never learn...it sends 400+ tasks for my cpu's even when I have a cache of 0.001 and 0.001 and a resource share of zero!! I too just abort the extra tasks, not my problem I didn't ask for them in the first place, and let them deal with resending them to someone else. I usually have to suspend the Project to stop getting tasks.
Boinc has a bug, even with the unofficial update I got, maybe it's a bug on the server end, if I say no GPU work from Einstein, and Boinc can't get MW GPU work because there isn't any, it will get Einstein GPU work when I intended only to run that on the CPU!

The Beta Boinc I'm using gets rid of some bugs to do with getting too much work - I got it to stop it getting hundreds of tasks when I'd limited it to running on x cores on the rosetta python app. Not sure if it would stop SRBase going mad on yours, it might do. If you want the beta Boinc (running perfectly here) for 64 bit Windows, send me a private message with your email address and I'll send the three executables over in a zip (about 5MB). You can get it from Github and compile it yourself, but I have no idea how to do that. Richard Haselgrove did it for me.
ID: 72094 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Bill F
Avatar

Send message
Joined: 4 Jul 09
Posts: 85
Credit: 16,692,777
RAC: 4,468
Message 72095 - Posted: 21 Mar 2022, 2:47:08 UTC - in response to Message 71995.  

There are lots of other projects to work on in the meantime.

Any specific suggestions? So far managed to keep the computer busy with what it could get between outages, and now it still has some 16-17h of work left, but it's only connected a small part of the time, sometimes as little as 2h/day, which makes it trickier to "catch" those moments when such things happen, and it's also just a dual-core Pentium with 4 GB of RAM, 32-bit OS, no dedicated GPU, and only doing CPU work, which seriously limits what it can work on, plenty of projects using much more resources, and I just want it to work on environmental or space projects (and can't exactly find any environmental ones), so after SETI@home shut down it was quite a relief to see that MW@h's separation tasks had even lower resource use and switch here.


Einstein has Tasks that work well with older systems or systems with limitations.
In October of 1969 I took an oath to support and defend the Constitution of the United States against all enemies, foreign and domestic;
There was no expiration date.


ID: 72095 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Tom Donlon
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 10 Apr 19
Posts: 408
Credit: 120,203,200
RAC: 0
Message 72096 - Posted: 21 Mar 2022, 2:56:20 UTC

Currently turning some processes off to flush the transitioner backlog. Once that's done I'll turn things back on.
ID: 72096 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 5 Jul 11
Posts: 990
Credit: 376,142,956
RAC: 2
Message 72097 - Posted: 21 Mar 2022, 3:38:03 UTC - in response to Message 72095.  

Einstein has Tasks that work well with older systems or systems with limitations.
Disable gravity tasks on the server if you have an older GPU though. They don't get anywhere - you can tell, low GPU usage, take forever.
ID: 72097 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 5 Jul 11
Posts: 990
Credit: 376,142,956
RAC: 2
Message 72098 - Posted: 21 Mar 2022, 3:39:17 UTC - in response to Message 72096.  

Currently turning some processes off to flush the transitioner backlog. Once that's done I'll turn things back on.
Thanks Tom, that poor old server has far too much to do. If I were you I'd gather lots of donations (put the donation box on the home page) and buy it some shiny new fast hardware.
ID: 72098 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Chooka
Avatar

Send message
Joined: 13 Dec 12
Posts: 101
Credit: 1,782,658,327
RAC: 0
Message 72099 - Posted: 21 Mar 2022, 5:20:46 UTC

Thanks for the updates Tom! Appreciated.

ID: 72099 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 5 Jul 11
Posts: 990
Credit: 376,142,956
RAC: 2
Message 72100 - Posted: 21 Mar 2022, 5:26:59 UTC
Last modified: 21 Mar 2022, 5:31:24 UTC

N-body now being issued for CPU.
No GPU work coming yet.

EDIT! GPU work floodgates opened! 200 tasks just appeared on a GPU.

EDIT AGAIN! Oops, 2 GPUs got work, 2 didn't. Limited supply.
ID: 72100 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 5 Jul 11
Posts: 990
Credit: 376,142,956
RAC: 2
Message 72101 - Posted: 21 Mar 2022, 5:38:44 UTC

Would it be possible to only give out Nbody for CPUs and not seperation? Seems daft to do seperation on CPUs when the GPUs are so much faster. There's not currently a way to choose Nbody only if you have a CPU and GPU in the same system.
ID: 72101 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
San-Fernando-Valley

Send message
Joined: 13 Apr 17
Posts: 256
Credit: 604,411,638
RAC: 0
Message 72102 - Posted: 21 Mar 2022, 7:42:27 UTC - in response to Message 72097.  

Hi Peter:
how old is old?

Einstein has Tasks that work well with older systems or systems with limitations.
Disable gravity tasks on the server if you have an older GPU though. They don't get anywhere - you can tell, low GPU usage, take forever.


Einstein GW:
NVIDIA GTX 750ti around max. 27 minutes credit 1000 (on old slow system/board/cpu)
NVIDIA Titan V around max. 11 minutes credit 1000

No OC.
About half of the time needed is GPU work, the rest is CPU.

I find that ok.
AND there are always plenty WUs.
AND there are even muc faster GR tasks that give you around 3600 credits ...

Just my humble 2 cents ...
Have a nice day!
cheers
ID: 72102 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 5 Jul 11
Posts: 990
Credit: 376,142,956
RAC: 2
Message 72103 - Posted: 21 Mar 2022, 8:04:28 UTC - in response to Message 72102.  
Last modified: 21 Mar 2022, 8:05:18 UTC

Hi Peter:
how old is old?

Einstein GW:
NVIDIA GTX 750ti around max. 27 minutes credit 1000 (on old slow system/board/cpu)
NVIDIA Titan V around max. 11 minutes credit 1000

No OC.
About half of the time needed is GPU work, the rest is CPU.

I find that ok.
AND there are always plenty WUs.
AND there are even muc faster GR tasks that give you around 3600 credits ...

Just my humble 2 cents ...
Have a nice day!
cheers
I have five Radeon R9 280X cards, only 6 months older than your 750Ti. But yours is OpenCL 3.0 and mine are only 1.2, perhaps Gravity needs a newer OpenCL? Or maybe Gravity uses Cuda on Nvidia cards.
ID: 72103 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
San-Fernando-Valley

Send message
Joined: 13 Apr 17
Posts: 256
Credit: 604,411,638
RAC: 0
Message 72105 - Posted: 21 Mar 2022, 9:15:37 UTC - in response to Message 72103.  

Hi Peter:
how old is old?

Einstein GW:
NVIDIA GTX 750ti around max. 27 minutes credit 1000 (on old slow system/board/cpu)
NVIDIA Titan V around max. 11 minutes credit 1000

No OC.
About half of the time needed is GPU work, the rest is CPU.

I find that ok.
AND there are always plenty WUs.
AND there are even muc faster GR tasks that give you around 3600 credits ...

Just my humble 2 cents ...
Have a nice day!
cheers
I have five Radeon R9 280X cards, only 6 months older than your 750Ti. But yours is OpenCL 3.0 and mine are only 1.2, perhaps Gravity needs a newer OpenCL? Or maybe Gravity uses Cuda on Nvidia cards.



Well, thanks for the info ... Very interesting to know!
I don't go into such details - my fault.

I guess things have to be carefully brokendown to/for each singular situation.

But what I just learned again (over and over) that each cruncher should try and find out what is the best for him/her.
Since we all have different hard-/software it is somewhat "dangerous" to give "general" recommendations.

Well, it sunny outside, but cold, anyway I'll go for a walk and not worry about not getting any tasks!
Have a great day.
ID: 72105 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 5 Jul 11
Posts: 990
Credit: 376,142,956
RAC: 2
Message 72106 - Posted: 21 Mar 2022, 9:23:00 UTC - in response to Message 72105.  

Well, it sunny outside, but cold, anyway I'll go for a walk and not worry about not getting any tasks!
Have a great day.
Same weather here, I'm off to buy some cockatiels!
ID: 72106 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
San-Fernando-Valley

Send message
Joined: 13 Apr 17
Posts: 256
Credit: 604,411,638
RAC: 0
Message 72108 - Posted: 21 Mar 2022, 9:31:41 UTC - in response to Message 72106.  


... Same weather here, I'm off to buy some cockatiels!

Nice hobby!
ID: 72108 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 . . . 6 · 7 · 8 · 9 · 10 · 11 · 12 . . . 22 · Next

Message boards : News : Server Trouble

©2024 Astroinformatics Group