| log in |
Message boards : Number crunching : new workunit queue size (6)
| Author | Message |
|---|---|
|
Let me know if the increase queue size helps at all with the work availability. | |
| ID: 13057 | Rating: 0 | rate:
| |
Let me know if the increase queue size helps at all with the work availability. Looking forward to it Travis, but right now I have this;
No work availabilty just now [edit] I just got a task. Whooohoooo! :) ____________ | |
| ID: 13058 | Rating: 0 | rate:
| |
Let me know if the increase queue size helps at all with the work availability. Yeah, I just restarted the server :P Work should start flowing now. ____________ | |
| ID: 13059 | Rating: 0 | rate:
| |
Let me know if the increase queue size helps at all with the work availability. Not really 27/02/2009 17:34:31 Milkyway@home Sending scheduler request: Requested by user. 27/02/2009 17:34:31 Milkyway@home Reporting 1 completed tasks, requesting new tasks 27/02/2009 17:34:36 Milkyway@home Scheduler request completed: got 0 new tasks 27/02/2009 17:35:07 Milkyway@home Sending scheduler request: Requested by user. 27/02/2009 17:35:07 Milkyway@home Requesting new tasks 27/02/2009 17:35:12 Milkyway@home Scheduler request completed: got 0 new tasks :( | |
| ID: 13060 | Rating: 0 | rate:
| |
|
Still nothing here. | |
| ID: 13061 | Rating: 0 | rate:
| |
Let me know if the increase queue size helps at all with the work availability. Yes, I just got a task, but no more;
____________ | |
| ID: 13062 | Rating: 0 | rate:
| |
|
Hmmm... | |
| ID: 13064 | Rating: 0 | rate:
| |
|
Nothing at all since that one task;
____________ | |
| ID: 13065 | Rating: 0 | rate:
| |
|
> got 49 WU's on one machine, 50 WU's on another but these are the exception... mostly '0 new tasks' after repeated manual primes... | |
| ID: 13067 | Rating: 0 | rate:
| |
|
It's gonna take the server awhile to catch up I think. Give it a little time :) | |
| ID: 13068 | Rating: 0 | rate:
| |
|
Well I haven't been following their work feeder problem that closely. However, in general BOINC terms, if you cannot transition enough work fast enough into the schedulers limited size queue, then this is what you end up seeing even if there is plenty of tasks coming out of the work generator(s). | |
| ID: 13069 | Rating: 0 | rate:
| |
|
Just got 80 for the Quad. | |
| ID: 13071 | Rating: 0 | rate:
| |
|
| |
| ID: 13072 | Rating: 0 | rate:
| |
Change can be good :) | |
| ID: 13073 | Rating: 0 | rate:
| |
|
Yup, the new limit came down right away on two machines with one manual update. Looks good so far. | |
| ID: 13074 | Rating: 0 | rate:
| |
Well I haven't been following their work feeder problem that closely. However, in general BOINC terms, if you cannot transition enough work fast enough into the schedulers limited size queue, then this is what you end up seeing even if there is plenty of tasks coming out of the work generator(s). Yeah, I'm thinking this might be the problem. Now I just have to figure out how to increase the scheduler's queue size. ____________ | |
| ID: 13075 | Rating: 0 | rate:
| |
|
I'm not seeing a reduction in the amount of '0 new tasks' and am not hitting the WU download limit. | |
| ID: 13079 | Rating: 0 | rate:
| |
|
Sitting idle here ... sigh ... when I checked the server page it said 500 available ... now 437 ... refetch master file, still nothing ... | |
| ID: 13081 | Rating: 0 | rate:
| |
|
Nearly 80 new WU here aswell. | |
| ID: 13088 | Rating: 0 | rate:
| |
|
I'm wondering if one could code a script which would request an update at say 15 second intervals automatically -- that would eventually get work, of course hammering the server might not have the right effect if such a script got out in the wild.... | |
| ID: 13089 | Rating: 0 | rate:
| |
|
Still no improvement here, wu unit increase made no difference at all, I've still got computers Idle and comms backed of for hours. | |
| ID: 13094 | Rating: 0 | rate:
| |
|
It's really hard to catch WUs because there's several "Scheduler request completed: got 0 new tasks" messages before one gets work eventually. | |
| ID: 13096 | Rating: 0 | rate:
| |
|
Yep, not seeing a change here either. Took 30 mins of updating every min to get like 3 mins of work. Then back to nothing. | |
| ID: 13106 | Rating: 0 | rate:
| |
|
It's taking several tries to get any work, and not even getting the full 20/core either... | |
| ID: 13107 | Rating: 0 | rate:
| |
|
In the 8 hours since I last posted I have had 4-5 small batches of work and thats it. | |
| ID: 13183 | Rating: 0 | rate:
| |
Well I haven't been following their work feeder problem that closely. However, in general BOINC terms, if you cannot transition enough work fast enough into the schedulers limited size queue, then this is what you end up seeing even if there is plenty of tasks coming out of the work generator(s). Hmmm... Sorry, about the delay replying. I've been working on getting a newly acquired firewall appliance straightened out and configured, so I had to block the whole rpi.edu domain to guarantee I didn't miss data points for my hosts, so I couldn't even look at the website again until now. IIRC, the big problem in increasing the scheduler queue the way it's designed is you have to allocate more physical memory to the shared segment. If you don't have any to spare, then I guess you would be pretty well boned from a quick fix POV. ;-) Alinator | |
| ID: 13186 | Rating: 0 | rate:
| |
|
Looks like lots more WUs ready to distribute, but getting any work seems to be almost impossible. | |
| ID: 13187 | Rating: 0 | rate:
| |
|
I guess as it's the weekend and Seti always goes down at the weekend as well, a few other projects are going to get some extra work done again :) | |
| ID: 13188 | Rating: 0 | rate:
| |
|
As of 1:44:14 UTC server status shows 0 results ready to send and the validator seems to have stopped. Rats, I may have to temporarily go back to SETI on these two machines. | |
| ID: 13192 | Rating: 0 | rate:
| |
|
Is it just me or has it been harder to get work since the cache limit was bumped up? | |
| ID: 13195 | Rating: 0 | rate:
| |
|
The assimilator/validator was crashed for the last hour or so, which would make it hard to get work :P | |
| ID: 13197 | Rating: 0 | rate:
| |
|
Just got 17 new tasks after many update tries | |
| ID: 13198 | Rating: 0 | rate:
| |
The assimilator/validator was crashed for the last hour or so, which would make it hard to get work :P I'm talking about all day, not just the last hour or so. ____________ Calm Chaos Forum...Join Calm Chaos Now | |
| ID: 13201 | Rating: 0 | rate:
| |
The assimilator/validator was crashed for the last hour or so, which would make it hard to get work :P I actually think you might be right here. If the queue on the scheduler is not large enough, this means it takes less requests to clean out it's queue. I'm going to lower the number to 6 and see if that helps. ____________ | |
| ID: 13202 | Rating: 0 | rate:
| |
|
Ahh, never mind - just saw the message about limiting it to 6... Ok. | |
| ID: 13203 | Rating: 0 | rate:
| |
2/27/2009 19:45:33|Milkyway@home|Message from server: (reached per-CPU limit of 6 tasks) See the above post. I dropped the queue down to 6 per core to see if this will help more people get work. It was looking like with the limit at 20, the scheduler was running out of work to send very quickly and more people were getting the out of work message. *edit* before you flip out too much -- this is just temporary to see if this helps with work availability. if it doesn't help i'll increase the queue again. also, once we get a larger queue for the scheduler we'll be able to increase the work unit queue as well. ____________ | |
| ID: 13205 | Rating: 0 | rate:
| |
|
Hi Travis, | |
| ID: 13206 | Rating: 0 | rate:
| |
Yea, I saw that after I posted - How fast do you generate work? With the number of GPU clients, and more to come, you may have to do review how the work is generated. I think there are several projects that have a separate server just for work generation.... I also remember you saying something about a budget and not having another machine. ____________ . | |
| ID: 13208 | Rating: 0 | rate:
| |
Work is generated more than fast enough. I've never seen less than 300 WUs available server-side today. What I'm thinking the problem is, AFAIK, is that the scheduler uses a shared memory queue to store WUs which it can send out to clients, the feeder puts WUs into this queue for the scheduler. What's happening is that before the feeder can re-fill up the queue for the scheduler, the queue gets emptied by WU requests so people are getting no work sent. I'm pretty sure we need to increase the scheduler's queue size, but we need to do some work with labstaff to get that done, so it probably won't happen until early next week (like monday). So until then, I think the 6 WU queue should help keep enough work available for the scheduler. ____________ | |
| ID: 13209 | Rating: 0 | rate:
| |
|
I don't know how many GPUs are already in use on this project, but with more likely to be used soon I would think it a good idea to increase the size of each wu, so there will not need to be so many requests. | |
| ID: 13210 | Rating: 0 | rate:
| |
|
Travis, | |
| ID: 13211 | Rating: 0 | rate:
| |
|
Do you have the option to limit DL's to perhaps 10 at once but keep the total limit at 20 per core? That should ease the hits on the scheduler yet leave us with a little better comfort level if we do manage get a full quota, and let us get a partial load to hold us over in the mean time. | |
| ID: 13212 | Rating: 0 | rate:
| |
I don't know how many GPUs are already in use on this project, but with more likely to be used soon I would think it a good idea to increase the size of each wu, so there will not need to be so many requests. I'd rather have a real fix for the problem, because if we do this the same problem will just show up again when we get more users and starting seeing the same amount of workunit requests... ____________ | |
| ID: 13215 | Rating: 0 | rate:
| |
Do you have the option to limit DL's to perhaps 10 at once but keep the total limit at 20 per core? That should ease the hits on the scheduler yet leave us with a little better comfort level if we do manage get a full quota, and let us get a partial load to hold us over in the mean time. I was thinking about doing something like this. Going to let the 6 WU queue go for awhile and see if it helps anything first. ____________ | |
| ID: 13216 | Rating: 0 | rate:
| |
|
Up until yesterday my quad had very rarely run out of work. Now, in the last 24 hours, it has run completely dry probably 5 times or more and all I get is the same as others: | |
| ID: 13218 | Rating: 0 | rate:
| |
I haven't been able to keep my caches full (set for .1 days) for a while now. Cutting back to 6 per core is just going to make it beg even more. Seems to me this would cause more problems for the server instead of it making fewer requests at a longer interval. so I'll help out. Setting all systems to pull from einstein when the server here won't honor requests for work. Not a complaint as I know you got problems, but would rather have something to do than the computers sitting at idle for long stretches. Maybe this will take some of the load off. Probably wouldn't hurt if others did the same thing over the weekend until you got this worked out. (but then again if more people were doing that you would probably think 6 is enough) <smile> ____________ | |
| ID: 13219 | Rating: 0 | rate:
| |
Up until yesterday my quad had very rarely run out of work. Now, in the last 24 hours, it has run completely dry probably 5 times or more and all I get is the same as others: I've been debugging the new code for the assimilator/validator which lets us do some validation of workunits to keep our searches from getting screwed up by invalid results. This has caused the server to crash quite a few times this evening, so that might be causing a lot of the lack of work. The assimilator/validator doesn't seem to be crashing anymore *fingers crossed* so work availability should be better from here on out. ____________ | |
| ID: 13221 | Rating: 0 | rate:
| |
The assimilator/validator doesn't seem to be crashing anymore *fingers crossed* so work availability should be better from here on out. My quad seems to be keeping filled so far. It's running the GPU app, 0.19. ____________ | |
| ID: 13223 | Rating: 0 | rate:
| |
|
Gday all.. Just saw this result in my task list wondering why it is so??? | |
| ID: 13224 | Rating: 0 | rate:
| |
|
What lowering to 6 does is temporarily (say for maybe a couple of hours) reduce the false (success / 0 new work) messages by replacing them with 'met your CPU limit of 6'. Then, when the completed work drops the queue back from 20 to 6, the same problem pops up again -- but *more* frequently. Now one needs to hit the server for more work almost continuously since it may take about 15 minutes of server pounding for more work to get 45 minutes of work by which time two more work units have completed.
____________ | |
| ID: 13225 | Rating: 0 | rate:
| |
|
Yup -- I think you are spot on here.
____________ | |
| ID: 13226 | Rating: 0 | rate:
| |
|
Feed the beast. Limiting on the workunit queue size to 6 will not stop the beast from starvation. Feed the beast. | |
| ID: 13227 | Rating: 0 | rate:
| |
Gday all.. Just saw this result in my task list wondering why it is so??? Hmmm... Hard to say, but assuming it wasn't just a coincidence the most likely reason is that the backend 'lost' the output file (due to the troubleshooting at MW) for some reason. If there aren't any further repeats of it, then I wouldn't worry about it. Alinator | |
| ID: 13228 | Rating: 0 | rate:
| |
|
Sorry, but reducing the wu limit to 6 has only made things worse. | |
| ID: 13229 | Rating: 0 | rate:
| |
|
2009-02-28 09:22:33|Milkyway@home|Message from server: No work sent | |
| ID: 13234 | Rating: 0 | rate:
| |
i see that more WUs is with granted credit 0. I'm seeing WUs claiming zero credit and being awarded the credit they jolly well deserve. (OK, it's some GPU which show zero crunching time but I've timed some of them on a stopwatch and they do actually take a few seconds. I mean, I wouldn't even see them if it was zero seconds, would I) ____________ | |
| ID: 13236 | Rating: 0 | rate:
| |
|
At this moment I see also workunits that are getting q credits. I did not see this before this night. Now it looks like to happen with about 10% of the wu. | |
| ID: 13238 | Rating: 0 | rate:
| |
|
I forgot. For me it seems to work more smoothfull. I get less wu but the system is running most of time now. The message reaches cpu limit is ocuring regurly, but this is the setting at this moment. This means that the work available for my serer is at that moment the maximum that it is allowed, so I think this has made the situation more stable. | |
| ID: 13239 | Rating: 0 | rate:
| |
I forgot. For me it seems to work more smoothfull. I get less wu but the system is running most of time now. The message reaches cpu limit is ocuring regurly, but this is the setting at this moment. This means that the work available for my serer is at that moment the maximum that it is allowed, so I think this has made the situation more stable. Hallelujah, someone's happy ;) ____________ | |
| ID: 13241 | Rating: 0 | rate:
| |
|
Compared to yesterday my rigs (CPU only) seem to be running well, and fed. Yesterday, with the cache at 20 per CPU, I was in the same position as everyone else - zilch. | |
| ID: 13243 | Rating: 0 | rate:
| |
|
Still have problems with comps running dry (overnight). | |
| ID: 13244 | Rating: 0 | rate:
| |
Compared to yesterday my rigs (CPU only) seem to be running well, and fed. Yesterday, with the cache at 20 per CPU, I was in the same position as everyone else - zilch. Everyone? Zilch? I got a shed load of WUs yesterday, I can tell you. Did anyone else get any? Did everyone get zilch WUs yesterday? Was I the only one who got any WUs yesterday? If you're going to bitch and complain, at least don't drag me into your 'everyone'. ____________ | |
| ID: 13245 | Rating: 0 | rate:
| |
At this moment I see also workunits that are getting q credits. I did not see this before this night. Now it looks like to happen with about 10% of the wu.I am seeing a rate of about 15% and it only just started with the newer work units.... | |
| ID: 13247 | Rating: 0 | rate:
| |
Compared to yesterday my rigs (CPU only) seem to be running well, and fed. Yesterday, with the cache at 20 per CPU, I was in the same position as everyone else - zilch. You'd have to be someone first... | |
| ID: 13249 | Rating: 0 | rate:
| |
|
With the 20 limit, I got more 0 responses than before, but on the other hand I got more WUs when I occasionally got some. | |
| ID: 13265 | Rating: 0 | rate:
| |
|
If there are so many problems with getting as many WU's out as is necessary to keep all crunching cores happily working, why do not you increase the crunching length of a WU? If you increase it by a factor of two, then the amount of SQL-requests to the dataserver should be lowered and server responsiveness should increase accordingly... | |
| ID: 13269 | Rating: 0 | rate:
| |
|
OK shoot me for this! | |
| ID: 13272 | Rating: 0 | rate:
| |
|
I still run out of work, two CPU apps dried up over the last 3 hours left to their own devices. The small WU queue did improve the situation a lot though. | |
| ID: 13273 | Rating: 0 | rate:
| |
OK shoot me for this! Bang! Travis has recently repeated that there are plenty of WUs on the server side. The problem is releasing them to the clients (us), which I understand requires a server memory fix which will be done on Monday, hopefully, the 20/6 WUs being a temporary fix. It has nothing to do do with WU shortage so blaming ATI will do nothing for you. Go here; http://milkyway.cs.rpi.edu/milkyway/server_status.php There are 700 WUs for you not being gobbled by ATIs [edit] it just went up to 800 + ____________ | |
| ID: 13274 | Rating: 0 | rate:
| |
Although the server side says there is plenty of WUs to be released so, clearly, the issuing to work to requests are the problem still. Again I am, uncharacteristicly, out of work. Sore? Try a few more idiotic statements to entertain us. ____________ | |
| ID: 13282 | Rating: 0 | rate:
| |
|
1 WU today 0 yesterday. | |
| ID: 13284 | Rating: 0 | rate:
| |
|
:lolol: :lolol: :lolol: :lolol: | |
| ID: 13287 | Rating: 0 | rate:
| |
|
Funny kindergarten here. | |
| ID: 13294 | Rating: 0 | rate:
| |
1 WU today 0 yesterday. The work can be had, I've been getting some across my entire Pharm, so you have some other reason your not getting any WU's ... | |
| ID: 13297 | Rating: 0 | rate:
| |
|
Beats me, ill check out the second PC to see if it has any work. | |
| ID: 13299 | Rating: 0 | rate:
| |
|
[/quote] | |
| ID: 13301 | Rating: 0 | rate:
| |
Beats me, ill check out the second PC to see if it has any work. Just Reset the Project, don't Detach but just reset it if it doesn't have any work & see if you get some. You may not right first but in awhile you might again ... | |
| ID: 13303 | Rating: 0 | rate:
| |
Beats me, ill check out the second PC to see if it has any work. Wait a second... Resetting the project as a workaround to get more work might be counter productive. The reason is that even if you already have the third party apps to install afterwards, your host is going to have to re-download all the input files it needs that got dumped from the reset. If running stock, it will have to download the app again too. This is probably not going to do much to help the overall situation. ;-) Alinator | |
| ID: 13307 | Rating: 0 | rate:
| |
|
Hmm ya, one out of two working is good enough. | |
| ID: 13309 | Rating: 0 | rate:
| |
Beats me, ill check out the second PC to see if it has any work. No, Resetting the Project won't get rid of any Optimized App's, Third Party App's I don't know about but the Optimized ones don't get taken out when Resetting. Yes you'll have to re-download the input files & if you have Dial-Up it could take some time or pose a problem. I do it all the time but I have Cable so it's no big deal for me to re-download the input files, it only takes a few seconds for me to do that. It works for me but if somebody has a problem with it then don't do it, more work for me then ... :) | |
| ID: 13310 | Rating: 0 | rate:
| |
I don't know how many GPUs are already in use on this project, but with more likely to be used soon I would think it a good idea to increase the size of each wu, so there will not need to be so many requests. Larger work units would be a real fix, and would give you more time to sort out the problem with how slowly work is being released at times. ____________ | |
| ID: 13315 | Rating: 0 | rate:
| |
I don't know how many GPUs are already in use on this project, but with more likely to be used soon I would think it a good idea to increase the size of each wu, so there will not need to be so many requests. I brought this up a couple weeks ago and Travis said they contain enough info and the 'group' doesn't want to increase the size any more. ____________ Doesn't expecting the unexpected make the unexpected the expected? If it makes sense, DON'T do it. | |
| ID: 13317 | Rating: 0 | rate:
| |
I don't know how many GPUs are already in use on this project, but with more likely to be used soon I would think it a good idea to increase the size of each wu, so there will not need to be so many requests. LOL, I'd love to know what majority group thinks everything is fine as it is :) ____________ | |
| ID: 13319 | Rating: 0 | rate:
| |
Whoever he is supposed to see about the project. Probably the same people who don't think a second server is needed. :P ____________ Doesn't expecting the unexpected make the unexpected the expected? If it makes sense, DON'T do it. | |
| ID: 13320 | Rating: 0 | rate:
| |
I expect the top crunchers amassed most of their credits before the ATIs were available for optimized crunching. ATI GPU crunching is relatively quite recent and I expect people want a stable period of crunching with the ATIs before saying how they are doing with them. ____________ | |
| ID: 13321 | Rating: 0 | rate:
| |
Agreed, not all CC's will dump the whole project directory on a reset. However, all will dump the stock app on one. However, that wasn't the real point. The new GPU apps are bandwidth and work sponges from the Projects POV. Therefore, 'casual' resetting forces those hosts to pull all the input files and a whole new set of work from empty, with the attendant DB and and bandwidth load that goes along with that. Given that the project is having trouble keeping up as it is, anything that increase that even more only makes things worse for the big picture. Also, there is no guarantee that the scheduler queue hasn't been sucked dry just as the host gets reset, so the whole process would have been done for no net gain in that case. Alinator | |
| ID: 13328 | Rating: 0 | rate:
| |
|
I know have work on all computers and including the GPU where I was getting practically nothing. | |
| ID: 13331 | Rating: 0 | rate:
| |
I know have work on all computers and including the GPU where I was getting practically nothing. Server-side, it looks like the change helped work availability quite a bit. ____________ | |
| ID: 13334 | Rating: 0 | rate:
| |
I know have work on all computers and including the GPU where I was getting practically nothing. LOL... Well, we had our three steps backwards over the last 36 hours or so. So I guess that would be about 2 1/2 forward now? ;-) Alinator | |
| ID: 13336 | Rating: 0 | rate:
| |
I know have work on all computers and including the GPU where I was getting practically nothing. The only problem- and it's a big one- is when you have 'got 0 new tasks' multiple times in a row, which does still happen, and run dry and the reconnect time goes to 2 or 3 hours and you have nothing to crunch until then... any way to change/fix(?) that? | |
| ID: 13341 | Rating: 0 | rate:
| |
I know have work on all computers and including the GPU where I was getting practically nothing. It depends on what mean by 'fix'. If you are talking about MW being able to feed work to the scheduler faster, then no, not until Monday at the earliest. If you're talking about not running dry, then you have two choices: 1.) Sit at the console all the time and pound on the update button when you are out. 2.) Run a backup project and just ride out these difficulties until a satisfactory backend fix can be implemented. Alinator | |
| ID: 13343 | Rating: 0 | rate:
| |
I think he was talking about the 3 hour delay. ____________ Doesn't expecting the unexpected make the unexpected the expected? If it makes sense, DON'T do it. | |
| ID: 13345 | Rating: 0 | rate:
| |
So far, I've seen *none* of the 0 responses I spoke too soon. Quite a few 0 responses hidden in the logs, and running dry on one occasion. But still a significant improvement over both the previous per-core quotas tried, as far as my crunchers are concerned. ____________ | |
| ID: 13355 | Rating: 0 | rate:
| |
So far, I've seen *none* of the 0 responses Should be even better now that the workunits should take around twice as long to crunch. ____________ | |
| ID: 13356 | Rating: 0 | rate:
| |
Huh. Making us work twice as slow eh? :p ____________ | |
| ID: 13360 | Rating: 0 | rate:
| |
'If you are talking about MW being able to feed work to the scheduler faster...' I'm not. 'I think he was talking about the 3 hour delay.' I was. | |
| ID: 13366 | Rating: 0 | rate:
| |
|
Good to see that the idea of increasing the size of a work unit was eventually taken on board. Making them smaller always seemed crazy when they are mostly completed so fast :) | |
| ID: 13367 | Rating: 0 | rate:
| |
Should be even better now that the workunits should take around twice as long to crunch. Ooh 2 seconds then. ____________ | |
| ID: 13380 | Rating: 0 | rate:
| |
Should be even better now that the workunits should take around twice as long to crunch. Hey now, some of the GPU apps are taking a whole 3 seconds. ____________ | |
| ID: 13382 | Rating: 0 | rate:
| |
Should be even better now that the workunits should take around twice as long to crunch. Need a faster app now. ____________ Doesn't expecting the unexpected make the unexpected the expected? If it makes sense, DON'T do it. | |
| ID: 13391 | Rating: 0 | rate:
| |
Should be even better now that the workunits should take around twice as long to crunch. Wonderful, we've solved the problem of WU availability, but now you've effectively cut WU credits almost in half (for me) yet again! | |
| ID: 13396 | Rating: 0 | rate:
| |
Should be even better now that the workunits should take around twice as long to crunch. Just because they take longer to crunch doesn't mean that they don't give more credits. :D ____________ | |
| ID: 13397 | Rating: 0 | rate:
| |
|
Unless it's the same credits as the previous amount of work. | |
| ID: 13398 | Rating: 0 | rate:
| |
Should be even better now that the workunits should take around twice as long to crunch. No. The longer WUs are granted more credit each. ____________ | |
| ID: 13399 | Rating: 0 | rate:
| |
but now you've effectively cut WU credits almost in half (for me) yet again! Apologies! Problem seems to have been rectified now, but earlier my WU's were running twice as long and still getting same amount of credits. | |
| ID: 13400 | Rating: 0 | rate:
| |
I know have work on all computers and including the GPU where I was getting practically nothing. I think the increasing back-off times are built into the BOINC core client, so there's nothing anyone at MW can do about it. Berkeley's idea behind this design was to ease the burden for project servers after being offline for a while (as seems to be common over at SAH), making clients contact them over a longer period of time instead of all at once. ____________ | |
| ID: 13401 | Rating: 0 | rate:
| |
|
Travis, thanks for keeping my beasties fed. | |
| ID: 13402 | Rating: 0 | rate:
| |
|
My quady appears to be much happier now, as does my 2 * work C2D's and my old P4. I wouldn't have thought that decreasing the cached wu's per cpu would work, but this and extending the crunching time of the wu's appears to have done the job! Well done Travis. | |
| ID: 13417 | Rating: 0 | rate:
| |
|
I second the two previous posts. Thanks, Travis (also for the work you've done *before* visible success). | |
| ID: 13420 | Rating: 0 | rate:
| |
|
My i7 is just fine, but my Core2Quad running on GPU is almost never running at its full potential. I usually only get 10 WUs when a request goes in, it crunches 8 in about 5-10 minutes, starts up two, gets a few more, maybe 8 more, sometimes 4, sometimes none. But at any rate, quite often today I've seen my system crunching 6 WUs and having none complete, so it made a request and just got nothing, or perhaps only 6. But I'm pretty positive its not had 24 WUs at any time today. | |
| ID: 13427 | Rating: 0 | rate:
| |
|
All running well here, thanks Travis | |
| ID: 13429 | Rating: 0 | rate:
| |
|
OK -- what I see is that this works ok *if* the only application running is Milkyway -- I'm doing that as a test on a batch of computers and it does work. However, if one is doing multiple projects (like I normally do), then the small cache tends to result in other projects *with lower resource shares* actually getting a larger proportion of CPU cycles because they have larger caches with similar due dates (examples in particular would be Spinhenge and Poem for me, but it also seems to apply to a lesser degree with SETI, Einstein, Rosetta). The only project which stays reasonable is Climate -- but that's because the due dates are so far out. I know have work on all computers and including the GPU where I was getting practically nothing. ____________ | |
| ID: 13437 | Rating: 0 | rate:
| |
Hey now, some of the GPU apps are taking a whole 3 seconds. Not really. But the latest test app does not use a full CPU core to poll the GPU all the time. That lowers the CPU load quite a bit and therefore the reported time, which is actually the CPU time. Ice has posted already one result that took him mere 0.96 CPU seconds to crunch (but a bit more on the GPU of course). It is a bit hard to get some meaningful timing for a GPU app. If only one WU at a time would run, one could report the wall clock time. but my app tries to overlap several WUs to increase the GPU load. Therefore the wall clock time is also not reliable (you can have a look in the stderr.txt output). Actually I can let the app report any time you want. So if you have a wish... If you want we can skew the cpcs values (they are skewed anyway with GPU apps) on the stats sites with some creative timing ;) | |
| ID: 13440 | Rating: 0 | rate:
| |
Hey now, some of the GPU apps are taking a whole 3 seconds. Yes of course. I have a stopwatch on my mobile, not too accurate for timing but, prior to the increased length WU since yesterday, they were taking around 8 seconds GMT time. As for the sub-second you quoted above - I caught one even quicker ;)
____________ | |
| ID: 13474 | Rating: 0 | rate:
| |
|
Thank you Travis for the other requested changes. | |
| ID: 13635 | Rating: 0 | rate:
| |
Message boards :
Number crunching :
new workunit queue size (6)