| log in |
Message boards : Number crunching : new workunit limit
| Author | Message |
|---|---|
|
It looks like the transitioner really can't keep up with what's going on with milkyway right now, so in order to speed things up i would like to reduce the workunit limit (5 would be ideal, 10 passable), to reduce the size of the database, which would speed things up. | |
| ID: 6951 | Rating: 0 | rate:
| |
|
5 doesn't do it unless they are made longer, as in an hour not 10 min. | |
| ID: 6953 | Rating: 0 | rate:
| |
|
Travis | |
| ID: 6958 | Rating: 0 | rate:
| |
|
The killer for my quads is the 'communication deferred' time in the BOINC projects tab- is that set by the project or BOINC? Even with network activity set to always on, when the communication deferred time goes to 2 or 3+ hours I lose all hope of keeping WU's cached and always run out and that's with 20 WU/core limit- we have to get longer WU's to make the change to 5WU's/core work and I guess that means we all have to run test apps as well... | |
| ID: 6959 | Rating: 0 | rate:
| |
|
This is pure non-sense... Even with 20 WUs per core the queue was stalling unless I manually requested more work. | |
| ID: 6972 | Rating: 0 | rate:
| |
|
11/29/2008 2:44:13 PM|Milkyway@home|Sending scheduler request: Requested by user. Requesting 3317 seconds of work, reporting 3 completed tasks I'm going to lower the WU limit to 5 and if this is really unworkable i'll raise it. It's really unworkable. You should raise it. ____________ | |
| ID: 6974 | Rating: 0 | rate:
| |
|
This stragety seems to be working for me. My Quads are staying at 20 tasks. As soon as any are done they are reported and replaced. | |
| ID: 6975 | Rating: 0 | rate:
| |
This stragety seems to be working for me. My Quads are staying at 20 tasks. As soon as any are done they are reported and replaced. Same here but also my pendings are still growing. ____________ A clear conscience is usually the sign of a bad memory | |
| ID: 6977 | Rating: 0 | rate:
| |
This stragety seems to be working for me. My Quads are staying at 20 tasks. As soon as any are done they are reported and replaced. I'm hoping with the lower limit the transitioner will be able to keep up with the work requests. I'll bump things up to 8 and see how that works out -- I don't want people getting communication deferreds if they're crunching too fast. The assimilator/validator for the new app are a lot faster than the old one, so when we make the switch to running only the new app, this should help a bit as well. ____________ | |
| ID: 6979 | Rating: 0 | rate:
| |
|
No problem, you did also say that the work units would be 12 to 20 times longer, right? If you REALLY want to lower the stress on the transitioner, then you must increase the length of the work unit. A 25 minute per core cache is only going to increase the stress on the transitioner as it will compel everyone running MW to be consinuously hitting the server. It looks like the transitioner really can't keep up with what's going on with milkyway right now, so in order to speed things up i would like to reduce the workunit limit (5 would be ideal, 10 passable), to reduce the size of the database, which would speed things up. ____________ | |
| ID: 6980 | Rating: 0 | rate:
| |
|
I see you just bumped up the cache to 8 WU's from 5. But until you have reasonably timed WU's, keeping things running smoothly is just a dream. | |
| ID: 6982 | Rating: 0 | rate:
| |
Seriously, the problem isn't a 5, 10 or 20 WU cache limit, it is the 5 minute WU, How many times has this been said? I know I have. It was why the old,old,old wu's were made into hours in the first place, until the optimised apps came along. ____________ Doesn't expecting the unexpected make the unexpected the expected? If it makes sense, DON'T do it. | |
| ID: 6984 | Rating: 0 | rate:
| |
Seriously, the problem isn't a 5, 10 or 20 WU cache limit, it is the 5 minute WU, I know we need longer WUs. In fact, I think this next meeting will be all about how we can get them much longer :P ____________ | |
| ID: 6988 | Rating: 0 | rate:
| |
Seriously, the problem isn't a 5, 10 or 20 WU cache limit, it is the 5 minute WU, I presume you can now think about more science to make the WUs longer, more useful to you science. On an aside - So far my PCs are being kept fed, and the work ready to send, on the servers, has been high (compared the past) which means that the work is there to satisfy demand. The only problem I see, when the computers here are unsupervised, is the build up of the time due to deferring communications for xxxx As long as this does not rise above, say, 20 minutes I think the current WUs-per-core limit might work OK. | |
| ID: 6991 | Rating: 0 | rate:
| |
Seriously, the problem isn't a 5, 10 or 20 WU cache limit, it is the 5 minute WU, Yes, but unfortunately that takes time :( I'm going to see what we can do more short-term, until we can take the analysis up to the next level (and hopefully make the WUs really long). ____________ | |
| ID: 6994 | Rating: 0 | rate:
| |
|
Not quite the same but when I get 'No work' it will deferr: 1 min, 1 min, 1 min, 3 hours (or some varation). | |
| ID: 6995 | Rating: 0 | rate:
| |
|
Well, it might take longer for you as since you are both the cook and bottle washer, the more time you spend nursing the server, the less time you have for all the good stuff (smile>)
____________ | |
| ID: 6996 | Rating: 0 | rate:
| |
|
Seems like only a temp fix: | |
| ID: 7006 | Rating: 0 | rate:
| |
|
It's not gonna work! already I'm getting repeated "No Work Sent" messages and my pc's are backing of to between 1 & 3 hours. Without constant attendance they'll spend most of the time with an empty cache. | |
| ID: 7008 | Rating: 0 | rate:
| |
It's not gonna work! already I'm getting repeated "No Work Sent" messages and my pc's are backing of to between 1 & 3 hours. Without constant attendance they'll spend most of the time with an empty cache. ...same here :( | |
| ID: 7009 | Rating: 0 | rate:
| |
|
I suspect that all the change really did was reduce the number of work units folks were pulling from the server for about an hour, and that hour is up, so the server is now getting the same sort of constant drone from hungry workstations. It's not gonna work! already I'm getting repeated "No Work Sent" messages and my pc's are backing of to between 1 & 3 hours. Without constant attendance they'll spend most of the time with an empty cache. ____________ | |
| ID: 7010 | Rating: 0 | rate:
| |
|
Mine are heading that way, but have not reached that point. Most will be out within 30 minutes and the rest will take a little longer. | |
| ID: 7011 | Rating: 0 | rate:
| |
|
Right, I am now pulling more work down from SETI, POEM and Spinhenge as well. Mine are heading that way, but have not reached that point. Most will be out within 30 minutes and the rest will take a little longer. ____________ | |
| ID: 7012 | Rating: 0 | rate:
| |
|
Travis, this is hopeless! You need 1 app and spend your time working on it instead of trying to fix all the problems with 10 apps! Its not about the crunchers.... its about the science! While I do like the credit's I'm getting from the opp. app.(credit whore)I joined for the project itself! Get 1 app. working and dont accept work from other apps! A simple "reset project" or "detach/reattach" should take care of it. Put everyone on level ground and get rid of all this drama! If you want to update the app later,... cool EVERYONE updates! | |
| ID: 7014 | Rating: 0 | rate:
| |
|
i guess i will now have to find another project to crunch. i really don't want to crunch seti anymore. all this waiting for work and the new lower 8 wu limits means, based on the past few days, over 1/3rd of the day my fleet will be idle waiting for something to do..esp when the time often goes to 2 hrs and 3 hrs before the server is even contacted again. milkyway has become too hands on lately for me. | |
| ID: 7016 | Rating: 0 | rate:
| |
|
Hello Travis, | |
| ID: 7019 | Rating: 0 | rate:
| |
|
this project has now become a waste of my time. | |
| ID: 7022 | Rating: 0 | rate:
| |
this project has now become a waste of my time. Oh come on, at least read the damn thread. Travis has already said that increasing the size of WUs is on the agenda, but that doing this isn't as easy as we may think. | |
| ID: 7023 | Rating: 0 | rate:
| |
this project has now become a waste of my time. 1% resource share seems to be doing well with the project at the present time. the only real computer now running the project is My atom 230 system. My dual xeon and core-i7 systems just go through the project too fast to even worry about running the project on at the present time. | |
| ID: 7026 | Rating: 0 | rate:
| |
|
30/11/2008 10:19:29|Milkyway@home|Message from server: No work sent | |
| ID: 7031 | Rating: 0 | rate:
| |
|
One thing to consider here, given the fragility of the servers, the workload presented by the use of the optimized client, and the 'interesting' approach for awarding credits, it seems to make some sense to NOT use the optimized client. Since the current award scheme seems to be based on cpu time spent and not work done, there is no 'credit penalty' for running the vastly less efficient regular client. The other benefit is that with the much longer running time of the regular client, and the miniscule cache, users don't spend an inordinate amount of time trying to pry a 5 minute work unit from the gasping server, rather, each work unit runs 5 hours. | |
| ID: 7041 | Rating: 0 | rate:
| |
|
LOL... | |
| ID: 7043 | Rating: 0 | rate:
| |
|
| |
| ID: 7044 | Rating: 0 | rate:
| |
This should be a non-issue now that we're moving over to the new application. WUs for that should be quite a bit longer. ____________ | |
| ID: 7047 | Rating: 0 | rate:
| |
|
I presume the best way to install the new client is to detachand reattach, as it is now the stock client? | |
| ID: 7055 | Rating: 0 | rate:
| |
I presume the best way to install the new client is to detachand reattach, as it is now the stock client? Isn't that just the app so it can be optimized? The new app should auto. download right? ____________ Doesn't expecting the unexpected make the unexpected the expected? If it makes sense, DON'T do it. | |
| ID: 7056 | Rating: 0 | rate:
| |
I presume the best way to install the new client is to detachand reattach, as it is now the stock client? Yeah, the new app should automatically download. Update: I edited the news post so people don't get confused and think they need to compile it themselves :) ____________ | |
| ID: 7057 | Rating: 0 | rate:
| |
I presume the best way to install the new client is to detachand reattach, as it is now the stock client? Make sure you dont have an app_info file in the milkyway folder, then the new program will arrive as soon as its released. ____________ | |
| ID: 7058 | Rating: 0 | rate:
| |
|
Detached my first quad and reattached. The system automatically downloaded the new client and the 8 WUs-per-core limit and got on crunching. | |
| ID: 7060 | Rating: 0 | rate:
| |
Detached my first quad and reattached. The system automatically downloaded the new client and the 8 WUs-per-core limit and got on crunching. Hmmm... Well one thing to keep in mind here, is that the detach and or reset will dump the app_info file (and opti) from the project directory. What isn't clear so far is; Are the Astronomy searches (gs series) completed now? If not, then you need to build the custom app_info I mentioned in another thread or you are going to have a TDCF mess if and when the CC picks up an Astronomy task and runs it with the stock version. Alinator | |
| ID: 7061 | Rating: 0 | rate:
| |
Detached my first quad and reattached. The system automatically downloaded the new client and the 8 WUs-per-core limit and got on crunching. WUs from nm_testX and nm_stripeX (not nm_stripeX_1) don't have as much work as the nm_stripeX_1s. I've left those searches generating more work because they're almost completed and i want to see what the final results are. After those are done the WUs should be about 4x longer than what they are right now. ____________ | |
| ID: 7062 | Rating: 0 | rate:
| |
Detached my first quad and reattached. The system automatically downloaded the new client and the 8 WUs-per-core limit and got on crunching. We won't be generating any more work for the old application. So the gs series that's out there right now is done. I'll probably be starting up some more gs WUs using the new app after i get some results with the nm ones. GS is doing a genetic search server side, while NM is doing an asynchronous newton method. GS is more of a global search, while the newton method we're testing is local search with a fast convergence rate. Right now we're trying to get some very accurate numbers on stripes 79, 82 and 86 for an upcoming publication. ____________ | |
| ID: 7063 | Rating: 0 | rate:
| |
|
AHHHH.... | |
| ID: 7064 | Rating: 0 | rate:
| |
Yep, Just noticed that you are moving to the new app - very good. Ignore my previous comment :) I guess we will see how the new app performs with the cache settings and server load.. Hope this fixes a bunch of things. | |
| ID: 7065 | Rating: 0 | rate:
| |
|
When you increse the length of the wu please ensure that the check-point error that has been showing up is fixed. With the shorter wu's it is not a problem with longer wu's and particularly on a slower machine if an error occurs because of a fault in the check-pointing a cruncher loses the credits for an hours work it is annoying. | |
| ID: 7067 | Rating: 0 | rate:
| |
When you increse the length of the wu please ensure that the check-point error that has been showing up is fixed. With the shorter wu's it is not a problem with longer wu's and particularly on a slower machine if an error occurs because of a fault in the check-pointing a cruncher loses the credits for an hours work it is annoying. Yep, that needs to be fixed, but keeping them in memory is a partial workaround. Alinator | |
| ID: 7068 | Rating: 0 | rate:
| |
|
Same here -- about 2.5 times the processing time. The new work units take about 12 minutes to process. The 8 wu queue is still short (under 2 hours), a 20 work unit queue would yield a full 4 hour queue, still short, but workable I guess. (I'd love to see 30 minute WU's with the 20 unit queue). Detached my first quad and reattached. The system automatically downloaded the new client and the 8 WUs-per-core limit and got on crunching. ____________ | |
| ID: 7069 | Rating: 0 | rate:
| |
When you increse the length of the wu please ensure that the check-point error that has been showing up is fixed. With the shorter wu's it is not a problem with longer wu's and particularly on a slower machine if an error occurs because of a fault in the check-pointing a cruncher loses the credits for an hours work it is annoying. v0.6 has the code to fix the checkpointing problems, so it should be ok. ____________ | |
| ID: 7073 | Rating: 0 | rate:
| |
|
OK.I seem to be having a problem. Newbie to Milky Way here. The WUs I downloaded says time to completion is 00:08:20 but after 03:33:18 hours of crunching, I'm only 19.87% completed and time to completion is 02:13:24 and increasing. My computer is - GenuineIntel | |
| ID: 7110 | Rating: 0 | rate:
| |
|
Don't worry about it at this point. MW is a tight deadline project for almost all host with the stock apps. This often drives the tasks into HP when you first attach (due to no TDCF data for the project), or running several projects and/or low Resource Share for MW on hosts which have been around awhile. | |
| ID: 7133 | Rating: 0 | rate:
| |
Don't worry about it at this point. MW is a tight deadline project for almost all host with the stock apps. This often drives the tasks into HP when you first attach (due to TDCF data for the project), or running several projects and/or low Resource Share for MW on hosts which have been around awhile. I think I need a week or so to settle down after the personal attention required to MW over the last week or so to keep the milk chocolate crunching ;) I can't complain though - my RAC is still flyin' :) Welcome Sorceress - nice avatar... ____________ | |
| ID: 7135 | Rating: 0 | rate:
| |
|
I don't think it has ever run in normal mode for me. Not that it's a big deal, I manually get work from the project I want to do, when I want to. | |
| ID: 7136 | Rating: 0 | rate:
| |
I think I need a week or so to settle down after the personal attention required to MW over the last week or so to keep the milk chocolate crunching ;) I can't complain though - my RAC is still flyin' :) LOL... I feel your pain. Now if we can get off of InstaPurge and back to something a little more user friendly, I'll be a Happy Camper again! ;-) Alinator | |
| ID: 7137 | Rating: 0 | rate:
| |
Yes it can be a pain not only to supply a free PC or two to a project like this, but then to try and work out what to do to keep up with it all. But then there are good people like you Alinator to help out the likes of me, so thanks very much for that. And also thanks to Travis and MW mod/science staff here - you're doing a fine job - and thanks! ____________ | |
| ID: 7160 | Rating: 0 | rate:
| |
|
Thanks Ice. I just got home and noticed MW is running w/high priority on both machines. Don't know what happened, but its working much better. Noticed also the 'time to completion' is more accurate. I still won't be able to complete much of the WU I recieved. I take my projects seriously and get irritated when I can't get my work done. My older machine trashed the graphics card and it took a while to get a new one. I was hesitant to use my newer machine on boinc but then I said what the heck. Now Im cruching on TWO! Thanks all for the help! Glad to be aboard. | |
| ID: 7161 | Rating: 0 | rate:
| |
Thanks Ice. I just got home and noticed MW is running w/high priority on both machines. Don't know what happened, but its working much better. Noticed also the 'time to completion' is more accurate. I still won't be able to complete much of the WU I recieved. I take my projects seriously and get irritated when I can't get my work done. My older machine trashed the graphics card and it took a while to get a new one. I was hesitant to use my newer machine on boinc but then I said what the heck. Now Im cruching on TWO! Thanks all for the help! Glad to be aboard. Try enabling experimental apps in the Milkyway settings for your account. That will allow you to get some work with the optimized app for shorter crunch times. ____________ | |
| ID: 7170 | Rating: 0 | rate:
| |
Thanks Ice. I just got home and noticed MW is running w/high priority on both machines. Don't know what happened, but its working much better. Noticed also the 'time to completion' is more accurate. I still won't be able to complete much of the WU I recieved. I take my projects seriously and get irritated when I can't get my work done. My older machine trashed the graphics card and it took a while to get a new one. I was hesitant to use my newer machine on boinc but then I said what the heck. Now Im cruching on TWO! Thanks all for the help! Glad to be aboard. Should have to do that as the stock app is now the new optimized version. It should download automatically. Also, no WUs are being generated for the old version anymore. ____________ | |
| ID: 7180 | Rating: 0 | rate:
| |
|
That option is not available in Milky Way preferences, Bigred. | |
| ID: 7193 | Rating: 0 | rate:
| |
That option is not available in Milky Way preferences, Bigred. Just checked and it is there, "run test applications"? ____________ A clear conscience is usually the sign of a bad memory | |
| ID: 7195 | Rating: 0 | rate:
| |
That option is not available in Milky Way preferences, Bigred. However, it is not needed to check that now. The "test application" was promoted to the "stock application". As of today, that setting will not cause a change for any of us...but you could potentially regret it later if they release a buggy application that was marked as "test", but you had forgotten that you had enabled it... | |
| ID: 7197 | Rating: 0 | rate:
| |
|
I am getting no work sent? Now for almost a day? | |
| ID: 7205 | Rating: 0 | rate:
| |
|
Sorry folks, my bad. ;) Was not looking in the right place. I have this option disabled on advice from others as well. OH, and Rick you can some of my WUs. I have way too many to complete in time. After 05:33:12 hrs I'm 33.2% complete on just one. I have 15 more to go by 12.03.08 That's means roughly 16 hr to complete a WU that was suppose to take 08:20 hrs. Is this normal?? Why is it taking so long to complete a MW WU? I finish most of my other projects WUs well withing the time limits. I am at a loss... | |
| ID: 7209 | Rating: 0 | rate:
| |
I am getting no work sent? Now for almost a day? Rick, do a detach / reattach and you should get work. ____________ A clear conscience is usually the sign of a bad memory | |
| ID: 7212 | Rating: 0 | rate:
| |
|
Thanks to Caspr I am a happy camper. I detached/re-attached to the project and the new WUs are running well. I had the 1.22 WUs but now I have the Optimized 0.04. After 24mins I'm at 43% complete. Yippee!! Thanks Caspr!! xoxox | |
| ID: 7227 | Rating: 0 | rate:
| |
Message boards :
Number crunching :
new workunit limit