| log in |
Message boards : News : New Nbody Run
| Author | Message |
|---|---|
|
A new NBody run has been posted. We are trying to determine the initial dark matter distribution of some test data. There have been some improvements to our model which should hopefully result in faster convergence times. The run is titled nbody_100K_Plum_EMD. All previous Nbody runs have been taken down. | |
| ID: 53981 | Rating: 0 | rate:
| |
|
Hi Colin, | |
| ID: 53983 | Rating: 0 | rate:
| |
|
i have one of these now, All other runs on this machine take 2.x hours max to run. This one has a forecasted run time of 42 hours | |
| ID: 53985 | Rating: 0 | rate:
| |
|
Hi Robert, | |
| ID: 53997 | Rating: 0 | rate:
| |
|
I haven't received any new downloads for this project in days now? Any idea the problem, is it something I'm doing wrong? | |
| ID: 54087 | Rating: 0 | rate:
| |
|
Your deadlines are 11 days out, apps with deadlines 4 days out aren't getting a single CPU cycle. If your 4 apps continues to hog all 4 CPUs on this machine, you'll find the MIPS I donate no longer available to you. Simple guys. Back it off and quickly. You have 16 elapsed hours from the time of this post. | |
| ID: 54088 | Rating: 0 | rate:
| |
|
Your new Nbody_110K_Plum_xxxxxxxx burns through my system in 1:33 (1 minute 33 seconds) | |
| ID: 54098 | Rating: 0 | rate:
| |
If your 4 apps continues to hog all 4 CPUs on this machine, you'll find the MIPS I donate no longer available to you. Simple guys. Back it off and quickly. You have 16 elapsed hours from the time of this post. First of all, CALM DOWN! I think this is caused by a bug that was introduced with the new dont_use_dcf command in Boinc 7.0.24. It makes the client assume that all tasks are about to miss the deadline, thus run them in high priority mode. This should affect all tasks of all projects that set dont_use_dcf. By the way, if I understand correctly how dont_use_dcf works, one should try to avoid using an app_info.xml on such projects, bug or not. | |
| ID: 54099 | Rating: 0 | rate:
| |
If your 4 apps continues to hog all 4 CPUs on this machine, you'll find the MIPS I donate no longer available to you. Simple guys. Back it off and quickly. You have 16 elapsed hours from the time of this post. For those of us not experienced in editing or removing sections of the Boinc Mgr program, how do you change whatever is causing this problem. On the other hand 3 other projects do not have this problem and the PS tasks also do not have this problem. So why do the Nbody tasks have this issue? Why not modify what ever the offending code is in the task rather than have us novices go monkying about in the programs behind boinc mgr? | |
| ID: 54102 | Rating: 0 | rate:
| |
For those of us not experienced in editing or removing sections of the Boinc Mgr program, how do you change whatever is causing this problem. You can't. The problem is in your local Boinc installation and you'll have to recompile the code or find someone to do it for you. This is the patch I used: diff -Naur boinc_core_release_7_0_25/client/cpu_sched.cpp boinc_core_release_7_0_25_patched/client/cpu_sched.cpp --- boinc_core_release_7_0_25/client/cpu_sched.cpp 2012-04-14 10:02:12.000000000 +0200 +++ boinc_core_release_7_0_25_patched/client/cpu_sched.cpp 2012-04-22 02:14:30.000000000 +0200 @@ -470,7 +470,7 @@ // treat projects with DCF>90 as if they had deadline misses // - if (!p->dont_use_dcf && p->duration_correction_factor < 90.0) { + if (p->dont_use_dcf || p->duration_correction_factor < 90.0) { if (p->rsc_pwf[rsc_type].deadlines_missed_copy <= 0) { continue; } This is part of the code where Boinc looks for tasks to run in high priority. Basically it means, if a project uses DCF (Duration Correction Factor) and the value is somewhat reasonable, consider to leave the task alone. Otherwise it's a candidate for special treatment. Unfortunately, if a project does not use DCF, the "leave it alone" part is never reached. On the other hand 3 other projects do not have this problem and the PS tasks also do not have this problem. So why do the Nbody tasks have this issue? If a project uses DCF, which I think most still do, you won't notice anything of this. Look in your client_state.xml file, if you find a line reading "<dont_use_dcf/>" the project that it belongs to should be affected. The whole project. And I say "should" because Milkyway is the only one for me, too, so I can't verify this. But the PS tasks do have the problem, that's why I looked into this in the first place. I noticed both of my GPUs running nothing but Milkyway for half a day. Why not modify what ever the offending code is in the task rather than have us novices go monkying about in the programs behind boinc mgr? The offending code is not in the task, nor is it with the project. It is at your end and you can't change it yourself. The project could probably work around it, but you would have to do that for every single project seperately. It's more efficient to fix the problem where it is. I'll look over at the Boinc site for a contact address, if I find one I'll suggest to change this in the next release. But if somebody knows whom to inform, please feel free to point them here. | |
| ID: 54103 | Rating: 0 | rate:
| |
|
Interesting that PS tasks go bad on your GPU while Nbody tasks go crazy on my CPU. | |
| ID: 54107 | Rating: 0 | rate:
| |
For those of us not experienced in editing or removing sections of the Boinc Mgr program, how do you change whatever is causing this problem. Where is the file that this patch should be applied to? Only something for MWAH? or to a file in the BOINC Mgr? | |
| ID: 54108 | Rating: 0 | rate:
| |
|
reverted back to older version of boinc to solve the problem | |
| ID: 54122 | Rating: 0 | rate:
| |
reverted back to older version of boinc to solve the problem oh, really??? I must TRY this solution! I hate it when GPU tasks are *PAUSED* while CPU-ONLY app "N-Body" is running in high-priority ! This is DEFINITELY a bug. Yes, BOINC Manager bug, probably. I will downgrade the BOINC immediatelly , NOW! :) Thanx for this solution, greg_be ! ____________ | |
| ID: 54125 | Rating: 0 | rate:
| |
reverted back to older version of boinc to solve the problem -------------------- CONFIRMED - reverting back to BOINC ver. 6.12.34 solved the GPU issue! (on BOINC ver. 7.xx, GPU tasks were NOT running while NBody goes crazy on all 8 CPUs, 1 minute 30 sec. run per WU... while guessing estimated: 3 hours). - need to find the DCF for this app and change it manually to something reasonable on my mobile CPU Core i7 @ 1.73 :) | |
| ID: 54132 | Rating: 0 | rate:
| |
|
As I wrote, the bug was introduced with a change in 7.0.24, so previous releases are safe from it. Of course older 7.0 versions can't be recommended, so a possible solution would be to switch back to BOINC 6 as you did. Now we all know that you can't simply downgrade from v7 to v6 (we have read the release notes, haven't we?) so here's instructions how to do it. | |
| ID: 54133 | Rating: 0 | rate:
| |
As I wrote, the bug was introduced with a change in 7.0.24, so previous releases are safe from it. Of course older 7.0 versions can't be recommended, so a possible solution would be to switch back to BOINC 6 as you did. Now we all know that you can't simply downgrade from v7 to v6 (we have read the release notes, haven't we?) so here's instructions how to do it. That is true, you do lose all your work from Vs 7 when you revert to 6. But then that is a price to pay to have control over your system again, unless you did what they recommended. I didn't let the work continue and just reverted back before all the tasks finished. Oh well. Someone else can have that headache for me. Happy with Vs. 6 now. | |
| ID: 54149 | Rating: 0 | rate:
| |
|
Sorry for not paying a lot of attention to this thread. I didn't realize people could comment on the news items. | |
| ID: 54259 | Rating: 0 | rate:
| |
|
Hello, Colin Rice. All is running fine here. | |
| ID: 54419 | Rating: 0 | rate:
| |
suddenly: there is limit of only one task for my computer - the same happened on my core i7 (1.73 GHz, 8 threads) notebook several days ago, also when the ESTIMATED time finally did get near the truth: 1 minute (estimated are 3 minutes). I still have in setting that it shall download work for 1 day... same situation: I get from the server: not sending work, reached limit on tasks in progress for computer. Hmmm, interesting... Does not seem to be BOINC bug. This is specific for Milkyway project or SERVER only. Can You see the logs what seems to be the cause, please? Cause it makes me wonder why the good estimated time causes a thing so bad! Fascinating, isn't it? I just cant get it. :))) Overtonesinger | |
| ID: 54420 | Rating: 0 | rate:
| |
Message boards :
News :
New Nbody Run