Message boards :
Number crunching :
Can't Complete WU In Time
Message board moderation
Author | Message |
---|---|
Send message Joined: 20 Nov 21 Posts: 5 Credit: 2,518,990 RAC: 0 |
Running Win10 on a 16Gb Ram laptop and just noticed that I have a work unit that's time to completion is July 9 but the remaining time is 2180 days 23 hours and 35 seconds. I'm not certain this will be completed in time for this study, using 2 CPUs, as that's nearly six years. Just noticed that the time remaining is growing each second instead of getting any smaller. What's my next step? I'd add a screenshot but I haven't the foggiest how to do that here. |
Send message Joined: 13 Apr 17 Posts: 256 Credit: 604,411,638 RAC: 0 |
... unhide your computers, so that the experts can have a look and maybe help you |
Send message Joined: 20 Nov 21 Posts: 5 Credit: 2,518,990 RAC: 0 |
How does one unhide a computer? |
Send message Joined: 22 May 13 Posts: 3 Credit: 828,272,986 RAC: 4,124 |
It is a setting that can be enabled on a per project basis. If you go under your Milkyway@home account page(should be able to just right click your name on the top right hand corner of this page), and go to MilkyWay@home preferences, there is a checkbox that needs selected for Should MilkyWay@home show your computers on its web site? Once that is checkmarked, others will be able to see the specs of any computers you have linked to your Milkyway@home account, as well as view some task information which really helps in troubleshooting. |
Send message Joined: 5 Jul 11 Posts: 990 Credit: 376,143,149 RAC: 6 |
It is a setting that can be enabled on a per project basis. If you go under your Milkyway@home account page(should be able to just |
Send message Joined: 22 May 13 Posts: 3 Credit: 828,272,986 RAC: 4,124 |
Yeah you're correct Peter, that was a typo. I think I was typing out "right hand corner" and accidentally also wrote right click. You just need to click your name to get to your account page. |
Send message Joined: 24 Jan 11 Posts: 708 Credit: 543,294,568 RAC: 140,060 |
Stop BOINC and reboot the computer. No BOINC task ever takes years to complete. So you know something went wrong with your host. |
Send message Joined: 5 Jul 11 Posts: 990 Credit: 376,143,149 RAC: 6 |
Stop BOINC and reboot the computer.Is that really necessary? Why not just abort the task? |
Send message Joined: 20 Nov 21 Posts: 5 Credit: 2,518,990 RAC: 0 |
Okay, I tried simply rebooting Milkyway@home with no change (starts growing again). I then simply rebooted Boinc, with no change. Tried to check the box to make my laptop visible, but the box would not show checked. Noted the "change preferences" link, clicked that but the box still would not change. Finally, I simply deleted both the project (only one I'm running) then deleted Boinc and Oracle. Downloaded and installed both then the project. I was able to get the box checked and saved. So, this laptop is available for the experts. I'm running the project on two laptops so can the experts see the difference or should I suspend the project on the other? I tend to run them both 24/7 for other programs. I truly appreciate the help and apologize for the delay in responding. |
Send message Joined: 28 May 17 Posts: 76 Credit: 4,389,075,401 RAC: 87,958 |
If you are running them on a laptop then the laptop is probably not fast enough to complete them in time. If it was completing them in time before but it's not now, then it might be because the GPU or CPU is throttling due to over heating. This could be caused by bad ventilation like placing the laptop on a bed or carpet where the vents beneath the laptop are blocked of their is a lot of dust in the heatsink fins and vents blocking air passage. That would be my first guess. There are different tools you can download for windows that will show whether your CPU or GPU is running at it's full potential as well as the temperature of them. MSI Afterburner for GPUs (Better support for Nvidia GPUs) CPU-z to show CPU information Windows Task manager to show both GPU and CPU usage stats. To name a few |
Send message Joined: 5 Jul 11 Posts: 990 Credit: 376,143,149 RAC: 6 |
MSI Afterburner for everything. Shows usage and temperature for all CPUs and GPUs. I check it every day on my 7 PCs to make sure nothing is overheating and everything is processing hard. A 16GB RAM laptop is in no way too slow to run Milkyway in the generous time limit. |
Send message Joined: 8 May 09 Posts: 3319 Credit: 520,337,727 RAC: 21,641 |
MSI Afterburner for everything. Shows usage and temperature for all CPUs and GPUs. I check it every day on my 7 PCs to make sure nothing is overheating and everything is processing hard. Ummm he's completing tasks already: His Ryzen 3 machine 1,226.29 9.47 227.11 Milkyway@home Separation v1.46 (opencl_ati_101) windows_x86_64 1,062.53 2,046.19 28.84 Milkyway@home N-Body Simulation v1.82 (mt) windows_x86_64 His I7 machine: 4,031.02 3,979.44 228.83 Milkyway@home Separation v1.46 windows_x86_64 286.62 1,179.92 25.09 Milkyway@home N-Body Simulation v1.82 (mt) windows_x86_64 His problem is he's trying to do both cpu and gpu tasks on that laptop and it's just going to be too slow to do that with much of a cache at all, yes you are right Peter in that the tasks will take a long time but yes he also CAN do them if he manages his cache size much better. On his Ryzen 3 machine he's returning tasks in about a day while on his I7 machine he's returning them in about 2 days. |
Send message Joined: 5 Jul 11 Posts: 990 Credit: 376,143,149 RAC: 6 |
Ummm he's completing tasks already:What? MW doesn't need loads of cache, I run GPU and CPU on all sorts of old machines. That 4000 seconds you quote is only an hour, that's fine for a CPU. |
Send message Joined: 20 Nov 21 Posts: 5 Credit: 2,518,990 RAC: 0 |
*edit* I've noticed that the problem of the "Remaining" time only increases when using the 2 CPUs. The others (0.721 CPUs and the one using memory) are working great,. Should I increase/decrease the setting for CPU usage in BOINC/Computing Preferences/Usage Limits (set at 50%) or amend that number? Seems the only way to stop the 2 CPUs from causing the problem is to restart the laptop and it works for a WU or two and then the problem returns. The current WU started at a little short of 1:01:00 and after an hour and 42 minutes Elapsed, it stands at 1:39:39 and increasing about one second every three seconds. I've installed MSI Afterburner. The CPU and Memory show 1200 MHz and temp is 60C. I can increase the core and the memory speeds (but haven't). Would any other numbers from that program help me in this? I apologize for the long time between posts. I have the preferences set to immediately notify me by email but it simply doesn't do that. Rosanna Rosannadanna had it right; "if it's not something it's something else!" |
Send message Joined: 5 Jul 11 Posts: 990 Credit: 376,143,149 RAC: 6 |
I use MSI afterburner to make sure the CPU and GPU are actually doing something. Change the preferences so you have a graph of CPU usage [1]. You should get nearly 100% if using all the cores. If you're doing Nbody tasks, these start on 1 core, then use all cores (up to 16 per task if you haven't created a special config for them) after a few minutes. If you're doing seperation tasks, these should use 1 core each continuously. [1] To do this, right click the graphs and choose properties. Find "CPU usage" and tick it, then press OK. Don't use things like "CPU1 usage", those are for individual cores. If your laptop was just slow (eg. overheating) it still wouldn't be taking years for a task. To get that slow they're actually stuck. I've never seen tasks stick. Try a few of them, aborting any that don't show enough usage in Afterburner. Tell us how often it happens. And somebody may be able to look at the output of the faulty tasks on your computer's page here on MW. |
Send message Joined: 8 May 09 Posts: 3319 Credit: 520,337,727 RAC: 21,641 |
*edit* One other thing you might look at is in the Boinc Manager itself under Options, computing preferences and then the 2nd section down 'when to suspend', turn all of those off and see if the times stop going up or not. You can then go back in and adjust the delay to something less than the defaults if you are using the laptop for something besides crunching. |
Send message Joined: 2 Nov 10 Posts: 25 Credit: 1,894,269,109 RAC: 0 |
Hal, You are chasing ghosts. You don't have some weird intermittent fault in your computers. What you don't have is an appreciation that once your computer uploads the results of your computation the job is not completed. Until the task is validated, invalidated or errors its clock is still running. If there is enough delay by the validation process, you can run out of time and Milky will cancel the task. It ain't right but it is the way it is. And, it isn't you or your computers; keep on truckin'. |
Send message Joined: 5 Jul 11 Posts: 990 Credit: 376,143,149 RAC: 6 |
Hal,No, that time is not shown in his Boinc manager. His problem is he isn't completing them to even get to the send back phase. |
Send message Joined: 13 Apr 17 Posts: 256 Credit: 604,411,638 RAC: 0 |
Hal, NOW, that is news - thanks for the info ! BTW ... who is Hal ? It's just getting to hot ... |
Send message Joined: 5 Jul 11 Posts: 990 Credit: 376,143,149 RAC: 6 |
I believe the above information is incorrect. You have to report it in by the deadline. If your wingman takes longer that isn't your problem.Hal,NOW, that is news - thanks for the info ! BTW ... who is Hal ?There is no apparent core overheat. What are you doing Dave? It's just getting to hot ...Ever heard of AC? I'm using it in Scotland! This extra ONE degree centigrade seems to make people think global warming exists. It's just some hot weather for Guinness sake. |
©2024 Astroinformatics Group