Welcome to MilkyWay@home

Can't Complete WU In Time

Message boards : Number crunching : Can't Complete WU In Time
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
haiweepp

Send message
Joined: 20 Nov 21
Posts: 5
Credit: 2,518,990
RAC: 0
Message 73942 - Posted: 2 Jul 2022, 3:24:22 UTC

Running Win10 on a 16Gb Ram laptop and just noticed that I have a work unit that's time to completion is July 9 but the remaining time is 2180 days 23 hours and 35 seconds. I'm not certain this will be completed in time for this study, using 2 CPUs, as that's nearly six years. Just noticed that the time remaining is growing each second instead of getting any smaller. What's my next step?

I'd add a screenshot but I haven't the foggiest how to do that here.
ID: 73942 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
San-Fernando-Valley

Send message
Joined: 13 Apr 17
Posts: 256
Credit: 604,411,638
RAC: 0
Message 73943 - Posted: 2 Jul 2022, 6:53:34 UTC - in response to Message 73942.  

... unhide your computers, so that the experts can have a look and maybe help you
ID: 73943 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
haiweepp

Send message
Joined: 20 Nov 21
Posts: 5
Credit: 2,518,990
RAC: 0
Message 73945 - Posted: 3 Jul 2022, 2:45:26 UTC - in response to Message 73943.  

How does one unhide a computer?
ID: 73945 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Icecold

Send message
Joined: 22 May 13
Posts: 3
Credit: 828,154,267
RAC: 0
Message 73946 - Posted: 3 Jul 2022, 3:37:43 UTC - in response to Message 73945.  

It is a setting that can be enabled on a per project basis. If you go under your Milkyway@home account page(should be able to just right click your name on the top right hand corner of this page), and go to MilkyWay@home preferences, there is a checkbox that needs selected for Should MilkyWay@home show your computers on its web site? Once that is checkmarked, others will be able to see the specs of any computers you have linked to your Milkyway@home account, as well as view some task information which really helps in troubleshooting.
ID: 73946 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 5 Jul 11
Posts: 990
Credit: 376,142,956
RAC: 2
Message 73947 - Posted: 3 Jul 2022, 11:19:10 UTC - in response to Message 73946.  

It is a setting that can be enabled on a per project basis. If you go under your Milkyway@home account page(should be able to just right click your name on the top right hand corner of this page), and go to MilkyWay@home preferences, there is a checkbox that needs selected for Should MilkyWay@home show your computers on its web site? Once that is checkmarked, others will be able to see the specs of any computers you have linked to your Milkyway@home account, as well as view some task information which really helps in troubleshooting.
ID: 73947 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Icecold

Send message
Joined: 22 May 13
Posts: 3
Credit: 828,154,267
RAC: 0
Message 73949 - Posted: 3 Jul 2022, 13:25:55 UTC - in response to Message 73947.  

Yeah you're correct Peter, that was a typo. I think I was typing out "right hand corner" and accidentally also wrote right click. You just need to click your name to get to your account page.
ID: 73949 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Keith Myers
Avatar

Send message
Joined: 24 Jan 11
Posts: 696
Credit: 540,036,538
RAC: 86,690
Message 73950 - Posted: 3 Jul 2022, 17:08:44 UTC - in response to Message 73942.  

Stop BOINC and reboot the computer.

No BOINC task ever takes years to complete.

So you know something went wrong with your host.
ID: 73950 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 5 Jul 11
Posts: 990
Credit: 376,142,956
RAC: 2
Message 73952 - Posted: 3 Jul 2022, 18:19:28 UTC - in response to Message 73950.  

Stop BOINC and reboot the computer.

No BOINC task ever takes years to complete.

So you know something went wrong with your host.
Is that really necessary? Why not just abort the task?
ID: 73952 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
haiweepp

Send message
Joined: 20 Nov 21
Posts: 5
Credit: 2,518,990
RAC: 0
Message 73955 - Posted: 5 Jul 2022, 20:02:11 UTC - in response to Message 73946.  

Okay, I tried simply rebooting Milkyway@home with no change (starts growing again). I then simply rebooted Boinc, with no change. Tried to check the box to make my laptop visible, but the box would not show checked. Noted the "change preferences" link, clicked that but the box still would not change. Finally, I simply deleted both the project (only one I'm running) then deleted Boinc and Oracle. Downloaded and installed both then the project. I was able to get the box checked and saved. So, this laptop is available for the experts. I'm running the project on two laptops so can the experts see the difference or should I suspend the project on the other? I tend to run them both 24/7 for other programs. I truly appreciate the help and apologize for the delay in responding.
ID: 73955 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Skillz

Send message
Joined: 28 May 17
Posts: 76
Credit: 4,386,301,288
RAC: 0
Message 73957 - Posted: 6 Jul 2022, 18:18:19 UTC

If you are running them on a laptop then the laptop is probably not fast enough to complete them in time. If it was completing them in time before but it's not now, then it might be because the GPU or CPU is throttling due to over heating. This could be caused by bad ventilation like placing the laptop on a bed or carpet where the vents beneath the laptop are blocked of their is a lot of dust in the heatsink fins and vents blocking air passage. That would be my first guess.

There are different tools you can download for windows that will show whether your CPU or GPU is running at it's full potential as well as the temperature of them.

MSI Afterburner for GPUs (Better support for Nvidia GPUs)
CPU-z to show CPU information
Windows Task manager to show both GPU and CPU usage stats.

To name a few
ID: 73957 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 5 Jul 11
Posts: 990
Credit: 376,142,956
RAC: 2
Message 73958 - Posted: 6 Jul 2022, 19:27:54 UTC - in response to Message 73957.  
Last modified: 6 Jul 2022, 19:28:46 UTC

MSI Afterburner for everything. Shows usage and temperature for all CPUs and GPUs. I check it every day on my 7 PCs to make sure nothing is overheating and everything is processing hard.

A 16GB RAM laptop is in no way too slow to run Milkyway in the generous time limit.
ID: 73958 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile mikey
Avatar

Send message
Joined: 8 May 09
Posts: 3315
Credit: 519,946,492
RAC: 22,242
Message 73959 - Posted: 7 Jul 2022, 11:28:17 UTC - in response to Message 73958.  

MSI Afterburner for everything. Shows usage and temperature for all CPUs and GPUs. I check it every day on my 7 PCs to make sure nothing is overheating and everything is processing hard.

A 16GB RAM laptop is in no way too slow to run Milkyway in the generous time limit.


Ummm he's completing tasks already:
His Ryzen 3 machine
1,226.29 9.47 227.11 Milkyway@home Separation v1.46 (opencl_ati_101) windows_x86_64

1,062.53 2,046.19 28.84 Milkyway@home N-Body Simulation v1.82 (mt) windows_x86_64

His I7 machine:
4,031.02 3,979.44 228.83 Milkyway@home Separation v1.46 windows_x86_64

286.62 1,179.92 25.09 Milkyway@home N-Body Simulation v1.82 (mt) windows_x86_64

His problem is he's trying to do both cpu and gpu tasks on that laptop and it's just going to be too slow to do that with much of a cache at all, yes you are right Peter in that the tasks will take a long time but yes he also CAN do them if he manages his cache size much better. On his Ryzen 3 machine he's returning tasks in about a day while on his I7 machine he's returning them in about 2 days.
ID: 73959 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 5 Jul 11
Posts: 990
Credit: 376,142,956
RAC: 2
Message 73960 - Posted: 7 Jul 2022, 12:16:39 UTC - in response to Message 73959.  
Last modified: 7 Jul 2022, 12:18:23 UTC

Ummm he's completing tasks already:
His Ryzen 3 machine
1,226.29 9.47 227.11 Milkyway@home Separation v1.46 (opencl_ati_101) windows_x86_64

1,062.53 2,046.19 28.84 Milkyway@home N-Body Simulation v1.82 (mt) windows_x86_64

His I7 machine:
4,031.02 3,979.44 228.83 Milkyway@home Separation v1.46 windows_x86_64

286.62 1,179.92 25.09 Milkyway@home N-Body Simulation v1.82 (mt) windows_x86_64

His problem is he's trying to do both cpu and gpu tasks on that laptop and it's just going to be too slow to do that with much of a cache at all, yes you are right Peter in that the tasks will take a long time but yes he also CAN do them if he manages his cache size much better. On his Ryzen 3 machine he's returning tasks in about a day while on his I7 machine he's returning them in about 2 days.
What? MW doesn't need loads of cache, I run GPU and CPU on all sorts of old machines. That 4000 seconds you quote is only an hour, that's fine for a CPU.
ID: 73960 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
haiweepp

Send message
Joined: 20 Nov 21
Posts: 5
Credit: 2,518,990
RAC: 0
Message 73980 - Posted: 21 Jul 2022, 19:58:06 UTC

*edit*

I've noticed that the problem of the "Remaining" time only increases when using the 2 CPUs. The others (0.721 CPUs and the one using memory) are working great,. Should I increase/decrease the setting for CPU usage in BOINC/Computing Preferences/Usage Limits (set at 50%) or amend that number? Seems the only way to stop the 2 CPUs from causing the problem is to restart the laptop and it works for a WU or two and then the problem returns. The current WU started at a little short of 1:01:00 and after an hour and 42 minutes Elapsed, it stands at 1:39:39 and increasing about one second every three seconds.

I've installed MSI Afterburner. The CPU and Memory show 1200 MHz and temp is 60C. I can increase the core and the memory speeds (but haven't). Would any other numbers from that program help me in this?

I apologize for the long time between posts. I have the preferences set to immediately notify me by email but it simply doesn't do that. Rosanna Rosannadanna had it right; "if it's not something it's something else!"
ID: 73980 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 5 Jul 11
Posts: 990
Credit: 376,142,956
RAC: 2
Message 73981 - Posted: 21 Jul 2022, 21:09:50 UTC - in response to Message 73980.  
Last modified: 21 Jul 2022, 21:13:26 UTC

I use MSI afterburner to make sure the CPU and GPU are actually doing something. Change the preferences so you have a graph of CPU usage [1]. You should get nearly 100% if using all the cores. If you're doing Nbody tasks, these start on 1 core, then use all cores (up to 16 per task if you haven't created a special config for them) after a few minutes. If you're doing seperation tasks, these should use 1 core each continuously.

[1] To do this, right click the graphs and choose properties. Find "CPU usage" and tick it, then press OK. Don't use things like "CPU1 usage", those are for individual cores.

If your laptop was just slow (eg. overheating) it still wouldn't be taking years for a task. To get that slow they're actually stuck. I've never seen tasks stick. Try a few of them, aborting any that don't show enough usage in Afterburner. Tell us how often it happens. And somebody may be able to look at the output of the faulty tasks on your computer's page here on MW.
ID: 73981 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile mikey
Avatar

Send message
Joined: 8 May 09
Posts: 3315
Credit: 519,946,492
RAC: 22,242
Message 73989 - Posted: 22 Jul 2022, 9:49:33 UTC - in response to Message 73980.  

*edit*

I've noticed that the problem of the "Remaining" time only increases when using the 2 CPUs. The others (0.721 CPUs and the one using memory) are working great,. Should I increase/decrease the setting for CPU usage in BOINC/Computing Preferences/Usage Limits (set at 50%) or amend that number? Seems the only way to stop the 2 CPUs from causing the problem is to restart the laptop and it works for a WU or two and then the problem returns. The current WU started at a little short of 1:01:00 and after an hour and 42 minutes Elapsed, it stands at 1:39:39 and increasing about one second every three seconds.

I've installed MSI Afterburner. The CPU and Memory show 1200 MHz and temp is 60C. I can increase the core and the memory speeds (but haven't). Would any other numbers from that program help me in this?

I apologize for the long time between posts. I have the preferences set to immediately notify me by email but it simply doesn't do that. Rosanna Rosannadanna had it right; "if it's not something it's something else!"


One other thing you might look at is in the Boinc Manager itself under Options, computing preferences and then the 2nd section down 'when to suspend', turn all of those off and see if the times stop going up or not. You can then go back in and adjust the delay to something less than the defaults if you are using the laptop for something besides crunching.
ID: 73989 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Frank

Send message
Joined: 2 Nov 10
Posts: 25
Credit: 1,894,269,109
RAC: 0
Message 73990 - Posted: 22 Jul 2022, 14:24:53 UTC - in response to Message 73980.  

Hal,
You are chasing ghosts. You don't have some weird intermittent fault in your computers. What you don't have is an appreciation that once your computer uploads the results of your computation the job is not completed. Until the task is validated, invalidated or errors its clock is still running. If there is enough delay by the validation process, you can run out of time and Milky will cancel the task.
It ain't right but it is the way it is. And, it isn't you or your computers; keep on truckin'.
ID: 73990 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 5 Jul 11
Posts: 990
Credit: 376,142,956
RAC: 2
Message 73991 - Posted: 22 Jul 2022, 18:46:28 UTC - in response to Message 73990.  

Hal,
You are chasing ghosts. You don't have some weird intermittent fault in your computers. What you don't have is an appreciation that once your computer uploads the results of your computation the job is not completed. Until the task is validated, invalidated or errors its clock is still running. If there is enough delay by the validation process, you can run out of time and Milky will cancel the task.
It ain't right but it is the way it is. And, it isn't you or your computers; keep on truckin'.
No, that time is not shown in his Boinc manager. His problem is he isn't completing them to even get to the send back phase.
ID: 73991 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
San-Fernando-Valley

Send message
Joined: 13 Apr 17
Posts: 256
Credit: 604,411,638
RAC: 0
Message 73997 - Posted: 23 Jul 2022, 12:47:46 UTC - in response to Message 73990.  

Hal,
... Until the task is validated, invalidated or errors its clock is still running. If there is enough delay by the validation process, you can run out of time and Milky will cancel the task. ...

NOW, that is news - thanks for the info !

BTW ... who is Hal ?

It's just getting to hot ...
ID: 73997 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 5 Jul 11
Posts: 990
Credit: 376,142,956
RAC: 2
Message 73998 - Posted: 23 Jul 2022, 15:27:18 UTC - in response to Message 73997.  
Last modified: 23 Jul 2022, 15:27:47 UTC

Hal,
... Until the task is validated, invalidated or errors its clock is still running. If there is enough delay by the validation process, you can run out of time and Milky will cancel the task. ...
NOW, that is news - thanks for the info !
I believe the above information is incorrect. You have to report it in by the deadline. If your wingman takes longer that isn't your problem.

BTW ... who is Hal ?
There is no apparent core overheat. What are you doing Dave?

It's just getting to hot ...
Ever heard of AC? I'm using it in Scotland! This extra ONE degree centigrade seems to make people think global warming exists. It's just some hot weather for Guinness sake.
ID: 73998 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
1 · 2 · Next

Message boards : Number crunching : Can't Complete WU In Time

©2024 Astroinformatics Group