Welcome to MilkyWay@home

Posts by mikey

1) Message boards : Number crunching : Huge 4 CPU task stuck at 17.188%, uses 0% of CPU, estimated time to finish 8 days? (Message 70013)
Posted 8 days ago by Profilemikey
Post:
Hi,

This task for 4 CPUs: https://milkyway.cs.rpi.edu/milkyway/workunit.php?wuid=2000974954 even when suspending all other tasks, does not use my CPU at all anymore and appears stuck .

But it somehow got to 17.188%. Is my CPU too old for the rest of the task?

Should I cancel it?


Suspend every other task on the pc, set all zero resource share projects to no new tasks too so you don't get tasks from them, and see if it starts up again it could just need more memory with the other things you are running. If that doesn't work try suspending it too then resuming it. You cpu is fine.
2) Questions and Answers : Unix/Linux : Tasks disappear from machine (Message 69986)
Posted 8 Jul 2020 by Profilemikey
Post:
Yeah, I do remember updating Boinc on this machine using a PPA for some reason.
The reason now escapes me.
I think it was something to do with Rossetta, perhaps graphics (which I don't really care about anyway) but I forget. :)

Everything is good now. 99 mostly completed tasks on the server this morning against the previously errant machine.

Thanks for the replies!


No problem just glad it's all working now
3) Questions and Answers : Unix/Linux : Tasks disappear from machine (Message 69984)
Posted 8 Jul 2020 by Profilemikey
Post:
My mistake.

The littleserver1 was running a newer boinc than the rlittlelaptop machine and for some reason it had a problem with coming out of suspend when the machine was inactive.
When I was testing it, I had the machine set to run GPU tasks all the time whereas my preferences were to have it suspended on activity.

I downgraded the boinc and it seems to be working as normal now.


Those settings can also be adjusted within the Boinc Manager itself, the Boinc Manager is down by the clock and if you click the up arrow you can see it, then right click to bring up some options then left click on 'open Boinc Manager' Boinc will open up for you. After that click on Options, computing preferences and the first tab, computing, in the 2nd section is all about when to suspend Boinc. Doing it this way is a computer by computer setting but you can also do it for every pc if you click on your name at the top right of this page and then in the 2nd section. preferences, choose the top line 'computing preferences' and make your changes there.

The choices of when to suspend when the computer is busy are the default settings in every version of Boinc, the newer versions tend to fix problems the older versions had like most recently an internet security certificate needed updating so they included it in with a few fairly minor changes. But if a version works for you you do not need to update just because there is a new version out there.
4) Questions and Answers : Unix/Linux : Tasks disappear from machine (Message 69982)
Posted 7 Jul 2020 by Profilemikey
Post:
I see the credit figures and they do look reasonable.

However, if you click through to the actual tasks by machine, you will see that "rlittlelaptop" (764322) has a large number of tasks listed, whereas "littleserver1" (822434) isn't even listing any tasks for the 6th or 7th of June.
If I watch the tasks completed, I see them transition to the task page as "completed" but they soon disappear from the machine's list, in a way that the tasks don't for rlittlelaptop.

I should clarify that when I mean disappear from the machine's list, I mean on the website. The GPU jobs take about 4mins each on the machines themselves.

I just wanted to make sure they were not being purged due to computation error.

I was worried about the apparent discrepancy at the server side.


On your list of tasks it will list the ones with errors or computation errors and neither machine shows any problems with any of the tasks.
BTW only you can see the actual machine names all we see is them sorted by the last time they connected to the project and a number 1, 2 etc. If you click on my name you can see my computers but not the actual names of the pc's just numbers.
5) Questions and Answers : Unix/Linux : Tasks disappear from machine (Message 69979)
Posted 7 Jul 2020 by Profilemikey
Post:
I'm been running GPU work through a couple of machines that I have.

One of them (id=764322) is pumping through GPU work and I see them in the task view for the machine.
One of the others (id=822434), is also pumping through pure GPU work and sending it up. However, for that machine, the jobs quickly disappear from my job list leaving only a very small number remaining.

Both machines are running Linux Mint 19.3.

The first is a laptop with a different NVidia GPU.
The GPU on the second machine is a GTX 1660 (stock everything, no overclocking).

Anybody have any idea what is going on here? Is the work getting rejected for some reason?


Both machines are crunching just fine, the gpu jobs are much shorter than cpu jobs, mine are around a minute because I use older AMD gpu's, so will 'disappear' quicker from the machine. But if you look at your computers both of them are doing very well. You can see your computers here and then you can click on Tasks and see how each is doing:
[url]https://milkyway.cs.rpi.edu/milkyway/hosts_user.php?userid=1116944[/url[
6) Message boards : Number crunching : Computation errors (Message 69977)
Posted 5 Jul 2020 by Profilemikey
Post:
Hi, thank you for reply,
I suspend it by myself because I noticed the progress has been stuck in there for hours,
even I force it to run always. the elapsed time is ticking but the remaining time is extending with it.
Although I put all 12 cores to run the project, but the CPU time is only 50%,
so I still could surf websites or watch movies while the Boinc is running
I did the same in the first few days without any issues.

Anyway, now something just happened magically.
everything seems back to normal this morning, the progress is moving on.

I'm not sure what happened, but I abort all other No-Body simulation units on the queue list.


Alof of us no longer run the N-Body cpu workunits from here because of those types of problems, they have plenty of gpu workunits that finish faster and don't seem to have any problems running. There is a 10 minutes wait period before you get new tasks but if you run a zero resource project with short workunits it works out just fine.
7) Message boards : Number crunching : Computation errors (Message 69975)
Posted 4 Jul 2020 by Profilemikey
Post:
Hi, everyone!

New to the project and just started to run some tasks from last week,
Now I am working on a N-Body Simulation over 11.5 hours but it seems stuck in 34.725% forever,
the total estimated time has been extended from 7.5 hrs to near 17 hrs,
Is that normal or the files are broken? What should I do now?
Thank you very much!
----------------------------------------------------------------------------------------------------------------------
Application Milkyway@home N-Body Simulation 1.76 (mt)
Name de_nbody_06_08_2020_v176_40k__data__2_1588605902_541679
State Suspended - user request
Received 29/06/2020 23:53:31
Report deadline 11/07/2020 23:53:30
Resources 12 CPUs
Estimated computation size 11,205 GFLOPs
CPU time 11:28:46
CPU time since checkpoint 00:16:15
Elapsed time 09:00:28
Estimated time remaining 16:55:57
Fraction done 34.725%
Virtual memory size 16.23 MB
Working set size 940.00 KB
Directory slots/1
Process ID 1412
Progress rate 3.960% per hour
Executable milkyway_nbody_1.76_windows_x86_64__mt.exe
----------------------------------------------------------------------------------------------------------------------


It says 'suspended' that means your settings have the unit suspended for some reason, since your pc'sare hidden it's hard to tell but ideas could be because you have Boinc set to not run while you are using the pc, or because you are running some other Project at the same time. N-Body tasks use as many cpu cores as you have in the pc so trying to run another Project at the same time won't work very well.
8) Message boards : Number crunching : Hard Drive Space Question (Message 69963)
Posted 25 Jun 2020 by Profilemikey
Post:
I was curious about crunching some giant work units. So I increased my hard drive space that BOINC can use from 10 gigs to 200 gigs. But BOINC only uses 5.54 gigs of space on my computer. MilkyWay only uses 24.81 MB of space. Can MilkyWay send me larger work units to crunch? Or are there other projects that do have large work units?


Einstein has some large data files that the workunits use to run and Climate Prediction https://www.cpdn.org/cpdnboinc/index.php has some very large workunits that can take over 1 year to crunch.
9) Message boards : Number crunching : Computation errors (Message 69960)
Posted 22 Jun 2020 by Profilemikey
Post:
I discovered the cause.
Its a condition called sagging.
Being so damn heavy this monolithic GPU unit doesn't bode well in a standard pc tower.
It does have a metal backplate but that doesn't stop stress on the PCIE slot connector.
One slight slip and it malfunctions.
Now I have it running just by repositioning the card on the PCIE slot and tightening up the mounting bracket screw.
It really needs a gpu mounting bracket base I tried to get one it came damaged.
Guess I need to try again. .


If you have the room lay the pc down on it's side until you get one. Can you make one or use a zip tie?
https://graphicscardhub.com/gpu-brace-support/ I'm assuming that's what you mean


you are spot on. Yes thats what I need. I had a look around and yeah its either a bracket or a pc case than tower that I need.
I actually put a ruler there for now :)


I hope you can find one. If you are a wood guy you could always notch a stick and put it between the top and bottom of the case, a small screw thru the top and bottom of the case should hold the stick in place since it isn't really a bracket just a 'don't fall out' thing.
10) Message boards : Number crunching : Computation errors (Message 69958)
Posted 21 Jun 2020 by Profilemikey
Post:
I discovered the cause.
Its a condition called sagging.
Being so damn heavy this monolithic GPU unit doesn't bode well in a standard pc tower.
It does have a metal backplate but that doesn't stop stress on the PCIE slot connector.
One slight slip and it malfunctions.
Now I have it running just by repositioning the card on the PCIE slot and tightening up the mounting bracket screw.
It really needs a gpu mounting bracket base I tried to get one it came damaged.
Guess I need to try again. .


If you have the room lay the pc down on it's side until you get one. Can you make one or use a zip tie?
https://graphicscardhub.com/gpu-brace-support/ I'm assuming that's what you mean
11) Message boards : Number crunching : Computation errors (Message 69955)
Posted 20 Jun 2020 by Profilemikey
Post:
Can someone please tell me what logging option check boxes to set on BOINC to log the issue
And where to obtain the error details to post if theres anything additional i need to do.
I am not showing my pc.
News will tell you Australia is suffering a hacking episode.
COVID19 has news media trying to provoke a cold war.
It is a good computer. A properly built and maintained system by a very experienced operator. If I was inexperienced or dumb then I would show it off.
I have just refreshed my PC with desktop software than server software and new passwords as a precaustion and enhanced my router security.
My drivers are not even a month old.
There shouldn't be an issue running.
When someone can please satisfy my queries I can log the work units and post any futrther errors.
I cannot crunch any further until I can report problems properly.


Click on my name and you can view my pc's and see what people could see about your pc's, it's not enough to make them any more hackable than they are right now.

One suggestion would be to roll back the drivers to ones a bit older. Another would be to stop overclocking anything if you are are.
12) Message boards : Number crunching : Computation errors (Message 69948)
Posted 20 Jun 2020 by Profilemikey
Post:
Isn't it fantastic need to find out if it was at my end or the milkyway@home end.
I stopped work and cancelled; let the errors transmit.
Has anyone else ran into this? in current GPU work units?


Since you didn't make your computers visible and you didn't explain the problem it's very hard to try and figure out your problem.

I run gpu workunits on multiple pc's and have no problems.
13) Questions and Answers : Preferences : Non automatically fetch for work units on multiple clients (Message 69936)
Posted 18 Jun 2020 by Profilemikey
Post:
Hi all,
since i ran two BOINC intances on my workstation (Windows 10 pro) BOINC is not able to download automatically new work units.
If i start the actualisation process by hand manually it works well and i get 600 WU´s on the main instance of BOINC.
The problem is that after appx. 3.5 hours all the jobs are done and i´m not sitting on my desk 24/7 to start the process again.

Any ideas to fix the problem ?

THX

Ralf


Yup it's a known 'problem' here for gpu tasks...your pc must not ask for new gpu tasks for 10 minutes before it can get new ones, it's a setting an old Admin made that the new one can't find. Easiest answer is to setup a zero resource share project to run when you are out of MilkyWay tasks and after running a few of the other projects tasks your computer will ask for MilkyWay tasks again after more than 10 minutes has elapsed.
14) Questions and Answers : Preferences : GPU error (Message 69935)
Posted 18 Jun 2020 by Profilemikey
Post:
I doesn't matter if I have GPU checked or not I continue to get this notice!?

Milkyway@Home: Notice from BOINC
Your settings do not allow fetching tasks for NVIDIA GPU. To fix this, you can change Project Preferences on the project's web site.
6/2/2020 8:22:50 PM


You have not had a computer get any tasks since March, try reloading the gpu drivers Win10 updates often messes them up.
15) Message boards : Number crunching : gpu (Message 69934)
Posted 18 Jun 2020 by Profilemikey
Post:
I just put milkway@home on a new iMac and the gpu is suspend. Why?


Either you are using the wrong drivers or you are running too many cpu tasks. Mac's are funky and not all gpu drivers work.
16) Message boards : News : Award Badges Going Live Soon! (Message 69933)
Posted 18 Jun 2020 by Profilemikey
Post:
For some I do not see my badges can someone let me know why...

Thanks William


They only show up when you post not on your account page anymore, yours are there.
17) Message boards : Number crunching : n body (Message 69921)
Posted 14 Jun 2020 by Profilemikey
Post:
I have a 64 bit system.


I think you have cpu tasks but don't know why they aren't nbody tasks, nbody tasks by default will use all of your cpu cores on each task and your gpu grunching will slow waaaay down
18) Message boards : Number crunching : cpu and gpu (Message 69912)
Posted 12 Jun 2020 by Profilemikey
Post:
.417 cpu + 1 nivida gpu. What does .417 cpu mean?


It means it's using almost 50% of a cpu core and your gpu to run a task on your gpu
19) Message boards : Number crunching : n body (Message 69911)
Posted 12 Jun 2020 by Profilemikey
Post:
How come people that have a imac can not run nbody?


Is your system 32 or 64 bit? They do not support 32 bit Mac's.
20) Questions and Answers : Unix/Linux : MW stopped using my nvidia GPU (Message 69886)
Posted 3 Jun 2020 by Profilemikey
Post:
openSuse Linux (Tumbleweed) x86_64, nvidia driver 440.59

This was going along fine but at least a day ago all the nvidia tasks went to "waiting to run" status and have not run since. the activitiy menu options are all set to "Always" and nothing is suspended, there is no appinfo.xml no app_confing.xml cc_config.xml etc - just raw MW and BOINC. computing preferences allow for 1 cpu per GPU (I have 2 nividia cards) so that should be ok. The nvidia driver hasnt changed. I just double checked I am still part of the video group so thats not it.
How can i determine what is preventing the GPU from running?
[i]Also, as an aside, how does one view the specific computer on the website? I cant seem to sort by computer id so searching for a particular computer in my set of computers is like searching for a needle in a haystack

[Milkyway@Home] Requesting new tasks for NVIDIA GPU
[Milkyway@Home] Scheduler request completed: got 0 new tasks
[Milkyway@Home] Not sending work - last request too recent: 77 sec
[Milkyway@Home] Project requested delay of 91 seconds
[/code]


The last part is the key "[Milkyway@Home] Not sending work - last request too recent: 77 sec" MilkyWay REQUIRES 10 minutes of not asking for new work before they will send you more gpu work, setup a zero resource share project and run a couple of their workunits until MilkyWay refills the cache. Cpu workunits do not have this problem, just gpu workunits.


Next 20

©2020 Astroinformatics Group