Welcome to MilkyWay@home

Posts by UnionJack

1) Questions and Answers : Preferences : MilkyWay steals all my CPUs (Message 71261)
Posted 6 days ago by ProfileUnionJack
Post:
| It would still be nice if they would add some kind of direct control on the prefs page. More casual users will just give up and delete the project.

I agree. Other projects have done it, so why not MW@H? It's almost as though the project didn't want us to find these things out without working at it.
2) Questions and Answers : Unix/Linux : Yet another computation-error problem (Message 67330)
Posted 14 Apr 2018 by ProfileUnionJack
Post:
I don't know what's going on here, but something has changed since I wrote the above: now I don't get the computation errors. I have another problem instead, on which I'll ask another question if I can't solve it myself.
3) Questions and Answers : Unix/Linux : Yet another computation-error problem (Message 67323)
Posted 11 Apr 2018 by ProfileUnionJack
Post:
I stopped using my GPU in BOINC when a case fan failed from hard work, but now I've tried to resume. I get immediate computation errors on all MW tasks, but not Einstein@Home tasks which run fine.

OS: Gentoo Linux, kernel 4.9.76-r1
GPU: Radeon Pro WX 5100 8GB GDDR5
Driver: amdgpu-pro-opencl-17.50.511655 using mesa-17.3.8
Typical error: Computation error (0.929 CPUs + AMD/ATI GPU) ... MilkyWay@Home 1.46 (opencl_ati_101) de_modfit_14_bundle5_NoConstraintsWithDisk...
(Also with de_modfit_23)
Toolkit: wxGTK-3.0.3-r300

stdoutdae.txt shows this (T&D stripped):
OpenCL: AMD/ATI GPU 0: AMD Radeon (TM) Pro WX 5100 Graphics (POLARIS10 / DRM 3.8.0 / 4.9.76-gentoo-r1, LLVM 5.0.1) (driver version 17.3.8, device version OpenCL 1.1 Mesa 17.3.8, 16029MB, 16029MB available, 2433 GFLOPS peak)
[...]
Memory: 31.32 GB physical, 62.47 GB virtual
Disk: 39.12 GB total, 26.02 GB free
Local time is UTC +1 hours
VirtualBox version: 5.2.8_Gentoor120774
Config: don't compute while cc1 is running
Config: don't compute while cc1plus is running
Config: don't compute while cmake is running
[...]
Reading preferences override file
Preferences:
   max memory usage when active: 28859.99 MB
   max memory usage when idle: 30463.32 MB
   max disk usage: 37.06 GB
   max download rate: 2621440 bytes/sec
   max upload rate: 838861 bytes/sec

I've searched everywhere I can think of for clues to this, but there's nothing either recent or relevant.
I've tried downgrading to amdgpu-pro-opencl-17.40.492261 but it's made no difference. Those are the only two versions available in Gentoo.
What else can I try?
4) Message boards : Number crunching : Restricting CPUs per Work Unit (Message 67232)
Posted 9 Mar 2018 by ProfileUnionJack
Post:
That's what I needed to know - thank you.

Wouldn't it be good if there were a central place where all the config options were documented.
5) Message boards : Number crunching : Restricting CPUs per Work Unit (Message 67225)
Posted 8 Mar 2018 by ProfileUnionJack
Post:
I know I've seen this somewhere but I just can't find it, so apologies in advance.

I want MW@H to use no more than, say, 5 of my 12 CPU threads, and I know I have to set a value in a config file in the project's directory, but what, and which file? I don't want to apply the same restriction to all 8 of my projects.
6) Message boards : Number crunching : How to throttle AMD GPU load? (Message 67204)
Posted 5 Mar 2018 by ProfileUnionJack
Post:
I have a Radeon Pro WX 5100 GPU, which is running at near enough full load. Here is a sample line from /usr/sbin/radeontop:

gpu 85.83%, ee 0.00%, vgt 0.00%, ta 85.00%, sx 0.00%, sh 0.00%, spi 85.83%, sc 0.00%, pa 0.00%, db 0.83%, cb 0.83%, vram 15.32% 1251.52mb

Is it possible to throttle the load on the GPU to reduce the heat? It's running at 84C, which is not excessive in itself but the case fans don't like it. I've seen threads on sharing the GPU between two tasks, but I want the converse.

This is a Gentoo Linux box - not windows!
7) Message boards : News : GPU Issues Mega Thread (Message 65515)
Posted 22 Oct 2016 by ProfileUnionJack
Post:
Something else is going wrong now. I decided to refuse modfit tasks until I had some solution, so I went to my preferences page, deselected modfit, told it not to accept tasks for other applications, then updated the project preferences - both on the website and in boincmgr. It still sent me modfit tasks. I found two applications listed in account_milkyway.cs.rpi.edu_milkyway.xml: numbers 3 and 7.

Next I detached from the project, checked that no milky* or Milky* files or directories existed, went back to the project preferences web page, told it to update once more, then reattached to the project in boincmgr.

It STILL sent me modfit tasks! Grr!

How do I tame this beast?
8) Message boards : News : GPU Issues Mega Thread (Message 65509)
Posted 21 Oct 2016 by ProfileUnionJack
Post:
Nope, that wasn't it. I have another 43 tasks showing computation error.
9) Message boards : News : GPU Issues Mega Thread (Message 65503)
Posted 20 Oct 2016 by ProfileUnionJack
Post:
Hello Al,

You've got me thinking there - dangerous, but thanks!

All software in this box is as up-to-date as it can be, but you made me wonder about kernel versions and when this problem first appeared, which prompted me to look again at the amdgpu config in the kernel. I had a test-only setting in there for some reason, so I reset it and rebooted. I'm now waiting for MW@home to have some tasks for me.
10) Message boards : News : GPU Issues Mega Thread (Message 65489)
Posted 19 Oct 2016 by ProfileUnionJack
Post:
The problem is not in whether or when to use the GPU, it's that every modfit task fails. I can sit and watch while 48 tasks run for two seconds each before stopping with Computation Error.

There seems to be a coding error in modfit that prevents it from using my GPU properly, or perhaps even from using it at all. That's why I gave as much detail of the device as I could, in the hope that the devs might discover how it differs from the ones they know.
11) Message boards : News : GPU Issues Mega Thread (Message 65448)
Posted 15 Oct 2016 by ProfileUnionJack
Post:
I posted in another thread that I'm getting computation failure errors on large numbers of MW tasks, but I didn't get a reply there so I'm trying here. This is an AMD/ATI Radeon R9 380X Nitro 4G D5.

I see this in stdoutdae.txt during restart after project reset, or on BOINC start:

15-Oct-2016 12:27:23 [---] OpenCL: AMD/ATI GPU 0: AMD TONGA (DRM 3.3.0 / 4.8.1-gentoo, LLVM 3.8.1) (driver version 12.0.3, device version OpenCL 1.1 Mesa 12.0.3, 1024MB, 1024MB available, 50 GFLOPS peak)

The boincmgr task list display shows "Computation error (0.0176 CPUs +1 AMD/ATI GPU)". There's no corresponding text in the Event Log page of boincmgr, nor in any file under ~/boinc. This is a Gentoo Linux 12-core i7 box with plenty of RAM and disk space. 0.0176 is about 1/58. I'm running the latest versions available of all drivers etc.

It looks as though it's only modfit tasks that show this behaviour.

# lspci -n -s 01:00.0
01:00.0 0300: 1002:6938 (rev f1)

The details of this device are at https://pci-ids.ucw.cz/read/PC/1002/6938. I'm peterh in that discussion. I hope something there is helpful in getting to the bottom of this GPU problem.
12) Message boards : News : Updated Server Daemons and Libraries (Message 65376)
Posted 4 Oct 2016 by ProfileUnionJack
Post:
# lspci -n -s 01:00.0
01:00.0 0300: 1002:6938 (rev f1)

You can find the details of this device at https://pci-ids.ucw.cz/read/PC/1002/6938 . I'm peterh in that discussion.
13) Message boards : News : Updated Server Daemons and Libraries (Message 65375)
Posted 4 Oct 2016 by ProfileUnionJack
Post:
I'm getting Computation Error on every milkyway@home task I run. The Tasks page of BOINC manager says "Computation error (0.0176 CPUs +1 AMD/ATI GPU)". There's no corresponding text in the Event Log page, nor in any file under ~/boinc. This is a Gentoo Linux 12-core i7 box with plenty of RAM and disk space, so this isn't the Windows problem reported above. 0.0176 is about 1/58. I'm running the latest versions available of all drivers etc.

The error comes up immediately each task is started. I've reset the project but that made no difference. The job log has a date stamp of 24 Sept, and its last entry is "1474692508 ue 5545.518061 ct 1614.182000 fe 1307020000000 nm de_nbody_8_22_16_v162_2k_2_1474377580_43838_0 et 172.516924 es 0". Other projects' job logs are date-stamped today.

# lspci -n -s 01:00.0
01:00.0 0300: 1002:6938 (rev f1)
14) Questions and Answers : Preferences : OpenCL and ATI/AMD Radeon R9 380X (Message 64634)
Posted 12 Jun 2016 by ProfileUnionJack
Post:
The MilkyWay@Home server has sent me a message saying "ATI GPU R600 (R38xx) does not support OpenCL". But I don't have one of those cards; I have a Radeon R9 380X Nitro 4G D5 (Tonga) card, which does support OpenCL - I'm using it on another BOINC project. Its described here: http://pci-ids.ucw.cz/read/PC/1002/6938.

Is this a case of the application code needing an update?
15) Questions and Answers : Unix/Linux : Milkyway@Home says it's running a task on 4 CPUs, but it isn't. (Message 63318)
Posted 3 Apr 2015 by ProfileUnionJack
Post:
Initialisation is taking far too long; it's leaving three-quarters of my resources unused for long periods.

I'm seriously considering dropping this project in favour of several other physics and astrophysics projects that don't leave my system idling.

Energy to power the machine does cost money. I'm happy to spend the money on worthwhile causes and I'd love to keep this project going.

Encourage me...
16) Questions and Answers : Unix/Linux : Milkyway@Home says it's running a task on 4 CPUs, but it isn't. (Message 63244)
Posted 18 Mar 2015 by ProfileUnionJack
Post:
That could be it, yes, but if so, it's taking an unexpectedly long time to initialise. Unexpected by me, that is. I'll watch it for a while.

Thanks Alan.

Later: It's just taken 16 minutes (+/- 0s) to start using the other three cores. Some initialisation!

Oh, and I should have said I'm running BOINC 7.2.0-r1 from the Gentoo package.

--
Peter.
17) Questions and Answers : Unix/Linux : Milkyway@Home says it's running a task on 4 CPUs, but it isn't. (Message 63233)
Posted 15 Mar 2015 by ProfileUnionJack
Post:
If I suspend the task, another one starts in its place, also claiming to be running on 4 CPUs but also not. Meanwhile I have three cores at 1% load and three other projects waiting. I've seen the problem before; it seems I just have to sit and wait.

This is Gentoo Linux running on an Asus P7P55D motherboard and i5 CPU, 16 GB RAM, plenty of disc, nVidia Quadro FX580 graphics, no CUDA.

--
Peter.




©2021 Astroinformatics Group