Welcome to MilkyWay@home

Posts by UnionJack

1) Questions and Answers : Unix/Linux : Authorisation failure (Message 75226)
Posted 27 Mar 2023 by Profile UnionJack
Post:
On Gentoo (testing, up to date) I get this from milkyway@home:

Authorization required, but no authorization protocol specified.

How do I fix this, and what is an authorisation protocol anyway?
2) Questions and Answers : Preferences : no "multicore" tasks (Message 74862)
Posted 1 Jan 2023 by Profile UnionJack
Post:
Thank you for this .xml file. I have a 24-core CPU and MilkyWay@home was using 16 of them on every task, so CPU utilisation was never more than 57%.

It would help if MW@H were to offer a setting for this on the Preferences page, as other projects do.
3) Questions and Answers : Preferences : Local control of GPU use (Message 71968)
Posted 16 Mar 2022 by Profile UnionJack
Post:
OK, I did that - thanks! Here's my new cc_config.xml file (it didn't exist before):
$ cat ~/boinc/cc_config.xml
<cc_config>
<options>
<ignore_ati_dev>1</ignore_ati_dev>
</options>
</cc_config>

Then on starting BOINC:
--->8
OpenCL: AMD/ATI GPU 0: AMD Radeon Pro W5500 (navi14, LLVM 13.0.1, DRM 3.44, 5.16.14-gen (driver version 22.0.0, device version OpenCL 1.1 Mesa 22.0.0, 8192MB, 8192MB available, 3344 GFLOPS peak)
--->8
[---] Config: ignoring AMD/ATI GPU 1
[etc...]

It's a pity I can't use that GPU, but it seems AMD navi14 sources are not available on most Linux distros - only a few enterprise versions, and certainly not Gentoo. And I only went for that card for its powerful GPU, too.

Thanks also to wb8ili.
4) Questions and Answers : Preferences : Local control of GPU use (Message 71965)
Posted 16 Mar 2022 by Profile UnionJack
Post:
I have two boxes running BOINC; one of them can run opencl apps, but the one I'm writing this on can't.

Is there a way to specify in, say, boinc/global_prefs_override.xml, that the gpu should not be used at all on this machine? I see there's a run_gpu_if_user_active parameter, which I've set to 0, but what about when I'm not 'active'?
5) Questions and Answers : Preferences : MilkyWay steals all my CPUs (Message 71261)
Posted 22 Oct 2021 by Profile UnionJack
Post:
| It would still be nice if they would add some kind of direct control on the prefs page. More casual users will just give up and delete the project.

I agree. Other projects have done it, so why not MW@H? It's almost as though the project didn't want us to find these things out without working at it.
6) Questions and Answers : Unix/Linux : Yet another computation-error problem (Message 67330)
Posted 14 Apr 2018 by Profile UnionJack
Post:
I don't know what's going on here, but something has changed since I wrote the above: now I don't get the computation errors. I have another problem instead, on which I'll ask another question if I can't solve it myself.
7) Questions and Answers : Unix/Linux : Yet another computation-error problem (Message 67323)
Posted 11 Apr 2018 by Profile UnionJack
Post:
I stopped using my GPU in BOINC when a case fan failed from hard work, but now I've tried to resume. I get immediate computation errors on all MW tasks, but not Einstein@Home tasks which run fine.

OS: Gentoo Linux, kernel 4.9.76-r1
GPU: Radeon Pro WX 5100 8GB GDDR5
Driver: amdgpu-pro-opencl-17.50.511655 using mesa-17.3.8
Typical error: Computation error (0.929 CPUs + AMD/ATI GPU) ... MilkyWay@Home 1.46 (opencl_ati_101) de_modfit_14_bundle5_NoConstraintsWithDisk...
(Also with de_modfit_23)
Toolkit: wxGTK-3.0.3-r300

stdoutdae.txt shows this (T&D stripped):
OpenCL: AMD/ATI GPU 0: AMD Radeon (TM) Pro WX 5100 Graphics (POLARIS10 / DRM 3.8.0 / 4.9.76-gentoo-r1, LLVM 5.0.1) (driver version 17.3.8, device version OpenCL 1.1 Mesa 17.3.8, 16029MB, 16029MB available, 2433 GFLOPS peak)
[...]
Memory: 31.32 GB physical, 62.47 GB virtual
Disk: 39.12 GB total, 26.02 GB free
Local time is UTC +1 hours
VirtualBox version: 5.2.8_Gentoor120774
Config: don't compute while cc1 is running
Config: don't compute while cc1plus is running
Config: don't compute while cmake is running
[...]
Reading preferences override file
Preferences:
   max memory usage when active: 28859.99 MB
   max memory usage when idle: 30463.32 MB
   max disk usage: 37.06 GB
   max download rate: 2621440 bytes/sec
   max upload rate: 838861 bytes/sec

I've searched everywhere I can think of for clues to this, but there's nothing either recent or relevant.
I've tried downgrading to amdgpu-pro-opencl-17.40.492261 but it's made no difference. Those are the only two versions available in Gentoo.
What else can I try?
8) Message boards : Number crunching : Restricting CPUs per Work Unit (Message 67232)
Posted 9 Mar 2018 by Profile UnionJack
Post:
That's what I needed to know - thank you.

Wouldn't it be good if there were a central place where all the config options were documented.
9) Message boards : Number crunching : Restricting CPUs per Work Unit (Message 67225)
Posted 8 Mar 2018 by Profile UnionJack
Post:
I know I've seen this somewhere but I just can't find it, so apologies in advance.

I want MW@H to use no more than, say, 5 of my 12 CPU threads, and I know I have to set a value in a config file in the project's directory, but what, and which file? I don't want to apply the same restriction to all 8 of my projects.
10) Message boards : Number crunching : How to throttle AMD GPU load? (Message 67204)
Posted 5 Mar 2018 by Profile UnionJack
Post:
I have a Radeon Pro WX 5100 GPU, which is running at near enough full load. Here is a sample line from /usr/sbin/radeontop:

gpu 85.83%, ee 0.00%, vgt 0.00%, ta 85.00%, sx 0.00%, sh 0.00%, spi 85.83%, sc 0.00%, pa 0.00%, db 0.83%, cb 0.83%, vram 15.32% 1251.52mb

Is it possible to throttle the load on the GPU to reduce the heat? It's running at 84C, which is not excessive in itself but the case fans don't like it. I've seen threads on sharing the GPU between two tasks, but I want the converse.

This is a Gentoo Linux box - not windows!
11) Message boards : News : GPU Issues Mega Thread (Message 65515)
Posted 22 Oct 2016 by Profile UnionJack
Post:
Something else is going wrong now. I decided to refuse modfit tasks until I had some solution, so I went to my preferences page, deselected modfit, told it not to accept tasks for other applications, then updated the project preferences - both on the website and in boincmgr. It still sent me modfit tasks. I found two applications listed in account_milkyway.cs.rpi.edu_milkyway.xml: numbers 3 and 7.

Next I detached from the project, checked that no milky* or Milky* files or directories existed, went back to the project preferences web page, told it to update once more, then reattached to the project in boincmgr.

It STILL sent me modfit tasks! Grr!

How do I tame this beast?
12) Message boards : News : GPU Issues Mega Thread (Message 65509)
Posted 21 Oct 2016 by Profile UnionJack
Post:
Nope, that wasn't it. I have another 43 tasks showing computation error.
13) Message boards : News : GPU Issues Mega Thread (Message 65503)
Posted 20 Oct 2016 by Profile UnionJack
Post:
Hello Al,

You've got me thinking there - dangerous, but thanks!

All software in this box is as up-to-date as it can be, but you made me wonder about kernel versions and when this problem first appeared, which prompted me to look again at the amdgpu config in the kernel. I had a test-only setting in there for some reason, so I reset it and rebooted. I'm now waiting for MW@home to have some tasks for me.
14) Message boards : News : GPU Issues Mega Thread (Message 65489)
Posted 19 Oct 2016 by Profile UnionJack
Post:
The problem is not in whether or when to use the GPU, it's that every modfit task fails. I can sit and watch while 48 tasks run for two seconds each before stopping with Computation Error.

There seems to be a coding error in modfit that prevents it from using my GPU properly, or perhaps even from using it at all. That's why I gave as much detail of the device as I could, in the hope that the devs might discover how it differs from the ones they know.
15) Message boards : News : GPU Issues Mega Thread (Message 65448)
Posted 15 Oct 2016 by Profile UnionJack
Post:
I posted in another thread that I'm getting computation failure errors on large numbers of MW tasks, but I didn't get a reply there so I'm trying here. This is an AMD/ATI Radeon R9 380X Nitro 4G D5.

I see this in stdoutdae.txt during restart after project reset, or on BOINC start:

15-Oct-2016 12:27:23 [---] OpenCL: AMD/ATI GPU 0: AMD TONGA (DRM 3.3.0 / 4.8.1-gentoo, LLVM 3.8.1) (driver version 12.0.3, device version OpenCL 1.1 Mesa 12.0.3, 1024MB, 1024MB available, 50 GFLOPS peak)

The boincmgr task list display shows "Computation error (0.0176 CPUs +1 AMD/ATI GPU)". There's no corresponding text in the Event Log page of boincmgr, nor in any file under ~/boinc. This is a Gentoo Linux 12-core i7 box with plenty of RAM and disk space. 0.0176 is about 1/58. I'm running the latest versions available of all drivers etc.

It looks as though it's only modfit tasks that show this behaviour.

# lspci -n -s 01:00.0
01:00.0 0300: 1002:6938 (rev f1)

The details of this device are at https://pci-ids.ucw.cz/read/PC/1002/6938. I'm peterh in that discussion. I hope something there is helpful in getting to the bottom of this GPU problem.
16) Message boards : News : Updated Server Daemons and Libraries (Message 65376)
Posted 4 Oct 2016 by Profile UnionJack
Post:
# lspci -n -s 01:00.0
01:00.0 0300: 1002:6938 (rev f1)

You can find the details of this device at https://pci-ids.ucw.cz/read/PC/1002/6938 . I'm peterh in that discussion.
17) Message boards : News : Updated Server Daemons and Libraries (Message 65375)
Posted 4 Oct 2016 by Profile UnionJack
Post:
I'm getting Computation Error on every milkyway@home task I run. The Tasks page of BOINC manager says "Computation error (0.0176 CPUs +1 AMD/ATI GPU)". There's no corresponding text in the Event Log page, nor in any file under ~/boinc. This is a Gentoo Linux 12-core i7 box with plenty of RAM and disk space, so this isn't the Windows problem reported above. 0.0176 is about 1/58. I'm running the latest versions available of all drivers etc.

The error comes up immediately each task is started. I've reset the project but that made no difference. The job log has a date stamp of 24 Sept, and its last entry is "1474692508 ue 5545.518061 ct 1614.182000 fe 1307020000000 nm de_nbody_8_22_16_v162_2k_2_1474377580_43838_0 et 172.516924 es 0". Other projects' job logs are date-stamped today.

# lspci -n -s 01:00.0
01:00.0 0300: 1002:6938 (rev f1)
18) Questions and Answers : Preferences : OpenCL and ATI/AMD Radeon R9 380X (Message 64634)
Posted 12 Jun 2016 by Profile UnionJack
Post:
The MilkyWay@Home server has sent me a message saying "ATI GPU R600 (R38xx) does not support OpenCL". But I don't have one of those cards; I have a Radeon R9 380X Nitro 4G D5 (Tonga) card, which does support OpenCL - I'm using it on another BOINC project. Its described here: http://pci-ids.ucw.cz/read/PC/1002/6938.

Is this a case of the application code needing an update?
19) Questions and Answers : Unix/Linux : Milkyway@Home says it's running a task on 4 CPUs, but it isn't. (Message 63318)
Posted 3 Apr 2015 by Profile UnionJack
Post:
Initialisation is taking far too long; it's leaving three-quarters of my resources unused for long periods.

I'm seriously considering dropping this project in favour of several other physics and astrophysics projects that don't leave my system idling.

Energy to power the machine does cost money. I'm happy to spend the money on worthwhile causes and I'd love to keep this project going.

Encourage me...
20) Questions and Answers : Unix/Linux : Milkyway@Home says it's running a task on 4 CPUs, but it isn't. (Message 63244)
Posted 18 Mar 2015 by Profile UnionJack
Post:
That could be it, yes, but if so, it's taking an unexpectedly long time to initialise. Unexpected by me, that is. I'll watch it for a while.

Thanks Alan.

Later: It's just taken 16 minutes (+/- 0s) to start using the other three cores. Some initialisation!

Oh, and I should have said I'm running BOINC 7.2.0-r1 from the Gentoo package.

--
Peter.


Next 20

©2024 Astroinformatics Group