Welcome to MilkyWay@home

Posts by Aurum

1) Questions and Answers : Unix/Linux : 15 CPUs cause running tasks to stop running (Message 76285)
Posted 19 Jul 2023 by Aurum
Post:
Al, "instructions retired per second" is the secret sauce that makes this work so well. I had taken your word for it before and have all my computers running 3-thread nbodys. Thanks much for doing this useful work and explaining it so thoroughly.
I once tried to install PERF to measure instructions retired but it gave me four ways to install it and I didn't get it working. It's the only Linux program I've ever seen suggest multiple ways to install.
CPDN has issued a slug of WAH but it's windoze only. Their new guy is planning on putting out much OpenIFS for Linux this fall:
https://www.cpdn.org/forum_thread.php?id=9149&postid=69160#69160
2) Questions and Answers : Unix/Linux : 15 CPUs cause running tasks to stop running (Message 76248)
Posted 13 Jul 2023 by Aurum
Post:
I found that there was a slight degradation for each thread up to three or four, then it went downhill quite fast -- while I was working out an optimum for one of my systems, I found 2 three-thread tasks worked the CPUs harder than 1 six-thread task did
How do you define "works harder?"
And what CPU did you test this on? TIA
3) Message boards : Number crunching : Will N-Body projects all use multiple CPUs? (Message 76247)
Posted 13 Jul 2023 by Aurum
Post:
Is n-body a legitimate multi-CPU project or is it just multiple WUs in one package?
Does n-body use all allocated CPUs for the entire run?
Or does it start using all CPUs and then as parts finish CPUs go idle and wasted until there's just one left running at the end?
4) Message boards : Number crunching : N-Body Simulation MT (Message 71773)
Posted 20 Feb 2022 by Aurum
Post:
Both of the computers that have been running 1C nbody WUs all day have now reverted to DLing 16C WUs.
It cannot be something I'm doing.
These WUs are misconfigured.
5) Message boards : News : Introducing Myself (Message 71772)
Posted 20 Feb 2022 by Aurum
Post:
Hi Dylan! Welcome aboard.
I hope you have modernizing the MW Preferences configuration on your ToDo List :-)
6) Message boards : Number crunching : N-Body Simulation MT (Message 71771)
Posted 20 Feb 2022 by Aurum
Post:
First question to ask is did you preserve the file type as an .xml file? Make sure you didn't create it as app_config.xml.txt.
BOINC will ignore the file if not named correctly.
That sounds like a Windoze problem. I create and/or edit them remotely using:
sudo xed /var/lib/boinc-client/projects/milkyway.cs.rpi.edu_milkyway/app_config.xml

Second question is do you see the app_config being read for Einstein in the Event Log at startup?
If you don't see the file being recognized by BOINC it isn't being acted on and is being ignored.
Do you mean this line from the BoincTasks Message tab?
Rig-01 137251 Milkyway@Home 2/19/2022 10:06:53 PM Found app_config.xml

Both cases would explain why your tasks are still being sent as 16c tasks. Also, any old tasks in your cache before you made the change will still show as 16c. All new tasks will have the correct intended parameters.
Yea I thought maybe I wasn't clearing the decks before running the edited app_config.xml but I just ran a test again (from having no MW WUs & reading config files) and the same thing happened.
I have 2 computers where it works correctly and I've been comparing to ones that keep getting 16Cs and I can find no difference. I looked for typos in my syntax, permissions, and ownership. I'm wondering if it requires a BOINC restart (sudo /etc/init.d/boinc-client restart) to get <avg_ncpus>1.000000</avg_ncpus> written into the client_state.xml file. Problem is I'm running an eclectic mix: GAIA with no checkpointing, ARP1 (_2s, _3s & _4s) with checkpointing every few hours and QuChemPedia with checkpointing every 30 seconds or who knows when.

Are these bona vide multithreaded WUs or just 16 WUs in one wrapper that are extremely inefficient???
Up to 4 threads and I wouldn't even care but 16, yikes!!! Control of the number of threads per CPU WU could be placed in our MW Preferences if they wanted to.
7) Message boards : Number crunching : N-Body Simulation MT (Message 71769)
Posted 19 Feb 2022 by Aurum
Post:
Now they're coming as 16C but not displaying that they're 16C.
8) Message boards : Number crunching : N-Body Simulation MT (Message 71768)
Posted 19 Feb 2022 by Aurum
Post:
From my client_state:
<app_version>
    <app_name>milkyway_nbody</app_name>
    <version_num>182</version_num>
    <platform>x86_64-pc-linux-gnu</platform>
    <avg_ncpus>16.000000</avg_ncpus>
    <flops>89728291437.088242</flops>
    <plan_class>mt</plan_class>
    <api_version>7.6.33</api_version>
    <file_ref>
        <file_name>milkyway_nbody_1.82_x86_64-pc-linux-gnu__mt</file_name>
        <main_program/>
    </file_ref>
</app_version>
It should be:
<avg_ncpus>1.000000</avg_ncpus>
I checked and they actually running with 16 CPU threads so it's not just reporting ti wrong.
9) Message boards : Number crunching : N-Body Simulation MT (Message 71767)
Posted 19 Feb 2022 by Aurum
Post:
I installed my app_config and made sure I read configs before switching to All New Work. But it keeps sending me 16c WUs, hundreds of them. I don't know what to do except Abort them. Be nice if we had more control over our work. E.g, there's no way to specify number of CPUs per WU in Edit My Preferences. Nor can one get n-body CPU WUs and just GPU WUs since we can get one of the other or all three.
<app_config>
<app>
    <name>milkyway</name>
    <gpu_versions>
        <cpu_usage>1.0</cpu_usage>
        <gpu_usage>1.0</gpu_usage>
    </gpu_versions>
    <max_concurrent>4</max_concurrent>
</app>
<app>
    <name>milkyway_nbody</name>
    <plan_class>mt</plan_class>
    <avg_ncpus>1</avg_ncpus>
    <cmdline>--nthreads 1</cmdline>
    <max_concurrent>1</max_concurrent>
</app>
</app_config>
10) Questions and Answers : Preferences : no "multicore" tasks (Message 70754)
Posted 26 Apr 2021 by Aurum
Post:
I have dozens of tasks for MilkyWay@home, almost 200. Is this normal? Seems it would be an inefficient way of doing things.

It's their project and they like multicore tasks as opposed to running one task per cpu core, you can always limit it with an app_config file
This works for me:
<app_config>
<app>
    <name>milkyway</name>
    <gpu_versions>
        <cpu_usage>1.0</cpu_usage>
        <gpu_usage>0.5</gpu_usage>
    </gpu_versions>
    <max_concurrent>4</max_concurrent>
</app>
<app_version>
    <app_name>milkyway_nbody</app_name>
    <plan_class>mt</plan_class>
    <avg_ncpus>1</avg_ncpus>
    <cmdline>--nthreads 1</cmdline>
    <max_concurrent>4</max_concurrent>
</app_version>
</app_config>
11) Message boards : News : The Last Couple Days (Message 70753)
Posted 26 Apr 2021 by Aurum
Post:
Ok, time to turn in my "guy card" and give y'all a laugh at my expense. Like many of you, I participate in more than one project. For me it's this one, Prime Grid, and Einstein. My Nvidia GPUs are running PG and my AMD cards are here. I hit up Einstein when this project went through that rough spot the last couple days. Now this all relates to the thread I promise. I noticed that Einstein seemed to be slow to give me work as well, but there is chatter over there about WUs that allowed me to believe it could be me. Now this project is back to normal and as I said I was getting few WUs while others have said they getting work no prob. So i got to thinking "gee Hold, what would cause all your projects to get limited work?" I strolled over to PG to see if I found any primes being Tour de Prime and all and it hit me. You idiot, you set your WU que to 0 days. So ever hopeful, I set up an alternate venue that allowed several days que and ploped my primary MW cruncher in that venue. Told MW to get work and POW...900 WUs. Sorry if I cause you any worry Tom, looks like all is actually well with the servers, I'm just a scrub. Ok, y'all can stop laughing now lol.
Resource Zero is a great feature. I tried it before where I had more than one RZ project accepting new work but either they both sit there saying after you, no after you... Or one dominates and the other never seems to send any work. It seems to work best to just have a single RZ project.
12) Message boards : Number crunching : CPU versus GPU (Message 68972)
Posted 13 Aug 2019 by Aurum
Post:
This is what I want:
✔ Milkyway@home N-Body Simulation - CPU
Milkyway@home Separation - CPU
✔Milkyway@home Separation - GPU
13) Message boards : Number crunching : CPU versus GPU (Message 68970)
Posted 12 Aug 2019 by Aurum
Post:
I just noticed that N-Body Simulation is only CPU. So I checked the box Use CPU. But then I get both CPU & GPU WUs for Separation. Is there any reason to run both CPU & GPU Separation WUs???
Could another option be added to Preferences/Run only the selected applications:
✔ Milkyway@home N-Body Simulation
Milkyway@home Separation - CPU
✔Milkyway@home Separation - GPU
14) Message boards : Number crunching : MilkyWay takes a backseat to Einstein ??? (Message 68968)
Posted 11 Aug 2019 by Aurum
Post:
You will find running the same project on your cpu and gpu will cause problems too as they do not have separate caches.
I did not know that. What cache is that?

I was hoping Dr Jake would just adjust a server setting and put them on a level playing field. But as soon as I posted I realized I'd be told to adjust Resource Share. I never use it since it can have undesired results and besides with multiple computers I'd lose track of which was set to what.

If you watch GPU Load using GPU-z you see that MilkyWay is a short duty cycle while Einstein is medium and GPUGrid is long. By pairing longer projects with MilkyWay it gets a more efficient and complete use of the GPU.
I recently saw a post where someone was using time stamps in the logs to measure the time over multiple WUs for Linux computers to test if pairings produced more points per hour.
15) Message boards : Number crunching : MilkyWay takes a backseat to Einstein ??? (Message 68963)
Posted 11 Aug 2019 by Aurum
Post:
I like to run MilkyWay with Einstein WUs on the same GPU, a perfect pairing. However, Einstein will continue to request work but MilkyWay stops requesting work and runs out. To get more MilkyWay WUs I have to Suspend Einstein to stock up on MilkyWay.
Is it possible to make MilkyWay as aggressive as Einstein???
16) Message boards : News : 30 Workunit Limit Per Request - Fix Implemented (Message 68491)
Posted 8 Apr 2019 by Aurum
Post:
I never get enough MW WUs. I can run 200 at a time. Some computers don't get any while some others get a steady supply. They all have the same app_config.xml. Server Status shows 10,000 WUs ready but I have computers that haven't gotten any in forever.
Is there some kind of governor on this project???
I can't find anything to explain the difference.
17) Message boards : Number crunching : Linux Mint 19 Tara (Message 67676)
Posted 18 Jul 2018 by Aurum
Post:
I believe I fixed it.
First: Linux 19 does not default to include recommended dependencies. Fix with 1.2.1: https://sites.google.com/site/easylinuxtipsproject/mint-cinnamon-first
Second: Install CUDA: https://help.ubuntu.com/community/Cuda
18) Message boards : Number crunching : Linux Mint 19 Tara (Message 67674)
Posted 17 Jul 2018 by Aurum
Post:
I upgraded a computer from Linux Mint 18.3 Sylvia to Linux Mint 19 Tara and now BOINC will not run GPU WUs any more. I'm running BOINC 7.12.0. The only Notice is:
Milkyway@Home: Notice from BOINC
Your current settings do not allow tasks from this project. To fix this, you can change Project Preferences on the project's web site.
Tue 17 Jul 2018 12:14:47 AM PDT
Anyone know how I can get GPUs working again???
BTW, it took several hours to upgrade to Linux 19. NoMachine works so much faster on this headless computer than Linux 18.3 which was unbearably slow.
I don't recommend upgrading until more kinks get worked out.
Tried Folding@Home 7.5.1 & it DLs WUs but shortly after they start running they fail and another WU is DLed. All 3 1080 Ti's do the same. I have all the latest updates and the latest Nvidia driver.
19) Message boards : Number crunching : Frequency preference (Message 67673)
Posted 17 Jul 2018 by Aurum
Post:
Grammar seems ambiguous, could be worded better. Seems counter intuitive. Shouldn't higher frame rates run WUs faster???
I'm trying 30 fps but I don't see any difference. Does this parameter work???
Frequency (in Hz) that should try to complete individual work chunks. Higher numbers may run slower but will provide a more responsive system. Lower may be faster but more laggy. 
default 60 (corresponds to 60 fps)
20) Message boards : News : New nbody runs July 2 (Message 67650)
Posted 3 Jul 2018 by Aurum
Post:
I never get any nbody WUs. I have both checked in my milkyway preferences. Any idea why not???




©2024 Astroinformatics Group