Welcome to MilkyWay@home

News General

Message boards : News : News General
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · Next

AuthorMessage
Spatzthecat

Send message
Joined: 1 Dec 10
Posts: 82
Credit: 15,452,009,012
RAC: 0
Message 71545 - Posted: 19 Dec 2021, 12:48:36 UTC

Hi Tom,
Things seem to have settled, much better - thank you.
Invalids rising slowly, now at 1077
ID: 71545 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
.clair.

Send message
Joined: 3 Mar 13
Posts: 84
Credit: 779,527,712
RAC: 0
Message 71546 - Posted: 19 Dec 2021, 20:27:32 UTC

Pity the validator can not be reprogramed to accept the size 7 workunits as a valid size
or is that just my dim way of solving the problem .
ID: 71546 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Spatzthecat

Send message
Joined: 1 Dec 10
Posts: 82
Credit: 15,452,009,012
RAC: 0
Message 71548 - Posted: 20 Dec 2021, 18:13:53 UTC

Hi Tom
Invalids are rising slowly now at 1239
ID: 71548 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile El Jefe

Send message
Joined: 19 Dec 21
Posts: 1
Credit: 73,240,965
RAC: 0
Message 71549 - Posted: 20 Dec 2021, 23:53:52 UTC

I have been having issues with the nbody tasks failing. They should process in a couple of minutes (using 16 processors out of 24 on a pair of xeon 56xx's) But instead they get part way and then lock up for the rest of the day, which means that no other tasks will run on those 16 processors. It isn't an issue of them just trying to finish either, on what should be a couple of minute task it would be stuck for 12 hours at 0% cpu.

If anyone has any suggestions on this.
ID: 71549 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Tom Donlon
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 10 Apr 19
Posts: 408
Credit: 120,203,200
RAC: 0
Message 71550 - Posted: 21 Dec 2021, 15:54:23 UTC - in response to Message 71549.  

I will reach out to Eric and let him know to check this thread.
ID: 71550 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
SolarPowered
Avatar

Send message
Joined: 14 Jan 22
Posts: 3
Credit: 240,634
RAC: 0
Message 71608 - Posted: 16 Jan 2022, 20:51:09 UTC

Hallo,

MilkyWay@home sounds very interesting "create a highly accurate three dimensional model of the Milkyway galaxy using data gathered by the Sloan Digital Sky Survey".

sounds like you are building a "google maps" software of the milky ways? (hope it is Open Source)

will it look like this? https://youtu.be/r97GWqMD7_c

https://skfb.ly/6QV7A

holy it is a probably not accurate 3D model that a 3D hobbyist did, but it looks just great & it run's in the user's browser and supports VR

neat :)

checkout this software http://universesandbox.com/scottmanley discover the universe like a video game :)

(it is more about inspiring people, if VR is brings any practical advancements is still in question https://sg.news.yahoo.com/vr-milky-way-model-open-100210956.html

have a good time despite everything
ID: 71608 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Rick

Send message
Joined: 29 Aug 21
Posts: 24
Credit: 67,464,864
RAC: 556
Message 71612 - Posted: 18 Jan 2022, 8:09:30 UTC

Hi fellow number crunchers. I have backed off a bit and not doing any work at this time because of no feedback as to all this effort being useful, for all I know I could be Crypto Mining. Can somebody point me to where all our effort is to be found or should I just go back to my telescope?
ID: 71612 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile mikey
Avatar

Send message
Joined: 8 May 09
Posts: 3339
Credit: 524,010,781
RAC: 0
Message 71613 - Posted: 18 Jan 2022, 11:55:13 UTC - in response to Message 71612.  

Hi fellow number crunchers. I have backed off a bit and not doing any work at this time because of no feedback as to all this effort being useful, for all I know I could be Crypto Mining. Can somebody point me to where all our effort is to be found or should I just go back to my telescope?


It's your pc's and your choice where you crunch but a simple look would have found this:

https://milkyway.cs.rpi.edu/milkyway/forum_thread.php?id=3145

yes it's from 2019 but this is about REAL Science not fake Science where results are gathered today and dropped onto some fake website the next
ID: 71613 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
kotenok2000
Avatar

Send message
Joined: 22 May 11
Posts: 71
Credit: 5,685,114
RAC: 0
Message 71616 - Posted: 18 Jan 2022, 12:03:28 UTC
Last modified: 18 Jan 2022, 12:11:41 UTC

ID: 71616 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Rick

Send message
Joined: 29 Aug 21
Posts: 24
Credit: 67,464,864
RAC: 556
Message 71618 - Posted: 18 Jan 2022, 22:10:59 UTC - in response to Message 71616.  

Thanks guys, it's what I was asking for.
ID: 71618 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
SolarPowered
Avatar

Send message
Joined: 14 Jan 22
Posts: 3
Credit: 240,634
RAC: 0
Message 71638 - Posted: 20 Jan 2022, 20:11:35 UTC - in response to Message 71612.  
Last modified: 20 Jan 2022, 20:27:39 UTC

actually u are: "MilkyWay@home is a whitelisted gridcoin project.[8] It is the second-largest manufacturer of gridcoins"

https://en.wikipedia.org/wiki/MilkyWay@home

ok let's rectify: why 'am i burning fossil fuel to power a server that does MilkyWay@home instead of monero?

because a lemon tree is in need of heating X-D

(and the server at least does something usefull (i hope) than just give off heat)

(because would need a very very fast and 32x core or 64x core CPU ~$2000 to make sense https://dwaves.de/2022/01/14/server-heating-a-lemon-tree-crypto-experiment-rechnet-es-sich-in-2022-01-monero-zu-minen-does-mining-monero-pay-off-maybe-better-of-boinc/)

sensors; #says
Adapter: ISA adapter
Package id 0:  +56.0°C  (high = +85.0°C, crit = +105.0°C)
Core 0:        +47.0°C  (high = +85.0°C, crit = +105.0°C)
Core 1:        +56.0°C  (high = +85.0°C, crit = +105.0°C)
Core 2:        +53.0°C  (high = +85.0°C, crit = +105.0°C)
Core 3:        +55.0°C  (high = +85.0°C, crit = +105.0°C)


hint: it might run much more efficient on GPUs

https://milkyway.cs.rpi.edu/milkyway/forum_thread.php?id=4379#68005

than CPUs!

PS: what GPU would be recommended? (watt / GFLOPs?, should be way x < $1000)
ID: 71638 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Tom Donlon
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 10 Apr 19
Posts: 408
Credit: 120,203,200
RAC: 0
Message 71874 - Posted: 4 Mar 2022, 19:01:54 UTC

I found some stale processes in the DB that were slowing things down. I cleared them out and the website is running faster now. It also looks like there are some more separation WUs in the well of jobs that are ready to be sent out.

Not sure about the validator back up, but I'm trying to figure out if something can be done. Same goes for the transitioner backup.
ID: 71874 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile mikey
Avatar

Send message
Joined: 8 May 09
Posts: 3339
Credit: 524,010,781
RAC: 0
Message 71876 - Posted: 5 Mar 2022, 11:14:36 UTC - in response to Message 71535.  
Last modified: 5 Mar 2022, 11:15:41 UTC

Work on GPU N-body is slow but still happening. Some major progress was made over the summer, but I wouldn't expect it anytime soon.


Maybe you could reach out to "Petri" thru Keith Myers in this thread and get his assistance, since he's done several already he may be able to get your thru the sticking points.
ID: 71876 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Septimus

Send message
Joined: 8 Nov 11
Posts: 205
Credit: 2,900,464
RAC: 0
Message 72723 - Posted: 12 Apr 2022, 11:46:28 UTC - in response to Message 71876.  

Work on GPU N-body is slow but still happening. Some major progress was made over the summer, but I wouldn't expect it anytime soon.


Maybe you could reach out to "Petri" thru Keith Myers in this thread and get his assistance, since he's done several already he may be able to get your thru the sticking points.


Hope it includes Intel GPU as well.
ID: 72723 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Keith Myers
Avatar

Send message
Joined: 24 Jan 11
Posts: 715
Credit: 556,844,997
RAC: 43,988
Message 72733 - Posted: 12 Apr 2022, 21:11:03 UTC - in response to Message 72723.  

Since the Separation app is OpenCL based, it would be compatible with all card types.

Generally Petri develops on Nvidia cards as a preference with CUDA being the preferred platform owing to its inherent performance advantages over compiled OpenCL.
ID: 72733 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile mikey
Avatar

Send message
Joined: 8 May 09
Posts: 3339
Credit: 524,010,781
RAC: 0
Message 72750 - Posted: 13 Apr 2022, 10:30:27 UTC - in response to Message 72733.  

Since the Separation app is OpenCL based, it would be compatible with all card types.

Generally Petri develops on Nvidia cards as a preference with CUDA being the preferred platform owing to its inherent performance advantages over compiled OpenCL.


With the possibility of new gpu tasks here at MilkyWay maybe that would be the way to go if he decides to help.
ID: 72750 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Frank

Send message
Joined: 2 Nov 10
Posts: 25
Credit: 1,894,269,109
RAC: 0
Message 73082 - Posted: 23 Apr 2022, 14:34:51 UTC

I am sure you all remember the w/u =7 tasks that caused invalidations. Well, this morning I encountered the son of w/u=7. Its signature is w/u=5 and it causes validation errors. Que paso?
ID: 73082 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
San-Fernando-Valley

Send message
Joined: 13 Apr 17
Posts: 256
Credit: 604,411,638
RAC: 0
Message 73083 - Posted: 23 Apr 2022, 14:53:49 UTC - in response to Message 73082.  

I am sure you all remember the w/u =7 tasks that caused invalidations. Well, this morning I encountered the son of w/u=7. Its signature is w/u=5 and it causes validation errors. Que paso?

No, I don't remember ...
ID: 73083 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile HRFMguy

Send message
Joined: 12 Nov 21
Posts: 236
Credit: 575,038,236
RAC: 0
Message 73084 - Posted: 23 Apr 2022, 15:27:20 UTC - in response to Message 73082.  

I am sure you all remember the w/u =7 tasks that caused invalidations. Well, this morning I encountered the son of w/u=7. Its signature is w/u=5 and it causes validation errors. Que paso?

Yes, I vaguely remember some vinegar associated with that. Can you please refresh us on how to find these and exterminate them? Actually, do we need to exterminate them? Or will the 'system' do that for us?
ID: 73084 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
alanb1951

Send message
Joined: 16 Mar 10
Posts: 213
Credit: 108,975,556
RAC: 29,973
Message 73086 - Posted: 23 Apr 2022, 18:18:10 UTC - in response to Message 73084.  

I am sure you all remember the w/u =7 tasks that caused invalidations. Well, this morning I encountered the son of w/u=7. Its signature is w/u=5 and it causes validation errors. Que paso?

Yes, I vaguely remember some vinegar associated with that. Can you please refresh us on how to find these and exterminate them? Actually, do we need to exterminate them? Or will the 'system' do that for us?

I'm only a user, not a BOINC SysAdmin, but I'll have a go at this...

To address the "extermination" question first - I believe there is some code in the existing validator that spots results that don't have the expected number of parameters and invalidates them (it is potential bad science, after all!) I would hope that it also flags the entire BOINC workunit as bad so no further retries should be sent, but without looking at the code I can't be sure!

Regarding Frank's post: there's more than one reason a result can be declared invalid, so without being able to see the output from the Invalid task(s) it isn't possible to say whether there is a recurrence of that "too many WUs in a shipped task" problem. Current tasks have number_WUs = 5 and number_params_per_WU = 20, so that 5 isn't an issue!

In order to satisfy my curiosity on this, I went via Frank's profile to find his computers to see if I could spot the offending task(s). I found a relevant(*) Invalid task on two of the systems. In each case it was declared a Validation error because it had failed to calculate the likelihoods for the first of the five jobs in the task. Judging by the content of the result report, it appeared to try to start doing the first job twice and got confused about the state of the checkpoint file -- perhaps something interrupted BOINC whilst a checkpoint was being taken? The validator would spit this out at once because it couldn't find all the likelihood data!

I think it would be interesting to know whether the retry/retries also fail to validate - if they fail, then there's possibly a problem in the parameters, but if they don't fail the issue was specific to the computer(s) in question, not a work unit error.

Hope this helps clear things up a bit...

Cheers - Al.

* There were also some "completed, couldn't validate" tasks, and some orphaned invalid tasks from the big renumbering crash of late January 2021.
ID: 73086 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · Next

Message boards : News : News General

©2024 Astroinformatics Group