| log in |
Message boards : Number crunching : search progress visualization
| Author | Message |
|---|---|
|
So I've been thinking about make things a little more interesting on your end. Dave and myself are going to be trying to get some small visualizations available (that should be updated quasi-real time) that should show how our different searches are progressing. | |
| ID: 11115 | Rating: 0 | rate:
| |
|
For a little more information about what you're looking at: | |
| ID: 11117 | Rating: 0 | rate:
| |
|
Interesting stuff. Travis wrote: p[i][j] = w * v[i][j] + c1 * r1 * (g[j] - p[i][j]) + c2 * r2 * (l[i][j] - p[i][j]) So tell me, your equation looks similar to the SOR iterative method (but not). Is that what W is? (The relaxation constant). I have studied (introductory) iterative methods, so please be gentle >< | |
| ID: 11120 | Rating: 0 | rate:
| |
Interesting stuff. That's interesting. I've actually never heard of the SOR method, so I just looked it up ;) But just from looking, they do seem somewhat similar. What w is doing here is putting some kind of limit on the particles velocity. If w is too high, the particle's velocity will increase over time, while if it's too low the particle's velocity will slow down over time. Since the search space that we're looking at here isn't particularly pretty, having a high w can be nice in that it will help a particle escape a local minima by increasing it's velocity until it can jump out. However, if it's too high the particles wont ever converge to a decent value in any of the minima found (making it more or less a random search). So in that way it's rather similar to the temperature used by simulated annealing. You can pretty much see this happening in the graphs above. The searches with a low w (ps_s79_2, ps_s82_2 and ps_s86_2) are converging rather quickly, while the other searches are keeping a wider populuation, meaning they're searching a larger area and not converging as quickly. Basically the tradeoff is with a low w you can converge to a solution really quickly, however you run a much higher risk of the solution being a local minima instead of the globally best point. ____________ | |
| ID: 11125 | Rating: 0 | rate:
| |
|
Fun stuff! | |
| ID: 11128 | Rating: 0 | rate:
| |
Fun stuff! The particles are trying to find the optimum fit of a cylinder to the sagittarius stream in the milkyway galaxy in the corresponding stripe of the sky. The parameters to our search basically define things like the size and shape of the cylinder, it's density and other information about how the background stars (the stars not in the stream) are distributed. The stripes are thin enough that we can get away with using a cylinder :P Once we have a accurate fits of these cylinders across multiple stripes we can fit them together to get an image of the sagittarius stream. *edit* So basically, each particle is a set of parameters, and we use these different particles to generate new sets of parameters to try and find one that gives the best fit to the data. ____________ | |
| ID: 11129 | Rating: 0 | rate:
| |
|
I think that is the most thorough explanation of a project I've ever read. Thank you Travis, I knew there was a reason this is my favorite project. | |
| ID: 11171 | Rating: 0 | rate:
| |
|
I saw some interesting things in the last set of searches, so I started up a new batch. Each of these have a w = 0.8 so they're rather exploratory. | |
| ID: 11397 | Rating: 0 | rate:
| |
|
Bump, I would highly recommend you keep the little blurb about this thread on the main page. It is definitely more interesting than credit/code changes! | |
| ID: 11468 | Rating: 0 | rate:
| |
|
Hehe, seems I misunderstood the news about that a bit. | |
| ID: 11479 | Rating: 0 | rate:
| |
Hehe, seems I misunderstood the news about that a bit. Well, I don't know about a screen saver because what we're doing on the individual processors isn't particularly interesting... it's much more interesting when you put it all together as a whole. What we have in this post is pretty much preliminary work, so we'd really like to hear your ideas and suggestions as to what you'd like to know is going on server-side, and what we're doing as a whole. Right now we would like to get these graphs (and some other ones) automatically generated in their own webpage, with information like what users have given us the best fitness and things like that. Pretty much we could have whatever information you guys think would be interesting, and we'll also have things we're interested about and looking into on their for our own benefit. Basically, we'd like a webpage we could check to easily know the status of our different searches, and I thought it would be more interesting with some user input, so we could make that information more interesting to you guys, and also add things you'd be interested in as crunchers. ____________ | |
| ID: 11494 | Rating: 0 | rate:
| |
|
I'd like to know the definition of fitness. | |
| ID: 11557 | Rating: 0 | rate:
| |
Well, I don't know about a screen saver because what we're doing on the individual processors isn't particularly interesting... it's much more interesting when you put it all together as a whole. The screen saver does not have to be "perfectly" real if that is not possible. LHC for example has this "swarm" display (I think the UBW has an image of it) which is nonsesnse with the exception of the turn counter which is accurate. It is just a thought to consider, to make the application more "real" to those that like the graphics ... Priority wise I would put it below the change to the client to detect ATI cards and the OpenCL application (conserve resources, make the one additional application instead of four) ... | |
| ID: 11580 | Rating: 0 | rate:
| |
|
Screensavers won't be a part of a fully optimized app anyway. So you'll need to determine what percentage use the stock app and prioritize accordingly. | |
| ID: 11581 | Rating: 0 | rate:
| |
I'd like to know the definition of fitness. The astronomy application does a maximum likelihood fit of the stars in a stripe to a model of star stream(s) and the background stars. Also when you say the particles are 'swarming' - how do stars swarm? Or better - what are particles? My naive thoughts are - I look at the sky from time to time, and (from my perspective), those stars move reallllly slowly. So if you are looking at telescopic imagery, how are you getting motion? It's basically just terminology that the particular search uses. In essence each particle is a set of parameters that gets updated over time. The particles continue to move in their previous direction in the search space, but are also pulled to their own previous best found position, and the best found position by the entire swarm. Basically, we have some function: fitness = f(parameters) and we want to find parameters that give us the best fitness. What each of our applications is doing is calculating that function f with a different set of parameters. The higher the fitness is, the higher the likelihood of those set of parameters being the most accurate fit of the model to the data. Edit- I am being naive on purpose, as people that crunch aren't necessarily astrophysicists, so simple things are undefined to them, the galaxy just boils down to stars and space, and they may have little knowledge of anything more! That's good! Part of the point of this thread is to try and accurately describe what we're doing to all our crunchers, so if you guys have any questions please feel free to fire away :) ____________ | |
| ID: 11587 | Rating: 0 | rate:
| |
Well, I don't know about a screen saver because what we're doing on the individual processors isn't particularly interesting... it's much more interesting when you put it all together as a whole. Whenever we're looking for a new undergrad to work with us, usually one of the first things we throw out there is "you could make a screen saver for our application." And usually they want to do something else :P We have kind of a tight ship right here, so the real issue is finding someone with the knowledge and desire to make one. Personally, I have no experience doing any kind of graphical design, so I think my skills are best used elsewhere... reducing credit and things of that nature :D ____________ | |
| ID: 11588 | Rating: 0 | rate:
| |
Screensavers won't be a part of a fully optimized app anyway. So you'll need to determine what percentage use the stock app and prioritize accordingly. I want so say maybe 90% or more of the WUs returned aren't from a stock app :P ... it might actually be interesting to count what WUs are generated from what applications and make that information available for you guys. Would let whoever compiled whichever application strut their stuff a bit. :) ____________ | |
| ID: 11590 | Rating: 0 | rate:
| |
|
Isnt that interesting Hmmm!! So so the results from these are good one would presume?? | |
| ID: 11592 | Rating: 0 | rate:
| |
Isnt that interesting Hmmm!! So so the results from these are good one would presume?? So far so good, every now and then we're having a little trouble with the outlier detection (which is why you see the best fitness go up, then back down), but the results we're getting using particle swarm for stripes 20, 21, 79, 82 and 86 are comparable (or better!) than what we've gotten doing more traditional searches (conjugate gradient descent) on the bluegene at RPI. We don't have any information about the values that stripes 22 and 23 should come out to... these are entirely new and we don't have any results for them. So you guys are really doing new astronomy with those two :) ____________ | |
| ID: 11595 | Rating: 0 | rate:
| |
|
Im about 50% through s23 now have to see how it reports.. | |
| ID: 11596 | Rating: 0 | rate:
| |
|
As of this morning (feb 19) we started using Mersenne twister instead of drand48() to generate the random numbers for our particle swarm optimization. I'm hoping that this will improve the results a little bit, by adding some more variety to the random numbers we're generating. | |
| ID: 11605 | Rating: 0 | rate:
| |
|
Good call. | |
| ID: 11613 | Rating: 0 | rate:
| |
|
I also added median values to the output and graphs. It'll probably take them awhile to catch up with the median values however. | |
| ID: 11633 | Rating: 0 | rate:
| |
What we have in this post is pretty much preliminary work, so we'd really like to hear your ideas and suggestions as to what you'd like to know is going on server-side, and what we're doing as a whole. I would suggest, no urge that the project utilize this new webpage as a means to provide a vastly greater educational opportunity for the volunteer participants. Thus, I believe it should provide information on both the fundamental physics and the computational aspects of the project. By computational aspects, I most certainly do not mean any discussion of credits or who has the bigger, faster hardware--such discussions are all too prevalent as it is on the forum. Some suggested reading materials on both subject matters would be excellent additions. Allow your volunteers to learn if they so choose. The lack of such learning resources is, in my opinion, a great weakness in the BOINC concept. Best regards, EigenState | |
| ID: 11671 | Rating: 0 | rate:
| |
|
The current situation in ps_s21_9 looks interesting. Is this a case of the particles moving from one local maximum to another, or are we seeing some outliers? | |
| ID: 12245 | Rating: 0 | rate:
| |
The current situation in ps_s21_9 looks interesting. Is this a case of the particles moving from one local maximum to another, or are we seeing some outliers? Not quite sure :) That's why I'm letting it keep running. ____________ | |
| ID: 12345 | Rating: 0 | rate:
| |
|
Hehe, that's cool. What about ps_s23_9? You can still just about see when the 'best' values shot up. I'm still not sure what you're aiming for - do you want the worst values to converge to the best ones, while finding the highest average overall? (if so, why aren't you showing the worst values in your graph?) | |
| ID: 12368 | Rating: 0 | rate:
| |
|
By the way, if the goal is for the particles to converge (i.e. all of them moving toward the global maximum?), could you add the mean absolute deviation or the interquartile range? (I guess if you add them at the bottom of the graphs you'd need a second range for the vertical axis, which could go on the right) | |
| ID: 12468 | Rating: 0 | rate:
| |
|
I would suggest that the bottom line of numbers be made X10000, to eliminate the mess of 0's. | |
| ID: 12478 | Rating: 0 | rate:
| |
Hehe, that's cool. What about ps_s23_9? You can still just about see when the 'best' values shot up. I'm still not sure what you're aiming for - do you want the worst values to converge to the best ones, while finding the highest average overall? (if so, why aren't you showing the worst values in your graph?) I just was playing around and took out the worst values because you could see a bit better how the best was progressing that way. It would be nice for the worst values to converge to the best, because that's an indication that the search is over, but it's not really necessary. We're just looking to find the highest fitness By the way, if the goal is for the particles to converge (i.e. all of them moving toward the global maximum?), could you add the mean absolute deviation or the interquartile range? (I guess if you add them at the bottom of the graphs you'd need a second range for the vertical axis, which could go on the right) Yeah I have our searches generate standard deviation and a few other values to the logs but haven't made a graph for that yet. It should be pretty easy to add one :) ____________ | |
| ID: 12513 | Rating: 0 | rate:
| |
|
With regard to a screensaver, I have to admit that at times it is nice to be able to look at the data, but, with the runtimes so short now it does not seem to make much sense. My "slowest" system when I was running it last week I think was down to 10 minutes per task ... that is not much time to do much of anything of interest... | |
| ID: 12776 | Rating: 0 | rate:
| |
|
Looks like the graphs haven't been updated in a few days - I thought they were scripted to run automatically.. Perhaps you could add a timestamp to the bottom of the post so we know when it was last updated. | |
| ID: 13051 | Rating: 0 | rate:
| |
Looks like the graphs haven't been updated in a few days - I thought they were scripted to run automatically.. Perhaps you could add a timestamp to the bottom of the post so we know when it was last updated. Dave was supposed to have gotten them run automatically :P I guess he didn't... I'm going to bug him some more. ____________ | |
| ID: 13055 | Rating: 0 | rate:
| |
Dave was supposed to have gotten them run automatically :P I guess he didn't... I'm going to bug him some more. Keep that cattle prod handy! ____________ Doesn't expecting the unexpected make the unexpected the expected? If it makes sense, DON'T do it. | |
| ID: 13091 | Rating: 0 | rate:
| |
Dave was supposed to have gotten them run automatically :P I guess he didn't... I'm going to bug him some more. I'll click the Red X on him. | |
| ID: 13097 | Rating: 0 | rate:
| |
Dave was supposed to have gotten them run automatically :P I guess he didn't... I'm going to bug him some more. Dave will blame it on a freshman. ____________ | |
| ID: 13125 | Rating: 0 | rate:
| |
Dave was supposed to have gotten them run automatically :P I guess he didn't... I'm going to bug him some more. I think you have something there. A freshman prodding Dave with a cattle prod. Now that's an idea for a screensaver :P ____________ | |
| ID: 13126 | Rating: 0 | rate:
| |
Dave was supposed to have gotten them run automatically :P I guess he didn't... I'm going to bug him some more. I might have to enroll. ____________ | |
| ID: 13128 | Rating: 0 | rate:
| |
Dave was supposed to have gotten them run automatically :P I guess he didn't... I'm going to bug him some more. The next hit internet game. Prod Dave & the sequel Prod Dave Again!! ____________ Doesn't expecting the unexpected make the unexpected the expected? If it makes sense, DON'T do it. | |
| ID: 13136 | Rating: 0 | rate:
| |
|
Updated these to the newer searches which are using some limited redundancy to check new local/global best particles ;) Hopefully these should improve better than the last set. | |
| ID: 13160 | Rating: 0 | rate:
| |
Updated these to the newer searches which are using some limited redundancy to check new local/global best particles ;) Hopefully these should improve better than the last set. How was the prodding? ____________ | |
| ID: 13190 | Rating: 0 | rate:
| |
|
The median seems to generally be where the average is. I know they aren't the same but I think it seems kind of redundant to show both on the graphs, unless the graphs could be stretched to show the difference better. Is it possible to change the left column to show the difference between each fittness? Otherwise they all show the same number, which may be confusing. | |
| ID: 15018 | Rating: 0 | rate:
| |
|
The median is useful because it isn't effected as much by outliers, so it's probably not as useful now that the amount of outliers has been reduced. | |
| ID: 15029 | Rating: 0 | rate:
| |
The median is useful because it isn't effected as much by outliers, so it's probably not as useful now that the amount of outliers has been reduced. Purple is worst. I know the different marks are useful. It just isn't as easy to view with them both being about the same spots. ____________ Doesn't expecting the unexpected make the unexpected the expected? If it makes sense, DON'T do it. | |
| ID: 15033 | Rating: 0 | rate:
| |
The median is useful because it isn't effected as much by outliers, so it's probably not as useful now that the amount of outliers has been reduced. What rhymes with purple? And what rhymes with orange? And what TV series had an episode that refered to "purple and orange squadrons"? ____________ | |
| ID: 15052 | Rating: 0 | rate:
| |
The median is useful because it isn't effected as much by outliers, so it's probably not as useful now that the amount of outliers has been reduced. Roger Miller seemed to think it was Maple Surple | |
| ID: 15142 | Rating: 0 | rate:
| |
|
It looks like all the current searches have stopped improving - how do you determine when to stop a search? Do you have objective criteria for when a search is over? | |
| ID: 15359 | Rating: 0 | rate:
| |
|
Every time I turn in a work unit or check the latest visualization reports, I feel the urge to rush to my Newtonian telescope to check if the stars have changed much. ;) | |
| ID: 15924 | Rating: 0 | rate:
| |
It looks like all the current searches have stopped improving - how do you determine when to stop a search? Do you have objective criteria for when a search is over? Still working on Dave to automate the script that generates the graphs. The one's we're doing right now are pretty close to done. For some streams (79, 82, 86) they're the highest fitness we've ever seen. The other stripes are in the same boat, however the parameters are a little weird so we're looking into the cause of that. ____________ | |
| ID: 16849 | Rating: 0 | rate:
| |
The one's we're doing right now are pretty close to done. For some streams (79, 82, 86) they're the highest fitness we've ever seen. The other stripes are in the same boat, however the parameters are a little weird so we're looking into the cause of that. That's pretty cool - how much higher (closer?) are they? Sounds like we're really breaking some new ground here :) | |
| ID: 16863 | Rating: 0 | rate:
| |
|
Sorry, I know this is off-topic, but I can't resist! What rhymes with purple? curple: the rump, derriere, buttocks, fundament hirple: to walk with a limp And what rhymes with orange? sporange: A single-celled or many-celled structure in which spores are produced, as in fungi, algae, mosses, and ferns. Also called spore case. And what TV series had an episode that refered to "purple and orange squadrons"? I was going to guess Captain Scarlet with it's 'multi-coloured' characters, but since you mention 'an episode' I pass on this one. | |
| ID: 17155 | Rating: 0 | rate:
| |
And what TV series had an episode that refered to "purple and orange squadrons"? While google is your friend, Battlestar Galactica is the saviour of mankind :) ____________ | |
| ID: 17185 | Rating: 0 | rate:
| |
And what TV series had an episode that refered to "purple and orange squadrons"? And thank Gods for bittorent...the saviour of bad network programming decisions! | |
| ID: 17459 | Rating: 0 | rate:
| |
And what TV series had an episode that refered to "purple and orange squadrons"? Oh those poor trackers for getting hammered like that. ____________ | |
| ID: 17502 | Rating: 0 | rate:
| |
And what TV series had an episode that refered to "purple and orange squadrons"? Better than hammering the Milkyway schedular! | |
| ID: 17513 | Rating: 0 | rate:
| |
And what TV series had an episode that refered to "purple and orange squadrons"? Tell that to your demonoid script. ____________ | |
| ID: 17572 | Rating: 0 | rate:
| |
And what TV series had an episode that refered to "purple and orange squadrons"? I know, I have a script that does "update tracker" every 5 seconds... | |
| ID: 17660 | Rating: 0 | rate:
| |
And what TV series had an episode that refered to "purple and orange squadrons"? Maybe that's why I have all those repetitive PG2 log entries. ____________ | |
| ID: 17700 | Rating: 0 | rate:
| |
|
Just an update here, I found a little php script to display the plots of how our searches are performing, so to take a look and see how things are running you can go to: | |
| ID: 24685 | Rating: 0 | rate:
| |
Just an update here, I found a little php script to display the plots of how our searches are performing, so to take a look and see how things are running you can go to: BBCODE won't allow you to do that if I'm understanding you correctly. You'd have to show each individual image. To keep it updated you'd have to manually replace the image, with the exact same name, on your end. Even then some browsers may not automatically refresh the image. ____________ | |
| ID: 24687 | Rating: 0 | rate:
| |
|
See my APOTD thread in Science for a nice example. Create a thread, in the first post make a link to the php url. Sticky the thread. Then all you need to do is update the php page. Since you'll most likely be adding new (and removing old?) images with different filenames they should be properly displayed in the browser. | |
| ID: 24688 | Rating: 0 | rate:
| |
Just an update here, I found a little php script to display the plots of how our searches are performing, so to take a look and see how things are running you can go to: Yeah I was trying to find some way to automatically put the images in a post, as opposed to just linking the php page (as in the first post in this thread). ____________ | |
| ID: 24701 | Rating: 0 | rate:
| |
Just an update here, I found a little php script to display the plots of how our searches are performing, so to take a look and see how things are running you can go to: Well you do have an unlimited post edit time. So every time there is a new image you could edit your post with the new filename. ____________ | |
| ID: 24706 | Rating: 0 | rate:
| |
Well you do have an unlimited post edit time. So every time there is a new image you could edit your post with the new filename. The lazy computer scientist in my hates that solution :D ____________ | |
| ID: 24713 | Rating: 0 | rate:
| |
Well you do have an unlimited post edit time. So every time there is a new image you could edit your post with the new filename. Plan B. Find an unused monitor. Display the image on that monitor. Point a webcam at the monitor. Post a link to the webcam. If an image gets burned in get another monitor and repeat the steps. ____________ | |
| ID: 24717 | Rating: 0 | rate:
| |
|
Is it just me, or are those fitness values a lot better than before? :O New coordinate system helping? | |
| ID: 24741 | Rating: 0 | rate:
| |
|
I'm unstickying this because it's now on the main webpage and being automatically updated :) | |
| ID: 24833 | Rating: 0 | rate:
| |
Is it just me, or are those fitness values a lot better than before? :O New coordinate system helping? Each stripe will have a different best fitness. I think the reason the new ones are a bit lower is because of the coordinate system, but who knows? :) ____________ | |
| ID: 24834 | Rating: 0 | rate:
| |
|
This probably the wrong place to ask this but it seems appropriate since it talks of the visualization. | |
| ID: 24835 | Rating: 0 | rate:
| |
Each stripe will have a different best fitness. I think the reason the new ones are a bit lower is because of the coordinate system, but who knows? :) Are you intending to rerun the search of the previous stripe for the sake of comparison? (and uh, potentially improved results) | |
| ID: 24840 | Rating: 0 | rate:
| |
This probably the wrong place to ask this but it seems appropriate since it talks of the visualization. I think the astronomers are working on getting some interesting figures for you guys showing what you've already crunched. ____________ | |
| ID: 24850 | Rating: 0 | rate:
| |
Each stripe will have a different best fitness. I think the reason the new ones are a bit lower is because of the coordinate system, but who knows? :) I dont think the new coordinate system applies to the older streams. ____________ | |
| ID: 24852 | Rating: 0 | rate:
| |
|
I think the graphs would fit better under the Boinc logo at the bottom. They widen the page a bit. | |
| ID: 24900 | Rating: 0 | rate:
| |
|
The fitness gets better, especially the de_s222_opt_1_v01 evolution. I wonder about the maps astronomers work on. Is there an estimated release date? | |
| ID: 36217 | Rating: 0 | rate:
| |
|
Hi my name is Rob and I'm new to the MilkyWay@Home project, but I thought I would give my 2 cents anyways. | |
| ID: 38746 | Rating: 0 | rate:
| |
Hi my name is Rob and I'm new to the MilkyWay@Home project, but I thought I would give my 2 cents anyways. Hi Rob, If you're accepted to RPI let us know :) What graduate department did you apply to? One of our undergrads is currently working on some data visualization, I hope it goes somewhere :) ____________ | |
| ID: 38748 | Rating: 0 | rate:
| |
Message boards :
Number crunching :
search progress visualization