Monday, June 27, 2011

6/12 - 6/26


 These last two weeks I've been working on the applet to analyze fractal dimension on networks.

What I've done so far:
  • Figured out the mathematics of calculating fractal network dimension
  • Implemented algorithms for the calculation
  • Realized my understanding of the math was wrong and started over
  • Repeated the above actions several times
  • Made some simple graphics to display various information about the network and the algorithm results
  • Tested some graphs to see what would come out
 The fractal dimension of a network has to do with how fast the number of nodes within a distance from an arbitrary node grows with the distance.  So for a network represented by a straight line of vertex-edge-vertex-etc., I would expect to get a dimension of 1.  The program gives ~.93, so I'm guessing there's some experimental error happening.
I also tested a graph that looks like the Sierpinski triangle, with vertices for line intersections and edges for line segments.  When the graph is constructed to match the triangle at 7 levels of recursion, I get  ~1.51 for the dimension.  The dimension of the actual Sierpinski triangle is  ~1.5849  - I don't know if I should be getting that for the network or not.  But here's a cool thing:  this is the plot of estimated fractal dimension at different points during the algorithm's execution


It isn't all on a straight line, but the important thing is that it isn't trending anywhere.  I'm not sure if this is sufficient to show a network is fractal, but it is necessary.  Compare to the plot for a random (therefore non-fractal) graph with the same number of vertices and edges
Definitely going somewhere.

I also put together a small synonym graph and analyzed it - the plot isn't as steady as for the triangle but it's much better than for a random graph of the same size.

Next steps:
  • Test networks with known dimension
  • Make it read and write graphs in XGMML
  • Improve graphics - readable coordinates, optional logarithmic scaling
  • Make gui more interactive (right now graph loading is still hard-coded)

Monday, June 13, 2011

Work 6/5 - 6/12

This week I continued working on puzzling out the dimension-calculation.  I realized that whenever there is a permanent, inactive line in the pattern, the dimension has to be at least 1 because that line is 1-dimensional.  But the question is, when these inactive lines get replicated ad infinitum,  when does the dimension of the set of them grow to be >1?  The approach we're looking at right now is to assume it is no greater than 1 when the dimension of the active part (given by the solution to the Moran equation) is <1, and no greater than the dimension of the active part otherwise; this lets me put in a simple guard making sure that whenever there are inactive segments the dimension is at least one.  It's a little counter-intuitive because of situations like this:  
One would think the dimension of this shape would be something >1, but since the Moran equation yields 0.97 which is <1, my algorithm in its current form settles for just plain 1.
The FractaSketch program that was the original inspiration for this applet does this as well.
___

I fixed a bug with the grid resizing feature; the version on the site would make the current shape invisible after resizing, and the code I was working with disabled grid-resizing altogether after starting to build a shape.  Now users can change the grid size at any time without losing their work.
___

I also made a change in how the different orientations get treated through replication steps.  It's hard to explain concisely but for an example:  before, a yellow (y-axis reflection) segment reproduced by any other segment would be a yellow segment still, but now it gets composed with the segment reproducing it so when a purple (x-axis reflection) reproduces a yellow, it turns it into a blue (pi-rotation), and when a blue reproduces a yellow it turns it into a purple and so on.  For illustration, take this seed and its second iteration (which looks the same in both versions):

Before, the third iteration looked like this:

After, the third iteration looks like this:

and after a while it looks like this:
which is irrelevant but pretty neat.  I don't think either is more theoretically sound than the other, but the new way is how I would expect the program to behave (it's also how the FractaSketch program behaves).

Also this week I've started a new project - an applet to measure fractal dimension on networks.

Sunday, June 5, 2011

Work progress 5/29 - 6/5


First up this week, I think I figured out what was causing the incompatibilty problem where seed shape files saved by one version were unloadable by the other.  It's an issue with serialization - some changes must have been made to the SeedShape and Segment classes between the two versions, so the saved (serialized) instances are not recognized as the same class.  Since I'll be introducing a version 3, I might make a translating program to read all the old saved shapes, create instances of the newest version, and re-serialize them.
___

The next thing I did was notice and fix a problem with two of the types of replacement lines.  Both yellow and blue lines replicate the shape reflected across an axis perpendicular to the baseline.  In order to do that, the code reversed the order of the points in the shape from left to right; however, the segments were still iterated through in the old order, so the segment information was not lining up with the correct endpoint information.
___

The most interesting task this week was to recreate the fractal dimension calculating part of the program (since we didn't have the source code for that part).  Dr. Krishnamoorthy gave me the C code originally used by my predecessor as a basis - it approximates solutions to polynomial equations, such as Moran equations (equations relating the dimension of a self-similar fractal with the various ratios of scaling used in replication).  So all I had to do was translate this to java and add a bit to construct the Moran equation of a given fractal, and voila, I can compute the dimension of Sierpinski's triangle to be 1.58496 +- .00001.  That's even better than wikipedia.  I still need to do some work making it display prettily and figuring out when to calculate it for maximum efficiency.

Also, there are some problems with this approach (though I think it's good enough for its purpose).
First, the algorithm doesn't take into account how much the lines of the shape overlap each other.  For instance, if the shape is made up of several replicating lines positioned so that it looks like one straight line (and continues to look so in all iterations), one would think the dimension should be 1.  But the algorithm treats it the same as if all the lines were skew and non-intersecting, in which case the dimension is usually > 1.  The algorithm gives this dimension to both cases.  However, since nobody's really interested in making replicating shapes that look like line segments for eternity, I don't think this is a real problem.  (If even part of a seed shape overlaps itself, or if subsequent iterations layer lines over each other exactly, the same problem occurs to a less noticable extent, but again, I don't think it happens often enough to be a problem and it is nearly always avoidable.)

More worrisome is what happens when there is only one replicating segment:  if there is no other information in the shape and the segment's scaling ratio is <1, the algorithm gives a dimension of 0 which as far as I've been able to read seems to be ok; however, if there is other information (inactive, non-replicating segments that will get replicated by the one active segment), the dimension certainly shouldn't drop to 0, but since the algorithm ignores inactive information it still does.  An example of this is a logarithmic spiral:


The applet version currently on the site gives it a dimension of 2, my code gives it a dimension of 0, and I've read it should really have a dimension of 1, though I don't know the reasoning behind that.  A more intuitive example is this one:

It's pretty obvious this should be 1-dimensional, but again the old version gives 2 and my version gives 0.
Also, the Moran equation is only valid when all of the scaling ratios are < 1; if a fractal spreads without limit this method is not applicable.
___

Another priority for this week was to fix an inconvenience with entering a value for line thickness:  one used to hit "enter" after typing in the value to make it update, which was mightily convenient but entirely unintuitive.  We settled on making it auto-update with every keystroke in the textfield as the best solution, and I think it works beautifully.
___

One more thing I did was fix an annoyance where the edit-panel would lose the keyboard focus after certain buttons (like "Clear") were clicked and therefore wouldn't be able to receive keyboard input again till a double-click point-creating action was made; this made deleting points at certain times impossible.  Now the panel requests the focus again every time the mouse moves over it.
___