 # An effective crossing minimisation heuristic based on star insertion

We present a new heuristic method for minimising crossings in a graph. The method is based upon repeatedly solving the so-called star insertion problem and has several desirable characteristics for practical use. We introduce the method, discuss some aspects of algorithm design for our implementation, and provide some experimental results. The results indicate that our method compares well to existing methods, particularly for dense graphs, and is able to obtain some of the best known heuristic solutions to well-known instances.

## Authors

##### This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.

## 1 Introduction

The crossing number of a graph , denoted by , is defined as the minimum number of pairwise edge intersections of any drawing of in the plane. Crossing numbers have been the source of many challenging problems since their introduction by Turán in the 1950s . Determining the crossing number of a graph is NP-hard  and remains NP-hard even for restricted versions of the problem, such as finding for a planar graph with the addition of a single edge 

. In recent years, an integer linear programming approach for finding

has been developed in ,  and . This is successful for sparse graphs of moderate size, however ILPs and other exact methods are very limited when it comes to dense graphs. For example, even remains unknown . Therefore, heuristic methods are currently of interest and are the subject of our present work. Interested readers are referred to  for a description of the current methods for computing crossing numbers, or to  for a comprehensive bibliography of results up until 2014 related to crossing numbers.

This manuscript is organised as follows. For the remainder of this section, we give some necessary definitions. In Section 2 we discuss some related approaches to crossing minimisation, most notably including the planarisation method and its variants, which are regarded as the current best practical methods for minimising crossings . We propose our new heuristic method in Section 3 and discuss some implementation and design aspects in Section 4. In Section 5 the runtime is shown to be time per iteration, where is the number of crossings in a current drawing of . Lastly, in Section 6, we benchmark the new heuristic on several collections of graphs which have been used in the past for testing other crossing minimisation algorithms. A fully featured implementation of our heuristic, which we call QuickCross, is available at http://fhcp.edu.au/quickcross in both C and MATLAB format.

### 1.1 Definitions

For an undirected graph with vertex set and edge set , let and . Let be the degree of vertex and denote the maximum degree of any vertex in . For a vertex , let denote the neighbourhood of ; that is, the set of vertices such that there exist edges in connecting to those vertices.

An embedding of a graph onto a surface (a compact, connected 2-manifold) is a representation of onto such that vertices are distinct points on and each edge is a simple arc on connecting the points associated with the end vertices of . The embedding must also satisfy: An arc of edge does not include any points associated with vertices other than the end vertices of , and two arcs never intersect at a point which is interior to either of the arcs. Two embeddings are equivalent if there is a homeomorphism of which transforms one into the other. The equivalence class of all such embeddings is a topological embedding of . In this present work, we are only concerned with embeddings in which is the surface of a sphere and this is assumed to be the case for the rest of this paper. A topological embedding of onto the sphere uniquely defines a cyclic ordering of the edges incident to each vertex of and the collection of these cyclic orderings is a combinatorial embedding for . A combinatorial embedding defines a set of cycles in which bound the faces of any embedding belonging to the associated topological embedding, and so we may talk about the set of ‘faces’ of . Similarly, defines a dual graph which is isomorphic to the dual graph of any embedding belonging to the associated topological embedding. Note that the edges of are in one-to-one correspondence with edges of the dual graph .

A drawing is a representation of a graph onto the plane with similar conditions to an embedding. Vertices are represented as distinct points and each edge is represented by a simple arc between the points associated with the end vertices of . The drawing must also satisfy: An arc of edge does not include any points associated with vertices other than the end vertices of , and any intersection between the interiors of arcs involves at most two arcs. Given a drawing of , the intersections which occur in the interiors of arcs are the crossings of the drawing and the number of crossings is denoted by . The crossing number of a graph is denoted by and is the minimum number of crossings over all possible drawings of . If , then is planar, and we say that is a planar drawing of . A planarisation of a drawing is a planar drawing of the planar graph obtained by replacing crossings of the initial drawing with dummy vertices of degree 4. Hence the graph corresponding to the planarisation of has vertices and edges.

In what follows, when no confusion is possible, we shall refer to the arcs of a drawing or embedding as ‘edges’ of the drawing (or embedding) and the points associated with vertices as ‘vertices’ of the drawing (or embedding).

Throughout this paper, we will often consider the situation where we have a combinatorial embedding of a graph, and then need to add an edge to the graph and obtain an updated combinatorial embedding. In such cases, we will say that we are inserting an edge into , as follows. Suppose that , where , and let be an embedding which realises the cyclic orderings in . A simple arc connecting and may be added to , such that the interior of the arc intersects only with the interiors of some (possibly empty) ordered set of edges already present in . Clearly, for any embedding which realises the cyclic orderings in , such an arc can be found which intersects exactly the same set of edges . We also refer to these intersections as ‘crossings’.

In the following discussion, we will be defining the star insertion problem (SIP). Prior to that, we first consider the edge insertion problem (EIP) studied in , which has two variations depending on the definition of optimality used; the fixed embedding variation and the variable embedding variation:

###### Definition 1

(EIP – fixed embedding) Given a combinatorial embedding of a graph and a pair of vertices , insert the edge into in such a way that the number of crossings is minimised.

###### Definition 2

(EIP – variable embedding) Given a planar graph and a pair of vertices , find a combinatorial embedding of such that inserting the edge into so as to minimise the crossings results in the minimal number of crossings among all embeddings of .

The fixed embedding problem can be solved in time by finding a shortest path in a modified dual graph, and this is explained in detail in . In  it is shown that the variable embedding problem can also be solved in time by taking advantage of the properties of maximal tri-connected components and SPQR trees. Note that solving the edge insertion problem is different from computing the crossing number of (the graph with the addition of edge ). However, it is shown in  that the number of crossings introduced in a solution to the variable embedding version approximates to within some factor and the best possible factor is proved in  to be .

A natural extension to the above is the star insertion problem (SIP) where instead of a single edge, the object to be added to is a vertex along with a set of incident edges of (collectively, a star). As before, there are fixed embedding and variable embedding versions of SIP:

###### Definition 3

(SIP – fixed embedding) Given a combinatorial embedding of a graph and a vertex (along with a set of incident edges whose other endpoints are all in ), insert along with its incident edges into in such a way that the number of crossings is minimised.

###### Definition 4

(SIP – variable embedding) Given a planar graph and a vertex (along with a set of incident edges whose other endpoints are all in ), find a combinatorial embedding of such that inserting along with its incident edges into so as to minimise the crossings results in the minimal number of crossings among all embeddings of .

The fixed embedding version can be solved in time using a method similar to the single edge insertion version . We will make use of this approach during our heuristic, and we briefly outline our implementation of this method in Section 3. The complexity of the variable embedding version was in question for a short time but was resolved by Chimani et al  who showed it to be solvable in polynomial time by a method which is briefly outlined in Section 2. Again, the number of crossings introduced in a solution to the variable embedding version is shown in Chimani, Hliněný and Mutzel  to approximate the crossing number of the graph to within a factor of .

## 2 Related work

Crossing minimisation has been considered in a number of contexts. For example, in the field of automated graph drawing, heuristics have been developed to construct drawings of graphs or networks with desirable characteristics, which often includes a low number of crossings. Approaches including force-directed drawing algorithms [18, 26, 28]

[3, 19, 1] have been developed for this purpose. When crossing minimisation is the sole aim, arguably the most successful heuristics to date have been based on edge insertion procedures.

### 2.1 Planarisation method

The planarisation method, a highly effective crossing minimisation heuristic, is based upon repeatedly solving the edge insertion problem. In particular, the planarisation method involves attempting to solve two separate problems:

1. Compute a planar subgraph of - ideally a maximum planar subgraph.

2. Iteratively re-insert the remaining edges of into a combinatorial embedding of while striving to keep number of crossings as small as possible.

Computing a maximum planar subgraph is NP-hard , so instead a locally maximal planar subgraph is usually used for step 1, which can be computed in time . To achieve step 2, given a planar subgraph of , EIP (in the fixed or variable embedding) is solved for one of the missing edges. Then any introduced crossings are replaced by degree 4 dummy vertices to obtain a new planar graph, and EIP is solved again for another missing edge, and so on until an embedding of the full graph is obtained.

The planarisation method was first described in the context of EIP-fixed by Batini et al . Later, in Gutwenger , the method was rigorously developed for EIP-variable, along with an implementation and experimental results which were also reported in Gutwenger and Mutzel . In most cases, the method based on EIP-variable provided superior solutions for the tested graphs. However, it was observed that the EIP-variable method often suffered in runtime in comparison to EIP-fixed implementations, due to the many SPQR trees which need computed (a new SPQR tree for every edge inserted). Later, in Chimani and Gutwenger , implementations were also reported on which focused on improving the post processing schemes that can be utilised when running these methods and again improved results were obtained from those previously reported.

A related approach to the planarisation method is to solve the multiple edge insertion problem (MEI), which involves inserting several edges simultaneously into a planar graph. Let be the set of edges being inserted into some planar graph . For general , solving MEI to optimality is NP-Hard , and approximation algorithms have been developed in  and . An approximate solution to MEI is known to approximate the crossing number of the graph  and so for graphs of bounded degree and bounded , the algorithm in  constitutes a multiplicative factor approximation algorithm for and the algorithm in  constitutes an additive factor approximation algorithm for . Among implementations based on MEI, only the algorithm of Chimani  has been experimentally reported on. In particular, it was considered in Chimani and Gutwenger , which is the most recent analysis on the practical usage of various crossing minimisation heuristics. Chimani and Gutwenger  claim that the MEI implementation from  achieves roughly comparable solution quality to the best iterative EIP-variable method, with the benefit of significantly reduced runtimes. If runtimes are disregarded, the iterative EIP-variable method with the addition of a significant post processing step usually produced the best solutions, however overall (in terms of both solution quality and runtime) Chimani and Gutwenger  advocate that the MEI implementation from  was the best heuristic for practical use.

Recently, in , another variation of EIP was investigated. Given a combinatorial embedding of and , the task is to find a straight line drawing of which realises the cyclic orderings of , and such that a straight line between can be added with minimal number of crossings among all straight line drawings of which realise the cyclic orderings of . This problem is known as geometric edge insertion and for the case , can be solved in linear time by the algorithm in .

### 2.2 Methods based on star-insertion

The approach to solving SIP-variable, described in Chimani et al , can be summarised as follows for a given graph and vertex to be inserted.

1. Compute an SPQR tree of , and consider a face in one of the skeleton graphs of ( belongs to a set of ‘interesting’ faces).

2. Solve a dynamic program whose solution advises the best combinatorial embedding which admits the minimal number of crossings when inserting into .

3. Repeat the above for all ‘interesting’ faces and select the solution which results in the fewest crossings.

Although the runtime of the algorithm provided in  is polynomial, it is considerably higher than solving EIP-variable, and experimental results have yet to be reported on. Nonetheless, a heuristic analogous to the planarisation method, but using star insertion rather than edge insertion, could be proposed. Indeed, in Chimani and Gutwenger , it is asked whether a heuristic based on star insertion could compare to the proven practical performance of the heuristic methods based on edge insertion. This present work seeks to answer this question, at least for SIP-fixed, but the approach we advocate is different in character to the planarisation method.

In particular, the approach that we advocate is to iteratively obtain improved drawings of a graph in the following way. For a given drawing of a graph , we attempt to find a vertex in satisfying the following: if we remove , and then reintroduce by solving the star insertion problem in a corresponding (fixed) combinatorial embedding, a drawing can be obtained such that . If there are no vertices in the graph for which this is possible, we say that the drawing is locally crossing-optimal. In what follows, we will prove the following.

###### Theorem 1

Let be a graph containing vertices and edges, and let be a drawing of which contains crossings. There exists an algorithm that finds a locally crossing-optimal drawing of in time.

It is our contention that the number of crossings in such a found by our algorithm is, typically, close to the crossing number of . We provide experimental results justifying this assertion in Section 6.

## 3 Proposed heuristic method

While the philosophy of the planarisation method is to start with a planar subgraph and increase the number of crossings at each iteration as the full graph is rebuilt, our approach works in the opposite direction; we start with a combinatorial embedding corresponding to a, presumably suboptimal, drawing of the full graph and at each iteration we attempt to find a combinatorial embedding corresponding to a drawing with fewer crossings. Unlike the planarisation method, the heuristic we propose does not require a planar subgraph to be computed. Instead it relies upon iteratively solving the star insertion problem in a combinatorial embedding which corresponds to the current (non-planar) drawing of . With the intention of keeping the new heuristic highly practical, each iteration is performed on a fixed combinatorial embedding; this is discussed further in Section 4.

Let be some drawing of and let be its planarisation. Then can be mapped to an embedding on the sphere, and this realises a particular combinatorial embedding. In this sense, we say that the combinatorial embedding ‘corresponds’ to the drawing . Note that given such a combinatorial embedding, a drawing which is equivalent to can be retrieved by using any planar graph drawing techniques, such as  or .

Let be a drawing of and let be a combinatorial embedding corresponding to . Consider deleting from a vertex and its set of incident edges; it is clear that a subdrawing can be easily obtained from . Then a combinatorial embedding corresponding to the subdrawing can be computed by repeatedly merging faces of which share an edge associated with one of the deleted edges. We shall call this the reduced combinatorial embedding corresponding to subdrawing and denote it as .

We define a star insertion into a combinatorial embedding by utilising definitons similar to those in . Let be a combinatorial embedding of , let be a face of and let be a vertex of . Then is an insertion path for and if either and is on the boundary of , or the following conditions are satisfied:

1. .

2. There is a face of with both and on its boundary.

3. is on the boundary of .

4. is a path in the dual graph .

Given an insertion path, an edge can be inserted into starting from an arbitrary point in face

(consider this a ‘dummy vertex’ for the moment) and ending at vertex

in such a way that it crosses precisely the edges .

Then, suppose we have a collection of insertion paths whose associated end vertices are . If they can all be inserted into in the above fashion, such that they are pairwise non-crossing, then we say that they collectively constitute a star insertion path. By inserting a dummy vertex into face and attaching the beginnings of each insertion path to , the star comprising of and the edges can be inserted into in such a way that they cross precisely the edges in . For a fixed face , and a fixed set of end vertices , we say that a star insertion path which crosses the fewest edges with respect to all possible star insertion paths into with the end vertices , is a crossing minimal star insertion path for and .

At each iteration we begin with a combinatorial embedding corresponding to a some drawing of . The processes within an iteration are summarised in the following procedure, for a given vertex :

### Procedure 1:

• Compute the reduced combinatorial embedding .

• Intelligently (see below) choose a face of . Compute the number of crossings resulting from a crossing minimal star insertion path into face for the star comprising of and its incident edges.

• If the total number of crossings has reduced, then insert the star comprised of and its incident edges into according to a crossing minimal star insertion path.

• Replace each introduced crossing with a dummy vertex of degree 4, and obtain a new combinatorial embedding. Call this new embedding and begin the next iteration.

Note that Step P2 is equivalent to solving the fixed embedding star insertion problem for the vertex (and its incident edges) in . To achieve this, we use the algorithm described in Chimani et al  on page 376. Since this is an important step in our heuristic, we include its description here. We begin by utilising a simple merging procedure in the dual graph of . For each vertex , we perform the following steps:

1. Contract the cycle in the dual graph that is formed by dual vertices of those faces that are incident to , into a single vertex (see Figure 1 for an example.) Remove any resulting multi-edges.

2. Find shortest paths in the dual graph with as the source.

3. Store the distances to each dual vertex, and for those dual vertices that were contracted in step 1, set their distance to zero.

4. Discard changes to the dual graph so that the above steps can be repeated with a different neighbor of .

After the above procedure, the dual vertex possessing the minimum sum of distances (over all ) corresponds to the optimal face for the new placement of , and the optimal insertion paths can be determined from the shortest path trees. Figure 1: The contraction operation in the dual graph for the vertex w∈NG(v). Dual edges are dashed and dual vertices are squares. The thick dashed edges which form a cycle around w are contracted and multi-edges are discarded. The resulting ‘merged’ dual vertex is dw. In this particular case, the resulting dual graph is just a K2.

## 4 Design methodology

In this section we outline some of the design choices and data structures of the highly practical implementation which is used for the experiments described in Section 6.

### 4.1 Initial embedding schemes

Since we focus on a fixed embedding at each iteration, the initial combinatorial embedding of obviously plays a large role in the performance of the heuristic. Any drawing method can be used to compute an initial embedding, and we discuss below just three possiblities. The first method produces an embedding quickly, however the initial number of crossings can be as large as . The second method is slower to compute but the initial number of crossings is usually much smaller for the case of sparse graphs. The third method is an implementation of a force-directed graph drawing algorithm. We will refer to these three initial embedding schemes as circle, planar and spring, respectively.

The “circle” initial embedding scheme, produces an embedding using the following procedure. We first assign each vertex a coordinate on the unit circle. Specifically, we place each vertex at coordinate . Then, the edges are drawn as straight lines, and the crossings can be easily computed. An upper bound on the number of crossings for a drawing obtained by this method can be seen by following a simple counting argument:

###### Lemma 1

The maximum number of crossings in a drawing obtained by the circle embedding scheme is .

The maximum number of crossings is attained by the complete graph . In , label the vertices from 1 to in a clockwise fashion, then any set of 4 vertices , where , corresponds to exactly one crossing involving the edges and . Thus the total number of crossings is .

The second initial embedding scheme, which we call “planar”, utilises a sequence of solutions to the star insertion problem. This idea has been considered as a heuristic for crossing minimisation in its own right (e.g. see Chimani et al ), and involves constructing an embedding in a way which is similar to the planarisation method. We begin by finding any chordless cycle of (if none exist then is acyclic and ) along with an embedding of this cycle, then iteratively perform the following:

1. Find a vertex which is not yet in , and such that there exists at least one edge in which connects to a vertex already present in . Denote by the set of all edges between and any vertices already present in .

2. Find a face of such that a crossing minimal star insertion path, into , of the star comprising of and the edges in , introduces the least number of crossings among all faces of .

3. Insert, into , the star comprising of and the edges in according to a crossing minimal star insertion path.

4. Replace each introduced crossing with a dummy vertex of degree 4 to obtain a planar graph, and compute a new combinatorial embedding. Call this new embedding and begin the next iteration.

At each step of the procedure we are building upon the embedding, one vertex at a time, until we have an embedding corresponding to some drawing of the full graph . As will be demonstrated in Section 6, this method, although still computationally efficient, is in practice slower than the circle embedding, particularly for dense instances. However, in Section 6 it will also be seen that this method tends to result in many fewer crossings, and hence substantial processing time is saved in the subsequent iterations of the main heuristic. For this reason, this is the default embedding choice in our implementation of the heuristic.

The third initial embedding scheme, which we call “spring”, comes from the area of force-directed graph drawing. In  Kamada and Kawai describe a method for drawing a graph which minimises the energy of a spring model representation of the graph. The resulting number of edge crossings is not taken into consideration in the spring model, however, especially for the case of sparse graphs, it will be demonstrated in Section 6 that the resulting drawings often provide a initial embedding with relatively few crossings. Of course, there are other force-directed graph drawing algorithms which could be used (e.g. see [18, 26]) and we make no claim here that  is the best for use in our heuristic.

It should be noted that, technically, any combinatorial embedding corresponding to a valid drawing of can serve as an initial embedding. Indeed, our heuristic could be applied as a post-processing step of the planarisation method, or any other similar heuristic which results in a valid drawing. To accommodate this, we have included in our implementation an option for user to specify their own initial combinatorial embedding, or to provide vertex coordinates for a straight-line drawing obtained from any drawing routine.

### 4.2 Minimisation schemes

There is a certain amount of freedom in the choice of how the heuristic descends towards its solution and we call these choices minimisation schemes. In particular, we discuss three possible minimisation schemes here.

The first minimisation scheme, which we call “first” works as follows. We consider vertices one at a time, in the order of their labels. In the first iteration, the first vertex considered is the one with the earliest label, and in subsequent iterations the first vertex considered is the one that follows the vertex that was re-inserted in the previous iteration. As soon as a vertex is found which can be re-inserted in such a way that the number of crossings is reduced, we fix this improved position and begin the next iteration.

The second minimisation scheme, which we call “best”, works as follows. We consider each of the vertices, and determine which should be re-inserted so as to gain the greatest reductions in crossings. Then, we fix the improved position of that vertex and begin the next iteration.

The third minimisation scheme, which we call “biggest face”, comes from an observation made during experimentation; re-inserting a vertex into the face of with the most edges (the ‘biggest face’) often provides an improvement. Intuitively this makes sense as the biggest face is ‘close’ to a relatively large number of vertices. This scheme allows for a significant speed increase during the early iterations because we may assume that the vertex can be placed in the biggest face and then find the shortest paths only once, using the dual vertex corresponding to the biggest face as the source, as opposed to the other schemes which require shortest paths to be computed up to times. As will be shown in Section 5, computing the shortest paths is the most time-consuming process in our heuristic and hence for dense graphs, where , we gain a significant speed increase. If the biggest face does not provide an improvement, other faces can then be checked according to one of the other minimisation schemes. In our implementation, if it happens that the biggest face does not provide an improvement for a certain number (specified by the user) of consecutive iterations, we stop checking the biggest face first and instead continue with a different minimisation scheme from that point forward.

### 4.3 Efficiently handling the dual graph

In each iteration, and for each vertex considered, the steps of the heuristic require the dual graph of the current embedding minus one vertex. It is possible that we may need to consider many or even all of the vertices, particularly if we use the “best” minimisation scheme. It is obviously undesirable to reconstruct this dual graph for every vertex, and so we use a simple updating procedure to avoid this. We compute the dual graph once per iteration, with all vertices present. Then, each time a vertex (along with its incident edges) is deleted from , the result in the embedding is that some pairs of faces (on either side of the planarised edges being deleted) are merged into single faces. In the computed dual graph, this corresponds to contracting the dual edge connecting the two faces on either side of each of these planarised edges (see Figure 2). Recall that each edge of the embedding corresponds precisely to an edge of the dual graph. We keep these edge indices consistent in our implementation to help simplify the above process. Figure 2: Edge deletions and the corresponding edge contractions in the dual graph. Dual edges are dashed and dual vertices are squares. Vertex v is to be deleted from G. The middle picture has v deleted and the thick dashed edges are the dual edges which are to be contracted. The right picture is the result after multi-edges have been discarded.

### 4.4 Pre- and post-processing schemes

Pre-processing schemes for crossing number heuristics are well understood and are reported in  and . We briefly outline the usual pre-processing schemes. The crossing number of a disconnected graph is the sum of crossing numbers of each of its connected components. Similarly, the crossing number of a 1-connected graph is the sum of crossing numbers of its maximal bi-connected components. Therefore, we can decompose any input graph into its biconnected components and handle them individually. One benefit is that this allows us to assume that any graph submitted to our heuristic is biconnected, and hence we can assume that any graph with one vertex removed is connected.

Due to the desire to compare our heuristic to current methods, we have not currently implemented any post-processing schemes. However several effective post-processing strategies are discussed in  and could be appended to our heuristic if desired.

### 4.5 Data structures

To store a combinatorial embedding , a list structure containing the following information is utilised: For each edge , this list stores and along with four indices; the edge index of the edge immediately clockwise from around vertex , the edge index of the edge immediately anti-clockwise from around vertex , and then likewise for vertex . An example of this list can be seen in Figure 3.

The following list structures allow for the efficient modifcations of the embedding at each iteration. The crossing order of an edge where is a list of the edges which currently cross in the order starting from the closest crossing to . Along with the crossing order list, there is the crossing orientation list. The crossing orientation is essentially the cyclic order of edges around a dummy vertex in the embedding. Suppose that within the crossing order entries of edge , we have the entry where and . Then the corresponding crossing orientation entry is stored as 1 to indicate that the order of the edges when traversing clockwise around the dummy vertex have the end-vertices , or -1 to indicate that the order is . Note that these are the only two possible orders (see Figure 4 for an example).

One difficulty arising from the combination of using these data structures and working in a fixed embedding scheme is that a pair of edges may cross each other more than once. Of course, it is known that in an optimal embedding this is never the case. However, it can arise during an intermediate step of the heuristic. If this happens, the crossing order list has no information about which entry corresponds to which crossing. To avoid this confusion, if edges and cross each other more than once, then is subdivided into a chain of edges such that none of the resulting edges cross more than once. A check is then performed in future iterations to see if the set of edges resulting from an earlier subdivision still cross any edge more than once. If not, those subdivisions are removed and the edges are merged back into a single edge. It is a simple excercise to show that by the time the heuristic concludes, all previous subdivisions have been reverted. Note that, in practice, these subdivisions are a rare occurance. Figure 3: A drawing associated with a combinatorial embedding and an example of the data structures utilised to store the embedding. For an edge e=(u,v) where u Figure 4: If e1=(u1,v1) crosses e2=(u2,v2) where u1

## 5 Runtime and implementation

We now discuss the runtime of each of the procedures and show that the iterations of the heuristic run in time where is the number of crossings in the drawing associated with the current embedding of . Pseudocode for the main loop and two subroutines are displayed in Algorithms 1-3. The code demonstrates the first minimisation scheme discussed in Section 4.2 where an improvement is taken as soon as it is found. There is a level of abstraction left in the pseudocode due to the numerous ways that one could perform the required operations; highly efficient C and MATLAB implementations of the heuristic are available at http://fhcp.edu.au/quickcross. In the discussion below we refer to the pseudocode and summarise the methods used in our implementation.

### 5.1 Implementation

First we discuss the operations involved in Algorithms 1-3. During the main loop procedure, we remove vertex and its incident edges, which possibly reduces the current number of crossings. Then, after identifying the best possible new placement for using the sip procedure, we have a new number of crossings for ’s potential placement and this number is new cr. Hence if then we have found a drawing with fewer crossings. Once an improvment has been found, is updated to reflect the new placement and this involves updating each of the data structures discussed in Section 4.5. If an edge is drawn such that it crosses some other edge multiple times, then we subdivide that edge to avoid confusion in the data structures as also discussed in Section 4.5. Similarly, if a set of edges resulting from an earlier subdivision no longer crosses any edge multiple times, then the previous subdivisions are reverted.

From the current combinatorial embedding , we compute the dual graph (which is then copied into for modifications). Step P1 of Procedure 1 asks to compute the reduced combinatorial embedding . This corresponds to removing from and a set of planarised edges. Because at this stage, it is unknown if the embedding will be utilised for the next iteration, it is quicker to instead modify the dual graph to reflect the removal of . This process is done inside of the procedure remove according to the discussion on contractions in the dual graph in Section 4.3. Later, if will be utilised for the next iteration, then it is computed, along with the new placement of .

The procedure sip solves the fixed embedding star insertion problem for the vertex (into ). The contractions in , discussed in Section 3, reduce the number of times that shortest paths need to be computed, which is the most costly process of the heuristic. Then the optimal placement for is given by newface, and shortest paths are computed once more with newface as the source vertex. The list shortest paths stores the tree paths from newface to each .

### 5.2 Runtime

In this subsection we work through the lines of the main loop

pseudocode and discuss the time complexity of each operation. The majority of the work is simple vector manipulation and so some detail is left out here. As will be seen, in each iteration, the steps performed take no more than

time.

At lines 5 and 6 we find the faces and dual graph of . This can be achieved by scanning the edges of in a clockwise manner and time required for this is .

Next, during the loop at line 7, we delete a vertex and search for a better placement for . Potentially every vertex may be tried before the algorithm moves on. So the following procedures may be repeated up to times per iteration.

In the procedure remove, which is entered at line 9, a number of edge contractions are performed. In the drawing of which is associated with the current embedding , let denote the number of crossings on the edges incident to vertex . Then the time required for the corresponding edge contractions is for each . Summing over all vertices in the aforementioned loop, this becomes a worst case of .

In the procedure sip, the contractions at line 4 can be performed in
time, and summing over all vertices, this becomes . At line 5 we find shortest paths on an unweighted planar graph (a simple breadth-first search) which can be done in time and this is repeated for each by the loop at line 2. Then, summing over all vertices, this becomes .

Back in the main loop the following procedures happen only once an improvement has been found, so only once per iteration. At line 17 we fix the new placement and update the existing data to reflect the new placement. Updating the crossing order list and crossing orientation list discussed in Section 4.5 can be performed in time. Updating the 4 clockwise and anticlockwise numbers discussed in Section 4.5 can be done in time.

Any required subdivisions are checked for at line 18 by scanning the crossings on every edge to check whether it crosses the same edge more than once. This scan can be performed in time. If a subdivision is required then the corresponding lists need to be updated and this also happens in time. Note that these subdivisions are a very rare occurance in practice and when they do occur, a check is put in place at each iteration thereafter to see if the subdivision can be undone. This additional check can be performed in time. If a subdivision is required to be undone, the corresponding lists need to be updated and this happens in time. We remark that any subdivisions do have an effect on the runtime of future iterations because they cause to grow, and bounding the time increase is difficult. Because these subdivisions are rare cases which are usually removed swiftly in subsequent iterations, we conclude that for practical purposes the additional runtime is negligible. We also remark that the total number of iterations is at most the number of crossings in the initial drawing of . Hence a naïve bound on the total runtime is where is the initial number of crossings. This emphasises the dependency between the quality of the initial drawing and the overall performance of the heuristic.

## 6 Experiments

### 6.1 Experimental setup

In this section, we consider the performance of our proposed heuristic on various sets of instances. As mentioned previously, we have implemented our heuristic in both C and MATLAB, and here we report on the C implementation, which we call QuickCross.

Each of the experiments reported on here were conducted on a 2.6GHz AMD Opteron 6282 SE with 4GB RAM and running Centos 6.7 OS. In order to compare the various schemes discussed in Section 4, each experiment is repeated nine times, once for each combination of the three initial embedding schemes (circle, planar, spring), and the three minimisation schemes (first, best, biggest face (bf)). Then, for each of these nine parameter settings, we try 100 different random permutations of the vertex labels and record the result with the least number of crossings. In such a case, we shall say that the graph was run with 100 random permutations.

We will consider four sets of instances, the first two of which contain sparse graphs, and the latter two of which contain dense graphs. In particular, the sparse instances considered are two of the sets of instances which were used for benchmarking crossing minimisation heuristics in ,  and . They are known respectively as the KnownCR graphs and the Rome graphs. The dense instances considered are sets of complete graphs, and complete bipartite graphs. We now briefly describe the experiments that will be carried out for each of the sets.

• KnownCR graphs - these are a set of instances containing between 9 and 250 vertices, first collected by Gutwenger , which can be further partitioned into four families of graphs as follows:

• : the Cartesian product of the cycle on vertices with the cycle on vertices. The instances contain graphs with and such that .

• : the cartesian product of the path on vertices with one of the 21 non-isomorphic connected graphs on 5 vertices, where denotes which of the 21. The instances contain graphs with .

• : the cartesian product of the cycle on vertices with one of the 21 non-isomorphic connected graphs on 5 vertices, where denotes which of the 21. The crossing number of these graphs are only known for some of the and only these cases are included. The instances contain graphs with .

• The Generalised Petersen graphs P and P, on vertices. We shall only use those of type P as P (studied in ) are easy for heuristics to solve as has already been observed in . The instances contain graphs with .

Unlike the other sets of instances in this section, all of the crossing numbers for the KnownCR instances are known, and hence we can compare how close QuickCross gets to the correct value for various scheme combinations. In particular, we report on the average relative deviation between the crossing numbers and the values obtained by QuickCross. In order to illustrate the work performed during the main loop of QuickCross, we also report the average relative deviation after only the initial embedding is finished. We also compare the runtimes of the various scheme combinations, separated into the time spent producing the initial embedding, and the time spent in the main loop of the heuristic. Finally, results on solution quality for other crossing minimisation heuristics have been reported in , and so we compare our results to theirs.

• Rome graphs - these are a set of 11,528 graphs which have been constructed from real-life applications, first described by Di Battista et al . They contain between 10 and 100 vertices, and are very sparse with average edge density of 1.35. The larger graphs in this set have unknown crossing numbers, since they are too large for the current exact methods to solve. Hence, it is impossible to report on how close QuickCross gets to the true crossing number. However, in  and , the largest graphs in the Rome graph set were considered, that is, the 140 graphs with exactly 100 vertices. For these graphs, the average numbers of crossings found for various crossing minimisation heuristics were reported. We compare the results of QuickCross to these values, and report on the runtimes for each of the scheme combinations. The runtimes are separated into time spent producing the initial embedding, and time spent in the main loop of the heuristic

• Complete graphs - Although the crossing number of the complete graph is not known for for , the value is conjectured, and typically assumed to be correct. We compare the nine combinations of schemes to see how close to the conjectured value each of them is able to get, for various sizes of complete graphs up to . We indicate how many crossings are obtained after the initial embedding, as well as at the conclusion of the heuristic. We also provide the runtimes, again separated into time spent producing the initial embedding, and time spent in the main loop of the heuristic.

• Complete bipartite graphs - Much like the complete graphs, the crossing number of the complete bipartite graph is only known in general for , but the value is conjectured and typically assumed to be correct. Again, we compare the nine combinations of schemes to see how close to the conjectured value they can get for values up to , and report the same data as for the Complete graphs.

It should be noted that there are two other sets of instances which were considered in , namely the AT&T graphs and the ISCA graphs. However, we have chosen not to include them in our experiments for the following reasons. First, the crossing numbers of the instances contained in them are unknown, so we cannot report on how close QuickCross gets to the crossing number. This is also true for the Rome graphs, however for that set there have been experiments reported on in  that list the average number of crossings found, and so we can do a meaningful comparison. In contrast, for the AT&T and ISCA graphs, the only reported results (e.g. see ) perform comparisons to the best solutions found by the heuristics involved within that experiment. There is no meaningful way to compare the results of QuickCross to the results in , and hence we omit these two sets of instances from consideration here.

### 6.2 KnownCR results

We partitioned the graphs into the four families described above and ran each with the 9 possible combinations of schemes. Each graph was run with 100 random permutations and the minimum found solution was compared to the actual crossing number by computing the percent relative deviation. Let denote the minimum found solution, then the percent relative deviation from is: . The average of these numbers was then taken over each of the four families of graphs and these results are displayed in Figure 5, which we now describe in detail.

For each of the nine scheme combinations in Figure 5, there are two bars displayed, specifically a light grey and a dark grey bar. The dark grey bar indicates the average percent relative deviation once the initial embedding is completed (but before the main loop of the heuristic is run), while the light grey bar indicates the average percent relative deviation at the conclusion of the heuristic. Therefore a large difference between the dark grey and light grey bars represents a large reduction in crossings achieved during the heuristic. We append the five best reported methods from  to Figure 5 for comparison. Each of these five methods also utilised 100 random permutations and then chose the minimum found solution.

We observe that for the graphs of type and , the circle embedding and the planar embedding perform very well and they outperform the other results by approximately , including those from . For the and graphs, the spring embedding performs relatively poorly. On the other hand, for the graphs of type and P, the circle and planar embeddings perform poorly and the spring embedding performs better. For the P graphs the spring embedding produced average relative deviations which are approximately equal to the best reported results in , while they are slightly worse for the graphs. The best scheme performed worse than first and bf under the same initial embedding scheme in almost all cases, with the sole exception of and planar embedding.

Runtimes were analysed by taking an average over the 100 random permutations for each graph. In Figure 6 we display the average runtimes to complete the initial embedding, while in Figure 7 we display the average runtimes for the remainder of the heuristic.

We observe that, as indicated in Section 4.1, the circle embedding computes an initial embedding the quickest, however it creates many additional crossings (see the dark grey bars in Figure 5), and consequently has a longer heuristic runtime. Alternatively, the planar embedding scheme computes an embedding almost as quick and the embedding has far fewer crossings, which results in a significantly lower heuristic runtime. Figure 5: Average percent relative deviations from the crossing numbers for four families within the KnownCR graphs. Figure 6: Average runtime to produce an initial embedding (sec.) compared to number of vertices for the KnownCR graphs. Figure 7: Average heuristic runtime per random permutation (sec.) compared to number of vertices for the KnownCR graphs.

### 6.3 Rome graphs

For the 140 graphs on 100 vertices in the Rome graph set, we repeat two experiments that have been previously performed in . In the first experiment, for each of the 9 possible combinations of schemes, each graph was run with 100 random permutations. In the second experiment, the number of random permutations is increased to 500. In each case, we record the smallest number of crossings found for each graph, and report on the average minimum number of crossings over the 140 graphs in Tables 1 and 2 respectively. In order to demonstrate the work performed by the main part of the heuristic, we provide the number of crossings at the completion of the initial embeddings as well. We also include the numbers obtained from the previous experiments in  for comparison of solution quality.

In addition, we include average runtimes in Table 1, separated into time spent in the main part of the heuristic, and time spent in the initial embedding. Since the experiments in  are from 2010, it is not meaningful to compare runtimes, and so we report only on the runtimes of QuickCross. Also, since the average runtime after 100 permutations is effectively the same as after 500 permutations, we omit the runtimes from Table 2.

For these experiments, we observe that after 100 random permutations, the planar embedding scheme outperforms both circle and spring in both solution quality and average runtime. Interestingly, the best minimisation scheme outperforms the other minimisation schemes under the same embedding scheme in each case. This result is different to the KnownCR graphs in which the best scheme was usually the worst performing scheme. With the exception of npc-var-inc-100, every configuration compares favourably to the experiments in . After 500 random permutations, the circle,best configuration resulted in the smallest average crossings, and each of our nine schemes outperform the experiments in . We do not include the average runtimes in Table 2 as they are almost identical to those in Table 1.

Finally, we note that there was an additional experiment conducted on the Rome graphs contained in . However, in that experiment, rather than reporting on average numbers of crossings, the results were instead compared to the best known results discovered by the heuristics used in the experiment. Hence, there is no meaningful way to compare the results of QuickCross to these results.

### 6.4 Complete graphs

The crossing number of the complete graph is conjectured (e.g. see Guy ) to be equal to

Although this conjecture is widely believed to be correct, it has only been confirmed for despite considerable effort to extend the results further . We ran the graphs for . Each graph was run with 100 random permutations and the minimum found solution was compared to by computing the percent relative deviation from . These results are displayed in Tables 3 for some selected values of , and the runtimes are provided in Table 4.

For these graphs, we observe that when

was odd, every scheme combination was able to obtain a drawing with

crossings. However, when was even, each scheme reached a value which was usually very close but not equal to . The average runtime under the best scheme is significantly higher than the other minimisation schemes simply due to the vast amount of additional work required to consider every vertex each iteration.

We now briefly look at the effect of the initial embedding schemes for these instances. In Table 5, we display the percent relative deviation at the conclusion of each of the initial embedding schemes. As can be seen, the planar initial embedding scheme often provides an embedding for which the number of crossings is very close or even equal to , and hence very little additional work is required by the heuristic. Since the planar initial embedding scheme is similar in character to the planarisation method, we conclude that heuristics based on the planarisation method would also be effective for these instances. Notably, unlike the sparser KnownCR and Rome instances, for these dense instances the planar initial embedding scheme takes much longer than the other two initial embedding schemes. This is because the amount of work performed by the circle and spring initial embedding schemes depends primarily on the number of vertices, rather than edges. However, the price paid by the circle and spring initial embedding schemes schemes is that they result in many more crossings than the planar scheme, and so the heuristic has to perform much more additional work to descend to a solution; also, the individual iterations (whose time depends on the current number of crossings) take longer as well. Overall, the planar initial embedding scheme performed the best in terms of both solution quality and total execution time for these instances.

### 6.5 Complete bipartite graphs

The crossing number of the complete bipartite graph is conjectured (e.g. see Zarankiewicz ) to be equal to

We ran the graphs for . Each graph was run with 100 random permutations and the minimum found solution was compared to . For the sake of space, we only report on the cases where and are multiples of five. As can be seen in Table 6, QuickCross was successful in obtaining the conjectured optimum in all cases and for all scheme combinations, except and under the circle, best combination. We conclude that these graphs are relatively easy for this heuristic to obtain a high-quality solution, and we suspect that this is the case for other heuristic methods as well. However, although the conjectured optimum is easily reached for these graphs, the runtimes in Tables 7 and 8 are comparable to those for the complete graphs, due to edge density. Again, the planar initial embedding takes a long time compared to the other initial embedding schemes, but nonetheless results in the shortest overall execution time.

## 7 Conclusion

We have presented a new heuristic approach to minimising crossings, based on repeatedly solving the star insertion problem. There are a number of parameters and scheme choices that can be utilised and these often result in markedly different performance.

The experiments conducted consistently demonstrate that the planar initial embedding scheme results in the fastest total execution time for the heuristic, compared to the other initial embedding schemes. This appears to remain true even despite taking considerably longer to complete the initial embedding than the other schemes when the instances are dense. The planar initial embedding scheme also typically produces a high quality solution, although this depends on the character of the instance considered. In particular, we found that some highly structured sparse instances (for example, the Generalized Petersen graphs considered in Section 6.2) responded better to other initial embedding schemes, albeit at a cost to execution time.

Regarding the minimisation schemes, our experiments indicate that the first minimisation scheme typically provides the best balance between a high quality solution and a fast run-time. For very sparse graphs, the best minimisation scheme sometimes provides marginally higher quality solutions, but its relatively slow runtime makes it unsuitable for large or very dense graphs. The biggest face minimisation scheme is the most efficient in the early iterations, but our experiments indicate that the first minimisation scheme often reaches a locally optimal solution in fewer iterations, and hence it is commonly quicker.

Our experiments indicate that the heuristic performs relatively well on dense graphs, albeit with a slower runtime due to the increased edge density. However, the circle initial embedding scheme may become impractical for dense graphs since the initial number of crossings is likely to be very large, rendering the early iterations very slow.

Overall, our recommendation for practical use is to rely primarily on the planar, first setting, and if the highest quality solutions are desired, also consider the circle, first setting.

For the fixed embedding setting, this work answers the question posed by Chimani and Gutwenger in  about the performance of a heuristic based upon the star/vertex insertion problem. It would be interesting to transfer these methods into a variable embedding setting, eliminating the dependence on the initial embedding which has a significant impact on the quality of the solutions.

## Acknowledgements

We are indebted to the two anonymous referees whose thoughtful and detailed suggestions greatly improved the clarity and quality of this manuscript.

## References

•  André da Motta Salles Barreto and Helio JC Barbosa. Graph Layout Using a Genetic Algorithm. In

Proc. Sixth Brazilian Symposium on Neural Networks

, pages 179–184. IEEE, 2000.
•  Carlo Batini, Maurizio Talamo, and Roberto Tamassia. Computer aided layout of entity relationship diagrams. Journal of Systems and Software, 4(2-3):163–173, 1984.
•  Jürgen Branke, Frank Bucher, and Hartmut Schmeck. Using Genetic Algorithms for Drawing Undirected Graphs. In Proc. The Third Nordic Workshop on Genetic Algorithms and their Applications. Citeseer, 1996.
•  Christoph Buchheim, Markus Chimani, Carsten Gutwenger, Michael Jünger, and Petra Mutzel. Crossings and Planarization. In R. Tamassia, editor, Handbook of Graph Drawing and Visualization, pages 43–80. Chapman and Hall/CRC, 2013.
•  Christoph Buchheim, Dietmar Ebner, Michael Jünger, Gunnar W Klau, Petra Mutzel, and René Weiskircher. Exact Crossing Minimization. In Proc. 13th International Symposium on Graph Drawing, volume 3843 of LNCS, pages 37–48. Springer, 2005.
•  Sergio Cabello and Bojan Mohar. Crossing number and weighted crossing number of near-planar graphs. Algorithmica, 60(3):484–504, 2011.
•  Sergio Cabello and Bojan Mohar. Adding One Edge To Planar Graphs Makes Crossing Number And 1-Planarity Hard. SIAM Journal on Computing, 42(5):1803–1829, 2013.
•  Markus Chimani. Computing Crossing Numbers. PhD thesis, Technical University of Dortmund, 2008.
•  Markus Chimani and Carsten Gutwenger. Advances in the Planarization Method: Effective Multiple Edge Insertions. Journal of Graph Algorithms and Applications, 16(3):729–757, 2012.
•  Markus Chimani, Carsten Gutwenger, and Petra Mutzel. Experiments on Exact Crossing Minimization Using Column Generation. Journal of Experimental Algorithmics (JEA), 14(4):303–315, 2009.
•  Markus Chimani, Carsten Gutwenger, Petra Mutzel, and Christian Wolf. Inserting a Vertex into a Planar Graph. In Proc. of the twentieth annual ACM-SIAM symposium on Discrete algorithms, pages 375–383. Society for Industrial and Applied Mathematics, 2009.
•  Markus Chimani and Petr Hliněnỳ. A Tighter Insertion-based Approximation of the Crossing Number. In Proc. International Colloquium on Automata, Languages, and Programming, volume 1443 of LNCS, pages 122–134. Springer, 2011.
•  Markus Chimani, Petr Hliněnỳ, and Petra Mutzel. Vertex insertion approximates the crossing number of apex graphs. European Journal of Combinatorics, 33(3):326–335, 2012.
•  Julia Chuzhoy, Yury Makarychev, and Anastasios Sidiropoulos. On graph crossing number and edge planarization. In Proc. of the twenty-second annual ACM-SIAM symposium on Discrete algorithms, pages 1050–1069. SIAM, 2011.
•  Hubert De Fraysseix, János Pach, and Richard Pollack. How to draw a planar graph on a grid. Combinatorica, 10(1):41–51, 1990.
•  Giuseppe Di Battista, Ashim Garg, Giuseppe Liotta, Roberto Tamassia, Emanuele Tassinari, and Francesco Vargiu. An experimental comparison of four graph drawing algorithms. Computational Geometry, 7(5-6):303–325, 1997.
•  Hristo N Djidjev. A Linear-Time Algorithm For Finding a Maximal Planar Subgraph. SIAM journal on Discrete Mathematics, 20(2):444–462, 2006.
•  Peter Eades. A Heuristic for Graph Drawing. Congressus Numerantium, 42:149–160, 1984.
•  Timo Eloranta and Erkki Mäkinen. TimGA: A Genetic Algorithm for Drawing Undirected Graphs. Divulgaciones Matematicas, 9(2):155–171, 2001.
•  Geoffrey Exoo, Frank Harary, and Jerald Kabell. The crossing numbers of some generalized petersen graphs. Mathematica Scandinavica, 43:184–188, 1981.
•  Michael R Garey and David S Johnson. Crossing Number is NP-Complete. SIAM Journal on Algebraic Discrete Methods, 4(3):312–316, 1983.
•  Carsten Gutwenger and Petra Mutzel. An Experimental Study of Crossing Minimization Heuristics. In Proc. 11th International Symposium on Graph Drawing, volume 2912 of LNCS, pages 13–24. Springer, 2003.
•  Carsten Gutwenger, Petra Mutzel, and René Weiskircher. Inserting an Edge into a Planar Graph. Algorithmica, 41(4):289–308, 2005.
•  Cartsten Gutwenger. Application of SPQR-Trees in the Planarization Approach for Drawing Graphs. PhD thesis, Technical University of Dortmund, 2010.
•  Richard K Guy. A combinatorial problem. Nabla (Bulletin of the Malayan Mathematical Society), 7:68–72, 1960.
•  Stefan Hachul and Michael Jünger. Drawing Large Graphs with a Potential-Field-Based Multilevel Algorithm. In Proc. 12th International Symposium on Graph Drawing, volume 3383 of LNCS, pages 285–295. Springer, 2004.
•  Petr Hliněnỳ and Gelasio Salazar. On the Crossing Number of Almost Planar Graphs. In Proc. 14th International Symposium on Graph Drawing, volume 4372 of LNCS, pages 162–173. Springer, 2006.
•  Tomihisa Kamada and Satoru Kawai. An algorithm for drawing general undirected graphs. Information Processing Letters, 31(1):7–15, 1989.
•  P.C Liu and R.V Geldmacher. On the deletion of nonplanar edges of a graph. Congressus Numerantium, 24(1):729–738, 1979.
•  Shengjun Pan and R Bruce Richter. The crossing number of is 100. Journal of Graph Theory, 56(2):128–134, 2007.
•  Marcel Radermacher and Ignaz Rutter. Inserting an Edge into a Geometric Embedding. In Proc. 26th International Symposium on Graph Drawing, volume 11282 of LNCS, pages 402–415. Springer, 2018.
•  Walter Schnyder. Embedding planar graphs on the grid. In Proc. of the first annual ACM-SIAM symposium on Discrete algorithms, pages 138–148. Society for Industrial and Applied Mathematics, 1990.
•  Paul Turán. A note of welcome. Journal of Graph Theory, 1(1):7–9, 1977.
•  Imrich Vrt’o. Crossing Numbers of Graphs: A Bibliography. Available electronically at ftp://ifi.savba.sk/pub/imrich/crobib.pdf, 2008.
•  Casimir Zarankiewicz. On a problem of P. Turán concerning graphs. Fundamenta Mathematicae, 1(41):137–145, 1955.
•  Thomas Ziegler. Crossing Minimization in Automatic Graph Drawing. PhD thesis, Saarland University, Germany, 2001.