# Objectives

• Learn the different ways of measuring node centrality
• Understand the reasons for choosing one measure over another
• Learn the computational obstacles involved in some measures of centrality

# Introduction

We know from experience that different people have different positions within a group, community, or organization. Furthermore, some positions are more essential to aspects of the organization and functioning of a community than others. The concept of centrality provides us a way of quantifying the importance of individual nodes in a network. While these measures use only the connectivity of the network (the patterns of edges), in many cases centrality can be an enlightening approximation for the actual importance of an individual to the group to which they belong. Because importance can be difficult to directly measure, centrality has become a valuable way of assessing node importance in an objective way.

The problem of assessing importance is made harder by the fact that importance can mean many things. This problem remains when we consider network centrality. The fundamental idea is to identify nodes that are deeply embedded in the network, important to the overall connectivity of the network, as highly central. This, however, is still too vague to be operationalized. In the following sections, we will examine a number of different ways of defining and implementing "embeddedness" in the network. Depending on the kind of importance you are interested in, one measure will often be more valuable than another.

There are two related, but distinct camps to which centrality measures belong. Some use only information about the immediate neighborhood of a node to determine it's importance. These methods implement the idea that a node's local connectivity tells us about its embeddedness. The other camp considers centrality to be an issue of proximity to other parts of the network. In this way of thinking, distances to various parts of the network reveal how important a node is to the network.

# Neighborhood-based Methods

## Degree Centrality

Degree centrality is the simplest of all centrality measures. It implements the quite intuitive idea that more important people know more people (i.e., have more connections to other people). Thus, the degree centrality measure is defined as

$C_D(v) = \frac{k_v}{\sum_{u\in V} k_u}$

where $k_v$ is the degree (number of edges) of node v. $C_D(v)$ will range between 0 and 1 - being larger for nodes that have higher degree.

In addition to being simple, degree centrality is also very fast to compute since it only requires visiting each node and counting the number of edges it has. We'll see that all other measures of centrality are significantly more computationally expensive than this one. This explains why, particularly for large networks, degree centrality is a favored measure.

## Eigenvector Centrality

While degree centrality is intuitively appealing, it has a glaring flaw: it assumes that connections to different people are equally valuable. Consider the utility of a poorly connected person (a homeless beggar) versus a well-connected person (a powerful CEO). Are these connections of equal utility? Under most circumstances, no. We'd like to modify degree centrality to take this into account - assigning greater importance to connections that lead to other well-connected individuals.

This modification is called eigenvector centrality and is the basis for, among other things, the algorithm that Google uses to rank the importance of web pages. We don't go into the math for this measure here. However, it can be a very useful measure to use.

# References

<references group=""></references>

• [[Def: ]]
• [[Def: ]]
• [[Def: ]]