Estimating Homophily in Social Networks Using Dyadic Predictions

George Berry, Antonio Sirianni, Ingmar Weber, Jisun An, Michael Macy

Sociological Science August 2, 2021
10.15195/v8.a14


Predictions of node categories are commonly used to estimate homophily and other relational properties in networks. However, little is known about the validity of using predictions for this task. We show that estimating homophily in a network is a problem of predicting categories of dyads (edges) in the graph. Homophily estimates are unbiased when predictions of dyad categories are unbiased. Node-level prediction models, such as the use of names to classify ethnicity or gender, do not generally produce unbiased predictions of dyad categories and therefore produce biased homophily estimates. Bias comes from three sources: sampling bias, correlation between model errors and node degree, and correlation between node-level model errors along dyads. We examine three methods for estimating homophily: predicting node categories, predicting dyad categories, and a hybrid “ego–alter” approach. This analysis indicates that only the dyadic prediction approach is unbiased, whereas the node-level approach produces both high bias and high overall error. We find that node-level classification performance is not a reliable indicator of accuracy for homophily. Although this article focuses on a particular version of homophily, results generalize to heterophilous cases and other dyadic measures. We conclude with suggestions for research design. Code for this article is available at https://github.com/georgeberry/autocorr.
Creative Commons LicenseThis work is licensed under a Creative Commons Attribution 4.0 International License.

George Berry: Department of Sociology, Cornell University
E-mail: geb97@cornell.edu

Antonio Sirianni: Department of Sociology, Dartmouth College
E-mail: antonio.d.sirianni@dartmouth.edu

Ingmar Weber: Qatar Computing Research Institute
E-mail: iweber@hbku.edu.qa

Jisun An: School of Computer and Information Systems, Singapore Management University
E-mail: jisun.an@acm.org

Michael Macy: Department of Sociology, Cornell University
E-mail: mwm14@cornell.edu

Acknowledgments: We thank Thomas Davidson, Mario Molina, Pablo Barberá, Christopher Cameron, Rebecca A. Johnson, Benjamin Cornwell, and Steven Strogatz; participants in the 2020 American Sociological Association section on Mathematical Sociology; the members of the Cornell Social Dynamics Lab; and the members of the Dartmouth Junior Faculty Writing Group for helpful comments and discussions.

  • Citation: Berry, George, Antonio Sirianni, Ingmar Weber, Jisun An, and Michael Macy. 2021. “Estimating Homophily in Social Networks Using Dyadic Predictions.” Sociological Science 8: 285-307.
  • Received: January 24, 2021
  • Accepted: April 4, 2021
  • Editors: Jesper Sørensen, Filiz Garip
  • DOI: 10.15195/v8.a14


, , ,

No reactions yet.

Write a Reaction


The reCAPTCHA verification period has expired. Please reload the page.

SiteLock