Estimating Homophily in Social Networks Using Dyadic Predictions

Berry, George; Sirianni, Antonio; Weber, Ingmar; An, Jisun; Macy, Michael

doi:10.15195/v8.a14

Estimating Homophily in Social Networks Using Dyadic Predictions

By Parker Webservices on August 2, 2021 in Articles

George Berry, Antonio Sirianni, Ingmar Weber, Jisun An, Michael Macy

Sociological Science August 2, 2021
10.15195/v8.a14

Abstract

PDF (4553 views)

0 Citation

Abstract
Author Information
Supplemental Material
Process Info

Predictions of node categories are commonly used to estimate homophily and other relational properties in networks. However, little is known about the validity of using predictions for this task. We show that estimating homophily in a network is a problem of predicting categories of dyads (edges) in the graph. Homophily estimates are unbiased when predictions of dyad categories are unbiased. Node-level prediction models, such as the use of names to classify ethnicity or gender, do not generally produce unbiased predictions of dyad categories and therefore produce biased homophily estimates. Bias comes from three sources: sampling bias, correlation between model errors and node degree, and correlation between node-level model errors along dyads. We examine three methods for estimating homophily: predicting node categories, predicting dyad categories, and a hybrid “ego–alter” approach. This analysis indicates that only the dyadic prediction approach is unbiased, whereas the node-level approach produces both high bias and high overall error. We find that node-level classification performance is not a reliable indicator of accuracy for homophily. Although this article focuses on a particular version of homophily, results generalize to heterophilous cases and other dyadic measures. We conclude with suggestions for research design. Code for this article is available at https://github.com/georgeberry/autocorr.

This work is licensed under a Creative Commons Attribution 4.0 International License.

George Berry: Department of Sociology, Cornell University
E-mail: geb97@cornell.edu

Antonio Sirianni: Department of Sociology, Dartmouth College
E-mail: antonio.d.sirianni@dartmouth.edu

Ingmar Weber: Qatar Computing Research Institute
E-mail: iweber@hbku.edu.qa

Jisun An: School of Computer and Information Systems, Singapore Management University
E-mail: jisun.an@acm.org

Michael Macy: Department of Sociology, Cornell University
E-mail: mwm14@cornell.edu

Acknowledgments: We thank Thomas Davidson, Mario Molina, Pablo Barberá, Christopher Cameron, Rebecca A. Johnson, Benjamin Cornwell, and Steven Strogatz; participants in the 2020 American Sociological Association section on Mathematical Sociology; the members of the Cornell Social Dynamics Lab; and the members of the Dartmouth Junior Faculty Writing Group for helpful comments and discussions.

Supplemental Material

Citation: Berry, George, Antonio Sirianni, Ingmar Weber, Jisun An, and Michael Macy. 2021. “Estimating Homophily in Social Networks Using Dyadic Predictions.” Sociological Science 8: 285-307.
Received: January 24, 2021
Accepted: April 4, 2021
Editors: Jesper Sørensen, Filiz Garip
DOI: 10.15195/v8.a14

Homophily, Machine Learning, Networks, Quantitative Methodology

Navigation

Estimating Homophily in Social Networks Using Dyadic Predictions

Sociological Science August 2, 2021
10.15195/v8.a14

No reactions yet.

Write a Reaction Click here to cancel reply.

Navigation

Sociological Science August 2, 2021 10.15195/v8.a14

Abstract

No reactions yet.

Write a Reaction Click here to cancel reply.

Sociological Science August 2, 2021
10.15195/v8.a14