Theory of computation

Aggregate (composite)

Algorithm

Image (mathematics)

Similarity (geometry)

Support vector machine

Data mining

Geodesy

Psychoanalysis

World Wide Web

Machine learning

AdaBoost

Benchmark (surveying)

Artificial intelligence

Psychology

Composite material

Geography

Materials science

Contrast (vision)

Measure (data warehouse)

Computer science

Social media

Romance

Annals of Operations Research

The purpose of this paper is to evaluate which communication types on social media are most indicative for romantic tie prediction. In contrast to analyzing communication as a composite measure, we take a disaggregated approach by modeling separate measures for commenting, liking and tagging focused on an alter’s status updates, photos, videos, check-ins, locations and links. To ensure that we have the best possible model we benchmark 8 classifiers using different data sampling techniques. The results indicate that we can predict romantic ties with very high accuracy. The top performing classification algorithm is adaboost with an accuracy of up to 97.89 %, an AUC of up to 97.56 %, a G-mean of up to 81.81 %, and a F-measure of up to 81.45 %. The top drivers of romantic ties were related to socio-demographic similarity and the frequency and recency of commenting, liking and tagging on photos, albums, videos and statuses. Previous research has largely focused on aggregate measures whereas this study focuses on disaggregate measures. Therefore, to the best of our knowledge, this study is the first to provide such an extensive analysis of romantic tie prediction on social media.

Evaluating the importance of different communication types in romantic tie prediction on social media