Language Geography from Microblogging Platforms

ORAL

Abstract

Microblogging platforms have now become major open source indicators for complex social interactions. With the advent of smartphones, the everincreasing mobile Internet traffic gives us the unprecedented opportunity to complement studies of complex social phenomena with real-time location information. In this work, we show that the data nowadays accessible allows for detailed studies at different scales, ranging from country-level aggregate analysis to the analysis of linguistic communities withing specific neighborhoods. The high resolution and coverage of this data permits us to investigate such issues as the linguistic homogeneity of different countries, touristic seasonal patterns within countries, and the geographical distribution of different languages in bilingual regions. This work highlights the potentialities of geolocalized studies of open data sources that can provide an extremely detailed picture of the language geography.

Authors

  • Delia Mocanu

    Northeastern University

  • Andrea Baronchelli

    Northeastern University, MoBS Lab - Northeastern University

  • Nicola Perra

    Northeastern University, Laboratory for the Modeling of Biological and Socio-Technical Systems,Northeastern University

  • Bruno Goncalves

    Northeastern University, Aix-Marseille Universit\'e

  • Alessandro Vespignani

    Department of Physics, College of Computer and Information Sciences, Bouve' College of Health Sciences Northeastern University, Northeastern University