Analyzing Refugee Migration Patterns Using Geo-tagged Tweets
Abstract
:1. Introduction
- Extract individual and aggregated trajectories that reflect refugee migration movements from crisis-ridden countries in the Middle East and Africa to Europe; and
- Identify spatio-temporal event clusters of refugee related tweets to determine likely locations of refugees along migration routes and areas of elevated tweet activities related to refugees in destination countries.
2. Related Work
3. Data and Methods
3.1. Data Extraction
3.2. Methodology
4. Trajectory Based Movement Analysis
4.1. Filtering Datasets
4.2. Extracting Movement Patterns
- Extract characteristic trajectory pointsThese include start and end points, points of significant turns and points of significant stops (i.e., pauses in the movement).
- Group extracted points by spatial proximityThe implemented cluster algorithm is capable of producing convex spatial clusters with desired spatial extents. The desired radius of a group needs to be provided as a parameter.
- Partition the areaThe study area is subdivided into Voronoi cells using centroids of groups found in Step 2 and additional points that are generated in a regular manner if they are more than twice the desired radius away from centroid points. The Voronoi cells are used as the locations for aggregating movement data and building flow maps.
- Divide trajectories into segmentsA place-based division is devised where a trajectory is represented as a sequence of Voronoi cells generated in Step 3. A trajectory is stored as a dual representation, namely as a sequence of cell visits and a sequence of moves between cells.
- Data aggregationData are aggregated in two complementary ways. First, visits to each cell are aggregated in the form of counts, statistics of durations, path lengths, etc. Second, moves between cell pairs are aggregated and for each aggregation, some summary statistics (e.g., number of elementary moves, statistics of lengths) are computed. Aggregated moves can then be represented by arrows with width proportional to the counts of elementary moves, upon which temporal, spatial, or attribute filters can be applied.Using a set of tweet sequences that already underwent regional, speed (bot), activity, and directional filters as described in Section 3.1, Figure 5 shows a generalized flow map of Twitter based trajectories that have at least one stop in Germany or Austria. Occasional arrows connecting other countries (e.g., the Netherlands and France) can also be found since the entire travel route of users crossing Germany or Austria is shown. Different aggregation filter settings were explored, using settings from other studies as guidelines [42]. A visually appealing map was achieved by allowing aggregate moves at the regional level (with segment lengths between 10 and 1000 km) and setting a minimum threshold of 10 identical moves between two cells from different trajectories to avoid cluttering. More specifically, following parameter settings were used to generate the map:
- Minimum number of trajectories per arrow = 10
- Maximum number of trajectories per arrow = 250
- Minimum angle between consecutive trajectory segments to be considered as a significant turn = 30 degrees
- Minimum distance to next position = 10 km
- Maximum distance to next position = 1000 km
4.3. User Types and Movement Patterns
5. Local Twitter Activities
5.1. Keyword Lists and Hashtag Extraction
5.2. Cluster Detection
“(1) Density-based clustering detects densely populated regions in space-time with arbitrary shape […]. The number of clusters is not pre-determined and isolated points are optionally discarded as noise, therefore this method is suited for an initial overview and detection of (significant) event candidates.”
“(2) Distance-bounded spatio-temporal event clustering […] can be applied to time-dynamic data sets (data streams) and thus can detect emerging spatio-temporal clusters and track their evolution in real-time. This method additionally reconstructs trajectories of clusters, i.e., the evolution of the centre of a cluster’s spatial footprint over time.”
5.3. Local Analysis of Activity Patterns
5.3.1. Austria
5.3.2. Germany
5.3.3. Greece
6. Summary and Discussion
Acknowledgments
Author Contributions
Conflicts of Interest
References
- Saarinen, V.; Ojala, J. The Flow towards Europe. 2017. Available online: http://www.lucify.com/the-flow-towards-europe/ (accessed on 5 July 2017).
- UNHCR. Global Trends—Forced Displacement in 2015. 2015. Available online: http://www.unhcr.org/576408cd7.pdf (accessed on 10 August 2017).
- Robinson, D. How the EU Plans to Overhaul “Dublin Regulation” on Asylum Claims. 2016. Available online: https://www.ft.com/content/d08dc262-bed1-11e5-9fdb-87b8d15baec2 (accessed on 29 July 2017).
- The Telegraph. Refugee Crisis: Many Migrants Falsely Claim to be Syrians, Germany Says as EU Tries to Ease Tensions. 2015. Available online: http://www.telegraph.co.uk/news/worldnews/europe/germany/11891219/Refugee-crisis-Many-migrants-falsely-claim-to-be-Syrians-Germany-says-as-EU-tries-to-ease-tensions.html (accessed on 29 July 2017).
- Wood, D. The Power of Maps; The Guilford Press: New York, NY, USA, 1992. [Google Scholar]
- Monmonier, M. How to Lie with Maps; The University of Chicago Press: Chicago, IL, USA, 1991. [Google Scholar]
- Quam, L.O. The Use of Maps in Propaganda. J. Geogr. 1943, 42, 21–32. [Google Scholar] [CrossRef]
- Li, L.; Goodchild, M.F.; Xu, B. Spatial, temporal, and socioeconomic patterns in the use of Twitter and Flickr. Cartogr. Geogr. Inf. Sci. 2013, 40, 61–77. [Google Scholar] [CrossRef]
- Bittner, C. Diversity in volunteered geographic information: Comparing OpenStreetMap and Wikimapia in Jerusalem. GeoJournal 2016. [Google Scholar] [CrossRef]
- Lotan, G.; Graeff, E.; Ananny, M.; Gaffney, D.; Pearce, I.; Boyd, D. The Revolutions Were Tweeted: Information Flows during the 2011 Tunisian and Egyptian Revolutions. Int. J. Commun. 2011, 5, 1375–1406. [Google Scholar]
- Pei, S.; Muchnik, L.; Andrade José, S.J.; Zheng, Z.; Makse, H.A. Searching for superspreaders of information in real-world social media. Sci. Rep. 2014, 4, 5547. [Google Scholar] [CrossRef] [PubMed]
- Graham, M.; Hale, S.A.; Gaffney, D. Where in the World Are You? Geolocation and Language Identification in Twitter. Prof. Geogr. 2014, 66, 568–578. [Google Scholar] [CrossRef]
- Azmandian, M.; Singh, K.; Gelsey, B.; Chang, Y.-H.; Maheswaran, R. Following Human Mobility Using Tweets. In Agents and Data Mining Interaction (LNCS Volume 7607); Cao, L., Zeng, Y., Symeonidis, A.L., Gorodetsky, V.I., Yu, P.S., Singh, M.P., Eds.; Springer: Berlin, Germay, 2013; pp. 139–149. [Google Scholar] [CrossRef]
- Krumm, J.; Caruana, R.; Counts, S. Learning Likely Locations. In User Modeling, Adaptation, and Personalization—Proceedings of UMAP 2013 (LNCS 7899); Carberry, S., Weibelzahl, S., Micarelli, A., Semeraro, G., Eds.; Springer: Berlin, Germay, 2013; pp. 64–76. [Google Scholar] [CrossRef]
- Valle, D.; Cvetojevic, S.; Robertson, E.P.; Reichert, B.E.; Hochmair, H.H.; Fletcher, R.J. Individual Movement Strategies Revealed through Novel Clustering of Emergent Movement Patterns. Sci. Rep. 2017, 7, 44052. [Google Scholar] [CrossRef] [PubMed]
- Lenormand, M.; Gonçalves, B.; Tugores, A.; Ramasco, J.J. Human diffusion and city influence. J. R. Soc. Interface 2015, 12, 20150473. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Hawelka, B.; Sitko, I.; Beinat, E.; Sobolevsky, S.; Kazakopoulos, P.; Ratti, C. Geo-located Twitter as proxy for global mobility patterns. Cartogr. Geogr. Inf. Sci. 2014, 41, 260–271. [Google Scholar] [CrossRef] [PubMed]
- Andrienko, N.; Andrienko, G.; Fuchs, G.; Rinzivillo, S.; Betz, H.-D. Detection, Tracking, and Visualization of Spatial Event Clusters for Real Time Monitoring. In Proceedings of the IEEE International Conference on Data Science and Advanced Analytics (DSAA); IEEE: Paris, France, 2015. [Google Scholar] [CrossRef]
- Andrienko, G.; Andrienko, N.; Bak, P.; Kisilevich, S.; Keim, D. Analysis of Community-Contributed Space- and Time-Referenced Data (Example of Flickr and Panoramio Photos). In Proceedings of the 2009 IEEE Symposium on Visual Analytics Science and Technology, Atlantic City, NJ, USA, 12–13 October 2009; pp. 213–214. [Google Scholar] [CrossRef]
- Andrienko, G.; Andrienko, N.; Bak, P.; Keim, D.; Wrobel, S. Visual Analytics of Movement; Springer: Heidelberg, Germany, 2013. [Google Scholar]
- von Landesberger, T.; Bremm, S.; Schreck, T.; Fellner, D.W. Feature-based automatic identification of interesting data segments in group movement data. Inf. Vis. 2013, 13, 190–212. [Google Scholar] [CrossRef]
- Romanillos, G.; Zaltz Austwick, M.; Ettema, D.; De Kruijf, J. Big Data and Cycling. Transp. Rev. 2016, 36, 114–133. [Google Scholar] [CrossRef]
- Alivand, M.; Hochmair, H.H.; Srinivasan, S. Analyzing how travelers choose scenic routes using route choice models. Comput. Environ. Urban Syst. 2015, 50, 41–52. [Google Scholar] [CrossRef]
- Beiró, M.G.; Panisson, A.; Tizzoni, M.; Cattuto, C. Predicting human mobility through the assimilation of social media traces into mobility models. EPJ Data Sci. 2016, 5, 30. [Google Scholar] [CrossRef]
- Sun, Y.; Fan, H.; Bakillah, M.; Zipf, A. Road-based travel recommendation using geo-tagged images. Comput. Environ. Urban Syst. 2015, 53, 110–122. [Google Scholar] [CrossRef]
- Rösler, R.; Liebig, T. Using Data from Location Based Social Networks for Urban Activity Clustering. In Geographic Information Science at the Heart of Europe; Vandenbroucke, D., Bucher, B., Crompvoets, J., Eds.; Springer International Publishing: Cham, Switzerland, 2013; pp. 55–72. [Google Scholar] [CrossRef]
- Lenormand, M.; Tugores, A.; Colet, P.; Ramasco, J.J. Tweets on the road. PLoS ONE 2014, 9, e105407. [Google Scholar] [CrossRef] [PubMed]
- Steiger, E.; de Albuquerque, J.P.; Zipf, A. An Advanced Systematic Literature Review on Spatiotemporal Analyses of Twitter Data. Trans. GIS 2015, 19, 809–834. [Google Scholar] [CrossRef]
- Senaratne, H.; Broering, A.; Schreck, T.; Lehle, D. Moving on Twitter: Using Episodic Hotspot and Drift Analysis to Detect and Characterise Spatial Trajectories. In Proceedings of the 7th ACM SIGSPATIAL International Workshop on Location-Based Social Networks; ACM Press: New York, NY, USA, 2014; pp. 23–30. [Google Scholar]
- Shelton, T.; Poorthuis, A.; Graham, M.; Zook, M. Geoforum Mapping the data shadows of Hurricane Sandy: Uncovering the sociospatial dimensions of ‘big data’. Geoforum 2014, 52, 167–179. [Google Scholar] [CrossRef]
- Crooks, A.; Croitoru, A.; Stefanidis, A.; Radzikowski, J. #Earthquake: Twitter as a Distributed Sensor System. Trans. GIS 2013, 17, 124–147. [Google Scholar] [CrossRef]
- Cassa, C.A.; Chunara, R.; Mandl, K.; Brownstein, J.S. Twitter as a Sentinel in Emergency Situations: Lessons from the Boston Marathon Explosions. PLOS Curr. Disasters 2013, 2, 1–11. [Google Scholar] [CrossRef] [PubMed]
- Sakaki, T.; Okazaki, M.; Matsuo, Y. Earthquake Shakes Twitter Users: Real-time Event Detection by Social Sensors. In Proceedings of the 19th International Conference on World Wide Web; ACM: New York, NY, USA, 2010; pp. 851–860. [Google Scholar] [CrossRef]
- Zagheni, E.; Garimella, V.R.K.; Weber, I.; State, B. Inferring international and internal migration patterns from twitter data. In Proceedings of the 23rd International Conference on World Wide Web; ACM: New York, NY, USA, 2014; pp. 439–444. [Google Scholar] [CrossRef]
- Rüegger, S.; Bohnet, H. The Ethnicity of Refugees (ER): A new dataset for understanding flight patterns. Confl. Manag. Peace Sci. 2015. [Google Scholar] [CrossRef]
- Iqbal, Z. The Geo-Politics of Forced Migration in Africa, 1992—2001. Confl. Manag. Peace Sci. 2007, 24, 105–119. [Google Scholar] [CrossRef]
- Rettberg, J.W.; Gajjala, R. Terrorists or cowards: Negative portrayals of male Syrian refugees in social media. Fem. Media Stud. 2016, 16, 178–181. [Google Scholar] [CrossRef] [Green Version]
- Darwish, K.; Magdy, W. Attitudes towards Refugees in Light of the Paris Attacks. 2015. Available online: https://arxiv.org/abs/1512.04310 (accessed on 20 July 2017).
- Roesslein, J. Tweepy Documentation [Internet]. 2009. Available online: http://docs.tweepy.org/en/v3.5.0/ (accessed on 20 June 2017).
- Uddin, M.M.; Imran, M.; Sajjad, H. Understanding Types of Users on Twitter. arXiv Prepr. 2014. Available online: https://arxiv.org/abs/1406.1335 (accessed on 11 July 2017).
- Zhang, C.M.; Paxson, V. Detecting and Analyzing Automated Activity on Twitter. In Passive and Active Measurement, PAM 2011; Spring, N., Riley, G., Eds.; Springer: Berlin, Germany, 2011; pp. 102–111. [Google Scholar] [CrossRef]
- Andrienko, N.; Andrienko, G. Spatial generalisation and aggregation of massive movement data. IEEE Trans. Vis. Comput. Graph. 2011, 17, 205–219. [Google Scholar] [CrossRef] [PubMed]
- Andrienko, G.; Andrienko, N.; Wrobel, S. Visual analytics tools for analysis of movement data. ACM SIGKDD Explor. Newsl. 2007, 9, 38. [Google Scholar] [CrossRef]
- Chong, M. Sentiment analysis and topic extraction of the twitter network of #prayforparis. Proc. Assoc. Inf. Sci. Technol. 2016, 53, 1–4. [Google Scholar] [CrossRef]
- Guzman, E.; Alkadhi, R.; Seyff, N. A Needle in a Haystack: What Do Twitter Users Say about Software? In Proceedings of the 2016 IEEE 24th International Requirements Engineering Conference (RE), Beijing, China, 12–16 September 2016; pp. 96–105. [Google Scholar] [CrossRef]
- UNHCR. UNHCR Population Statistics—Data—Time Series. 2017. Available online: http://popstats.unhcr.org/en/time_series (accessed on 7 August 2017).
- Cerutti, V.; Fuchs, G.; Andrienko, G.; Andrienko, N.; Ostermann, F. Identification of Disaster-Affected Areas Using Exploratory Visual Analysis of Georeferenced Tweets: Application to a Flood Event; Association of Geographic Information Laboratories in Europe: Helsinki, Finland, 2016; p. 5. [Google Scholar]
- Andrienko, G.; Andrienko, N.; Rinzivillo, S.; Nanni, M.; Pedreschi, D.; Giannotti, F. Interactive visual clustering of large collections of trajectories. In Visual Analytics Science and Technology (VAST); IEEE: Atlantic City, NJ, USA, 2009; pp. 3–10. [Google Scholar] [CrossRef]
- The Guardian. Hungary to Take Thousands of Refugees to Austrian Border by Bus. 2015. Available online: https://www.theguardian.com/world/2015/sep/04/hundreds-refugees-march-austria-budapest-hungary-syrians (accessed on 31 July 2017).
- BBC. Migrant Crisis: Thousands Enter Slovenia after Hungary Closes Border. 2015. Available online: http://www.bbc.com/news/world-europe-34564830 (accessed on 30 July 2017).
- The Local. Few Freigners in Eastern Germany but Xenophobia is Rife. 2017. Available online: https://www.thelocal.de/20170326/few-foreigners-in-eastern-germany-but-xenophobia-is-rife (accessed on 29 July 2017).
- Leadbeater, C. Which Greek Islands are Affected by the Refugee Crisis? 2016. Available online: http://www.telegraph.co.uk/travel/destinations/europe/greece/articles/greek-islands-affected-by-refugee-crisis/ (accessed on 31 July 2017).
- Associated Newspapers Ltd. Italian Coastguard Seizes cargo Ship Carrying 600 Illegal Migrants after the Crew Programmed the Vessel to Crash into Coast before Fleeing. 2014. Available online: http://www.dailymail.co.uk/news/article-2891118/Ship-coast-Corfu-carrying-700-passengers-issues-SOS-armed-men-board.html (accessed on 22 April 2017).
- Telegraph Media Group Ltd. Mysterious Migrant “Ghost Ship” Arrives in Italy. 2014. Available online: http://www.telegraph.co.uk/news/worldnews/europe/italy/11318586/Mysterious-migrant-ghost-ship-arrives-in-Italy.html (accessed on 22 April 2017).
- ORF. Wichtige Flüchtlingsrouten. 2016. Available online: http://orf.at/stories/2307356/2307294/ (accessed on 5 August 2017).
- Lovelace, R.; Birkin, M.; Cross, P.; Clarke, M. From Big Noise to Big Data: Toward the Verification of Large Data sets for Understanding Regional Retail Flows. Geogr. Anal. 2016, 48, 59–81. [Google Scholar] [CrossRef]
- Cvetojevic, S.; Juhász, L.; Hochmair, H.H. Positional Accuracy of Twitter and Instagram Images in Urban Environments. GI_Forum 2016, 1, 191–203. [Google Scholar] [CrossRef]
- Cheng, Z.; Caverlee, J.; Lee, K. You Are Where You Tweet : A Content-Based Approach to Geo-locating Twitter Users. In Proceedings of the 19th ACM International Conference on Information and Knowledge Management, Toronto, ON, Canada, 26–30 October 2010; pp. 759–768. [Google Scholar] [CrossRef]
- Kotzias, D.; Lappas, T.; Gunopulos, D. Addressing the Sparsity of Location Information on Twitter. In Proceedings of the Workshop of the EDBT/ICDT 2014 Joint Conference, Athens, Greece, 28 March 2014. [Google Scholar]
- Sagl, G.; Delmelle, E.; Delmelle, E. Mapping collective human activity in an urban environment based on mobile phone data. Cartogr. Geogr. Inf. Sci. 2014, 41, 272–285. [Google Scholar] [CrossRef]
- Lenormand, M.; Picornell, M.; Cantú-Ros, O.G.; Tugores, A.; Louail, T.; Herranz, R.; Barthelemy, M.; Frías-Martínez, E.; Ramasco, J.J. Cross-Checking Different Sources of Mobility Information. PLoS ONE 2014, 9, e105184. [Google Scholar] [CrossRef] [PubMed]
- Lu, X.; Wrathall, D.J.; Sundsøy, P.R.; Nadiruzzaman, M.; Wetter, E.; Iqbal, A.; Qureshi, T.; Tatem, A.; Canright, G.; Engø-Monsen, K.; et al. Unveiling hidden migration and mobility patterns in climate stressed regions: A longitudinal study of six million anonymous mobile phone users in Bangladesh. Glob. Environ. Chang. 2016, 38, 1–7. [Google Scholar] [CrossRef]
- Gonzalez, M.C.; Hidalgo, C.A.; Barabasi, A.-L. Understanding individual human mobility patterns. Nature 2008, 453, 779–782. [Google Scholar] [CrossRef] [PubMed]
- Dwibhasi, S.; Jami, D.; Lanka, S. Analyzing and Visualizing the Sentiments of Ebola Outbreak Via Tweets. In Proceedings of the SAS Global Forum, Dallas, TX, USA, 26–29 April 2015; pp. 1–12. [Google Scholar]
- Mitchell, L.; Frank, M.R.; Harris, K.D.; Dodds, P.S.; Danforth, C.M. The Geography of Happiness: Connecting Twitter Sentiment and Expression, Demographics, and Objective Characteristics of Place. PLoS ONE 2013, 8, e64417. [Google Scholar] [CrossRef] [PubMed]
- Steiger, E.; Resch, B.; Zipf, A. Exploration of spatiotemporal and semantic clusters of Twitter data using unsupervised neural networks. Int. J. Geogr. Inf. Sci. 2016, 30, 1694–1716. [Google Scholar] [CrossRef]
Coefficient | Std. Err. | t | Sig. | |
---|---|---|---|---|
Constant | 0.630 | 0.009 | 6.937 | 0.000 ** |
Distance to Hungary (in 1000s of km) | −0.149 | 0.039 | −3.848 | 0.006 ** |
N | 9 | |||
Adjusted R2 | 0.633 |
© 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Hübl, F.; Cvetojevic, S.; Hochmair, H.; Paulus, G. Analyzing Refugee Migration Patterns Using Geo-tagged Tweets. ISPRS Int. J. Geo-Inf. 2017, 6, 302. https://doi.org/10.3390/ijgi6100302
Hübl F, Cvetojevic S, Hochmair H, Paulus G. Analyzing Refugee Migration Patterns Using Geo-tagged Tweets. ISPRS International Journal of Geo-Information. 2017; 6(10):302. https://doi.org/10.3390/ijgi6100302
Chicago/Turabian StyleHübl, Franziska, Sreten Cvetojevic, Hartwig Hochmair, and Gernot Paulus. 2017. "Analyzing Refugee Migration Patterns Using Geo-tagged Tweets" ISPRS International Journal of Geo-Information 6, no. 10: 302. https://doi.org/10.3390/ijgi6100302
APA StyleHübl, F., Cvetojevic, S., Hochmair, H., & Paulus, G. (2017). Analyzing Refugee Migration Patterns Using Geo-tagged Tweets. ISPRS International Journal of Geo-Information, 6(10), 302. https://doi.org/10.3390/ijgi6100302