Hrvatska znanstvena Sekcija img
3 gif
 About the project
4 gif
Basic search
Advanced search
Statistical data
Other bibliographies
Similar projects
 Catalogues and databases

Bibliographic record number: 771082


Authors: Schatten, Markus; Ševa, Jurica; Okreša Đurić, Bogdan
Title: Big Data Analytics and the Social Web - A Tutorial for the Social Scientist
Source: European Quarterly of Political Attitudes and Mentalities (EQPAM) (2285-4916) 4 (2015), 3; 30-81
Paper type: article
Keywords: big data analytics; social web; web mining; social and conceptual network analysis; natural language processing; social science; Croatian political blogging site
The social web or web 2.0 has become the biggest and most accessible repository of data about human (social) behaviour in history. Due to a knowledge gap between big data analytics and established social science methodology, this enormous source of information, has yet to be exploited for new and interesting studies in various social and humanities related fields. To make one step towards closing this gap, we provide a detailed step-by-step tutorial on some of the most important web mining and analytics methods on a real-world study of Croatia’s biggest political blogging site. The tutorial covers methods for data retrieval, data conversion, cleansing and organization, data analysis (natural language processing, social and conceptual network analysis) as well as data visualization and interpretation. All tools that have been implemented for the sake of this study, data sets through the various steps as well as resulting visualizations have been published on-line and are free to use. The tutorial is not meant to be a comprehensive overview and detailed description of all possible ways of analyzing data from the social web, but using the steps outlined herein one can certainly reproduce the results of the study or use the same or similar methodology for other datasets. Results of the study show that a special kind of conceptual network generated by natural language processing of articles on the blogging site, namely a conceptual network constructed by the rule that two concepts (keywords) are connected if they were extracted from the same article, seem to be the best predictor of the current political discourse in Croatia when compared to the other constructed conceptual networks. These results indicate that a comprehensive study has to be made to investigate this conceptual structure further with an accent on the dynamic processes that have lead to the construction of the network.
Project / theme: HRZZ-UIP-2013-11-8537
Original language: ENG
Category: Znanstveni
Research fields:
Political science,Information and communication sciences,Sociology
Full paper text: 771082.Big_Data_Analytics_and_the_Social_Web_-_Tutorial.pdf (tekst priložen 25. Srp. 2015. u 17:26 sati)
URL cjelovitog rada:
Journal in electronic form only:: DA
Google Scholar: Big Data Analytics and the Social Web - A Tutorial for the Social Scientist
Contrib. to CROSBI by: Markus Schatten (, 25. Srp. 2015. u 17:26 sati

  Print version   za tiskati