A context model for collecting diversity-aware data (2306.09753v1)
Abstract: Diversity-aware data are essential for a robust modeling of human behavior in context. In addition, being the human behavior of interest for numerous applications, data must also be reusable across domain, to ensure diversity of interpretations. Current data collection techniques allow only a partial representation of the diversity of people and often generate data that is difficult to reuse. To fill this gap, we propose a data collection methodology, within a hybrid machine-artificial intelligence approach, and its related dataset, based on a comprehensive ontological notion of context which enables data reusability. The dataset has a sample of 158 participants and is collected via the iLog smartphone application. It contains more than 170 GB of subjective and objective data, which comes from 27 smartphone sensors that are associated with 168,095 self-reported annotations on the participants context. The dataset is highly reusable, as demonstrated by its diverse applications.
- The SAGE handbook of online research methods. Sage, 2008.
- Critical questions for big data: Provocations for a cultural, technological, and scholarly phenomenon. Information, communication & society, 15(5):662–679, 2012.
- Sensing technology for human activity recognition: A comprehensive survey. IEEE Access, 8:83791–83820, 2020.
- A survey of human-in-the-loop for machine learning. Future Generation Computer Systems, 2022.
- Recognizing detailed human context in the wild from smartphones and smartwatches. IEEE pervasive computing, 16(4), 2017.
- User identity linkage across online social networks: A review. Acm Sigkdd Explorations Newsletter, 18(2):5–17, 2017.
- Big–thick blending: A method for mixing analytical insights from big and thick data sources. Big Data & Society, 5(1):2053951718765026, 2018.
- Time use research in the social sciences. Springer, 1999.
- EUServices PwC. Cost of not having fair research data. cost-benefit analysis for fair research data. European Commission, 2018.
- Fausto Giunchiglia. Contextual reasoning. Epistemologia, special issue on I Linguaggi e le Macchine, 16:345–364, 1993.
- Personal context modelling and annotation. In 2017 IEEE International Conference on Pervasive Computing and Communications Workshops (PerCom Workshops), pages 117–122. IEEE, 2017.
- Herbert F Weisberg. The total survey error approach. Chicago Press, 2009.
- Teleologies: Objects, actions and functions. In ER- International Conference on Conceptual Modeling, pages 520–534. ICCM, 2017.
- A context model for personal data streams. In Web and Big Data: 6th International Joint Conference, APWeb-WAIM 2022, Nanjing, China, November 25–27, 2022, Proceedings, Part I, pages 37–44. Springer, 2023.
- Improving time use measurement with personal big collection - the experience of the european big data hackathon 2019. Journal of Official Statistics, 2020.
- Multi-device activity logging. In Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing: Adjunct Publication, pages 299–302, 2014.
- Mobile social media and academic performance. In International conference on social informatics, pages 3–13. Springer, Cham, 2017.
- Human-like context sensing for robot surveillance. International Journal of Semantic Computing, 12(01):129–148, 2017.
- Personal context recognition via reliable human-machine collaboration. In 2018 IEEE International Conference on Pervasive Computing and Communications Workshops (PerCom Workshops), pages 379–384. IEEE, 2018.
- John P Robinson. The time-diary method. In Time use research in the social sciences, pages 47–89. Springer, 2002.
- Studentlife: assessing mental health, academic performance and behavioral trends of college students using smartphones. In Proceedings of the 2014 ACM international joint conference on pervasive and ubiquitous computing, pages 3–14, 2014.
- Big five inventory. Journal of Personality and Social Psychology, 1991.
- A survey on students’ daily routines and academic performance at the university of trento. 2022.
- Mobile social media usage and academic performance. Computers in Human Behavior, 82:177–185, 2018.
- Putting human behavior predictability in context. EPJ Data Science, 10(1):42, 2021.
- Complex daily activities, country-level diversity, and smartphone sensing: A study in denmark, italy, mongolia, paraguay, and uk. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, CHI ’23, New York, NY, USA, 2023. Association for Computing Machinery.
- A worldwide diversity pilot on daily routines and social practices (2020). 2021.