Qualitative Coding in the Computational Era: A Hybrid Approach to Improve Reliability and Reduce Effort for Coding Ethnographic Interviews
dc.contributor.author | Li, Zhuofan | |
dc.contributor.author | Dohan, Daniel | |
dc.contributor.author | Abramson, Corey M. | |
dc.date.accessioned | 2021-12-09T01:55:44Z | |
dc.date.available | 2021-12-09T01:55:44Z | |
dc.date.issued | 2021-12-06 | |
dc.identifier.citation | Zhuofan Li, Daniel Dohan, and Corey M Abramson. 2021. “Qualitative Coding in the Computational Era: A Hybrid Approach to Improve Reliability and Reduce Effort for Coding Ethnographic Interviews.” Socius 7. https://doi.org/10.1177/2378023121106 2345 | en_US |
dc.identifier.issn | 2378-0231 | |
dc.identifier.doi | 10.1177/23780231211062345 | |
dc.identifier.uri | http://hdl.handle.net/10150/662481 | |
dc.description.abstract | Sociologists have argued that there is value in incorporating computational tools into qualitative research, including using machine learning to code qualitative data. Yet standard computational approaches do not neatly align with traditional qualitative practices. The authors introduce a hybrid human-machine learning approach (HHMLA) that combines a contemporary iterative approach to qualitative coding with advanced word embedding models that allow contextual interpretation beyond what can be reliably accomplished with conventional computational approaches. The results, drawn from an analysis of 87 human-coded ethnographic interview transcripts, demonstrate that HHMLA can code data sets at a fraction of the effort of human-only strategies, saving hundreds of hours labor in even modestly sized qualitative studies, while improving coding reliability. The authors conclude that HHMLA may provide a promising model for coding data sets where human-only coding would be logistically prohibitive but conventional computational approaches would be inadequate given qualitative foci. | en_US |
dc.description.sponsorship | University of Arizona (Research, Discover, and Innovation Faculty Seed Grant) National Institute of Health (DP1AG069809, R01CA152195) | en_US |
dc.language.iso | en | en_US |
dc.publisher | SAGE Publications | en_US |
dc.rights | © The Author(s) 2021. This article is distributed under the terms of the Creative Commons Attribution NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/). | en_US |
dc.rights.uri | https://creativecommons.org/licenses/by-nc/4.0/ | en_US |
dc.subject | computational social science | en_US |
dc.subject | machine learning | en_US |
dc.subject | natural language processing | en_US |
dc.subject | coding reliability | en_US |
dc.subject | computational ethnography | en_US |
dc.subject | qualitative methods | en_US |
dc.title | Qualitative Coding in the Computational Era: A Hybrid Approach to Improve Reliability and Reduce Effort for Coding Ethnographic Interviews | en_US |
dc.type | Article | en_US |
dc.identifier.eissn | 2378-0231 | |
dc.contributor.department | University of Arizona, School of Sociology | en_US |
dc.identifier.journal | Socius | en_US |
dc.description.note | Open access journal | en_US |
dc.description.collectioninformation | This item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at repository@u.library.arizona.edu. | en_US |
dc.eprint.version | Final published version | en_US |
dc.identifier.pii | 10.1177/23780231211062345 | |
dc.source.journaltitle | Socius: Sociological Research for a Dynamic World | |
dc.source.volume | 7 | |
dc.source.beginpage | 237802312110623 | |
refterms.dateFOA | 2021-12-09T01:55:48Z |