Semi-Automated Coding for Qualitative Research: A User-Centered Inquiry and Initial Prototypes

論文URL:http://dl.acm.org/citation.cfm?doid=3173574.3173922

論文アブストラクト:Qualitative researchers perform an important and painstaking data annotation process known as coding. However, much of the process can be tedious and repetitive, becoming prohibitive for large datasets. Could coding be partially automated, and should it be? To answer this question, we interviewed researchers and observed them code interview transcripts. We found that across disciplines, researchers follow several coding practices well-suited to automation. Further, researchers desire automation after having developed a codebook and coded a subset of data, particularly in extending their coding to unseen data. Researchers also require any assistive tool to be transparent about its recommendations. Based on our findings, we built prototypes to partially automate coding using simple natural language processing techniques. Our top-performing system generates coding that matches human coders on inter-rater reliability measures. We discuss implications for interface and algorithm design, meta-issues around automating qualitative research, and suggestions for future work.

日本語のまとめ:

定性的研究を行う研究者は「コーディング」と呼ばれるデータアノテーションを行う。我々は自動化に適したいくつかのコーディング手法に従うことを発見した。我々はコーディングを自動化するプロトタイプを作成し、このシステムは人間と同程度のコーディングを行う。

(135文字)

発表スライド: