Please use this identifier to cite or link to this item:
Title: Text Analysis with R for Students of Literature
Authors: Jockers, M Atthew L
Keywords: Statistics;Computational linguistics;Mathematical statistics;Statistics;Statistics and Computing/Statistics Programs;Computational Linguistics;Statistics for Social Science, Behavorial Science, Education, Public Policy, and Law;519.5 23;QA276-280
Issue Date: 2014
Publisher: Springer International Publishing;Imprint Springer,
place: Cham
Series/Report no.: Quantitative Methods in the Humanities and Social Sciences,;2199-0956.;Quantitative Methods in the Humanities and Social Sciences,;2199-0956.
Abstract: Text Analysis with R for Students of Literature is written with students and scholars of literature in mind but will be applicable to other humanists and social scientists wishing to extend their methodological tool kit to include quantitative and computational approaches to the study of text. Computation provides access to information in text that we simply cannot gather using traditional qualitative methods of close reading and human synthesis. Text Analysis with R for Students of Literature provides a practical introduction to computational text analysis using the open source programming language R. R is extremely popular throughout the sciences and because of its accessibility, R is now used increasingly in other research areas. Readers begin working with text right away and each chapter works through a new technique or process such that readers gain a broad exposure to core R procedures and a basic understanding of the possibilities of computational text analysis at both the micro and macro scale. Each chapter builds on the previous as readers move from small scale 'microanalysis' of single texts to large scale 'macroanalysis' of text corpora, and each chapter concludes with a set of practice exercises that reinforce and expand upon the chapter lessons. The book's focus is on making the technical palatable and making the technical useful and immediately gratifying.
Description: Printed edition:;9783319031637.
Table Of Contents: R Basics.-First Foray into Text Analysis with R.-Accessing and Comparing Word Frequency Data.-Token Distribution Analysis.-Correlation -- Measures of Lexical Variety -- Hapax Richness -- Do It KWIC -- Do It KWIC (Better) -- Text Quality, Text Variety, and Parsing XML -- Clustering -- Classification -- Topic Modeling -- Appendix A: Variable Scope Example -- Appendix B: The LDA Buffet -- Appendix C: Code Repository -- Appendix D: R Resources -- Practice Exercise Solutions -- Index.
ISBN: 9783319031644
9783319031637 (print)
Format: [electronic resource] /
Type Of Material: Book
Appears in Collections:Mathematics

Files in This Item:
File Description SizeFormat 
9783319031644.pdf2.21 MBAdobe PDFThumbnail
Download    Request a copy

Items in HannanDL are protected by copyright, with all rights reserved, unless otherwise indicated.