This Element offers intermediate or experienced programmers algorithms for Corpus Linguistic (CL) programming in the Python language using dataframes that provide a fast, efficient, intuitive set of methods for working with large, complex datasets such as corpora. This Element demonstrates principles of dataframe programming applied to CL analyses, as well as complete algorithms for creating concordances; producing lists of collocates, keywords, and lexical bundles; and performing key feature analysis. An additional algorithm for creating dataframe corpora is presented including methods for tokenizing, part-of-speech tagging, and lemmatizing using spaCy. This Element provides a set of core skills that can be applied to a range of CL research questions, as well as to original analyses not possible with existing corpus software.
Explore Programming for Corpus Linguistics with Python and Dataframes by Daniel Keller on eBooksStore by Arnlweb. Discover book details, reader ratings, reviews, release information, genres, and related digital books available through the iTunes Store.
This book is part of our growing collection of bestselling eBooks, popular digital reading materials, and trending author releases. Readers can explore similar books, discover new authors, and browse related genres including fiction, romance, mystery, fantasy, business, self-help, educational books, and more.
Our platform helps readers discover highly rated digital books optimized for smartphones, tablets, laptops, and desktop devices. Browse fast-loading book pages, reader reviews, and popular recommendations from bestselling authors worldwide.