Programming for Corpus Linguistics with Python and Dataframes

Daniel Keller (Western Kentucky University)

$96.95

Hardback

Not in-store but you can order this
How long will it take?

Availability Information

We source books from suppliers in Australia and overseas. For books we don't currently have in stock, the time it takes to get them from our suppliers can vary widely - from a few days to a few months - so we check each book with each supplier to determine the expected time it will take to be supplied to us.

We then advise you accordingly. If the time taken to get any book is too long for you, you can let us know and we will cancel or adjust your order, and refund as required.

To find out the anticipated arrival time for specific items prior to ordering, please contact us by phone or email:

Phone +61 2 9264 3111, or 1800 4 BOOKS (1800 4 26657) if outside Sydney:
option 1 Abbey's Bookshop (Crime, History, Science, Kids & more) • info@abbeys.com.au
option 2 Language Book Centre (ESL & Foreign Languages) • language@abbeys.com.au
option 3 Galaxy Bookshop (Sci-fi, Fantasy, Romance, Graphic Novels) • sf@galaxybooks.com.au

QTY:

English

Cambridge University Press
20 June 2024

Computational linguistics; Programming & scripting languages: general

Series: Elements in Corpus Linguistics

This Element offers intermediate or experienced programmers algorithms for Corpus Linguistic (CL) programming in the Python language using dataframes that provide a fast, efficient, intuitive set of methods for working with large, complex datasets such as corpora. This Element demonstrates principles of dataframe programming applied to CL analyses, as well as complete algorithms for creating concordances; producing lists of collocates, keywords, and lexical bundles; and performing key feature analysis. An additional algorithm for creating dataframe corpora is presented including methods for tokenizing, part-of-speech tagging, and lemmatizing using spaCy. This Element provides a set of core skills that can be applied to a range of CL research questions, as well as to original analyses not possible with existing corpus software.

By: Daniel Keller (Western Kentucky University)
Imprint: Cambridge University Press
Country of Publication: United Kingdom
Dimensions: Height: 229mm, Width: 152mm, Spine: 8mm
Weight: 306g
ISBN: 9781009486781
ISBN 10: 1009486780
Series: Elements in Corpus Linguistics
Pages: 114
Publication Date: 20 June 2024
Audience: Professional and scholarly , Undergraduate
Format: Hardback
Publisher's Status: Active

1. Data frame corpora; 2. Python basics for corpus linguistics; 3. Working with data frames; 4. Algorithms for common corpus linguistic tasks; 5. Creating data frame corpora; 6. Conclusion; References.