Creating And Digitizing Language Corpora

eBook Download

BOOK EXCERPT:

This book unites a range of approaches to the collection and digitization of diverse language corpora. Its specific focus is on best practices identified in the exploitation of these resources in landmark impact initiatives across different parts of the globe. The development of increasingly accessible digital corpora has coincided with improvements in the standards governing the collection, encoding and archiving of ‘Big Data’. Less attention has been paid to the importance of developing standards for enriching and preserving other types of corpus data, such as that which captures the nuances of regional dialects, for example. This book takes these best practices another step forward by addressing innovative methods for enhancing and exploiting specialized corpora so that they become accessible to wider audiences beyond the academy.

Product Details :

Genre : Language Arts & Disciplines
Author : Karen P. Corrigan
Publisher : Springer
Release : 2016-09-19
File : 378 Pages
ISBN-13 : 9781137386458


Creating And Digitizing Language Corpora

eBook Download

BOOK EXCERPT:

A range of electronic corpora has become accessible via the WWW and CD-ROM. This coincides with improvements in standards governing the collecting, encoding and archiving of such data. This book develops similar standards for enriching and preserving 'unconventional' data': the fragmentary texts and voices left to us as accidents of history.

Product Details :

Genre : Language Arts & Disciplines
Author : J. Beal
Publisher : Springer
Release : 2007-07-12
File : 270 Pages
ISBN-13 : 9780230223202


Creating And Digitizing Language Corpora

eBook Download

BOOK EXCERPT:

A range of electronic corpora is increasingly accessible via the WWW and CD-ROM. This development coincided with improved standards governing the collecting, encoding and archiving of such data. This book looks at developing similar standards for enriching and preserving unconventional data: dialects, child language and bilingual databases.

Product Details :

Genre : Language Arts & Disciplines
Author : J. Beal
Publisher : Springer
Release : 2007-06-27
File : 266 Pages
ISBN-13 : 9780230223936


Corpus Design And Construction In Minoritised Language Contexts Cynllunio A Chreu Corpws Mewn Cyd Destunau Ieithoedd Lleiafrifoledig

eBook Download

BOOK EXCERPT:

This bilingual book provides a detailed overview of the project to construct a National Corpus of Contemporary Welsh (CorCenCC), addressing the conceptual and methodological challenges faced when developing language corpora for minoritised languages. A conceptual framework is presented for the user-driven design that underpinned the CorCenCC project, along with a detailed blueprint that can function as a scaffold for other researchers embarking on projects of this nature. This book will be of value to those working in language teaching, learning and assessment, language policy and planning, translation, corpus linguistics and language technology, and to anyone with an interest in Welsh and other minoritised languages. Mae'r llyfr dwyieithog hwn yn rhoi trosolwg manwl o'r prosiect i greu Corpws Cenedlaethol Cymraeg Cyfoes (CorCenCC), ac yn mynd i'r afael â'r heriau cysyniadol a methodolegol a wynebir wrth ddatblygu corpora iaith ar gyfer ieithoedd lleiafrifoledig. Cyflwynir fframwaith cysyniadol ar gyfer y cynllun wedi'i yrru gan ddefnyddwyr sy'n greiddiol i brosiect CorCenCC, ynghyd â glasbrint manwl a all weithredu fel sgaffald i ymchwilwyr eraill sy'n dechrau ar brosiectau o'r fath. Bydd y llyfr hwn o werth i'r rhai sy'n gweithio ym meysydd addysgu, dysgu ac asesu ieithoedd, polisi iaith a chynllunio ieithyddol, cyfieithu, ieithyddiaeth gorpws a thechnoleg iaith, ac unrhyw un â diddordeb yn y Gymraeg ac ieithoedd lleiafrifoledig eraill.

Product Details :

Genre : Language Arts & Disciplines
Author : Dawn Knight
Publisher : Springer Nature
Release : 2021-07-05
File : 178 Pages
ISBN-13 : 9783030724849


The Open Handbook Of Linguistic Data Management

eBook Download

BOOK EXCERPT:

A guide to principles and methods for the management, archiving, sharing, and citing of linguistic research data, especially digital data. "Doing language science" depends on collecting, transcribing, annotating, analyzing, storing, and sharing linguistic research data. This volume offers a guide to linguistic data management, engaging with current trends toward the transformation of linguistics into a more data-driven and reproducible scientific endeavor. It offers both principles and methods, presenting the conceptual foundations of linguistic data management and a series of case studies, each of which demonstrates a concrete application of abstract principles in a current practice. In part 1, contributors bring together knowledge from information science, archiving, and data stewardship relevant to linguistic data management. Topics covered include implementation principles, archiving data, finding and using datasets, and the valuation of time and effort involved in data management. Part 2 presents snapshots of practices across various subfields, with each chapter presenting a unique data management project with generalizable guidance for researchers. The Open Handbook of Linguistic Data Management is an essential addition to the toolkit of every linguist, guiding researchers toward making their data FAIR: Findable, Accessible, Interoperable, and Reusable.

Product Details :

Genre : Language Arts & Disciplines
Author : Andrea L. Berez-Kroeker
Publisher : MIT Press
Release : 2022-01-18
File : 687 Pages
ISBN-13 : 9780262045261


Crossing Boundaries Through Corpora

eBook Download

BOOK EXCERPT:

This volume illustrates new trends in corpus linguistics and shows how corpus approaches can be used to investigate new datasets and emerging areas in linguistics and related fields. It addresses innovative research questions, for example how prosodic analyses can increase the accuracy of syntactic segmentation, how tolerant English language teachers are about language variation, or how natural language can be translated into corpus query language. The thematic scope encompasses four types of ‘boundary crossings’. These include the incorporation of innovative scientific methods, specifically new statistical techniques, acoustic analysis and stylistic investigations. Additionally, temporal boundaries are crossed through the use of new methods and corpora to study diachronic data. New methodologies are also explored through the analysis of prosody, variety-specific approaches, and teacher attitudes. Finally, corpus users can cross boundaries by employing a more user-friendly corpus query language.

Product Details :

Genre : Language Arts & Disciplines
Author : Sarah Buschfeld
Publisher : John Benjamins Publishing Company
Release : 2024-10-15
File : 273 Pages
ISBN-13 : 9789027246486


Corpus Linguistics

eBook Download

BOOK EXCERPT:

Throughout history, linguists and literary scholars have been impelled by curiosity about particular linguistic or literary phenomena to seek to observe them in action in original texts. The fruits of each earlier enquiry in turn nourish the desire to continue to acquire knowledge, through further observation of newer linguistic facts. As time goes by, the corpus linguist operates increasingly in the awareness of what has gone before. Corpus Linguistics, thirty years on, is less an innocent sortie into corpus territory on the basis of a hunch than an informed, critical reassessment of existing analytical orthodoxy, in the light of new data coming on stream. This volume comprises twenty-two articles penned by members of the ICAME (International Computer Archive of Modern and Mediaeval English) association, which together provide a critical and informed reappraisal of the facts, data, methods and tools of Corpus Linguistics which are available today. Authors reconsider the boundaries of the discipline, exploring its areas of commonality with Sociolinguistics, Language Variation, Discourse Linguistics, and Lexical Statistics and showing how that commonality is potentially of immense benefit to practitioners in the fields concerned. The volume culminates in the report of a timely and novel expert panel discussion on the role of Corpus Linguistics in the study of English as a global language. This encompasses issues such as English as an international lingua franca, ‘norms’ for global English, and the question of ‘ownership’, or who qualifies as a native speaker.

Product Details :

Genre : Language Arts & Disciplines
Author :
Publisher : BRILL
Release : 2015-06-29
File : 470 Pages
ISBN-13 : 9789042025981


Data Collection In Sociolinguistics

eBook Download

BOOK EXCERPT:

The second edition of Data Collection in Sociolinguistics: Methods and Applications continues to provide up-to-date, succinct, relevant, and informative discussion about methods of data collection in sociolinguistic research. Written by a range of top sociolinguists, both veteran and emerging scholars, it covers the main areas of research design, conducting research, and sharing data findings. In addition to revisions of original material, this edition includes nine new vignettes covering such topics as collecting data from social media, conducting linguistic landscape research, forensic linguistic data collection, and working with transgender communities. A companion website, http://sociolinguisticdatacollection.com, provides enhanced pedagogical features such as discussion questions, activities, end-of-chapter exercises, and contributor videos. This volume is the one-stop, go-to guide for the numerous quantitative, qualitative, and mixed methods used in sociolinguistic research; it is the ideal resource for undergraduate and graduate courses in sociolinguistic research, field methods and data collection.

Product Details :

Genre : Language Arts & Disciplines
Author : Christine Mallinson
Publisher : Routledge
Release : 2017-11-22
File : 414 Pages
ISBN-13 : 9781315535234


The Routledge Handbook Of Language And Superdiversity

eBook Download

BOOK EXCERPT:

The Routledge Handbook of Language and Superdiversity provides an accessible and authoritative overview of this growing area, the linguistic analysis of interaction in superdiverse cities. Developed as a descriptive term to account for the increasingly stratified processes and effects of migration in Western Europe, ‘superdiversity’ has the potential to contribute to an enhanced understanding of mobility, complexity, and change, with theoretical, practical, global, and methodological reach. With seven sections edited by leading names, the handbook includes 35 state-of-the art chapters from international authorities. The handbook adopts a truly interdisciplinary approach, covering: Cultural heritage Sport Law Education Business and entrepreneurship. The result is a truly comprehensive account of how people live, work and communicate in superdiverse spaces. This volume is key reading for all those engaged in the study and research of Language and Superdiversity within Applied Linguistics, Linguistic Anthropology and related areas.

Product Details :

Genre : Language Arts & Disciplines
Author : Angela Creese
Publisher : Routledge
Release : 2018-02-21
File : 547 Pages
ISBN-13 : 9781317444671


A Practical Handbook Of Corpus Linguistics

eBook Download

BOOK EXCERPT:

This handbook is a comprehensive practical resource on corpus linguistics. It features a range of basic and advanced approaches, methods and techniques in corpus linguistics, from corpus compilation principles to quantitative data analyses. The Handbook is organized in six Parts. Parts I to III feature chapters that discuss key issues and the know-how related to various topics around corpus design, methods and corpus types. Parts IV-V aim to offer a user-friendly introduction to the quantitative analysis of corpus data: for each statistical technique discussed, chapters provide a practical guide with R and come with supplementary online material. Part VI focuses on how to write a corpus linguistic paper and how to meta-analyze corpus linguistic research. The volume can serve as a course book as well as for individual study. It will be an essential reading for students of corpus linguistics as well as experienced researchers who want to expand their knowledge of the field.

Product Details :

Genre : Philosophy
Author : Magali Paquot
Publisher : Springer Nature
Release : 2021-05-04
File : 686 Pages
ISBN-13 : 9783030462161