Skip to Main Content

Linguistics

Library and Research Help

Profile Photo
Milan Simić
He/Him/His
Contact:
Walter C. Koerner Library
1958 Main Mall
Vancouver BC V6T 1Z2
604-822-3748

Corpora

A corpus (pl. corpora) is a collection of written or spoken material in machine-readable form, assembled for the purpose of studying linguistic structures. 

Generally, corpora are assembled according to predefined criteria to fit intended aims such as studying linguistic structures, machine translation or natural language processing. Building a corpus is a time consuming task.

This section introduces the following resources:


The following titles provide a good starting point for students who are new to corpus-based research: