4. CORPUS LINGUISTIC APPROACHES FOR DISCOURSE ANALYSIS
This chapter provides an overview of approaches within corpus linguistics that address discourse-level phenomena. The shared characteristics of all corpus-based research are first reviewed. Then four major approaches are covered: (1) investigating characteristics associated with the use of a language feature, for example, analyzing the factors that affect the omission or retention of that in complement clauses; (2) examining the realizations of a particular function of language, such as describing all the constructions used in English to express stance; (3) characterizing a variety of language, for example, conducting a multi-dimensional analysis to investigate relationships among the registers used in different settings at universities; and (4) mapping the occurrences of a feature through entire texts, for example, tracing how writers refer to themselves and their audience as they construct authority in memos. For each approach, a variety of studies are reviewed to illustrate the diverse perspectives that corpus linguistics can bring to our understanding of discourse. The chapter concludes with a brief overview of some other foci in corpus linguistics and suggests that two areas require particular attention for the advancement of discourse-oriented corpus studies: the need for more computer tools and computer programmers for corpus linguistics, and the need for further studies about how best to represent language varieties in a corpus.