Developing a corpus of the nursing literature: A pilot study
Article first published online: 15 MAY 2007
DOI: 10.1111/j.1742-7924.2007.00071.x
Additional Information
How to Cite
BUDGELL, B., MIYAZAKI, M., O’BRIEN, M., PERKINS, R. and TANAKA, Y. (2007), Developing a corpus of the nursing literature: A pilot study. Japan Journal of Nursing Science, 4: 21–25. doi: 10.1111/j.1742-7924.2007.00071.x
Publication History
- Issue published online: 15 MAY 2007
- Article first published online: 15 MAY 2007
- Received 13 December 2006; accepted 7 February 2007.
- Abstract
- Article
- References
- Cited By
Keywords:
- communication;
- literacy;
- nursing education;
- nursing research;
- nursing students
Abstract
Aim: The purpose of this project was to develop and analyze a pilot corpus of the nursing literature.
Methods: The first issues for 2005 of six representative nursing journals were used to create a corpus of ≈ 250,000 words. This corpus was analyzed for word frequency using a concordance software package.
Results: The process of developing and analyzing the corpus took the equivalent of ≈ 2 weeks’ work for a single person. Approximately 7000 unique words were identified and sorted according to frequency.
Conclusions: In terms of time and cost, the development and analysis of a corpus is an efficient way of identifying the vocabulary which the nursing profession uses in its research publications. Even this brief pilot exercise was able to identify previously unexpected patterns of usage.

1742-7924/asset/JJNS_left.gif?v=1&s=394e77ddbe109cd2cdc1a571e3c575c65251c72a)
1742-7924/asset/JJNS_right.gif?v=1&s=120e497f08ceb97a927b0598fc61b720ae2e5c72)
