The analysis of n-gram does not require a reference corpus. However, if you are looking for a reference corpus for keyword analysis, you can go to Laurence Anthony's website for a word list. Www.laurenceanthony.net
These terms are used in many contexts. In corpus linguistics, they refer to the same phenomenon but "N-gram" is used more generally while "cluster" is used in WordSmith Tool. Moreover, those who use the word "cluster" seem to indicate their departure from Douglas Biber's framework of lexical bundle analysis as well (this is my observation). However, in NLP and statistics "cluster" has another meaning. It means the items that tend to co-occur together, which can be determined by some statistical tests like cluster analysis, for example.
thank you for your video and where is the reference corpus?how to download it?
The analysis of n-gram does not require a reference corpus. However, if you are looking for a reference corpus for keyword analysis, you can go to Laurence Anthony's website for a word list. Www.laurenceanthony.net
whats the difference between cluster and N-gram?
These terms are used in many contexts. In corpus linguistics, they refer to the same phenomenon but "N-gram" is used more generally while "cluster" is used in WordSmith Tool. Moreover, those who use the word "cluster" seem to indicate their departure from Douglas Biber's framework of lexical bundle analysis as well (this is my observation). However, in NLP and statistics "cluster" has another meaning. It means the items that tend to co-occur together, which can be determined by some statistical tests like cluster analysis, for example.