2016年1月31日 星期日

What are N-Grams?

What are N-Grams?

N-grams of texts are extensively used in text mining and natural language processing tasks. They are basically a set of co-occuring words within a given window and when computing the n-grams you typically move one word forward (although you can move X words forward in more advanced scenarios). For example, for the sentence "The cow jumps over the moon". If N=2 (known as bigrams), then the ngrams would be: - 

See more at: http://www.text-analytics101.com/2014/11/what-are-n-grams.html#sthash.BPbGDKfN.dpuf

